Hi,
I hope you are doing well,
This is Karishma and I am a Sr. Recruiter, Talent Acquisition for K-Tek Resourcing. If you are interested and available for the position please feel free to contact me on my number –18329571806.
Please attached the Updated Resume
Key Responsibilities:-
Design and implement resilient IT infrastructure solutions, focusing on high availability and performance.
Establish and monitor Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
Incident and Problem Management:
Proactively monitor and troubleshoot infrastructure issues to minimize Mean Time to Recovery (MTTR).
Conduct root cause analysis (RCA) for P1/P2 incidents and implement preventive measures.
Automation and Toil Reduction:
Develop and implement Infrastructure-as-Code (IaC) solutions using tools like Terraform, Ansible, or similar.
Automate repetitive tasks to improve operational efficiency and reduce human intervention.
Observability and Monitoring:
Set up and manage observability tools such as Grafana, Prometheus, ELK, or Azure Monitor.
Ensure comprehensive logging, metrics, and alerting for all critical systems.
DevOps Integration:
Collaborate with DevOps teams to integrate CI/CD pipelines into infrastructure workflows.
Support containerized environments (e.g., Docker, Kubernetes) and orchestration platforms.
Required Skills and Qualifications:-
Bachelor’s degree in Computer Science, IT, or a related field.
Proven experience in managing and scaling IT infrastructure in on-premise, cloud, or hybrid environments.
Proficiency in one or more cloud platforms (e.g., AWS, Azure, GCP).
Strong scripting and programming skills (e.g., Python, Bash, PowerShell).
Hands-on experience with automation tools such as Terraform, Ansible, or Chef.
Familiarity with observability tools (e.g. Grafana, ELK).
Solid understanding of networking, virtualization, and storage concepts.