Senior AWS DevOps Engineer
Location: USA/Canada, Remote
Title: DevOps & Infra – Shared Services & Replication Engineer (AWS & On-Premise)
The ideal candidate will play a key role in architecting and implementing robust replication strategies to ensure seamless failover capabilities and data consistency between cloud and on-premise systems. This role will assist with building out an on-premises failover environment and onboarding of all critical Renaissance applications in clearing, settlement, and risk pillars. Customer is required by regulators to demonstrate the ability to run all clearing and risk systems through internal datacenter ( “On-Prem”) and to be able to switch all production functionality running through On-Prem within the predefined SLA. The newly developed clearing and risk systems takes advantage of scalability, ease of provision, and the reduced cost offered by cloud services (AWS) and the cloud offered Multi-Region as Disaster Recovery (DC) strategy.
Required skill set: Groovy scripts, Terraform, Jenkins, On-Prem, Automation, VMWare, Gradle & Shared infra services
Responsibilities:
· Design, implement, and maintain shared infrastructure services for AWS and On-Premise VMware environments.
· Collaborate with cross-functional teams to establish connectivity and integration between cloud and on-premise resources.
· Design and implement data replication, sync, and failover strategies to ensure high availability and disaster recovery across environments.
· Develop and maintain CI/CD pipelines for automated deployment and configuration management.
· Monitor and optimize shared services performance, scalability, and reliability.
· Troubleshoot and resolve issues related to shared infrastructure services and replication mechanisms.
· Document configurations, processes, and procedures related to shared services and replication engineering.
· Stay updated with industry best practices and emerging technologies related to cloud and on-premise infrastructure.
· As a DevOps engineer setup CI/CD pipeline for application deployment and infra components deployment using terraform
· Work in the DevOps team, to build new shared infrastructure services for on premises failover environment: Kubernetes, Hashicorp Vault, S3, Kafka, Databases
· Work with different other teams to establish connectivity between new failover shared services and existing shared services: Secret, identity, LDA, DNS, Artifactory, Jenkins, Splunk services
· Work with DevOps team to automate deployment DR strategy, Automation of data replication between Cloud and failover environment required between all applications
· Continuously improve the processes and the DevOps team using thoughtful, calculated approaches to identify opportunities, and challenge those around you to strive for perfection.
· Ideate solutions to complex technical challenges; code, test, troubleshoot, debug, and document the solutions you develop. Use agile software development model to produce well-designed programs, scripts, and tools required to provision, configure, and monitor new shared infrastructure services for on premises failover environment
Key Skills:
Must have:
· Strong experience with Hashicorp Vault and Terraform along with AWS services such as EC2, S3, RDS, VPC, IAM, etc.
· Proficiency in VMware virtualization technologies and managing On-Premise VMware environments.
· In-depth knowledge of data replication technologies and strategies, including asynchronous replication, synchronous replication, and log shipping.
· Hands-on experience with CI/CD tools like Jenkins, Harness, Helm Charts, Sonar Qube, etc.
· Hands-on experience with Maven and Gradle builds.
· Expertise in scripting languages such as Groovy, YAMLs, Python, Bash, or PowerShell for automation and orchestration.
· Familiarity with networking concepts, including TCP/IP, DNS, VPN, and firewalls.
· Excellent troubleshooting and problem-solving skills in complex distributed systems environments.
· Strong communication and collaboration skills to work effectively in cross-functional teams.
· Deep expertise in automating certificate management using Python and corresponding libraries
· We need someone who's really good at setting up CI/CD pipelines using DevOps tools like Jenkins, Artifactory, Vault, SonarQube, GitHub, Terraform, Rancher, and Harness. It's also important that they have experience with deploying on Kubernetes using Rancher or Harness. Plus, they should know about deploying Kafka MRC and using monitoring tools like Logic Monitor and Splunk. And it'd be great if they have experience with AWS services, especially setting up S3 and AWS Artifactory, and know how to do S3 replication.
· Experience with K8 and S3/Cloudian for shared infra
· Specializing in Cloud Infrastructure Modernization, virtualization, data center setup, DR & BC Strategies, and DevOps
· Experience in on-premises data center operations, AWS hosted data center and operations management
· Pilot Kafka data replication between AWS and On-Prem
· Pilot PostgreSQL to run in AWS and On-Prem and the data replication between them
· Experience with continuous deployment tools, techniques, and automation frameworks – especially Terraform Enterprise and Ansible.
· Hands-on experience writing testable scripts using Python or other languages.
· Experience managing helm charts and deploying into Kubernetes (k8s)
· Expertise with monitoring related tools and frameworks like Splunk, LogicMonitor, SignalFX, and Prometheus.
· Worked on projects involving deployment and management of micro services, and hybrid cloud/on-prem infrastructure
· Intermediate working knowledge of development tools like Maven/Gradle, Java, and distributed message brokers like Kafka, RabbitMQ, ActiveMQ, Amazon Kinesis, etc.
Good to Have:
· Certification in AWS (e.g., AWS Certified DevOps Engineer) or VMware (e.g., VMware Certified Professional).
· Experience with containerization technologies such as Docker and Kubernetes.
· Knowledge of hybrid cloud architectures and multi-cloud environments.
· Familiarity with database replication technologies (e.g., MySQL replication, PostgreSQL streaming replication).
Thanks and Best Regards,
Praveen Kumar I SRI Tech Solutions INC
Certified Minority Business Enterprise | An E-Verify Employer
SRI Tech Solutions is an equal opportunity employer and does not discriminate on the basis of race, color, gender, religion, age, sexual orientation, national origin or citizenship status or ethnic origin, disability, marital status, veteran status, or any other occupationally irrelevant criteria.