Candidate should be able to work with Global operations & Core Practices teams to operate, industrialize & strengthen operational practices. Should have proven experience in handling large scale and growing infrastructure across Data Centers and heterogeneous Cloud platforms.
A Team player with good communication and problem-solving skills. Prior experience with DevOps practices & working in Global operations model (follow the sun), should be able to lead tasks independently & mentor junior team members.
MUST HAVE
Strong experience in container orchestration and server automation tools such as Kubernetes, Docker, AWS EKS, Ansible & Terraform
Experience in deploying and managing highly scalable fault resilient systems
o Infrastructure as code (IAC) using Terraform
o CI/CD pipeline automation using Jenkins/GitLab CI/Travis
o Scripting - Automation using Shell, Python, Groovy scripts
Strong Knowledge and experience of AWS services : ( 4 + years of Exp )
o Compute Services (EC2 Creation)
o AWS KeyPair creation
o Route 53
o Storage / IAM
o VPN setup
o ELB Creation
o CloudWatch, CloudTrail
o Cloud Formation
In depth knowledge of Monitoring tools like ICINGA, Prometheus
Able to design, maintain & support
o DR & Failover architectures
o OS/APP Patching & Upgrades
Incident management experience using runbooks & Troubleshooting
GOOD TO HAVE
Ticketing tools like Service Now, Jira
ITIL certification, AWS Certification, Kubernetes Administrator
Linux administration & commands knowledge.
Networking - Virtual Network, DNS, IPs, Security Concepts (like ACLs, firewalls etc.)
Familiar with various network protocols e.g., HTTP/FTP/SFTP/SMTP
Knowledge of SSL, SSH, LDAP/firewalls, Certificates
Total Experience Expected: 04-06 yearsQualifications
B.E./ B.Tech./ MCA
Keyskills: Automation Networking LDAP VPN DNS Incident management HTTP Troubleshooting Operations Python