Role
You will be involved in planning for high availability, disaster recovery and compliance
Responsible for the Infrastructure maintenance, availability, performance & cost reduction
Dive deep to resolve problems at their root and troubleshoot services in our AWS infrastructure
Enhance and maintain our monitoring infrastructure
Develop automation tools for managing our cloud infrastructure
Improve engineering standards, tooling, and processes
Partake in an on-call rotation alongside the engineers who build our production back ends
Eligibility
3+years of experience with a start-up mentality in managing & troubleshooting large-scale distributed systems
Experience working in Kubernetes, EKS, or any orchestration service
Experience in building CI/CD pipeline
Excellent Linux and troubleshooting skills
Passion for solving problems using open source software
Strong experience working in AWS environment and other server virtualization technologies
Knowledge on SQL, AWS Redshift & AWS EMR