- Design, build, and maintain the core infrastructure of DeepSource, ensuring high availability, scalability, and performance.
- Minimize the risk of reliability-related downtimes by optimizing systems for durability, availability, latency, and efficiency.
- Debug production issues in cloud and customer environments across services and various levels of the stack, identifying and resolving root causes.
- Help plan the roadmap and future growth of DeepSource s infrastructure to support an expanding user base.
- Lead the design of software components and systems to ensure availability, scalability, and efficiency of DeepSources services.
- Collaborate with the engineering team to implement best practices and improve system reliability.
- Participate in on-call rotations to ensure service reliability.
Preferred Qualifications
- 2+ years of professional experience managing production infrastructure.
- Expertise in working with Kubernetes in production environments.
- Experience with one or more cloud providers, ideally GCP.
- Working knowledge of industry best practices with regard to information security.
- Comfortable with Python, Go, or any relevant programming language.
- A deep understanding of Linux is a huge advantage.
- Proven problem-solving skills and a proactive approach to identifying and mitigating risks.
- Ability to work collaboratively in a fast-paced environment.