Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Cloud Site Reliability Engineer @ Zafin Software Centre

Home > IT & Information Security - Other

 Cloud Site Reliability Engineer

Job Description

Interested candidates please register using the link: https://forms.office.com/r/RxNdQsFB7Q


Walk-in details:


Date : 19th July 2025 - Saturday

Time : 9.30 AM - 4.30 PM

Venue: Zafin Software Centre of Excellence Private Limited,

7th floor, Niagara Building, Embassy Taurus Techzone,

Technopark Phase III, Kulathur, Thiruvananthapuram.Kerala, 695583


Zafin is seeking a Cloud Site Reliability Engineer II (CSRE II) to lead strategic initiatives in ensuring the reliability, scalability, and performance of our cloud infrastructure and applications. This advanced role requires mastery in cloud technologies, strategic planning, and incident management to drive innovative solutions and operational excellence.

As a CSRE II, you will influence the direction of cloud reliability strategies, mentor junior engineers, and lead significant projects that have a broad organizational impact. This position reports directly to the VP of Cloud Services and requires a proactive, collaborative mindset to achieve operational and strategic objectives.


Key Responsibilities

  • Lead and manage the resolution of complex technical issues involving Zafins products and Azure cloud environment.
  • Design and implement strategic operational enhancements to improve resiliency and system reliability.
  • Conduct in-depth Root Cause Analysis (RCA) for high-severity incidents and drive initiatives to reduce error recurrence.
  • Represent the organization in external client escalation calls, providing expert guidance and solutions.
  • Architect and optimize cloud infrastructure for high performance, scalability, and cost-effectiveness.
  • Provide thought leadership in managing and scaling container orchestration platforms such as AKS and OpenShift.
  • Oversee the implementation of advanced monitoring solutions and integrate predictive analytics for proactive issue resolution.
  • Develop and execute automation strategies to streamline operational workflows and incident responses.
  • Create and maintain comprehensive documentation of cloud architectures, processes, and incident management strategies.
  • Mentor and coach junior engineers, fostering a culture of continuous learning and innovation.
  • Drive strategic initiatives, collaborating with cross-functional teams to achieve organizational objectives.

Qualifications

  • Bachelors degree in Computer Science, Engineering, or a related field (Masters degree preferred).
  • 5 - 12 years of experience in cloud support, operations, or a related role.
  • Advanced expertise in Microsoft Azure (preferred) or equivalent cloud platforms.
  • Demonstrated experience in designing and scaling container orchestration systems like AKS or OpenShift.
  • Proven leadership in managing automated deployment pipelines, including Azure DevOps.
  • Mastery in enterprise monitoring platforms (e.g., Azure Insights, Grafana) and predictive analytics tools.
  • Advanced scripting skills with PowerShell, Python, or similar languages.
  • Extensive experience in incident management and defining SLAs for global production environments.
  • In-depth knowledge of database management, particularly Postgres.

Preferred Qualifications

  • Advanced certifications in cloud platforms (e.g., Azure Solutions Architect Expert).
  • Experience with ITSM tools and processes (e.g., ServiceNow).
  • Comprehensive understanding of security and compliance in cloud environments.

Soft Skills

  • Exceptional analytical and problem-solving abilities.
  • Strong leadership and mentoring skills.
  • Advanced communication and collaboration capabilities.
  • Visionary approach to operational innovation and strategic planning.

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: IT & Information Security
Role Category: IT & Information Security - Other
Role: IT & Information Security - Other
Employement Type: Walk-ins

Contact Details:

Company: Zafin Software Centre
Location(s): Thiruvananthapuram

+ View Contactajax loader


Keyskills:   Azure Site Reliability Engineering Azure Cloud Shell Scripting Python

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Cloud Operations Engineer II & III

  • Zafin Software Centre
  • 3 - 8 years
  • Thiruvananthapuram
  • 3 days ago
₹ Not Disclosed

Manager, Cloud Support

  • Zafin Software Centre
  • 12 - 20 years
  • Thiruvananthapuram
  • 4 days ago
₹ Not Disclosed

Manager, Cloud Support

  • Zafin Software Centre
  • 12 - 20 years
  • Thiruvananthapuram
  • 4 days ago
₹ Not Disclosed

Cloud Support Analyst II

  • Zafin Software Centre
  • 3 - 6 years
  • Thiruvananthapuram
  • 4 days ago
₹ Not Disclosed

Zafin Software Centre

Pluribus Networks