Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Site Reliability Engineer-L3 @ Cloud Kinetics

Home > Devops

 Site Reliability Engineer-L3

Job Description

We are looking for a skilled TechOps Lead to manage and maintain our OTT platforms technical

Operation. The ideal candidate will have experience in Application Support, Content Delivery

Networks, Logging & Triaging, and Cloud-based technologies. You will be responsible for ensuring high availability, scalability, and performance of our platform. You will be responsible for triaging issues and finding issues using trend analysis.


Role & Responsibilities:


  • Must be aware of end to end incident handling.
  • Monitor, identify, and respond to incidents promptly to minimize business impact.
  • Prioritize, classify, and escalate incidents based on severity and urgency.
  • Coordinate and facilitate communication between stakeholders during incidents.
  • Perform root cause analysis and implement preventive measures.
  • Document incidents, resolutions, and generate performance reports.
  • Provide Technical support by handling and consulting on BAU, Incidents for respective applications.
  • Act as an escalation point for user issues and requests and from L1/L2 support.
  • Report issues to senior management.
  • Define, document, and maintain SLAs, technical documentation, and knowledge bases to support platform.
  • Monitor application performance, identifying areas for improvement.
  • Build and maintain effective and productive relationships with stakeholders in business, development, product, and third-party system providers.
  • Facilitate coordination across L1/L2 and L3/engineering Teams to investigate and resolve ongoing platform or application issues impacting business.
  • Candidate will have to work in shifts as part of Rota covering 24*7.
  • In event of major outage or issues we may ask for flexibility to help provide appropriate cover.
  • Weekend on-call coverage needs to be provided on rotational/need basis.
  • Understand reliability metrics and enhance automation solutions for auto-healing and incident resolution. Understand and improve applications and plan for faster MTTD, MTTR, and auto healing

Preferred candidate profile:


  • 4 to 7 years in Application Support/SRE or a related field.
  • Should have experience with any  API monitoring tool (Experience with Datadog and Cora Logix is ideal)
  • Knowledge of CDNs ( Akamai, Cloudflare etc.) and cloud-based technologies ( AWS,GCP, etc.)
  • Comfortable with large scale production systems, configurations management, load balancing & distributed systems.
  • Must be strong in backend development (80%) with some frontend experience (20% )
  • Experience with troubleshooting tools and techniques for FE,BE, API etc.
  • Familiar with job scheduling tools: cron and experience with application monitoring tools.
  • Knowledge of web services ( SOAP based and RESTful Web services )
  • Prior experience in L2/L3 support.
  • Well versed with anyone of the Scripting language ( Shell, Python etc. )
  • Strong Problem-Solving Skills and attention to detail

Should you be interested please share the updated copy of resume on Jy***********t@cl***********s.com


Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Cloud Kinetics
Location(s): Chennai

+ View Contactajax loader


Keyskills:   Application Performance Monitoring Log Analysis Coralogix latency Application Monitoring Datadog Monitoring Tools Performance Monitoring uptime

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Site Reliability Engineer IV

  • Avalara Technologies
  • 5 - 10 years
  • Pune
  • 3 days ago
₹ Not Disclosed

Site Reliability Engineer Lead

  • Optum
  • 9 - 14 years
  • Hyderabad
  • 3 days ago
₹ 0-37.5 Lacs P.A.

DevOps / Site Reliability Engineer - IAC Terraform

  • Emperen Technologies
  • 6 - 9 years
  • Pune
  • 4 days ago
₹ Not Disclosed

Site Reliability Engineer

  • Camp Systems
  • 5 - 10 years
  • Hyderabad
  • 4 days ago
₹ Not Disclosed

Cloud Kinetics

Cloud Kinetics is a premier provider of digital solutions. We enable enterprises, service providers, and ISVs to drive their business objectives with minimal dependence on infrastructure elements. We offer unique platform-driven services aimed towards accelerating customers’ business tran...