Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Site Reliability Engineer @ Globallogic

Home > Devops

Globallogic  Site Reliability Engineer

Job Description

Description:

Hiring SRE Lead for the Hyderabad location


Requirements:

Qualifications:
Bachelors or Masters degree in Computer Science, Information Systems, Engineering, or a related technical field.
12+ years of total experience in infrastructure, platform engineering, or software development roles, including at least 35 years in an SRE or DevOps leadership role.
Deep understanding of Linux/Unix systems, networking fundamentals, and containerized environments (Docker, Kubernetes).
Proven experience managing large-scale production systems, including high-availability, distributed, and event-driven architectures.
Strong hands-on experience with cloud platforms such as AWS, GCP, or Azure and infrastructure-as-code tools (e.g., Terraform, CloudFormation).
Proficiency in at least one scripting or programming language (Python, Go, Shell, Java, etc.).
Demonstrated experience building observability solutions (metrics, logs, traces) and integrating them into proactive monitoring and alerting systems.
Solid understanding of incident response practices, runbook automation, on-call rotation management, and disaster recovery planning.
Familiarity with modern CI/CD tools (Jenkins, GitLab CI, Argo CD, Spinnaker) and release automation best practices.
Strong problem-solving and debugging skills, especially in high-pressure, production-critical environments.
Excellent leadership, communication, and cross-functional collaboration skills.


Job Responsibilities:

Responsibilities:
Lead the SRE function, owning end-to-end service reliability, observability, incident management, capacity planning, and production readiness.
Establish SLOs, SLIs, and error budgets in collaboration with product and engineering teams to drive service quality goals.
Build and maintain highly available, fault-tolerant, and self-healing infrastructure leveraging IaC, automation, and scalable architectures.
Design and implement monitoring, alerting, and observability platforms using tools like Prometheus, Grafana, Datadog, ELK/EFK stack, or equivalent.
Drive the evolution of CI/CD pipelines, release automation, and safe deployment practices using GitOps or similar methodologies.
Lead and refine the incident management lifecycle, including root cause analysis (RCA), incident postmortems, and production runbooks.
Optimize cost, performance, and scalability of cloud infrastructure across hybrid or multi-cloud environments (AWS, GCP, Azure).
Champion DevSecOps and SRE best practices, advocating for early detection, chaos engineering, and continuous improvement in resilience engineering.
Mentor and develop a team of SREs and platform engineers; conduct performance reviews and technical coaching.
Serve as a key advisor in architectural reviews to ensure systems are built with reliability, scalability, and observability in mind.
Maintain strong partnerships with Security, Product, QA, and Engineering teams to support agile development and delivery.


What We Offer:

Exciting Projects: We focus on industries like High-Tech, communication, media, healthcare, retail and telecom. Our customer list is full of fantastic global brands and leaders who love what we build for them.

Collaborative Environment: You Can expand your skills by collaborating with a diverse team of highly talented people in an open, laidback environment or even abroad in one of our global centers or client facilities!

Work-Life Balance: GlobalLogic prioritizes work-life balance, which is why we offer flexible work schedules, opportunities to work from home, and paid time off and holidays.

Professional Development: Our dedicated Learning & Development team regularly organizes Communication skills training(GL Vantage, Toast Master),Stress Management program, professional certifications, and technical and soft skill trainings.

Excellent Benefits: We provide our employees with competitive salaries, family medical insurance, Group Term Life Insurance, Group Personal Accident Insurance , NPS(National Pension Scheme ), Periodic health awareness program, extended maternity leave, annual performance bonuses, and referral bonuses.

Fun Perks: We want you to love where you work, which is why we host sports events, cultural activities, offer food on subsidies rates, Corporate parties. Our vibrant offices also include dedicated GL Zones, rooftop decks and GL Club where you can drink coffee or tea with your colleagues over a game of table and offer discounts for popular stores and restaurants!

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Globallogic
Location(s): Hyderabad

+ View Contactajax loader


Keyskills:   fundamentals continuous integration kubernetes functional python ci/cd networking cloud platforms monitoring docker scripting unix system environment git collaboration linux leadership cloud infrastructure debugging aws programming devops tools communication skills

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Test Engineer - L1

  • Wipro
  • 1 - 5 years
  • Chennai
  • 3 days ago
₹ Not Disclosed

Azure DevOps Engineer (with Docker Experience)

  • Ipem Solutions
  • 5 - 8 years
  • Hyderabad
  • 6 hours ago
₹ 10-12 Lacs P.A.

Azure DevOps Engineer (with Docker Experience)

  • Ipem Solutions
  • 5 - 8 years
  • Hyderabad
  • 12 hours ago
₹ 10-12 Lacs P.A.

Azure DevOps Engineer (with Docker Experience)

  • Ipem Solutions
  • 5 - 8 years
  • Hyderabad
  • 14 hours ago
₹ 10-12 Lacs P.A.

Globallogic

GlobalLogic is a full-lifecycle product development services leader thatcombines deep domain expertise and cross-industry experience to connectmakers with markets worldwide. Using insight gained from working on innovative products and disruptive technologies, we collaborate with customers to...