Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Site Reliability Engineer Iii (sre) - Cloud Applications @ Guidewire Software

Home > Devops

 Site Reliability Engineer Iii (sre) - Cloud Applications

Job Description

Summary
At Guidewire, we deliver the software that Property and Casualty (PC) insurance companies rely on to protect their customers during crises, natural disasters, accidents, and cyber risks. Our core applications enable insurers to sell and underwrite policies, settle claims, and bill their customers. We also offer a suite of innovative products for data management, digital portals, and predictive analytics.
Hundreds of insurers worldwide use Guidewires products, running on our cutting-edge Guidewire Cloud Platform, to handle billions of dollars in business. We are dedicated to providing the tools and technology that help insurers protect and support their customers when they need it most.
The Opportunity
We are seeking a Site Reliability Engineer III who is eager to contribute to the transformation of the insurance industry with our leading cloud platform. As a member of the SRE-Application team, youll play a critical role in ensuring the reliability, performance, and scalability of applications running on our Guidewire Cloud Platform. This position offers a unique opportunity to apply your skills in automation, software engineering, and operational discipline to support our cloud-based solutions.
Job Description
What Youll Do
  • Work with development teams to troubleshoot and resolve issues, minimizing customer impact.
  • Develop and maintain automated runbooks to manage issues proactively.
  • Apply engineering principles and automation to enhance our operating environments.
  • Monitor and improve the reliability and performance of applications on the Guidewire Cloud Platform.
  • Use your software engineering expertise to optimize systems and reduce manual toil.
  • Document incidents and develop processes to prevent future occurrences.
  • Stay current with industry trends, tools, and best practices in site reliability engineering.
  • Foster a culture of innovation, learning, and continuous improvement.
  • Participate in on-call rotations to ensure the availability and reliability of our services.
What Youll Bring
  • Experience as an SRE or similar role, with a focus on improving system reliability.
  • Strong problem-solving skills and the ability to analyze complex systems and devise effective solutions.
  • Effective collaboration and communication skills to work cross-functionally and document processes clearly.
  • Experience with automation, monitoring, and performance optimization tools and techniques.
  • Commitment to maximizing uptime, scalability, and delivering an exceptional end-user experience.
  • Passion for technology and a desire to continuously learn and grow your skills.
  • Alignment with Guidewires mission to leverage technology to help protect and support others.
Required Skills:
  • Experience with designing and implementing SLIs, SLOs, and Error Budgets
  • Familiarity with application performance monitoring (APM) and telemetry tools to maintain expected service levels for applications
  • Proficiency with Linux system administration and the ability to program/script using Python, Go, Java, shell, or equivalent
  • Experience troubleshooting and debugging distributed systems on cloud infrastructure
  • Experience with CICD pipelines within K8S and legacy ecosystems
  • Experience creating monitors, dashboards, and synthetic transactions in monitoring tools like Datadog
  • Experience deploying and managing scalable infrastructure within AWS and Kubernetes ecosystems using Terraform and other cloud-native approaches
  • Experience with infrastructure configuration management using tools such as GitOps, Puppet, or Ansible
  • Understanding of AWS cloud networking and security, with some hands-on experience remediating infrastructure vulnerabilities
Preferred Skills:
  • SRE Certification in one or more categories
  • AWS Certification in one or more categories
  • Experience with SQL, database administration, data pipelines, performance tuning, and schema design
  • Familiarity with pipelining tools such as Team City, Bitbucket Pipelines, Jenkins, or GitHub Actions
  • Exposure to open-source distributed data processing frameworks such as Hadoop, Apache Spark, AWS RedShift, etc.

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Guidewire Software
Location(s): Bengaluru

+ View Contactajax loader


Keyskills:   Automation Claims Networking Data management Configuration management Schema Data processing Continuous improvement Open source Operations

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

DevOps Engineer - I

  • Increff
  • 0 - 3 years
  • Bengaluru
  • 1 month ago
₹ Not Disclosed

Tech Lead - Full Stack Development-Java Job

  • Yash Technologies
  • 4 - 8 years
  • Pune
  • 23 hours ago
₹ Not Disclosed

Expert DevOps Engineer-Public Cloud

  • Ensono
  • 10 - 15 years
  • Pune
  • 2 days ago
₹ Not Disclosed

Expert DevOps Engineer-Public Cloud

  • Ensono
  • 10 - 15 years
  • Pune
  • 2 days ago
₹ Not Disclosed

Guidewire Software

Guidewire exists to deliver the software that P&C insurers need to adapt and succeed in a time of rapid industry change, and to ensure that every customer succeeds in the journeyWe believe that P&C insurance plays a vital role in protecting people and business, and in enabling society to fun...