Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Lead Site Reliability Engineer @ Optum

Home > Devops

 Lead Site Reliability Engineer

Job Description


Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start  Caring. Connecting. Growing together.
  •    Primary Responsibilities
  •  
  • Identifies all business and security risks and mitigations associated with the service, including coordination of Data Risk Assessments (DRAs) as needed, and ensures compliance with Minimum Security (MinSec) Standards
  • Provides or approves client communication for service launch and pending maintenance windows and coordinates internal communication for operational staff
  • Provides input into the development of Key Performance Indicators (KPIs) and metrics to provide status on service health
  • Generates reports and reviews them on a regular basis to identify actionable opportunities for service improvement
  • Influences the vendor in the development of the cloud service; advocates for new features and functionality
  • Participates in internal service review meetings
  • Participates in negotiating Service Level Agreements (SLAs) and Operational Level Agreements (OLAs) for the service
  • Represents the service across the organization
  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Build software and systems to manage platform infrastructure and applications
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement
  • Provide primary operational support and engineering for multiple large-scale distributed software applications
  • Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding
  • Partner with development teams to improve services through rigorous testing and release procedures
  • Participate in system design consulting, platform management, and capacity planning
  • Create sustainable systems and services through automation and uplifts
  • Balance feature development speed and reliability with well-defined service-level objectives
  • Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so
  •  Required Qualifications
  •  
  • Bachelors degree (or equivalent) in computer science or related discipline
  • Experience with distributed storage technologies such as NFS, HDFS, Ceph, and Amazon S3, as well as dynamic resource management frameworks (Apache Mesos, Kubernetes, Yarn)
  • Proven ability to program (structured and OOP) using one or more high-level languages, such as Python, Java, C/C++, Ruby, and JavaScript
  • Proactive approach to identifying problems, performance bottlenecks, and areas for improvement
  • #Gen
  • Job Classification

    Industry: Retail
    Functional Area / Department: Engineering - Software & QA
    Role Category: DevOps
    Role: Site Reliability Engineer
    Employement Type: Full time

    Contact Details:

    Company: Optum
    Location(s): Hyderabad

    + View Contactajax loader


    Keyskills:   c++ java python javascript ruby kubernetes nagios reliability redhat linux ansible docker mesos elastic search git devops oops linux jenkins shell scripting mysql hadoop cloud computing c ceph cassandra kafka nfs terraform openstack yarn aws

     Fraud Alert to job seekers!

    ₹ Not Disclosed

    Similar positions

    Tech Lead - Full Stack Development-Java Job

    • Yash Technologies
    • 4 - 8 years
    • Pune
    • 14 hours ago
    ₹ Not Disclosed

    Cloud Platform Engineer

    • Accenture
    • 7 - 12 years
    • Hyderabad
    • 3 days ago
    ₹ Not Disclosed

    Azure Devops Engineer

    • Vlink
    • 3 - 8 years
    • Noida, Gurugram
    • 3 days ago
    ₹ 20-25 Lacs P.A.

    Cloud Engineer

    • Cradlepoint
    • 4 - 8 years
    • Noida, Gurugram
    • 3 days ago
    ₹ Not Disclosed

    Optum

    About: OptumInsight India Pvt Ltd, a UnitedHealth group company is a leading health services and innovation company dedicated to help make the health system work better for everyone. With more than 115,000 people worldwide, Optum combines technology, data and expertise to improve the delivery, ...