The Staff Platform Operations Engineer is a part of our Product and Engineering team who are at the forefront of ensuring reliability, scalability and availability across our products so that our customers are safe from attacks and breaches. In this role you will be focussed ensuring our products are delivering a secure, available, reliable and scalable experience. You will be working on a high impact and cross-functional team, creating end-to-end security solutions that drive customer success.You will have the opportunity to further enhance your skills surrounded by a team of incredibly smart and experienced Engineers, whilst mentoring others.
In this role, you will:
Collaborate with multiple operations/SRE and product engineering teams and other key stakeholders to identify potential risks to availability/reliability .
Establish patterns and standards around testing and operations to improve reliability, quality, and time-to-market of our suite of software solutions
Design and establish methods for how teams should execute comprehensive test plans to identify software defects across functional, performance, usability, and regression testing areas
Be involved in the creation, design and planning of upcoming testing strategies, operational improvements and decisions around tooling and frameworks relating to SRE/Operations
Regularly monitor our applications/infrastructure and identify opportunities for improving efficiencies or MTTR
Deeply understand our products and make high impact decisions to support our customers
The skills you ll bring include:
A minimum of 8 years experience in software development using Java, terraform and Jenkins. Experience with Python, Go, ansible, spinnaker, or other equivalent programming languages would be advantageous
Experience with vulnerability or code quality frameworks such as snyk or sonarqube
A minimum of 2 years of working with observability tooling such as datadog or grafana
Experience using Cloud infrastructure, ideally AWS
Experience with testing frameworks such as Selenium, Cypress, Cucumber, Playwright would be advantageous
Excited by technology, curious and eager to learn, with the ability to mentor more junior members of the team
Customer focussed mindset, understanding customer needs, providing excellent service and focussed on delivering value
The attitude and ability to thrive in a high-growth, evolving environment
Collaborative team player who has the ability to partner with others and drive toward solutions
Strong creative problem solving skills
Solid communicator with excellent written and verbal communications skills both within the team and cross functionally and strong attention to detail
Demonstrable experience of delivering complex solutions to customers
Demonstrable experience of instigating continuous delivery and continuous integration patterns
Job Classification
Industry: Law Enforcement / Security ServicesFunctional Area / Department: Engineering - Software & QARole Category: DevOpsRole: Site Reliability EngineerEmployement Type: Full time