Job Summary
Lead deliver and monitor key improvements to the currently installed systems and infrastructure.
Work with product owners architects and others to implement world-class solutions that meet regulatory and customer needs.
Drive improvements and upgrades to the environment from conception through to implementation
Responsibilities
Install configure test and maintain operating systems application software and system management tools
Provide an advanced level of support to the existing environment
Identify prioritize and execute tasks in the software development life cycle (SDLC)
Proactively ensure the highest levels of systems and infrastructure availability
Maintain security backup and redundancy strategies
Maintain CI/CD pipelines to automate routine build and testing activities
Write and maintain custom scripts to increase system efficiency and lower human intervention time on any tasks
Work with an issue/problem management system to ensure services are provided according to relevant SLA(s)
Participate in the backlog grooming and sprint planning sessions analysing requirements providing complexity estimates and proposing low-level implementation plans.
Collaborate with a global group of internal teams that span Asia Europe and Americas.
Ensure software is up-to-date with latest technologies and standards
Assist front-line support teams in resolving customer and production issues.
Escalate risks and issues and provide status reports for management.
Write and maintain appropriate documentation for manual and automated processes.
Understand existing complex environments and be able to easily identify problem areas and undertake successful implementations to resolve and/or mitigate.
Perform occasional weekend work (e.g. patching upgrades VM migrations)
Minimum 5 years experience as Devops/Site Reliability Engineer
Working experience in installing configuring and troubleshooting Windows and Linux environments
Experience with Automation CI/CD Gitlab Ansible and Terraform
Scripting skills and (Bash Python)
Working experience in setting up SLI/SLO/SLA for any new services in the monitoring systems
Experience with monitoring systems Genios Big Panda Datadog)
Experience with virtualization and containerization (VMware Docker Kubernetes EKS)
Fair understanding of Agile methodology
Experience in AWS Azure Windows eco system is plus
Good working knowledge Jira Confluence and SNOW tools
Keyskills: continuous integration kubernetes confluence ci/cd eks docker ansible eco software development life cycle devops awsazure linux datadog big panda jira cd python vmware amazon sqs amazon ec2 system lambda expressions snow troubleshooting sns terraform bash gitlab agile aws sdlc