Job Description
Role Purpose
Are you looking for an opportunity to advance your tech career? Interested in creating next-generation cybersecurity and analytics capabilities? If you are, then apply to join HSBCs Cybersecurity Science & Analytics (CSA) team. We use our award-winning advanced analytics platform to develop innovative products for securing one of the largest technology estates in the world.
CSA are a unique and multi-skilled team of cybersecurity scientists, data and analytics professionals, and engineers. Our mission is to harness the power of data, analytics, AI/ML, and cybersecurity science to innovate and advance HSBCs cybersecurity capabilities.
The Lead Cybersecurity Analytics Cloud Infrastructure & Site Reliability Engineer will form part of the CSA Platform & Data Engineering Team, joining a global team of data technology professionals to deliver critical analytics engineering requirements for the strategic cybersecurity data lake and analytics platform.
The position is a mid-senior technical, hands-on delivery role, requiring knowledge of data engineering, cloud infrastructure and platform engineering, platform operations and production support.
The Lead Cybersecurity Analytics Cloud Infrastructure & Site Reliability Engineer role will work closely with Principal engineers within the Platform & Data Engineering Team, contributing to and coordinating data engineering, data acquisition, cloud infrastructure and platform engineering, platform operations, and production support activities using ground-breaking cloud and big data technologies. There is ample room for growth in the role, with opportunities to develop skills across multiple domains, starting from a strong foundation in one or more core technical areas e.g. data engineering or cloud infrastructure engineering. For this role, cybersecurity-specific knowledge is preferred but not essential.
The ideal candidate will possess strong technical skills, an eagerness to learn, a keen interest in Cybersecurity, the ability to work collaboratively in a fast-paced environment, and an aptitude for picking up new tools and techniques on the job, building on existing skillsets as a foundation.
Principal Accountabilities:
Cloud Infrastructure & Platform Engineering / Platform Operations & Production Support
- Operation and enhancement of the strategic cybersecurity data lake and advanced analytics platform in Azure cloud.
- Build and manage CSA Azure platform to enable application deployments.
- Engineer solutions on the Azure platform using Infrastructure-as-Code methods (e.g., Terraform)
- Manage Azure connectivity and networking e.g., Express Route, DNS, IP allocation and traffic routing, Network Security Gateways (NSG) and Azure Firewall.
- Gather requirements and ensure continuous feedback loops between CSA and the wider Azure Cloud Services team.
- Execute the functional pattern lifecycle process within the code library (design, document, develop, and maintain).
- Integrate, configure, deploy, and manage centrally provided common cloud services e.g., identity and access management (IAM), networking, monitoring, operating systems, containers.
- Build and support continuous integration (CI), continuous delivery (CD) and continuous testing activities.
- Providing maintenance and run support for landing zones/cloud workspaces operated on behalf of Cybersecurity teams.
- Ensuring reliable and resilient services that comply with applicable controls and regulations, meet stakeholder functional and non-functional requirements, and operate within cybersecurity's risk, cost, and quality tolerances.
- Ensure compliance with operational risk management standards (e.g., Network, Firewall, OS, Logging, Monitoring, Availability, Resiliency).
- Update support and operational documentation as required.
- Fault-find and support CSA delivery teams building applications within the data platform.
- Day-to-day troubleshooting and support of the CSA Azure tenant/infrastructure.
Required Skills
The successful candidate will be a student of the Google Site Reliability Engineering (SRE) philosophy as applied to managing large-scale cloud infrastructure, possess skills and experience within one or more of the following areas, and demonstrate a willingness to learn additional skills via certification and/or on-the-job learning where required.
Programming, Software & Network Principles
- Experience with SRE and Azure DevOps
- Ability to script (Bash/PowerShell, Azure CLI), code (Python, C#, Java), query (SQL, Kusto query language) coupled with experience with software versioning control systems (e.g., GitHub) and CI/CD systems.
- Programming experience in the following languages: PowerShell, Terraform, Python Windows command prompt and object orientated programming languages.
- Demonstrable experience of Linux administration and scripting (preferably Red Hat Systems)
- Understanding of hardware and software principles and storage technologies (SSD, HDD, NVMe), CPU architectures, and Memory & Operating system principles (especially network stack fundamentals)
- Understanding of network protocols and network design
Cloud Infrastructure & Platform Engineering
- Cloud Infrastructure & Platform Engineering (Azure preferred)
- Ability to build, operate, maintain and support cloud infrastructure & data services at scale
- Experience of engineering and deploying a range of Services in Azure
- Experience in dealing with multiple support groups that contribute to a Service
- Experience working with highly available/high-load web infrastructure e.g., web proxies, reverse proxies
- Data Quality, Data Management, Data Controls & Data Governance
- Security and Compliance, e.g., IAM and cloud compliance/auditing/monitoring tools
- Troubleshooting/service support experience
- A track record of constantly looking for ways to do things better and an excellent understanding of the mechanisms necessary to successfully implement change
- Demonstrated experience troubleshooting complex problems, especially those resulting from interactions across a cloud, services, and application stack.
- Strong documentation, change management, and agile development ethos
Technology Stack
- Technical knowledge and breadth of Azure technology services (Identity, Networking, Compute, Storage, Web, Containers, Databases)
- Cloud & Big Data Technologies such as Azure Cloud, Azure IAM, Azure Active Directory (Azure AD), Azure Data Factory, Azure Databricks, Azure Functions, Azure, Kubernetes, Service, Azure Logic App, Azure Monitor, Azure Log Analytics, Azure Compute, Azure Storage, Azure Data Lake Store, S3, Synapse Analytics and/or PowerBI
- Experience with server, operating system and infrastructure technologies such as Nginx/Apache, CosmosDB, Linux, Bash, PowerShell, Prometheus, Grafana, Elasticsearch)
- Experience with Infrastructure-as-Code and Automation tools such as Terraform, Chef, Ansible, CloudFormation/Azure Resource Manager (ARM)
- Streaming platforms such as Azure Event Hubs or Kafka, and stream processing services such as Spark streaming
- Experience with Security Information & Event Management (SIEM) and Security Orchestration, Automation & Response (SOAR) technologies, especially cloud based, is a significant asset
Education and Experience:
The successful candidate will possess:
- Positive attitude, strong work ethic and passion for learning.
- Bachelors degree in Computer Science, Software Engineering, Data Science, or a related field advantageous.
- Bachelors degree in any Science, Technology, Engineering, or Medical (STEM) field accompanied with data, technology and/or programming experience also considered.
- Applicants without a Bachelors degree will be considered if they have demonstrable career experience within one or more of the required areas and/or have relevant industry certifications in Cloud, Data, or Cybersecurity.
- CSA are a supportive team who welcome candidates from diverse skills and educational backgrounds. Thus, candidates with transferrable skills gained through software development or any other technology background will also be considered.
- Proficiency in programming languages such as Python, Java, C#, or similar preferred.
- Experience operating in highly regulated industry, e.g., Financial Services advantageous.
- Understanding of cybersecurity principles, global financial services business models, regional compliance regulations and applicable laws advantageous.
- Understanding and knowledge of common industry cybersecurity frameworks, standards, and methodologies, including OWASP, ISO2700x series, PCI DSS, GLBA, Global data security and privacy acts, FFIEC guidelines, CIS, and NIST standards advantageous.
Job Classification
Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA,
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time
Contact Details:
Company: NTT DATA, Inc.
Location(s): Hyderabad
Keyskills:
Cloud
PowerShell
Azure CLI
C#
Java
Terraform
GitHub
Bash
Azure DevOps
Python
SQL