Job Summary
We are seeking an experienced z/OS Performance and Capacity Engineering Specialist to join our team. In this role, you will be responsible for monitoring, analyzing, and optimizing the performance of our mission-critical z/OS mainframe environment. You will work collaboratively with systems programmers, application teams, and infrastructure management to ensure our mainframe systems operate at peak efficiency while meeting business SLAs and planning for future growth. Responsibilities: o Performance Monitoring and Analysis: o Establish and maintain comprehensive performance monitoring for all key z/OS subsystems (e.g., CICS, DB2, IMS, MQ, TCP/IP). o Utilize industry-standard tools (e.g., SMF, RMF, Omegamon, MainView) to collect, analyze, and interpret performance data. o Proactively identify performance trends, anomalies, and potential bottlenecks. o Conduct in-depth root cause analysis of performance issues and develop effective solutions. o Tune z/OS parameters, workload management (WLM) policies, and subsystem configurations to optimize performance. o Capacity Planning and Management: o Develop and maintain accurate capacity plans for all z/OS resources (CPU, memory, DASD, network). o Forecast future capacity requirements based on business growth, application changes, and technology advancements. o Conduct capacity modelling and simulation studies to identify potential constraints and recommend proactive upgrades or optimizations. o Monitor resource utilization and provide timely reports on capacity trends and projections. o Collaborate with application development teams and infrastructure architects to ensure applications are designed and implemented with performance and scalability in mind. o Problem Diagnosis and Resolution: o Serve as a SME for z/OS performance and capacity-related issues. o Participate in incident management and problem management processes, providing expert guidance and troubleshooting support. o Collaborate with other technical teams (e.g., systems programming, database administration, network) to resolve complex issues. o Document problem resolutions and contribute to knowledge base articles. o System Optimization and Tuning: o Identify and implement opportunities for system optimization and performance improvements across the z/OS environment. o Evaluate new z/OS features and technologies and recommend their adoption where beneficial for performance and capacity. o Participate in the planning and execution of system upgrades and migrations, ensuring minimal performance impact. o Develop and maintain performance baselines and service level agreements (SLAs). o Collaboration and Communication: o Effectively communicate performance and capacity-related findings, recommendations, and risks to both technical and non-technical audiences. o Collaborate with application development teams on performance testing and tuning efforts. o Work closely with systems programmers on z/OS configuration and maintenance activities. o Participate in project planning and provide input on performance and capacity considerations. o Stay current with industry best practices and emerging trends in z/OS performance and capacity management. Technical Skills Required: o Deep understanding of z/OS architecture and its core components: o Workload Management (WLM) concepts and configuration. o Memory management (paging, swapping, virtual storage). o I/O subsystem and DASD performance. o z/OS security concepts (RACF, ACF2). o Proficiency in z/OS performance monitoring and analysis tools: o System Management Facilities (SMF) data and reporting. o Resource Measurement Facility (RMF) reports and interpretation. o Experience with at least one or more performance monitoring tools such as IBM Omegamon, BMC MainView, or Compuware Strobe. o Strong knowledge of key z/OS subsystems and their performance characteristics: o CICS transaction processing and performance tuning. o DB2 database performance monitoring and optimization (SQL analysis, EXPLAIN, etc.). o IBM MQ messaging performance. o Understanding of zIIP and zAAP specialty processors and their optimization. o TCP/IP networking performance on z/OS. o Experience with z/OS capacity planning methodologies and tools: o Understanding of capacity planning metrics and forecasting techniques. o Familiarity with capacity planning tools or scripting for data analysis and modelling (e.g., MXG, SAS, REXX, Python). o Solid scripting and automation skills: o Proficiency in REXX and/or other scripting languages (e.g., JCL, Python) for data extraction, analysis, and automation of tasks. o Strong analytical and problem-solving skills: o Ability to analyze complex performance issues, identify root causes, and develop effective solutions. o Data-driven approach to problem-solving and decision-making. o Experience with cloud integration and hybrid mainframe environments (a plus). Soft Skills o Strong communication skills to explain technical concepts to various audiences o Ability to work under pressure during critical performance issues o Collaborative approach to problem-solving across multiple teams o Proactive mindset for identifying potential issues before they impact production o Detail-oriented with excellent documentation habits o Self-motivated with ability to manage multiple prioritie
Keyskills: Capacity Management Capacity Planning Mainframes