Mainframe Engineer/Performance and Capacity Management
CYNET SYSTEMS
- Montreal, QC
- Permanent
- Full-time
- The Mainframe Performance and Capacity Engineer will be responsible for monitoring, analyzing, and optimizing z/OS system health and performance across large-scale environments.
- This includes capacity planning, performance tuning, cost modeling, and collaboration with cross-functional teams to ensure system stability, efficiency, and scalability.
- The role also involves data analysis, reporting, and providing actionable insights to leadership and stakeholders.
- Performance management and capacity planning include monitoring real-time z/OS system health and performance across CPU, memory, DASD, and WLM-managed workloads using RMF, SmartIS, IzPCA, MICS, and other internal tools.
- Analyze performance data to identify trends, bottlenecks, and potential issues.
- Detect, troubleshoot, and resolve resource anomalies, workload misbehaviors, and degradation risks in production systems.
- Partner with incident response teams to resolve performance issues quickly and accurately.
- Develop and implement performance tuning strategies by recommending changes to service definitions, dispatching priorities, and workload placement.
- Contribute to capacity planning by forecasting and modeling workload resource demand and capacity requirements.
- Support cost modeling, vendor reporting, infrastructure sizing, and resource optimization efforts.
- Data analysis responsibilities include collecting and analyzing system performance data to generate reports and dashboards, identifying key performance indicators (KPIs), and developing metrics to track system performance.
- Visualize, summarize, and present data findings, recommendations, and methodology to senior leadership, department leadership, and enterprise stakeholders.
- Collaboration and communication include working closely with cross-functional teams, including operations, development, and infrastructure teams.
- Provide technical support and guidance to team members and stakeholders.
- Participate in on-call rotations and provide timely responses to performance and observability issues.
- Contribute to the migration of performance and capacity tooling to Git change management and DevOps deployment pipelines.
- Bachelor's degree in information systems, mathematics, finance, or another quantitative or related subject.
- More than 10 years of mainframe systems experience with proficiency in performance management for large, multi-processor, multi-LPAR, Parallel Sysplex environments utilizing z/OS.
- Proven experience in mainframe performance monitoring, observability, capacity management, and data analysis.
- Proven experience resolving systems performance problems in real time through adjustments to WLM and batch initiators.
- Strong understanding of PR/SM.
- Proficiency in REXX, Python, Job Control Language (JCL), and DB2.
- Strong understanding of batch processing and job scheduling.
- Advanced user of MS Excel (charts, pivot tables, VLOOKUPs, PowerPivot) and PowerPoint for data visualization.
- Experience with mainframe monitoring tools and performance tuning techniques.
- Experience working with large, highly transactional datasets to draw insights and create organizational value.
- Experience with DevOps and ADABAS is a plus.
- Strong analytical, problem-solving, and strategic thinking skills, including the ability to prioritize.
- Proficiency in data analysis and dashboard creation to provide actionable insights for operations, engineering, and management.
- Working knowledge of the MS Office suite for management reporting using both PowerPoint and Excel.
- Ability to integrate data from disparate systems to generate meaningful insights.
- Ability to analyze and solve business problems at their root, while considering the broader context.
- Excellent written and verbal communication skills.
- Ability to work independently and as part of a team.
- Detail-oriented with strong organizational skills.
- Deep expertise with SMF and RMF data, WLM service definitions, and z/OS workload behavior.
- IBM ClientRT reporting experience is a plus.
- MICS product and reporting experience is a significant plus.
- iZPCA product experience is an advantage.
- Strong time management, independent thinking, and decision-making skills.
- High sense of urgency and ability to prioritize competing requirements in a fast-paced environment.
We are sorry but this recruiter does not accept applications from abroad.