
Senior Embedded Products and Software Reliability Engineer
- Markham, ON
- Permanent
- Full-time
- Perform reliability modeling and statistical analysis of energy management and protection/control products to predict field performance and act as authority for reliability design decisions.
- Develop and execute reliability test plans for hardware and software, including hardware-in-the-loop (HIL), environmental, and stress testing for grid components.
- Conduct and lead FMEA, FMECA, and Root Cause Analysis (RCA) for hardware/software failure events across embedded systems and communication layers.
- Work with product design, systems engineering, and software teams to ensure Design for Reliability (DfR) is embedded from concept through deployment.
- Analyze field data from deployed devices (e.g., relays, RTUs, control systems) to identify systemic issues and recommend corrective/preventive actions.
- Collaborate with utilities and customers to support grid asset performance programs and grid modernization initiatives, including IEC 61850, virtualization, and advanced diagnostics.
- Develop and maintain reliability KPIs (e.g., MTBF, FIT rate, availability) for product lines and integrate into product development and quality programs.
- Identify and drive adoption of predictive and condition-based maintenance strategies using remote diagnostics, SCADA, and sensor data.
- Consult and align with the Innovation Office and Applications team to ensure reliability priorities influence working group participation, standards leadership, and technology strategy.
- Stay current on regulatory compliance requirements (e.g., NERC, IEEE, IEC standards) impacting utility asset reliability and cybersecurity.
- Provide coaching, training, and guidance on reliability practices to internal teams and external stakeholders.
- Bachelor’s degree in electrical engineering, Systems Engineering, or a related discipline.
- Minimum of 7 years of experience in reliability, systems, or product engineering in the energy or utility sector.
- Familiar with protection & control systems, grid-edge devices, substation automation, and utility-grade software solutions.
- Hands-on experience with reliability engineering tools and standards (e.g., ReliaSoft, ISO 9001/55000, IEC 61025).
- Familiarity with communications protocols and standards used in T&D systems: IEC 61850, DNP3, Modbus, IEEE C37.118, etc.
- Strong analytical skills and experience with data analysis platforms (e.g., Python, R, Minitab, Power BI).
- Excellent written and verbal communication skills; able to engage cross-functional teams and utility customers.
- Understanding of grid reliability challenges in T&D environments & willingness to travel for customer engagements, conferences, and global collaboration.
- Collaborative leadership across engineering, field teams, and customers
- Data-driven problem-solving mindset with attention to long-term product health including systems thinking
- Proactive approach to emerging technologies and continuous improvement
- Master’s degree in Reliability or Systems Engineering.
- Certified Reliability Engineer (CRE) or similar professional certification.
- Familiar with model-based design and HIL testing (e.g., RTDS, PSCAD, or EMTP) for utility applications and simulation tools used in electronics and CAD design
- Experience working directly with T&D utilities, supporting product deployments, failure investigations, or asset strategies.
- Familiarity with cybersecurity risk and reliability considerations for critical infrastructure.