
Dynatrace Application Performance Monitoring SME
- Toronto, ON
- Contract
- Full-time
Duration: 6 Months
Location: Toronto, ONJOB DESCRIPTION:Responsibilities:
- Candidate should Experienced in architecting implementing Dynatrace monitoring solution activities and liaising with multiple technology teams to ensure flawless onboarding of application to Dynatrace.
- Have proficiency in monitoring applications deployed in Linux and Windows clustered environments.
- Possess excellent written and verbal communication skill and be experienced to work independently.
- Have strong analytical skill to analyze risks and foresee production issues through various monitoring tool and published reports. Proactively look for hardware, software, and environmental alerts, malfunctions, and problems.
- Able to participates in 247 on-call rotation and project work during night shifts and or weekends as required. Be able to monitor, coach educate a small team of IT professionals.
- Experience with monitoring performance data in Dynatrace guiding application teams to interpret performance data. Support the efforts for ongoing Dynatrace and Splunk onboarding implementation, including setting up custom dashboards, alerts, and reports. Ability to analyze dashboards and reporting monitoring tools to look at trends and patterns in application health and performance.
- Document information such as configuration, installation and user guides following process standards.
- Effectively lead and guide Incident triage calls from a technical perspective analyzing different components of the infrastructure and application environment via the use of a variety of monitoring tools and processes. Have strong Troubleshooting experience for root cause analysis using application performance management and event correlation monitoring tools.
- Undergraduate degree or Technical Certificate Graduate degree preferred 5 years of experience with Dynatrace.
- 5 years of experience in developing Operational and Business Dashboards, setting up alerting profiles.
- 3 years with Dynatrace Synthetic
- 1 year with Grail.
- Dynatrace Certification preferred.
- Working knowledge of Distributed Server environments.
- Thorough knowledge of middleware technologies (JBoss, WebSphere, NodeJS) and web server technologies (IHS, Tomcat, IIS, Apache)
- Strong automation skills with proficiency in script languages (Bash shell scripting, Perl, Python, PowerShell)
- Knowledge of Public Cloud Platform Azure.
- Ability to analyze different components of the infrastructure and application environments during Incident triage calls.
- Aptitude to influence other technical teams on incident triage calls and articulate troubleshooting steps effectively.
- Capture applications baseline performance deviations and implement proactive monitoring and alerting of production systems Leverage monitoring tools (such as Splunk Dynatrace Data Dog) and capabilities to find root cause for incident and problem management.
- Provide input into strategies, capabilities, and integrations to improve the availability and performance of production applications.