Skills Required:
DevOps Monitoring and Management Dynatrace
Job Description:
Job Purpose Application Monitoring Specialist will work closely with infrastructure, development, support, and database teams to determine monitoring requirements and implement filters, events, alerts, and dashboards primarily using Dynatrace.
This role is responsible for recommending, installing, and maintaining the Dynatrace monitoring platform, and ensuring its optimal use across the organization.
Accountabilities Collaborate with infrastructure, support, development, and database teams to gather requirements and deploy monitoring solutions.
Design, configure, and maintain dashboards, filters, events, and alerts tailored to business needs.
Perform root cause analysis on events using logs, analytics, and available documentation.
Conduct proactive health checks and incident reviews, identify gaps or issues and remediate them.
Review infrastructure and application architecture to recommend appropriate monitoring checks.
Define, analyse, build, and maintain monitoring solutions and platforms, with Dynatrace as the central tool.
Remediate any risks discovered in the monitoring platform. Install, configure, and maintain Dynatrace agents across various environments.
Integrate modern technologies and applications into the monitoring platform.
Provide event correlation mechanisms to reduce alert noise.
Work closely with technical support to improve customer experience through insights.
Plan and execute disaster recovery tests on monitoring platforms.
Create and maintain documentation and templates related to monitoring.
Develop, maintain, and implement monitoring scripts and log queries.
At least 2 years of experience managing monitoring solutions, with a focus on Dynatrace. Hands-on experience with Dynatrace, including dashboard creation, alert configuration, and agent deployment.
Familiarity with other monitoring and logging tools (Nagios, Grafana, Splunk, Kibana) is an asset.
Experience with Linux and command line interface.
Scripting programming experience (Bash, Java, Python, PowerShell).
Working understanding of web infrastructure (web servers, load balancers, application servers).
Nice To have Understanding of infrastructure components (virtualization, operating systems, networks, storage).
Experience with cloud technologies (Azure, AWS).
Database experience (PostgreSQL, SQL).
Experience Required: 6 Years
Job Types: Full-time, Fixed term contract
Work Location: Hybrid remote in Toronto, ON