Role: Site Reliability Engineer / DevOps Engineer (
Dynatrace + ELK
)
Location: Sydney
Permanent (Fulltime)
Job Description
Site Reliability Engineer with Observability Focus
This includes:
Designing and implementing SLIs/SLOs aligned to key customer journeys.
Strong knowledge of observability concepts: logs, metrics, traces, SLIs/SLO.
Integrating observability tools like
Dynatrace
, Elastic search, and Nagios to provide deep insights into application performance and reliability.
Building alerting pipelines via PagerDuty to ensure timely and actionable notifications for support teams.
Collaborating with senior SREs and application teams to identify gaps and drive improvements in monitoring coverage and incident response.
ML/Anomaly detection strategies.
What You all Work With
Tooling: Dynatrace, ELK stack
, Nagios, PagerDuty, Unix, java
Environments: Mix of on prem and cloud hosted applications
Practices: SRE principles, customer journey mapping, service level indicators/objectives (SLIs/SLOs), incident response automation
Desirable: (Good to have)
OTEL experience.
ITIL
JAVA
AWS Knowledge
ITIL
Automation/Coding.
Experience with containerised environments.
Tertiary qualifications, computer science/engineering.
Interested Candidates can share their updated resumes on sourabh.sood@carecone.com.au or can reach me on +61 251 103 879.