Role:
Site Reliability Engineer
Location:
Sydney, NSW
Employment Type:
Permanent
Must Have:
Full working rights. No sponsorship available.
This includes:
Designing and implementing
SLIs/SLOs
aligned to key customer journeys.
Strong knowledge of
observability concepts
: logs, metrics, traces, SLIs/SLO.
Integrating observability tools like
Dynatrace, Elastic
, and
Nagios
to provide deep insights into application performance and reliability.
Building alerting pipelines via PagerDuty to ensure timely and actionable notifications for support teams.
Collaborating with senior SREs and application teams to identify gaps and drive improvements in monitoring coverage and incident response.
ML/Anomaly detection strategies.
Work With:
Tooling: Dynatrace, ELK stack, Nagios, PagerDuty,unix, java
Environments: Mix of on prem and cloud hosted applications
Practices: SRE principles, customer journey mapping, service level indicators/objectives (SLIs/SLOs), incident response automation
Desirable:
OTEL experience.
ITIL
JAVA
AWS Knowledge
ITIL
Automation/Coding.
Experience with containerised environments.
Tertiary qualifications, computer science/engineering
Interested consultants can share their updated resume at
vipul.chaudhary@carecone.com.au
or call
+61 283 195 538.