👨🏻‍💻 postech.work

Senior DevOps Engineer - Cloud & Microservices

Astra-North Infoteck Inc. ~ Conquering today’s challenges, achieving tomorrow’s vision! • 🌐 In Person

In Person Posted 2 days, 10 hours ago

Job Description

Job Description Role:

Senior DevOps Engineer - cloud, Microservices

Location:

Toronto, ON

Skills:

Digital: Cloud DevOps, Microservices

Experience Required:

8–10 years

Role Description

Hands on Cloud DevOps / Microservices Engineer to design, build, and operate secure, scalable, and observable cloud native platforms and services. You’ll own CI/CD, container orchestration, infrastructure as code, and runtime reliability for microservices partnering closely with software engineering, security, SRE, and product teams to deliver high quality releases with speed and confidence.

Design, provision, and maintain cloud infrastructure (

AWS, Azure

) including compute, networking, storage, IAM, and managed services.

Implement Infrastructure as Code (IaC) with

Terraform, CloudFormation, ARM, Bicep

; enforce GitOps practices (pull requests, code reviews, change history).

Build multi-account subscription landing zones, network segmentation, and controls for cost, security, and compliance.

Optimize cloud spend with right sizing, autoscaling, reserved savings plans, and cost governance dashboards.

Build robust CI/CD pipelines (

Azure DevOps, GitHub Actions, Jenkins, GitLab CI

) with automated build, test, security scans, artifact management, and progressive delivery (blue-green, canary).

Standardize pipelines as reusable templates, implement trunk-based development, and robust branching strategies.

Enable automated rollbacks and deployment health checks; integrate quality gates (unit, integration, contract tests).

Containerize services (

Docker

) and operate them on

Kubernetes (AKS, EKS, GKE)

or

OpenShift

.

Manage service discovery, ingress, service mesh (

Istio, Linkerd

), config and secret management, HPA, pod disruption budgets, and node pools.

Implement microservices best practices (circuit breakers, retries/backoff, idempotency, API gateways, rate limits, event driven patterns such as SQS, SNS, Kafka, Event Hub, PubSub).

Establish end-to-end observability: metrics, logs, traces with

Prometheus, Grafana, CloudWatch, Azure Monitor, ELK, Splunk, OpenTelemetry

.

Define and track SLI, SLO, SLA; create actionable alerts and on-call runbooks; participate in incident response and post-incident reviews (RCA).

Conduct load performance testing; remediate hotspots at infrastructure, container, and application layers.

Get job updates in your inbox

Subscribe to our newsletter and stay updated with the best job opportunities.