👨🏻‍💻 postech.work

DevOps Engineer

Hammehr Talent Consultants • 🌐 Remote • 💵 $120,000 - $140,000

Remote Posted 6 days, 1 hour ago

Job Description

Position: Permanent

Work environment: Remote, within Canada

Salary: $120,000 – $140,000 CAD

Position overview

Hammehr is currently working with a SaaS company that develops an integrated platform used by construction firms to manage logistics, scheduling, and communication to help recruit a DevOps Engineer.

Success in this role means being both a builder and a stabilizer, someone who keeps distributed systems resilient, deployments repeatable, and production environments observable. It’s a hands-on role where you’ll orchestrate modern infrastructure, automate recovery paths, and think about reliability as an engineering discipline rather than an afterthought.

Work here feels autonomous and pragmatic. The team is fully remote and operates with a high-trust culture where accountability is expected and supported.

What you’ll do

Design, maintain, and scale Kubernetes-based infrastructure to support smooth and consistent application releases.

Operate resilient, distributed environments that use consensus-based coordination (e.g., Raft) for leadership, state management, and fault tolerance.

Expand system observability by building and refining dashboards, alerts, and metrics pipelines using Prometheus and Grafana.

Configure, secure, and optimize workloads running on Google Cloud Platform, including autoscaling, networking, IAM policies, and cluster infrastructure.

Participate in an on-call rotation, handling incident triage, coordinating rollbacks when needed, and producing clear follow-up documentation.

Basic qualifications

3+ years managing Kubernetes in live production environments.

Hands-on experience with systems that rely on consensus protocols (such as Raft) for coordination or leader election.

Strong capability with monitoring and metrics tooling, particularly Prometheus and Grafana.

Solid understanding of core GCP services and infrastructure primitives—compute, networking, security, and scaling patterns.

Excellent written communication for distributed teams, including crisp updates, structured documentation, and accurate handoffs.

Ability to collaborate effectively with colleagues across multiple regions and time zones.

Preferred qualifications

Background supporting or operating multi-tenant SaaS platforms at scale.

Exposure to service meshes, tracing frameworks, or automated incident-response tooling.

Experience designing resilient deployment workflows, including automated recovery or near-zero-downtime releases.

Advanced troubleshooting skills to diagnose complex reliability issues under pressure.

The challenges

Managing interconnected systems where changes in one component can ripple across the entire environment.

Owning the full lifecycle of infrastructure while working with a high degree of independence.

Coordinating effectively across asynchronous communication channels and time-zone differences during critical events.

Your impact

Enhance overall service stability and performance across the platform.

Increase deployment confidence and reduce operational burden for the engineering teams.

Elevate monitoring and observability standards, leading to faster root-cause analysis and fewer unknowns.

Contribute to a team culture where reliability, clarity, and engineering discipline are central.

Get job updates in your inbox

Subscribe to our newsletter and stay updated with the best job opportunities.