Experience Required:
3 - 6 years
Location:
Gurgaon
Department:
Product and Engineering
Working Days:
Alternate Saturdays Working - wfh (1st and 3rd)
🔧
Key Responsibilities
Design, implement, and maintain highly available and scalable infrastructure using AWS Cloud Services.
Build and manage Kubernetes clusters (EKS, self-managed) to ensure reliable deployment and scaling of microservices.
Develop Infrastructure-as-Code using Terraform, ensuring modular, reusable, and secure provisioning.
Containerize applications and optimize Docker images for performance and security.
Ensure CI/CD pipelines (Jenkins, GitHub Actions, etc.) are optimized for fast and secure deployments.
Drive SRE principles including monitoring, alerting, SLIs/SLOs, and incident response.
Set up and manage observability tools (Prometheus, Grafana, ELK, Datadog, etc.).
Automate routine tasks with scripting languages (Python, Bash, etc.).
Lead capacity planning, auto-scaling, and cost optimization efforts across cloud infrastructure.
Collaborate closely with development teams to enable DevSecOps best practices.
Participate in on-call rotations, handle outages with calm, and conduct postmortems.
🧰
Must-Have Technical Skills
Kubernetes (EKS, Helm, Operators)
Docker \& Docker Compose
Terraform (modular, state management, remote backends)
AWS (EC2, VPC, S3, RDS, IAM, CloudWatch, ECS/EKS)
Linux system administration
Database tuning based on hardware config.
CI/CD pipelines (Jenkins, GitLab CI, GitHub Actions)
Logging \& monitoring tools: ELK, Prometheus, Grafana, CloudWatch
Site Reliability Engineering practices
Load balancing, autoscaling, and HA architectures
💡
Good-To-Have
GCP or Azure exposure
Security hardening of containers and infrastructure
Chaos engineering exposure
Knowledge of networking (DNS, firewalls, VPNs)
👤
Soft Skills
Strong problem-solving attitude; calm under pressure
Good documentation and communication skills
Ownership mindset with a drive to automate everything
Collaborative and proactive with cross-functional teams