Contribute to the infrastructure roadmap and deliver critical infrastructure changes for the project
Configure and maintain the CICD pipelines incorporating streamlined change and release management process
Maintain tools for configuration management, build, continuous integration, and deployment, reporting, monitoring etc.
Implement and operate monitoring and alerting tools
Implement scripts to automate server, CICD and other relevant tasks
Participate and implement capacity planning and risk management
Explore and adopt new and creative DevOps approaches to improve production reliability and availability.
Implement security practices that comply with standards to protect the organization's data \& infrastructure.
Working in highly collaborative teams and building quality environments.
Ability to effectively prioritize and execute tasks in a high-pressure, fast paced environment
Requirements
5+ years of relevant DevOps experience
In-depth knowledge of programming languages such as
Python,Ruby,Go,Javascript and Typescript
Hands-on experience working on
AWS
Hands-on experience with containerization using
Docker, and ECS/Kubernetes
Good understanding of virtualization and networking concepts
Hands-on experience with configuration management systems such as Terraform, Packer and Ansible
Hands-on experience with production deployments such as ECS/Kubernetes and automating provisioning with IaC best practices.
Hands-on experience in linux server administration, and shell scripting with Bash, and good to have other scripting languages.
Knowledge in system architecture is a plus
Has knowledge with tools such as Prometheus, Thanos, Grafana, ELK
Experience with highly scalable distributed systems
Breadth of knowledge - OS, networking, distributed computing, cloud computing
Manage the operations of cloud infrastructure and application services
Strong technical aptitude including excellent troubleshooting and communications skills
Good understanding and working knowledge of Agile projects such as SCRUM and KANBAN
Bonus: Familiarity AI/ML infrastructure