Responsibility:
- Designing, implementing and maintaining a cloud infrastructure platform ensuring balance between high availability, reliability, complexity, security, scalability and cost depending on each stage of the product (using Infrastructure as Code and GitOps)
- Participating in CI/CD system designing and implementing
- Performing root cause analysis for errors and investigating and resolving technical issues in all development environments
- Optimizing performance, cost of the whole system
- Designing, implementing, maintaining, optimizing the observability systems (include monitoring, logging, distributed tracing, continuous profiling and APM)
- Supporting development team in designing, implementing, maintaining infrastructure for new services/applications
- Researching, designing, implementing toolchains and workflows that enable self-service capabilities for development team, as well as improve stability and scalability of the system
- Writing system documentations and training other team members
- Collaborating with security team to patch infrastructure vulnerabilities, implement security components
Must have:
- Ability to manage time efficiently, working independently and in a team
- Ability to learn quickly and apply new skills or technologies effectively
- Ability to debug and analyze problems quickly
- Ability to multitask and retain a high attention to detail
- 3+ YoE working with at least one of AWS, GCP, Azure (prefer AWS)
- 3+ YoE working with Container and Kubernetes
- Good understanding of computer networking
- Basic understanding of linux
- Basic experience with shell script and at least one programming language (prefer python, nodejs, go)
- Strong experience in building CI/CD process
- Strong experience in building and managing monitoring and logging system
- Knowledge of the source control and its related concepts
- Well-Knowledge in software development process
- Experience with building infrastructure for high available and scalable applications
- Bachelor's degree or higher in computer science or related field
Nice to have
- Hand-on experience with Infrastructure as Code (prefer Terraform or CloudFormation)
- Good knowledge of GitOps concept and hand-on experience with at least 1 GitOps tool
- Experience in developing applications
- Experience in building permission matrix
- Experience in integrating internal systems to a Single-Sign On system