We are seeking a
Senior DevOps Engineer
to join our Development team. In this role, you will be at the core of our technology stack — building, maintaining, and scaling high-availability systems that power our global services. You'll work with a diverse set of modern technologies, including Linux, Kubernetes, Infrastructure as Code (IaC) tools, monitoring systems, and distributed databases.
This position offers the opportunity to work at the heart of a high-performance global platform, directly influencing its scalability, reliability, and resilience. You will collaborate with experienced engineers, shape best practices, and contribute to the continuous evolution of our infrastructure — ensuring that our systems remain robust, secure, and efficient at scale.
Tasks to solve
Maintain and optimize core server infrastructure, including bare-metal servers, LXC containers, virtual machines, and cloud environments;
Operate and support core infrastructure services such as Nginx, Puppet, GitLab, Artifactory, Nexus, Harbor, Grafana, etc;
Manage and evolve infrastructure following the Infrastructure as Code (IaC) paradigm.
Handle and resolve incidents related to infrastructure operations;
Collaborate closely with cross-functional teams (network engineers, developers, and other technical stakeholders);
Design and implement high-availability, fault-tolerant, and scalable software solutions;
Monitor service performance and availability using modern observability tools, ensuring system reliability and optimal resource utilization.
This role is perfect for you if
4+ years of experience in Linux administration, DevOps, or Site Reliability Engineering (SRE);
Strong proficiency in automating tasks using Bash or similar scripting languages;
Solid understanding of networking fundamentals (TCP/IP stack, routing, DNS, etc.);
Hands-on experience managing bare-metal infrastructure in production environments;
Experience with configuration management systems such as Ansible or Puppet;
Experience with distributed databases (Elasticsearch, Cassandra, MongoDB, MySQL, PostgreSQL, etc.);
Expertise in Kubernetes administration and managing containerized workloads;
Experience with IaC tools such as FluxCD or ArgoCD;
Ability to design and implement high-performance, fault-tolerant, and secure infrastructure solutions;
Experience with monitoring and observability systems — Zabbix, VictoriaMetrics, Loki, Grafana, etc. — including building dashboards and configuring alerting.
What else we appreciate in our team
Programming experience in Python, Go;
Experience with cloud providers (AWS, GCP, Alibaba Cloud, or others);
Familiarity with distributed storage systems (Ceph);
Experience with service meshes (e.g., Istio);
Proven track record of working on high-load, large-scale distributed systems.