Site Reliability Engineer (SRE) – Blockchain Infrastructure
We are looking for a Site Reliability Engineer (SRE) to join our infrastructure team. Our mission is to ensure reliability, performance, and scalability of blockchain APIs and services that power users worldwide.
Your role - As an SRE Engineer, you will:
Support and evolve our monitoring and alerting systems for blockchain APIs.
Troubleshoot incidents, perform root cause analysis, and improve system reliability.
Automate deployment and operational workflows for blockchain nodes and services.
Collaborate with developers and infrastructure engineers to ensure smooth delivery of new features.
Help optimize system performance and resource usage across bare metal and cloud environments.
What we're looking for
Experience with monitoring/observability stacks (Prometheus, Grafana, Loki, VictoriaMetrics, or similar).
Experience with automated testing in infrastructure and/or services.
Basic programming skills in Go and/or Python.
Familiarity with containers and orchestration (Docker, Kubernetes is a plus).
Hands-on experience with CI/CD pipelines (GitHub Actions, ArgoCD, etc.).
Experience with high-availability systems and troubleshooting performance issues.
Interest in blockchain technologies and willingness to dive into new protocols.
Nice to have
Previous experience running blockchain nodes.
Linux systems administration skills.
Familiarity with infrastructure-as-code (Terraform, Ansible, or similar).
Knowledge of networking (load balancing, DNS, firewalls, BGP)