👨🏻‍💻 postech.work

Senior Data Platform Operations Engineer

EPAM Systems • 🌐 Remote

Remote Posted 1 day, 11 hours ago

Job Description

We are seeking a highly skilled Senior Data Platform Operations Engineer to ensure the stability, security, performance, and cost efficiency of our global enterprise data platform.

This role is pivotal in providing 8/5 operational coverage within a follow-the-sun 24x5 support model, ensuring the platform consistently supports business activities worldwide. The ideal candidate will demonstrate expertise in cloud-based data platforms, a strong operational mindset, and a proactive approach to optimizing performance, enhancing observability, and managing costs.

Responsibilities

Maintain a stable, secure, and performant enterprise data platform (Snowflake, AWS data stack, dbt, orchestration tools, BI/analytics, etc.)

Provide operational coverage within an 8/5 support model and participate in a 24/7 on-call rotation for critical incidents

Implement robust monitoring, alerting, and observability solutions to facilitate proactive incident detection and resolution

Perform platform upgrades, patching, and configuration management in alignment with security and compliance requirements

Continuously tune system performance to meet evolving business needs

Use holistic observability frameworks covering infrastructure, data pipelines, and platform services to execute monitoring activities

Deliver actionable operational insights through monitoring dashboards and reporting

Identify and execute process automation to improve efficiency and reduce manual interventions

Propose and implement continuous improvements to advance platform resilience, scalability, and cost-effectiveness

Contribute to infrastructure-as-code and configuration-as-code practices for consistent, repeatable operations

Requirements

Background in managing cloud-native data platforms for over 3 years (e.g., Snowflake, Databricks, BigQuery, or similar)

Expertise in cloud infrastructure (AWS) with emphasis on operations, automation, and cost governance

Skills in monitoring and observability tools (Datadog, Prometheus, Grafana, ELK, CloudWatch, etc.)

Knowledge of Infrastructure as Code (Terraform, Pulumi, Ansible) and configuration management practices

Understanding of networking, security, and compliance in cloud environments

Competency in problem-solving with a proactive, service-oriented mindset

Flexibility to work in a global operations environment with on-call responsibilities

Qualifications in clear communication and collaboration with engineering, data, and business stakeholders

Commitment to continuous improvement and operational excellence

Proficiency in English language at an Upper-Intermediate level (B2) or higher

Nice to have

Showcase of implementing FinOps frameworks and cost optimization practices

Background in working within regulated industries (pharma, healthcare, finance) in compliance-driven environments

Familiarity with modern data stack tools (dbt, Dagster/Airflow, ThoughtSpot, Tableau, Power BI)

Understanding of SRE (Site Reliability Engineering) principles and practices

We offer

Career plan and real growth opportunities

Unlimited access to LinkedIn learning solutions

Constant training, mentoring, online corporate courses, eLearning and more

English classes with a certified teacher

Support for employee’s initiatives (Algorithms club, toastmasters, agile club and more)

Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)

Flexible work schedule and dress code

Collaborate in a multicultural environment and share best practices from around the globe

Hired directly by EPAM \& 100% under payroll

Law benefits (IMSS, INFONAVIT, 25% vacation bonus)

Major medical expenses insurance: Life, Major medical expenses with dental \& visual coverage (for the employee and direct family members)

13 % employee savings fund, capped to the law limit

Grocery coupons

30 days December bonus

Employee Stock Purchase Plan

12 vacations days

Official Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th \& 31st)

Monthly non-taxable amount for the electricity and internet bills

EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM´s Privacy Notice and Policy.

Get job updates in your inbox

Subscribe to our newsletter and stay updated with the best job opportunities.