We are looking for a Machine Learning Engineer to join our team and contribute to the GenAI initiative. In this position, you will focus on creating, enhancing, and fine-tuning backend systems that drive LLM-powered applications utilizing OpenAI APIs. Your expertise in MLOps, CI/CD, observability, and cloud-native tools will be critical in ensuring the performance, reliability, and scalability of AI-driven solutions.

Responsibilities

Build and enhance backend systems for AI and LLM-powered applications

Integrate LLM applications into cloud platforms and manage their operations

Scale AI systems to meet performance and reliability goals

Create CI/CD pipelines to enable automated deployment processes

Monitor the performance of AI services to ensure system stability

Set up observability and logging to track the performance of LLM APIs

Work with DevOps teams to optimize workflows and improve system reliability

Collaborate with AI and Data Science teams to expand and refine application features

Utilize cloud platforms, particularly Azure, for hosting and scaling AI applications

Design APIs and microservices architecture to enable AI functionalities

Requirements

A minimum of 2 years of experience in Machine Learning Engineering with a focus on backend and software systems

Extensive experience in integrating OpenAI APIs and AI services

Proficiency with MLOps tools such as Orion, ArgoCD, and Opsera for automation of deployments

Experience using monitoring and observability platforms like Grafana, Dynatrace, or ThoughtSpot

Strong knowledge of cloud infrastructure, with a preference for Azure, as well as expertise in Apache Spark and Databricks

Advanced Python programming skills for backend development

Proven experience in developing APIs and designing microservices architectures

Fluency in English, both written and spoken, with a proficiency level of B2+ or higher

Nice to have

Understanding of Data Science concepts and methodologies

Experience working with Large Language Models (LLMs)

Familiarity with Natural Language Processing (NLP) techniques and tools

We offer

Career plan and real growth opportunities

Unlimited access to LinkedIn learning solutions

Constant training, mentoring, online corporate courses, eLearning and more

English classes with a certified teacher

Support for employee’s initiatives (Algorithms club, toastmasters, agile club and more)

Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)

Flexible work schedule and dress code

Collaborate in a multicultural environment and share best practices from around the globe

Hired directly by EPAM \& 100% under payroll

Law benefits (IMSS, INFONAVIT, 25% vacation bonus)

Major medical expenses insurance: Life, Major medical expenses with dental \& visual coverage (for the employee and direct family members)

13 % employee savings fund, capped to the law limit

Grocery coupons

30 days December bonus

Employee Stock Purchase Plan

12 vacations days

Official Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th \& 31st)

Monthly non-taxable amount for the electricity and internet bills

EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM´s Privacy Notice and Policy.

ML Engineer

Job Description

Login / Register

👋 Let's find you a Dream Job

Check Your Email!

Get job updates in your inbox