We are looking for a Senior Machine Learning Engineer to join our team and contribute to the success of the GenAI initiative. In this position, you will focus on building, enhancing, and optimizing backend systems to support LLM-powered applications using OpenAI APIs. Your expertise in MLOps, CI/CD, observability tools, and cloud-native platforms will be key to ensuring the scalability, reliability, and efficiency of AI-driven solutions.
Responsibilities
Design and enhance backend systems to support AI and LLM-based applications
Deploy and manage LLM applications within cloud environments
Optimize AI systems to meet performance and reliability standards
Develop automated deployment workflows using CI/CD pipelines
Monitor and maintain the stability of AI services
Establish observability and logging systems to track LLM API performance
Work with DevOps teams to enhance workflows and ensure system reliability
Collaborate with AI and Data Science teams to improve and expand application capabilities
Utilize cloud platforms, particularly Azure, to deploy and scale AI solutions
Create and implement APIs and microservices to enable AI-powered functionalities
Requirements
A minimum of 3 years of experience in Machine Learning Engineering with a focus on backend and software development
Extensive experience integrating and working with OpenAI APIs and similar AI services
Proficiency in using MLOps tools such as Orion, ArgoCD, and Opsera for deployment automation
Hands-on experience with observability and monitoring tools, including Grafana, Dynatrace, and ThoughtSpot
Strong knowledge of cloud platforms, especially Azure, along with expertise in Apache Spark and Databricks
Advanced Python skills for backend development and implementation
Demonstrated experience in designing and building APIs and microservices architectures
Fluency in English, both written and spoken, at a B2+ level or higher
Nice to have
Understanding of Data Science principles and methodologies
Experience working with Large Language Models (LLMs)
Familiarity with Natural Language Processing (NLP) techniques and tools
We offer
Career plan and real growth opportunities
Unlimited access to LinkedIn learning solutions
Constant training, mentoring, online corporate courses, eLearning and more
English classes with a certified teacher
Support for employee’s initiatives (Algorithms club, toastmasters, agile club and more)
Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)
Flexible work schedule and dress code
Collaborate in a multicultural environment and share best practices from around the globe
Hired directly by EPAM \& 100% under payroll
Law benefits (IMSS, INFONAVIT, 25% vacation bonus)
Major medical expenses insurance: Life, Major medical expenses with dental \& visual coverage (for the employee and direct family members)
13 % employee savings fund, capped to the law limit
Grocery coupons
30 days December bonus
Employee Stock Purchase Plan
12 vacations days
Official Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th \& 31st)
Monthly non-taxable amount for the electricity and internet bills
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM´s Privacy Notice and Policy.