We are looking for a Machine Learning Engineer to join our team and contribute to the GenAI initiative. In this position, you will focus on creating, enhancing, and fine-tuning backend systems that drive LLM-powered applications utilizing OpenAI APIs. Your expertise in MLOps, CI/CD, observability, and cloud-native tools will be critical in ensuring the performance, reliability, and scalability of AI-driven solutions.
Responsibilities
Build and enhance backend systems for AI and LLM-powered applications
Integrate LLM applications into cloud platforms and manage their operations
Scale AI systems to meet performance and reliability goals
Create CI/CD pipelines to enable automated deployment processes
Monitor the performance of AI services to ensure system stability
Set up observability and logging to track the performance of LLM APIs
Work with DevOps teams to optimize workflows and improve system reliability
Collaborate with AI and Data Science teams to expand and refine application features
Utilize cloud platforms, particularly Azure, for hosting and scaling AI applications
Design APIs and microservices architecture to enable AI functionalities
Requirements
A minimum of 2 years of experience in Machine Learning Engineering with a focus on backend and software systems
Extensive experience in integrating OpenAI APIs and AI services
Proficiency with MLOps tools such as Orion, ArgoCD, and Opsera for automation of deployments
Experience using monitoring and observability platforms like Grafana, Dynatrace, or ThoughtSpot
Strong knowledge of cloud infrastructure, with a preference for Azure, as well as expertise in Apache Spark and Databricks
Advanced Python programming skills for backend development
Proven experience in developing APIs and designing microservices architectures
Fluency in English, both written and spoken, with a proficiency level of B2+ or higher
Nice to have
Understanding of Data Science concepts and methodologies
Experience working with Large Language Models (LLMs)
Familiarity with Natural Language Processing (NLP) techniques and tools
We offer
Career plan and real growth opportunities
Unlimited access to LinkedIn learning solutions
Constant training, mentoring, online corporate courses, eLearning and more
English classes with a certified teacher
Support for employee’s initiatives (Algorithms club, toastmasters, agile club and more)
Enjoyable working environment (Gaming room, napping area, amenities, events, sport teams and more)
Flexible work schedule and dress code
Collaborate in a multicultural environment and share best practices from around the globe
Hired directly by EPAM \& 100% under payroll
Law benefits (IMSS, INFONAVIT, 25% vacation bonus)
Major medical expenses insurance: Life, Major medical expenses with dental \& visual coverage (for the employee and direct family members)
13 % employee savings fund, capped to the law limit
Grocery coupons
30 days December bonus
Employee Stock Purchase Plan
12 vacations days
Official Mexican holidays, plus 5 extra holidays (Maundry Thursday and Friday, November 2nd, December 24th \& 31st)
Monthly non-taxable amount for the electricity and internet bills
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM´s Privacy Notice and Policy.