INTRODUCTION
SotaTek (State-of-the-Art-Technology) offers leading tech experts with strong determination, enthusiasm and commitment to providing the most hi-tech IT services, with the ultimate goal to enable your business success through digital transformation. We deliver sustainable Software Development, including Web/App, Blockchain, AI \& Machine Learning, ERP with cost-effective solutions.
During 9+ years of development, we have gathered over 1000 talented IT consultants and developers who share deep expertise to successfully provide full-cycle IT services to our Clients from 30+ nations worldwide, with 500+ projects in various industries, such as Finance, Health Care, Retail, Real Estate, Education, Media \& Entertainment.
SotaLabs is a member of the Sota Holdings Group. We specialize in developing exceptional web and mobile apps. We deliver our products including TopspotAI; ChatWOW AI, ShotX AI Headshot Studio, NoteX AI Note Taker. Our team comprises first principle thinkers who strive to create products rooted in customer needs, delivering true value to them.
What sets SotaLabs apart is how we: Encourage first-principle thinking; Prioritize customer obsession over competition-focused strategies; Think long-term; Empower employees to make decisions based on what's best for the company; Retain only the most efficient workers
Company size: 1000 - 2000
Headquarter: Hanoi, Vietnam
Representative offices: US, Japan, Korea and Australia
Main clients: US \& NA, EU, UK, ANZ, and Asia
JOB VACANCY
Position: AI Engineer (AI Agent Product, Global Market)
Level: Junior - Middle
Working model: Fulltime
Location: CIC Tower, No.2 Nguyen Thi Due, Yen Hoa, Cau Giay, Ha Noi
JOB DESCRIPTION
RAG Development
Design, build, and optimize end-to-end RAG pipelines (ingestion → indexing → retrieval → reranking → generation).
Integrate and tune vector databases (Pinecone, Weaviate, Milvus, Chroma, etc.).
Improve chunking strategies, embedding quality, and similarity search performance.
Build evaluation pipelines for RAG (precision/recall, context relevance, response quality).
Agentic AI / AI Agents
Develop AI Agents using frameworks like LangGraph, AutoGen, or LlamaIndex Agents.
Design multi-agent workflows (planner → executor → evaluator).
Implement tool/function calling, serverless integrations, and dynamic reasoning workflows.
Model Context Protocol (MCP)
Design and implement MCP servers exposing tools, APIs, and datasets to LLMs.
Integrate MCP into application stack (frontend ↔ backend ↔ LLM).
Build custom MCP tools (database connectors, internal API tools, scraping tools, etc.).
System Architecture \& MLOps
Build and maintain inference infrastructure (Docker, Kubernetes, GPU servers).
Implement logging, monitoring, and observability for RAG/Agent pipelines.
Optimize model serving cost and performance (quantization, vLLM, batching, TensorRT, etc.).
JOB REQUIREMENTS
Must have
Strong proficiency in Python (FastAPI / Flask). NodeJS is a plus.
Practical experience building production-level RAG systems.
Deep understanding of LLMs, embeddings, vector databases, and reranking.
Hands-on experience with RAG/Agent frameworks: LangChain, LlamaIndex, LangGraph, etc.
Familiarity with Docker, Linux, and basic DevOps.
Strong debugging and system thinking skills.
Nice to have
Experience with Agentic AI or multi-agent architectures.
Experience working with MCP servers \& custom tools.
Familiarity with GPU inference and optimization (vLLM, TensorRT, quantization).
Knowledge of self-hosting LLMs (Ollama, vLLM, HF TGI, LM Studio, etc.).
Experience building evaluation frameworks for RAG or Agents.
COMPENSATION \& BENEFITS
Flexible working regime and health care:
Flexible timekeeping (from 8:00 - 9:00 to 17:30 - 18:30)
Minimum 14 paid leaves per annum for all employees after probation
01-day remote work per month
A flexitime allowance of 90-180 minutes per month for employees
01 hour paid leave per day for women having children under 12 months
Social insurance, health insurance, unemployment insurance and MIC care insurance
Transparent and fair benefits:
Saturday \& Sunday OFF, Overtime pay is 150%, 200%, 300% as per labor law;
13th-month salary, Performance Bonus
Bonus Policy: Public holidays (2/9, 30/4, 1/5, 1/1,...); Personal Performances; Excellent Team; Performance bonus in Token of the project;..
Men’s Day, Women’s Day, Children’s Day, Mid-Autumn Festival and other benefits under the provisions of the company
Dynamic environment and open culture:
Year-end party, sports day, yearly company trip and quarterly team building,...with a generous budget
Socialize with colleagues through monthly Happy Hour
Monthly allowance when joining clubs: Soccer, Swimming, Yoga, Music,...
Nice \& modern working space with young, dynamic \& friendly colleagues and free coffee, tea, drinks,...
Flat, open and sharing culture with friendly management team; outsourcing company with product mindset
Strong learning culture:
Free training courses for technical and soft skills (presentation skills, communication skills, foreign language courses,...)
Account to log in to our online learning system, which contains thousands of valuable lectures (LMS)
Participate in workshops, seminars, tech talk,... with sharing from experts inside and outside the company
Working opportunities with technical gurus who built and operated world-class applications with millions of users.
Job Type: Full-time
Pay: 35,000,000₫ per month