👨🏻‍💻 postech.work

AI Engineer (AI Agent Product, Global Market)

CÔNG TY CỔ PHẦN CÔNG NGHỆ SOTATEK • 🌐 Remote

Remote Posted 4 days, 13 hours ago

Job Description

INTRODUCTION

SotaTek (State-of-the-Art-Technology) offers leading tech experts with strong determination, enthusiasm and commitment to providing the most hi-tech IT services, with the ultimate goal to enable your business success through digital transformation. We deliver sustainable Software Development, including Web/App, Blockchain, AI \& Machine Learning, ERP with cost-effective solutions.

During 9+ years of development, we have gathered over 1000 talented IT consultants and developers who share deep expertise to successfully provide full-cycle IT services to our Clients from 30+ nations worldwide, with 500+ projects in various industries, such as Finance, Health Care, Retail, Real Estate, Education, Media \& Entertainment.

SotaLabs is a member of the Sota Holdings Group. We specialize in developing exceptional web and mobile apps. We deliver our products including TopspotAI; ChatWOW AI, ShotX AI Headshot Studio, NoteX AI Note Taker. Our team comprises first principle thinkers who strive to create products rooted in customer needs, delivering true value to them.

What sets SotaLabs apart is how we: Encourage first-principle thinking; Prioritize customer obsession over competition-focused strategies; Think long-term; Empower employees to make decisions based on what's best for the company; Retain only the most efficient workers

Company size: 1000 - 2000

Headquarter: Hanoi, Vietnam

Representative offices: US, Japan, Korea and Australia

Main clients: US \& NA, EU, UK, ANZ, and Asia

JOB VACANCY

Position: AI Engineer (AI Agent Product, Global Market)

Level: Junior - Middle

Working model: Fulltime

Location: CIC Tower, No.2 Nguyen Thi Due, Yen Hoa, Cau Giay, Ha Noi

JOB DESCRIPTION

RAG Development

Design, build, and optimize end-to-end RAG pipelines (ingestion → indexing → retrieval → reranking → generation).

Integrate and tune vector databases (Pinecone, Weaviate, Milvus, Chroma, etc.).

Improve chunking strategies, embedding quality, and similarity search performance.

Build evaluation pipelines for RAG (precision/recall, context relevance, response quality).

Agentic AI / AI Agents

Develop AI Agents using frameworks like LangGraph, AutoGen, or LlamaIndex Agents.

Design multi-agent workflows (planner → executor → evaluator).

Implement tool/function calling, serverless integrations, and dynamic reasoning workflows.

Model Context Protocol (MCP)

Design and implement MCP servers exposing tools, APIs, and datasets to LLMs.

Integrate MCP into application stack (frontend ↔ backend ↔ LLM).

Build custom MCP tools (database connectors, internal API tools, scraping tools, etc.).

System Architecture \& MLOps

Build and maintain inference infrastructure (Docker, Kubernetes, GPU servers).

Implement logging, monitoring, and observability for RAG/Agent pipelines.

Optimize model serving cost and performance (quantization, vLLM, batching, TensorRT, etc.).

JOB REQUIREMENTS

Must have

Strong proficiency in Python (FastAPI / Flask). NodeJS is a plus.

Practical experience building production-level RAG systems.

Deep understanding of LLMs, embeddings, vector databases, and reranking.

Hands-on experience with RAG/Agent frameworks: LangChain, LlamaIndex, LangGraph, etc.

Familiarity with Docker, Linux, and basic DevOps.

Strong debugging and system thinking skills.

Nice to have

Experience with Agentic AI or multi-agent architectures.

Experience working with MCP servers \& custom tools.

Familiarity with GPU inference and optimization (vLLM, TensorRT, quantization).

Knowledge of self-hosting LLMs (Ollama, vLLM, HF TGI, LM Studio, etc.).

Experience building evaluation frameworks for RAG or Agents.

COMPENSATION \& BENEFITS

Flexible working regime and health care:

Flexible timekeeping (from 8:00 - 9:00 to 17:30 - 18:30)

Minimum 14 paid leaves per annum for all employees after probation

01-day remote work per month

A flexitime allowance of 90-180 minutes per month for employees

01 hour paid leave per day for women having children under 12 months

Social insurance, health insurance, unemployment insurance and MIC care insurance

Transparent and fair benefits:

Saturday \& Sunday OFF, Overtime pay is 150%, 200%, 300% as per labor law;

13th-month salary, Performance Bonus

Bonus Policy: Public holidays (2/9, 30/4, 1/5, 1/1,...); Personal Performances; Excellent Team; Performance bonus in Token of the project;..

Men’s Day, Women’s Day, Children’s Day, Mid-Autumn Festival and other benefits under the provisions of the company

Dynamic environment and open culture:

Year-end party, sports day, yearly company trip and quarterly team building,...with a generous budget

Socialize with colleagues through monthly Happy Hour

Monthly allowance when joining clubs: Soccer, Swimming, Yoga, Music,...

Nice \& modern working space with young, dynamic \& friendly colleagues and free coffee, tea, drinks,...

Flat, open and sharing culture with friendly management team; outsourcing company with product mindset

Strong learning culture:

Free training courses for technical and soft skills (presentation skills, communication skills, foreign language courses,...)

Account to log in to our online learning system, which contains thousands of valuable lectures (LMS)

Participate in workshops, seminars, tech talk,... with sharing from experts inside and outside the company

Working opportunities with technical gurus who built and operated world-class applications with millions of users.

Job Type: Full-time

Pay: 35,000,000₫ per month

Get job updates in your inbox

Subscribe to our newsletter and stay updated with the best job opportunities.