LLM Engineer
2 weeks ago
Role OverviewOwn the design, fine-tuning, optimization, and production deployment of large language models (LLMs) for domain-specific use cases. You will build high-performance RAG systems, optimize prompts/agents, operate inference at scale, and champion engineering best practices while driving research and innovation.Key ResponsibilitiesLLM Engineering: Design, fine-tune, and optimize models such as GPT, Claude, Gemini, LLaMA, and Falcon for domain-specific applications.RAG Systems: Build and operate retrieval-augmented generation pipelines (ingestion, chunking, embedding, indexing, retrieval, re-ranking) using vector databases (FAISS, Pinecone, Weaviate, etc.).Prompt/Agent Optimization: Develop prompt templates, chains, and agents with LangChain/LlamaIndex; implement guardrails, tool-use, and memory.Model Deployment (LLMOps): Implement, monitor, and scale inference endpoints with MLflow, Docker, and Kubernetes; manage versioning/registry and safe rollouts (blue-green/canary).Performance Optimization: Evaluate and continuously improve accuracy, latency, and cost (batching, caching/KV-cache, quantization, speculative decoding).Collaboration & Mentoring: Review code, set best practices for AI software engineering, and mentor junior engineers.Research & Innovation: Track advances in LLMs, multimodal AI, and open source; lead PoCs, benchmarking, and knowledge sharing.Required QualificationsEducation: Bachelor's or Master's in Computer Science, Artificial Intelligence, or related field (PhD preferred).Experience:5+ years in machine learning/NLP.2+ years working directly with LLMs or GenAI applications.Technical Skills:Proficiency in Python and ML frameworks ( PyTorch/TensorFlow ) and Hugging Face Transformers .Hands-on with LangChain , LlamaIndex , or SDKs for OpenAI/Anthropic/Cohere/Gemini.Strong understanding of embeddings , tokenization, and vector search/retrieval.Familiarity with MLOps, CI/CD , and cloud (AWS/Azure/GCP); containerization with Docker/Kubernetes.Experience integrating AI APIs (OpenAI, Anthropic, Cohere, Google Gemini).Soft Skills: Excellent problem-solving and communication; comfortable leading projects and mentoring teammates.Preferred/BonusExperience with model distillation and fine-tuning open-source LLMs (LoRA/QLoRA, PEFT).Exposure to multimodal AI (text + image + audio/voice), TTS/ASR, VLMs.Familiarity with AI safety , bias/fairness, privacy, and governance/compliance frameworks.Cost/performance tuning: quantization (INT8/INT4), speculative decoding, throughput optimization.Success Metrics (KPIs)Model quality (task-specific metrics: accuracy/recall, hallucination rate, BLEU/ROUGE/WER as applicable).System performance & cost (P95 latency, throughput, cost per request).Reliability (SLO/SLA, error rates) and delivery velocity (lead time, deployment frequency).Knowledge impact (PoC → production conversions, docs/best practices, mentoring outcomes).Tools & EnvironmentModel/Serving: HF Transformers, vLLM/TensorRT-LLM, Triton, Ray/Modal (as applicable).Vector/RAG: FAISS, Pinecone, Weaviate, Milvus; re-ranking (e.g., Cross-Encoder/ColBERT).Ops/Observability: MLflow, Prometheus/Grafana, OpenTelemetry, Weights & Biases.Data: Airflow/Prefect, dbt, Spark (as needed).Benefits (customizable)Competitive compensation with performance/PoC success bonuses.Learning budget/certifications and conference attendance.Dedicated GPU credits/resources for R&D; open-source-friendly environment.Comprehensive insurance and flexible work arrangements.
-
LLM Engineer
1 week ago
Hanoi, Hanoi, Vietnam FPT Software AI Center Full time ₫60,000,000 - ₫120,000,000 per yearWe are seeking a skilled professional to design, develop, and deploy scalable LLM and agent-based AI solutions for thousands of users. You will architect multi-agent workflows, build robust APIs and RAG pipelines, and deliver advanced multimodal features. The role requires expertise in frameworks such as LangChain, LangGraph, and OpenAI Agents SDK, as well...
-
Mid - Senior AI Engineer (LLM)
1 week ago
Hanoi, Hanoi, Vietnam Eastgate Software - We Drive Digital Transformation Full time $30,000 - $100,000 per yearCompany DescriptionEastgate Software is a Vietnamese software development company with consulting branches in Germany, Japan, Singapore, and Australia. Renowned for its quality-driven results, Eastgate Software has become a trusted outsourcing partner for SMEs and Fortune 500 companies, including a long-term strategic partnership with SIEMENS Mobility. The...
-
THỰC TẬP SINH AI/LLM
1 week ago
Hanoi, Hanoi, Vietnam EOV SOLUTIONS Full time ₫480,000 - ₫10,480,000 per yearThực tập sinh sẽ được giao phụ trách một dự án RD độc lập với các nhiệm vụ chính sau:Tìm hiểu kiến trúc hệ thống AI/LLM và phương pháp triển khai thực tế.Nghiên cứu thử nghiệm chuyên sâu các dự án OSS, bao gồm:AI OCR tài liệu kỹ thuật .Hệ thống RAG AI assistant cho tri thức doanh...
-
Senior AI Engineer
1 week ago
Hanoi, Hanoi, Vietnam Akila Full time ₫80,000,000 - ₫200,000,000 per yearTop 3 reasons to join usWe are a green team not a mean teamAttractive packageFlexible working timeJob descriptionAkila is looking for a Senior AI Engineer who will be engaged into machine learning algorithm and theoretical frontier research which includes machine learning, deep learning and data mining. The candidate will be working across multiple teams and...
-
Lead AI Engineer
1 week ago
Hanoi, Hanoi, Vietnam sun Full time ₫1,200,000 - ₫2,400,000 per yearWHAT WE DO:At Sun Asterisk, we are committed to building cutting-edge AI solutions that empower businesses and transform industries.As an AI Engineer Lead, you will:Design and implement agentic AI workflows, integrating LLMs with APIs, tools, and external systems using frameworks such as LangChain, LlamaIndex, or Semantic KernelCollaborate with clients and...
-
Fullstack Engineer
1 week ago
Hanoi, Hanoi, Vietnam sun Full time $500,000 - $1,000,000 per yearWHO WE ARE:We are looking for an AI Fullstack Engineer who specializes in integrating LLMs into production systemsYou will work on product engineering, leveraging existing AI platforms (OpenAI, Azure OpenAI, Microsoft AI services…) to deliver scalable, production-ready solutions.WHAT WE DOAt Sun Asterisk, we build cutting-edge AI solutions that leverage...
-
Generative AI Engineer
4 days ago
Hanoi, Hanoi, Vietnam Qualcomm Full time $150,000 - $200,000 per yearCompany:Qualcomm Vietnam Company Limited, Hanoi Branch OfficeJob Area:Engineering Group, Engineering Group > Machine Learning EngineeringGeneral Summary:Qualcomm AI Research is looking for world-class algorithm engineers in general domain machine learning, especially deep learning, generative AI, LLM, LVM. Come join a high-caliber team of engineers building...
-
Generative AI Engineer
2 days ago
Hanoi, Hanoi, Vietnam Qualcomm Full time $100,000 - $150,000 per yearCompany:Qualcomm Vietnam Company Limited, Hanoi Branch OfficeJob Area:Engineering Group, Engineering Group > Machine Learning EngineeringGeneral Summary:Qualcomm AI Research is looking for world-class algorithm engineers in general domain machine learning, especially deep learning, generative AI, LLM, LVM. Come join a high-caliber team of engineers building...
-
Lead AI Engineer
1 week ago
Hanoi, Hanoi, Vietnam VinSmart Future Full time $120,000 - $240,000 per yearTop 3 reasons to join usCompetitive Salary & BenefitsTraining course and certificatesThe preferences when using the services of the VinJob descriptionAbout the RoleWe are seeking a Lead AI Engineer to spearhead the development of large-scale AI systems and define the next generation of intelligent, human-centered products.In this role, you will take...
-
AI Engineer
6 days ago
Hanoi, Hanoi, Vietnam CÔNG TY CỔ PHẦN PT HUB Full time ₫8,000,000 - ₫12,000,000 per yearJob descriptionMục tiêu công việcXây dựng, triển khai và tối ưu các mô hình ngôn ngữ lớn (LLM) phục vụ cho các ứng dụng thực tế như chatbot, trợ lý ảo, phân tích dữ liệu, hệ thống hỏi đáp và tự động hóa nghiệp vụ doanh nghiệp.Nhiệm vụ chínhNghiên cứu & Phát triển (R&D):Tìm...