Back to Jobs
Llamaindex

Multimodal AI Engineer, Document Understanding at Llamaindex

Llamaindex San Francisco, CA

Job Description

Join us and help shape the future of AI by redefining document workflows with AI agents.About the Role:We are seeking exceptional AI engineers to join our core document understanding team. You will work at the intersection of computer vision, natural language processing, and production ML systems to push the boundaries of what's possible in document parsing and understanding.Our document understanding team builds the intelligence behind LlamaParse, LlamaExtract, and our other processing products. These systems are processing millions of complex documents including PDFs, PowerPoints, Word documents, and spreadsheets. Your work will directly impact thousands of developers building RAG applications and document agents, while also contributing to our open-source frameworks that shape how the industry approaches document processing.Depending on your background and interests, you might focus more on data curation and evaluation, model fine-tuning and experimentation, or ML infrastructure and production systems. We're hiring multiple people and will work with you to find the best fit.Responsibilities:Develop, train, and optimize machine learning models for document structure understanding, table extraction, layout analysis, and multimodal content processingBuild robust data pipelines, evaluation frameworks, and experimentation infrastructureDesign and implement production ML systems that handle complex, real-world documents at scaleStay current with latest advances in vision-language models, document AI, and multimodal learningCollaborate with engineering teams to integrate ML innovations into production APIsContribute to both our open-source frameworks and enterprise offeringsDrive technical decisions while balancing research exploration with product deliveryRequired Qualifications:3-7 years of experience in machine learning engineering or applied researchStrong software engineering fundamentals with production Python experience (modern tooling: uv, ruff, mypy, Pydantic)Hands-on experience training, fine-tuning, or deploying ML models in productionDeep understanding of modern ML techniques, particularly in computer vision, NLP, or multimodal learningExperience with at least one of: data pipeline development, model training/fine-tuning, or ML infrastructureAbility to read and implement from research papers and technical specificationsTrack record of executing with high intensity in fast-paced environmentsStrong technical communication skills and comfort with open-source collaborationPreferred Qualifications:Experience with vision-language models, transformer architectures, or model fine-tuning (LoRA, QLoRA)Experience building evaluation frameworks, benchmarks, or data quality pipelinesExperience with model serving frameworks (vLLM, TensorRT, ONNX) or MLOps toolsExperience specifically with document understanding, OCR, or layout analysisContributions to open-source ML projects or frameworksExperience with LLM applications and RAG systemsStrong understanding of model optimization techniques (quantization, distillation, pruning)Experience with Docker/Kubernetes and distributed systemsActive participation in ML research communityLocation:We offer a hybrid-friendly culture based out of our downtown San Francisco office. Remote candidates will be considered for exceptional fits.Why Join Us?Impactful Mission: Work on innovative AI products that redefine how knowledge is accessed and utilized. Your models will process millions of documents and directly impact thousands of developers.Cutting-Edge Technology: Work with the latest vision-language models, contribute to open-source frameworks used industry-wide, and shape the future of document AI.Collaborative Team: Join a focused team of passionate engineers and researchers committed to pushing the boundaries of what's possible in document understanding.Technical Autonomy: Significant creative freedom to explore new approaches while maintaining focus on delivering high-quality, production-ready solutions.Growth Opportunities: Be at the forefront of the AI revolution, with ample opportunities to grow alongside our scaling organization. Shape your role based on your interests and strengths.Additional Benefits:Competitive base salary and equity compensationComprehensive medical/dental/vision coverage for you and your familyUnlimited paid time off policyDaily catered lunch and snacks in the San Francisco officeBudget for conferences, research materials, and professional developmentAccess to cutting-edge compute resources and research toolsPursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.LlamaIndex does not accept unsolicited agency resumes. Please do not forward resumes to our jobs alias, employees, or any other organization location. LlamaIndex is not responsible for any fees related to unsolicited resumes.

Resume Suggestions

Highlight relevant experience and skills that match the job requirements to demonstrate your qualifications.

Quantify your achievements with specific metrics and results whenever possible to show impact.

Emphasize your proficiency in relevant technologies and tools mentioned in the job description.

Showcase your communication and collaboration skills through examples of successful projects and teamwork.

Explore More Opportunities