1. Home
  2. Jobs
  3. United States
  4. California
  5. Palo Alto
  6. AI Research
  7. AI Scientist - Palo Alto (Internship, Phd)
MA

AI Scientist - Palo Alto (Internship, Phd)

Mistral AI
Posted onFeb 23, 2026
LocationPalo Alto, California, United States (On-site)
Employment typeContract
About Mistral 
At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.
We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.
We are a dynamic, collaborative team passionate about AI and its potential to transform society.
Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.
Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.
Mistral AI are hiring experts in the role of pre-training and fine-tuning large language models.
Role Summary 
-You will be working with the fine tuning team on making state-of-the-art generative models.
-You will run autonomous work streams under the supervision of experienced scientists.
-The role is based in our Bay area offices
-Internship duration : 3 to 6 months. We will only consider candidates looking for end of studies internships (Phd)
What you will do
-Explore state-of-the-art LLM algorithms for fine tuning LLMs, with the supervision of top level scientists.
-Assist in the design and implementation of machine learning models and algorithms.
-Conduct research on the latest advancements in natural language processing and LLMs.
-Contribute to the development and optimization of our LLM systems.
-Collaborate with cross-functional teams to integrate LLM technologies into various applications.
-Perform data analysis and visualization to support research and development efforts.
-Document research findings and contribute to technical reports and publications.
-Participate in team meetings and brainstorming sessions to share ideas and insights
About you
-Currently doing a Phd from tier 1 engineering schools / Universities.
-High scientific understanding of the field of generative AI. 
-Broad knowledge of the field of AI, and specific knowledge or interest in fine-tuning and using language models for applications.
-Strong programming skills in Python, with experience in libraries such as TensorFlow, PyTorch, or similar.
-Familiarity with natural language processing techniques and machine learning algorithms.
-Design complex software and make them usable in production. 
-Navigate the full MLOps technical stack, with a focus on architecture development and model evaluation and usage. 
-Previous experience with LLMs or related technologies.
-Knowledge of deep learning frameworks and techniques..Experience with version control systems (e.g., Git) and linux shell environment.
Now, it would be ideal if you : 
-Have experience in fine tuning LLMs.
-Have used complex HPC infrastructure with full autonomy.

Mistral AI is a French AI company founded in 2023 that builds open-weight, frontier AI models to democratize artificial intelligence access for enterprises worldwide.

Similar jobs

You might also be interested in...

SC2w

AI Research Engineer, Enterprise Evaluations

Scale

San Francisco, California, United States (On-site)

$179.4k – $224.3k Yearly

NV7d

Applied Deep Learning Scientist Intern, Bio Foundation Model Research - Summer 2026

NVIDIA

Santa Clara, California, United States (On-site)

$20 – $71 Hourly

NV4d

Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents

NVIDIA

Santa Clara, California, United States (On-site)

$224k – $356.5k Yearly

NV3w

Senior Research Scientist, Multi-Modal Language Models

NVIDIA

Santa Clara, California, United States (On-site)

$192k – $356.5k Yearly

TM5d

Research, Vision Expertise

Thinking Machines Lab

San Francisco, California, United States (On-site)

$350k – $475k Yearly