1. Home
  2. Jobs
  3. LLM APIs

LLM APIs Jobs

Browse 821 LLM APIs jobs on Inference Jobs.

821 jobs

4wCE

ML API Features SDET

Cerebras

Sunnyvale, California, United States (Hybrid)
2wIN

Member of Technical Staff

Interaction

San Francisco, California, United States (On-site)
2wSC

Senior/Staff Machine Learning Engineer, General Agents, Enterprise GenAI

Scale

San Francisco, California, United States (On-site)$218k – $273k Yearly
2wLA

Python OSS Engineer

LangChain

San Francisco, California, United States (On-site)$160k – $225k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2wCE

Sr. Engineer, Inference Ecosystem Engineering

Cerebras

Sunnyvale, California, United States (On-site)
3wAN

Software Engineer, Encoding Libraries

Anthropic

San Francisco, California, United States (Hybrid)$300k – $405k Yearly
1wHA

Forward Deployed Engineer - Portuguese Speaking

HappyRobot

Madrid, Madrid, Spain or Remote (Madrid, Spain)
1wHA

Forward Deployed Engineer

HappyRobot

Paris, Paris, France or Remote (France)
1wOP

Full Stack Software Engineer, GTM Innovation

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
2wMA

Applied AI, AI Engineer for Mistral

Mistral AI

Île de Ré, Charente-Maritime, France (On-site)
3wXA

Senior Grok Engineer

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
4wCO

Applied AI Engineer – Agentic Workflows (Korea)

Cohere

Seoul, Seoul, South Korea or Remote (South Korea)
5dAN

Solutions Architect, Applied AI

Anthropic

San Francisco, California, United States (Hybrid)$240k – $270k Yearly
5dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
5dTA

Lead DX Engineer - Documentation (SF / NYC)

Together AI

San Francisco, California, United States (On-site)$160k – $240k Yearly
1wOP

TLM, Machine Learning, Integrity

OpenAI

San Francisco, California, United States (On-site)$405k – $490k Yearly
2wPE

Full Stack Software Engineer - Applied AI

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
3wXA

Member of Technical Staff, Voice (Enterprise)

xAI

London, England, United Kingdom (On-site)$180k – $440k Yearly
2wHA

Forward Deployed Engineer

HappyRobot

København, Capital Region of Denmark, Denmark or Remote (Denmark)