1. Home
  2. Jobs
  3. LLM Architecture

LLM Architecture Jobs

Browse 866 LLM Architecture jobs on Inference Jobs.

161-180 of 866 jobs

1wBI

AI Technical Operations Manager

Bioptimus

Île de Ré, Charente-Maritime, France or Remote (European Union)
2wOP

Backend Software Engineer (Evals) – Support Automation Engineering

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
3dNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wBR

Software Engineer, Product

Braintrust

San Francisco, California, United States (On-site)
1wAN

Manager, Solutions Architect

Anthropic

Seoul, Seoul, South Korea (Hybrid)
1wZU

Staff Engineer (Agentic AI & Data)

Zuma

San Francisco, California, United States (Hybrid)
2wOP

Technical Deployment Lead, Financial Services

OpenAI

New York, New York, United States (Hybrid)$280k – $345k Yearly
1wTM

Research Engineer, Infrastructure, Kernels

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wSI
1wCA

Solutions Engineer

Cartesia

San Francisco, California, United States (On-site)$160k – $220k Yearly
3wFU

Senior AI Engineer - Agent Team

FurtherAI

San Francisco, California, United States (On-site)$225k – $300k Yearly
1wAN

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)$280k – $560k Yearly
3wPE

Research Engineering Manager - Model Training

Perplexity

San Francisco, California, United States (On-site)$300k – $470k Yearly
1wAN

Partner Solutions Architect, Applied AI

Anthropic

New York, New York, United States (Hybrid)$280k – $300k Yearly
3wCE

Inference Compiler and Frontend Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)
3wTA

Research Intern, Model Shaping (Summer 2026)

Together AI

San Francisco, California, United States (On-site)
2wRA
1wLA

Applied Research Intern

Labelbox

San Francisco, California, United States (Hybrid)$35 – $45 Yearly