Language Model Inference Jobs
Browse 569 Language Model Inference jobs on Inference Jobs.
569 jobs
2wCE
Inference Compiler and Frontend Engineer – Dubai
Cerebras
Dubai, Dubai, United Arab Emirates (On-site)
5dAN
Research Engineer, Discovery
Anthropic
San Francisco, California, United States (Hybrid)$340k – $425k Yearly
4wCE
Python / PyTorch Developer — Frontend Inference Compiler – Dubai
Cerebras
United Arab Emirates (On-site)
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
5dNV
Senior Deep Learning Engineer - Model Evaluation & AI Systems
NVIDIA
Santa Clara, California, United States (On-site)$224k – $431.3k Yearly
1wPO
Member of Engineering (Pre-training and inference software)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
1wCO
Member of Technical Staff, Model Efficiency
Cohere
New York, New York, United States or Remote (New York, United States + 3 more)
5dTM
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
2wRA
Member of Technical Staff - Post-Training
Reflection AI
San Francisco, California, United States (On-site)
4dAN
Engineering Manager, Inference
Anthropic
San Francisco, California, United States (Hybrid)$425k – $560k Yearly
4dNV
Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents
NVIDIA
Santa Clara, California, United States (On-site)$224k – $356.5k Yearly