1. Home
  2. Jobs
  3. Large-Scale Models

Large-Scale Models Jobs

Browse 1,032 Large-Scale Models jobs on Inference Jobs.

141-160 of 1,032 jobs

1wTA

Research Engineer, Frontier Speculative Decoding

Together AI

San Francisco, California, United States (On-site)$190k – $270k Yearly
2wOP

Research Engineer / Research Scientist - Foundations Retrieval Lead

OpenAI

San Francisco, California, United States (Hybrid)$460k – $555k Yearly
1wSC

Machine Learning Engineer - Model Evaluations, Public Sector

Scale

San Francisco, California, United States (On-site)$216.3k – $300.3k Yearly
1wTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200k – $280k Yearly
2wCO

Senior Member of Technical Staff, Multimodal AI

Cohere

San Francisco, California, United States or Remote (Worldwide)
3dNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
1wTM

Research, Pre-Training Science

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
1wCE

Full Stack LLM Engineer

Cerebras

Toronto, Ontario, Canada (On-site)
1wSC

AI Infrastructure Engineer, Model Serving Platform

Scale

San Francisco, California, United States (On-site)$179.4k – $224.3k Yearly
2wPL

Research Engineer - Midtraining

Periodic Labs

Menlo Park, California, United States (On-site)
1wSC

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale

San Francisco, California, United States (On-site)$252k – $315k Yearly
2wNV

Senior AI Application Developer - GPU and SOC Architecture Modeling

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
2wOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
1wSC

Software Engineer, Gen AI

Scale

San Francisco, California, United States (Hybrid)$179.4k – $224.3k Yearly
2wCE

Senior Full Stack LLM Engineer - Training

Cerebras

Sunnyvale, California, United States (On-site)
5dDE

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)$300k – $430k Yearly
1wTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wCO

Member of Technical Staff, Modeling

Cohere

London, England, United Kingdom or Remote (Worldwide)