1. Home
  2. Jobs
  3. Low-Precision Inference

Low-Precision Inference Jobs

Browse 291 Low-Precision Inference jobs on Inference Jobs.

21-40 of 291 jobs

3wCE

Inference Frontend

Cerebras

Sunnyvale, California, United States (On-site)
1wOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
2wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
3dNV

Senior AI Inference Compiler Engineer

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
6dVA

GPU Systems Engineer – HPC / Parallel Computing

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
1wD-

Analog Design Engineer, Senior Staff

d-Matrix

Santa Clara, California, United States (Hybrid)$196k – $300k Yearly
2wD-

Software Engineering Intern, Developer and Qualification Tools

d-Matrix

Santa Clara, California, United States (Hybrid)$30 – $59 Hourly
1wPO

Member of Engineering (Inference)

Poolside

United Kingdom or Remote (Europe + 1 more)
6dNV

Senior GPU Low Power Architect

NVIDIA

Santa Clara, California, United States (On-site)$136k – $264.5k Yearly
4dCE

Performance Reliability Engineer

Cerebras

Sunnyvale, California, United States (On-site)
6dTM

Research Engineer, Infrastructure, Inference

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
2wCR

Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)

Crusoe

San Francisco, California, United States or Remote (California, United States + 1 more)$204k – $247k Yearly
5dAN

Engineering Manager, Inference

Anthropic

San Francisco, California, United States (Hybrid)$425k – $560k Yearly
1wPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
2wCE

Senior Full Stack LLM Engineer - Training

Cerebras

Sunnyvale, California, United States (On-site)
3dNV

Senior Compiler Engineer, AI Inference Platforms

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
4dMO

Forward Deployed ML Engineer

Modal

New York, New York, United States (On-site)$180k – $250k Yearly