1. Home
  2. Jobs
  3. Low-Precision Inference

Low-Precision Inference Jobs

Browse 294 Low-Precision Inference jobs on Inference Jobs.

61-80 of 294 jobs

3wNV

Platform Architecture Engineer, GeForce NOW

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
3wNV

Low Power ASIC Engineer - New College Grad 2026

NVIDIA

Santa Clara, California, United States (On-site)$100k – $189.8k Yearly
2wMA

Research Engineer

Magic

San Francisco, California, United States (On-site)$225k – $550k Yearly
3wCO

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)$92k – $135k Yearly
3dNV

Principal Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
6dVA

Systems/GPU Research Engineer

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
2wSE

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)$175k – $280k Yearly
3dNV

Senior Compiler Engineer - AI

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
1wOP

Software Engineer, Load Balancing - Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
2wNV

Senior Deep Learning Performance Architect

NVIDIA

California, United States (Hybrid)$152k – $287.5k Yearly
4wNV

Technical Marketing Engineer, World Models - AV Physical AI

NVIDIA

Santa Clara, California, United States (On-site)$148k – $287.5k Yearly
2wCO

Product Marketing Manager, CoreWeave Inference

CoreWeave

Livingston, New Jersey, United States (Hybrid)$143k – $210k Yearly
5dNV

Power and Performance Engineer - System Memory

NVIDIA

Santa Clara, California, United States (Hybrid)$168k – $310.5k Yearly
5dCO

Director of Engineering, Inference Services

CoreWeave

Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
2wNE

Senior ML Engineer (Token Factory)

Nebius

Amsterdam, North Holland, Netherlands (On-site)
2wCO

Member of Technical Staff, Model Efficiency

Cohere

New York, New York, United States or Remote (New York, United States + 3 more)
2wBA

Engineering Manager - Forward Deployed Engineering (LLM)

Baseten

San Francisco, California, United States (On-site)$220k – $285k Yearly