1. Home
  2. Jobs
  3. Low-Precision Inference

Low-Precision Inference Jobs

Browse 287 Low-Precision Inference jobs on Inference Jobs.

41-60 of 287 jobs

2wNV

Senior Software Engineer, Deep Learning Inference - TensorRT

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
1dNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300k – $385k Yearly
1wTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200k – $280k Yearly
4dNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wAI

AI Infrastructure Engineer - Autonomy

Applied Intuition

Sunnyvale, California, United States (On-site)$153k – $222k Yearly
2wPE

UK Internship Program

Perplexity

London, England, United Kingdom (Hybrid)
3wXA

Software Engineer - Applied Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
4dNV

Senior Compiler Engineer, AI Inference Performance

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
6dCE

Senior Research Engineer - Inference ML

Cerebras

Sunnyvale, California, United States (Hybrid)
2wHA

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
6dAC

Infrastructure Engineer, ML Systems

Applied Compute

San Francisco, California, United States (On-site)
3wXA

Member of Technical Staff, Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wNV

Senior Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)$200k – $391k Yearly
4wCE

ML API Features SDET

Cerebras

Sunnyvale, California, United States (Hybrid)
6dSC

ML Research Engineer, ML Systems

Scale

San Francisco, California, United States (On-site)$218.4k – $273k Yearly
2wNV

Principal Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)$248k – $391k Yearly
2wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
2wAN

Research Compute Operations

Anthropic

San Francisco, California, United States (Hybrid)$270k – $290k Yearly