1. Home
  2. Jobs
  3. Low-Precision Inference

Low-Precision Inference Jobs

Browse 294 Low-Precision Inference jobs on Inference Jobs.

294 jobs

5dAN

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)$280k – $560k Yearly
5dTM

Research Engineer, Infrastructure, Kernels

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
5dOP

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)$455k – $555k Yearly
5dTM

Research, Audio Expertise

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wMA

Software Engineer, Technical Lead, Inference

Mistral AI

Île de Ré, Charente-Maritime, France (Hybrid)
1wCE
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
2wNV

Senior AI Software Engineer, GenAI Framework

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
3wAI
2wCE

Inference Compiler and Frontend Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)
4wNV

Product Manager - BioNeMo Inference

NVIDIA

New York, New York, United States (On-site)$168k – $258.8k Yearly
2wCE

Deployment Engineer, AI Inference

Cerebras

Sunnyvale, California, United States (On-site)
5dTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
3dCR

Research Engineer

Crusoe

Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
5dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)