1. Home
  2. Jobs
  3. Low-Precision Inference

Low-Precision Inference jobs

Explore Low-Precision Inference roles on Inference Jobs and apply today.

101-120 of 287 jobs

NV2w

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)

$152k – $287.5k Yearly

NV3w

Senior Technical Program Manager, Deep Learning Libraries

NVIDIA

Santa Clara, California, United States (On-site)

$168k – $322k Yearly

NV3d

Senior ML Compiler Engineer

NVIDIA

Redmond, Washington, United States (On-site)

$152k – $287.5k Yearly

NV2w

Senior Deep Learning Engineer

NVIDIA

Warszawa, Masovian Voivodeship, Poland (Hybrid)

zł 292.5k – zł 507k Yearly

CO5d

Senior Software Engineer I, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)

$139k – $204k Yearly

TE4w

Software Engineer, Metal Runtime

Tenstorrent

Santa Clara, California, United States (Hybrid)

$100k – $500k Yearly

NV4w

Senior Power Methodology and Modeling Engineer

NVIDIA

Austin, Texas, United States (On-site)

$136k – $264.5k Yearly

NV2w

Senior Machine Learning Applications and Compiler Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)

$152k – $287.5k Yearly

MO2w

Member of Technical Staff - Product (Backend)

Modal

New York, New York, United States (On-site)

$150k – $270k Yearly

CE3w

Sr. Engineer, Inference Ecosystem Engineering

Cerebras

Sunnyvale, California, United States (On-site)

NV2w

Senior Machine Learning Applications and Compiler Engineer

NVIDIA

Toronto, Ontario, Canada (Hybrid)

C$135k – C$220k Yearly

OP2w

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)

$325k – $490k Yearly

SE2w

ML Engineer

Sesame

New York, New York, United States (On-site)

$190k – $320k Yearly

DE5d

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)

$300k – $430k Yearly

NV5d

Senior Machine Learning Applications and Compiler Engineer

NVIDIA

Cambridge, England, United Kingdom (Hybrid)

CE3w

Engineering Manager, Inference Platform

Cerebras

Sunnyvale, California, United States (On-site)

D-2w

Senior Runtime Software Engineer

d-Matrix

Sydney, New South Wales, Australia (Hybrid)

TE1w

Senior Physical Design Engineer

Tenstorrent

Austin, Texas, United States (Hybrid)

$100k – $500k Yearly

D-2w

Senior Staff ML Researcher - LLM Algorithmic Optimization

d-Matrix

Bengaluru, Karnataka, India (Hybrid)

₹4M – ₹6M Yearly

MO2w

Member of Technical Staff - ML Performance

Modal

New York, New York, United States (On-site)

$150k – $270k Yearly