1. Home
  2. Jobs
  3. Deep Learning Inference

Deep Learning Inference Jobs

Browse 555 Deep Learning Inference jobs on Inference Jobs.

21-40 of 555 jobs

4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
4wNV

Product Manager - BioNeMo Inference

NVIDIA

New York, New York, United States (On-site)$168k – $258.8k Yearly
3wNV

Senior Technical Program Manager, Deep Learning Libraries

NVIDIA

Santa Clara, California, United States (On-site)$168k – $322k Yearly
2wPO

Member of Engineering (Inference)

Poolside

United Kingdom or Remote (Europe + 1 more)
3wNV

Senior Applied Deep Learning Research Scientist, Efficiency

NVIDIA

Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
3dNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wNV

Senior Software Research Architect, AI Networking

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
1dNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
6dNE

ML/AI Engineer

Nebius

Amsterdam, North Holland, Netherlands (On-site)
5dNV

Senior Software Engineer, AI Inference Systems

NVIDIA

Toronto, Ontario, Canada (Hybrid)C$170k – C$275k Yearly
6dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
2wNV

Senior AI Software Engineer, GenAI Framework

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
2wHF

Senior Open-Source Machine Learning Engineer, Computer Vision - EMEA Remote

Hugging Face

Île de Ré, Charente-Maritime, France or Remote (Europe, Middle East, and Africa)
1dNV

Senior ML Compiler Engineer

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
4wGR
2wOP

Research Engineer, Privacy

OpenAI

San Francisco, California, United States (On-site)$380k – $460k Yearly
6dTM

Research, Audio Expertise

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly