1. Home
  2. Jobs
  3. CUDA Programming

CUDA Programming Jobs

Browse 379 CUDA Programming jobs on Inference Jobs.

379 jobs

3dNV

Senior System Software Engineer - CUDA Chips

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
2wNV

Senior AI Performance and Efficiency Engineer

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
1wCO

Member of Technical Staff, Model Efficiency

Cohere

New York, New York, United States or Remote (New York, United States + 3 more)
5dAI

Senior Sensor Rendering Software Engineer

Applied Intuition

Sunnyvale, California, United States (On-site)$150k – $250k Yearly
5dTM

Research Engineer, Infrastructure, Kernels

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
1wCO

Member of Technical Staff, Modeling

Cohere

London, England, United Kingdom or Remote (Worldwide)
2wNV

Senior Software Engineer, AI Inference Systems

NVIDIA

Santa Clara, California, United States (Hybrid)$184k – $356.5k Yearly
2wNV

Principal System Integration Engineer - Autonomous Vehicles

NVIDIA

Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
3wNV

Senior Applied Deep Learning Research Scientist, Efficiency

NVIDIA

Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
3dNV

Senior Resiliency and Safety Architect, GPU Workloads and Failure Analysis

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
5dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wD-

Machine Learning Research Intern

d-Matrix

Santa Clara, California, United States (Hybrid)$30 – $59 Hourly
2wPE

UK Internship Program

Perplexity

London, England, United Kingdom (Hybrid)
2hNV

Senior Systems Software Engineer - Deep Learning Solutions

NVIDIA

Toronto, Ontario, Canada (On-site)C$225k – C$275k Yearly
2wNE

Senior ML Engineer (Token Factory)

Nebius

Amsterdam, North Holland, Netherlands (On-site)
2wNV

Senior Performance Architect - Heterogeneous Workload Optimization

NVIDIA

Santa Clara, California, United States (Hybrid)$184k – $356.5k Yearly