1. Home
  2. Jobs
  3. XLA/MLIR

XLA/MLIR Jobs

Browse 80 XLA/MLIR jobs on Inference Jobs.

61-80 of 80 jobs

2wOP

Research-Hardware Codesign Engineer

OpenAI

San Francisco, California, United States (Hybrid)$230k – $460k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
1wAN

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)$280k – $560k Yearly
2wMO

Member of Technical Staff - ML Performance

Modal

New York, New York, United States (On-site)$150k – $270k Yearly
3dNV

Senior Systems Software Engineer - Deep Learning Solutions

NVIDIA

Toronto, Ontario, Canada (On-site)C$225k – C$275k Yearly
3wD-

Senior Staff Machine Learning Engineer -Frameworks

d-Matrix

Santa Clara, California, United States (Hybrid)$155k – $250k Yearly
1wTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200k – $280k Yearly
1wTE

Software Engineer

Tenstorrent

東京都, Tokyo Prefecture, Japan (On-site)
2wHA

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
2wNV

Senior Deep Learning Engineer

NVIDIA

Warszawa, Masovian Voivodeship, Poland (Hybrid)zł 292.5k – zł 507k Yearly
2wD-

Machine Learning Research Intern

d-Matrix

Santa Clara, California, United States (Hybrid)$30 – $59 Hourly
2wNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
6dCE

Kernel Engineer

Cerebras

Sunnyvale, California, United States (On-site)
3wNV

Senior Software Engineer, AI Inference Systems

NVIDIA

Santa Clara, California, United States (Hybrid)$184k – $356.5k Yearly
1wTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
1wNV

Raytracing Compiler Engineer - Developer and Performance Technology

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
3wNV

Senior Deep Learning Performance Architect

NVIDIA

California, United States (Hybrid)$152k – $287.5k Yearly