1. Home
  2. Jobs
  3. AI Inference Accelerator

AI Inference Accelerator Jobs

Browse 638 AI Inference Accelerator jobs on Inference Jobs.

638 jobs

1wD-

Analog Design Engineer, Senior Staff

d-Matrix

Santa Clara, California, United States (Hybrid)$196k – $300k Yearly
5dAN

Senior Software Engineer, Inference

Anthropic

Dublin, Dublin, Ireland (Hybrid)€235k – €295k Yearly
4dAN

Engineering Manager, Inference

Anthropic

San Francisco, California, United States (Hybrid)$425k – $560k Yearly
1wD-

AI Security Architect, Principal

d-Matrix

Santa Clara, California, United States or Remote (United States)$220k – $300k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2wPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)
4dAN

Engineering Manager, ML Acceleration

Anthropic

San Francisco, California, United States (Hybrid)$425k – $560k Yearly
1wD-

Senior Staff ML Researcher - LLM Algorithmic Optimization

d-Matrix

Bengaluru, Karnataka, India (Hybrid)₹4M – ₹6M Yearly
3dNV

Principal Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
3dNV

Senior AI Inference Compiler Engineer

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2wCE

Deployment Engineer, AI Inference

Cerebras

Sunnyvale, California, United States (On-site)
2wNV

Senior Software Research Architect, AI Networking

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
3dCR

Research Engineer

Crusoe

Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
2wCE

Inference Compiler and Frontend Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)
3wCO

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)$92k – $135k Yearly
1wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
1wNV

Senior ML Framework Performance Engineer - AI for Science at Scale

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly