Attention Optimization jobs
Explore Attention Optimization roles on Inference Jobs and apply today.
141-160 of 304 jobs
Software Engineer, TensorRT Specialized Platforms - New College Grad 2025
NVIDIA
Santa Clara, California, United States (On-site)
$124k – $195.5k Yearly
Senior Developer Technology Engineer, CPU Performance
NVIDIA
Santa Clara, California, United States (Hybrid)
$152k – $287.5k Yearly
Machine Learning Research Intern
d-Matrix
Santa Clara, California, United States (Hybrid)
$30 – $59 Hourly
Principal GPU Memory Architect
NVIDIA
Santa Clara, California, United States (On-site)
$272k – $431.3k Yearly
Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents
NVIDIA
Santa Clara, California, United States (On-site)
$224k – $356.5k Yearly
Research Engineer - Pretraining
Anthropic
London, England, United Kingdom (Hybrid)
£260k – £630k Yearly
Research Engineer, Notifications
OpenAI
San Francisco, California, United States (Hybrid)
$325k – $590k Yearly
Senior Machine Learning Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)
$152k – $287.5k Yearly
Senior Staff Machine Learning Engineer -Frameworks
d-Matrix
Santa Clara, California, United States (Hybrid)
$155k – $250k Yearly
Member of Technical Staff - Alignment Lead
Reflection AI
San Francisco, California, United States (On-site)
Senior AI Networking Exploration Architect
NVIDIA
Yokneam Ilit, Northern District, Israel (On-site)
Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI
Scale
San Francisco, California, United States (On-site)
$252k – $315k Yearly
Senior Compiler Engineer, AI Inference Performance
NVIDIA
Santa Clara, California, United States (On-site)
$152k – $241.5k Yearly
Member of Technical Staff - Pre-Training
Reflection AI
San Francisco, California, United States (On-site)