1. Home
  2. Jobs
  3. Attention Optimization

Attention Optimization jobs

Explore Attention Optimization roles on Inference Jobs and apply today.

141-160 of 304 jobs

NV5d

Software Engineer, TensorRT Specialized Platforms - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-site)

$124k – $195.5k Yearly

NV1w

Senior Developer Technology Engineer, CPU Performance

NVIDIA

Santa Clara, California, United States (Hybrid)

$152k – $287.5k Yearly

NV2w

Electronic Design Automation Intern - 2026

NVIDIA

新竹市, Hsinchu City, Taiwan (On-site)

D-2w

Machine Learning Research Intern

d-Matrix

Santa Clara, California, United States (Hybrid)

$30 – $59 Hourly

NE4w

Senior Applied AI Researcher (Agentic Search)

Nebius

Netherlands + 3 more (Remote)

NV2d

Principal GPU Memory Architect

NVIDIA

Santa Clara, California, United States (On-site)

$272k – $431.3k Yearly

NE1w

Senior ML Engineer (AI Research)

Nebius

Europe + 4 more (Remote)

NV6d

Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents

NVIDIA

Santa Clara, California, United States (On-site)

$224k – $356.5k Yearly

CE1w

Principal ML Investigator

Cerebras

Sunnyvale, California, United States (On-site)

AN2w

Research Engineer - Pretraining

Anthropic

London, England, United Kingdom (Hybrid)

£260k – £630k Yearly

OP2w

Research Engineer, Notifications

OpenAI

San Francisco, California, United States (Hybrid)

$325k – $590k Yearly

NV5d

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)

$152k – $287.5k Yearly

D-2w

Senior Staff Machine Learning Engineer -Frameworks

d-Matrix

Santa Clara, California, United States (Hybrid)

$155k – $250k Yearly

RA2w

Member of Technical Staff - Alignment Lead

Reflection AI

San Francisco, California, United States (On-site)

NV1w

Senior AI Networking Exploration Architect 

NVIDIA

Yokneam Ilit, Northern District, Israel (On-site)

SC4w

Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI

Scale

San Francisco, California, United States (On-site)

$252k – $315k Yearly

PO5d

Member of Engineering (Pre-training / CUDA)

Poolside

Europe + 1 more (Remote)

NV2w

Deep Learning Performance Architect - Intern - 2026

NVIDIA

Shanghai, Shanghai, China (On-site)

NV5d

Senior Compiler Engineer, AI Inference Performance

NVIDIA

Santa Clara, California, United States (On-site)

$152k – $241.5k Yearly

RA2w

Member of Technical Staff - Pre-Training

Reflection AI

San Francisco, California, United States (On-site)