FP8/INT8 Quantization Jobs
Browse 14 FP8/INT8 Quantization jobs on Inference Jobs.
14 jobs
3w ago
NV
Senior Deep Learning Algorithms Engineer - BioNeMo
NVIDIA
Ho Chi Minh City, Ho Chi Minh City, Vietnam (On-site)
2w ago
GD
4w ago
TM
Research Engineer, Infrastructure, Numerics
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
4w ago
RU
Member of Technical Staff, Research Engineer (GPU Performance)
Runway
United States (Remote)$270K – $370K Yearly
2w ago
OP
Research-Hardware Codesign Engineer
OpenAI
San Francisco, California, United States (Hybrid)$185K – $455K Yearly
22h ago
TA
Forward Deployed Engineer (Inference & Post-Training)
Together AI
San Francisco, California, United States (On-site)$270K – $300K Yearly
2w ago
BA
Software Engineer - Model Performance
Baseten
San Francisco, California, US or Remote (Worldwide)$180K – $360K Yearly
2w ago
BA
GPU Kernel Engineer
Baseten
San Francisco, California, US or Remote (United States)$180K – $360K Yearly
3w ago
NV
Senior Performance Engineer - LLM Inference Frameworks
NVIDIA
Yokne'am, Northern District, Israel (Hybrid)
4w ago
NV
4w ago
AI
Embedded AI Engineer – Android Automotive (On-Device Intelligence)
Applied Intuition
Sunnyvale, California, United States (On-site)$150K – $250K Yearly
1w ago
NV
Senior Software Engineer, Deep Learning Inference
NVIDIA
Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)$150K – $200K Yearly
4w ago
TA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly