Image/Video/Audio-to-Text Modalities Jobs
Browse 28 Image/Video/Audio-to-Text Modalities jobs on Inference Jobs.
28 jobs
5d ago
OP
Software Engineer, Inference - Multi Modal
OpenAI
San Francisco, California, United States (On-site)$295K – $555K Yearly
2d ago
DE
Research Engineer, Multimodal Generative AI (Image/Video)
DeepMind
Kirkland, Washington, United States (On-site)$166K – $244K Yearly
2w ago
PR
AI Trainer - Advanced Video and Image Annotation (US & Canada)
Prolific
United States + 1 more (Remote)Up to $25 Hourly
5d ago
OP
Engineering Manager, Multimodal (API)
OpenAI
San Francisco, California, United States (On-site)$293K – $385K Yearly
2w ago
XA
Member of Technical Staff - Imagine Model
xAI
Palo Alto, California, United States (On-site)$180K – $440K Yearly
6d ago
GD
Research Engineer, Human Understanding
Google DeepMind
Los Angeles, California, United States (On-site)$174K – $252K Yearly
5d ago
CO
Senior Member of Technical Staff, Multimodal AI
Cohere
San Francisco, California, US or Remote (Worldwide)
2w ago
TM
Research, Audio Expertise
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
19h ago
NV
2w ago
AN
Research Engineer, Audio
Anthropic
San Francisco, California, United States (Hybrid)$350K – $500K Yearly
2w ago
XA
Member of Technical Staff - Multimodal Understanding
xAI
Palo Alto, California, United States (On-site)$180K – $440K Yearly
6d ago
GD
2d ago
CA
Senior Applied Researcher, Audio Understanding
Cartesia
San Francisco, California, United States (On-site)$200K – $350K Yearly
2w ago
TM
Research, Vision Expertise
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly