1. Home
  2. Jobs
  3. Image/Video/Audio-to-Text Modalities

Image/Video/Audio-to-Text Modalities Jobs

Browse 28 Image/Video/Audio-to-Text Modalities jobs on Inference Jobs.

28 jobs
5d agoOpenAI logoOP
5d agoOpenAI logoOP

Engineering Manager, Multimodal (API)

OpenAI

San Francisco, California, United States (On-site)$293K – $385K Yearly
2w agoxAI logoXA
6d agoGoogle DeepMind logoGD

Research Engineer, Human Understanding

Google DeepMind

Los Angeles, California, United States (On-site)$174K – $252K Yearly
2w agoThinking Machines Lab logoTM

Research, Audio Expertise

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
2w agoAnthropic logoAN

Research Engineer, Audio

Anthropic

San Francisco, California, United States (Hybrid)$350K – $500K Yearly
2w agoxAI logoXA

Member of Technical Staff - Multimodal Understanding

xAI

Palo Alto, California, United States (On-site)$180K – $440K Yearly
6d agoGoogle DeepMind logoGD

Research Scientist, Gemini Safety

Google DeepMind

Mountain View, California, United States (On-site)
2d agoCartesia logoCA

Senior Applied Researcher, Audio Understanding

Cartesia

San Francisco, California, United States (On-site)$200K – $350K Yearly
2w agoFigure logoFI
2w agoThinking Machines Lab logoTM

Research, Vision Expertise

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
Subscribe to this search

Get email updates when new jobs match this search.