Palo Alto, California

Multimodal Speech Engineer, AI Companion

Multimodal Speech Engineer, AI Companion Team | Speech & Multimodal Interfaces
Location: Palo Alto, CA (on-site)

We build humanoid robots that work alongside people to solve labor shortages and create abundance.

The Role
As a Multimodal Speech Engineer on the AI Companion Team, you will lead the development of a real-time conversational speech model that integrates multiple modalities including vision, spatial audio, and body language. You will collaborate with cross-functional teams to align NEO’s speech with its physical embodiment and personality. This is a key role in shaping how users interact with our humanoid robot in intuitive, engaging ways.

Requirements

Apply Now