Palo Alto, California
Multimodal Speech Engineer, AI Companion
Multimodal Speech Engineer, AI Companion Team | Speech & Multimodal Interfaces
Location: Palo Alto, CA (on-site)
We build humanoid robots that work alongside people to solve labor shortages and create abundance.
The Role
As a Multimodal Speech Engineer on the AI Companion Team, you will lead the development of a real-time conversational speech model that integrates multiple modalities including vision, spatial audio, and body language. You will collaborate with cross-functional teams to align NEO’s speech with its physical embodiment and personality. This is a key role in shaping how users interact with our humanoid robot in intuitive, engaging ways.