LinkedIn

YouTube

Instagram

TikTok

Multimodal Speech Engineer, AI Companion Team | Speech &amp; Multimodal Interfaces Location: Palo Alto, CA (on-site)We build humanoid robots that work alongside people to solve labor shortages and create abundance.The Role As a Multimodal Speech Engineer on the AI Companion Team, you will lead the development of a real-time conversational speech model that integrates multiple modalities including vision, spatial audio, and body language. You will collaborate with cross-functional teams to align NEO’s speech with its physical embodiment and personality. This is a key role in shaping how users interact with our humanoid robot in intuitive, engaging ways.

Palo Alto, CA

Consider an interaction where you walk up to a person, hand them an object, and then ask them to describe what you’ve given them. How fast can a person do this? How would you get NEO to listen, react, and respond at 1X human speed? How could you make this as fast and intelligent as possible? 

Based on the very little you know about us, what do you think are our most complex technical challenges?

What is your annual salary expectations in USD?  

Where did you find out about us? Please indicate person and place.

This job is located in Palo Alto, California. Are you already based in California or will you be relocating? 

Would you be able to work without visa sponsorship now and in the future? 

If offered a job, how quickly would you be able to start? Please specify if you have any notice period at current work. 

I grant permission for 1X to contact me directly regarding potential future job opportunities.

By submitting this application, I acknowledge that I have reviewed the Privacy Policy and consent to 1X storing my personal information for the purpose of processing my job application.

Multimodal Speech Engineer, AI Companion Team | Speech &amp; Multimodal InterfacesLocation: Palo Alto, CA (on-site)We build humanoid robots that work alongside people to solve labor shortages and crea

Palo Alto, California

Multimodal Speech Engineer, AI Companion

Requirements