Our Vision

Our mission is to leverage AI technology to make video internationalization efficient, expressive, and effortless—empowering anyone, anywhere, to share their voice across languages and cultures.

“Empowering every voice to be heard, across every language.”

Technology Spotlight

Multi-modal Recognition

Multi-modal Recognition

We combine speech and subtitle signals to achieve more accurate and robust transcription. This dual-channel recognition reduces errors and ensures better alignment with the original content, especially in noisy or complex audio scenes.

Emotional Speech

Our voice synthesis models generate expressive, emotionally rich speech that enhances storytelling and viewer engagement. By capturing tone, rhythm, and nuance, we make AI voices feel more human and relatable.

Length-aware Translation and Customization

We optimize translations not just for accuracy, but also for timing and pacing—crucial for video and voice alignment. Users can further customize tone, length, and phrasing to suit different content needs or audience preferences.

Controlled Video Generation

We enable structured, template-driven video generation with controllable visual elements and transitions. This gives creators both creative freedom and production consistency, reducing manual effort while ensuring high-quality output.

Meet the Team

Jay Wang

Jay Wang

Founder & CEO

Ph.D., Twitter, Kuaishou, 19+ years in ML. Author, builder, visionary.

Shengli Li

Shengli Li

Founding Architect

20+ years in distributed systems, search & recommendation engine architecture.

Ting Zhang

Ting Zhang

GTM & Partnerships

Biz dev strategist on B2B partnerships across entertainment, education, and media industries.

Rafi Ahmed Patel

Rafi Ahmed Patel

Founding ML Engineer

MSc UCL. Specializes in TTS, CV, and translation systems.

Ronel Solomon

Ronel Solomon

Founding ML Engineer

MS Data Science, Expert in Generative AI - Video/Animation