Our Vision
Our mission is to leverage AI technology to make video internationalization efficient, expressive, and effortless—empowering anyone, anywhere, to share their voice across languages and cultures.
Technology Spotlight

Multi-modal Recognition
We combine speech and subtitle signals to achieve more accurate and robust transcription. This dual-channel recognition reduces errors and ensures better alignment with the original content, especially in noisy or complex audio scenes.
Emotional Speech
Our voice synthesis models generate expressive, emotionally rich speech that enhances storytelling and viewer engagement. By capturing tone, rhythm, and nuance, we make AI voices feel more human and relatable.
Length-aware Translation and Customization
We optimize translations not just for accuracy, but also for timing and pacing—crucial for video and voice alignment. Users can further customize tone, length, and phrasing to suit different content needs or audience preferences.
Controlled Video Generation
We enable structured, template-driven video generation with controllable visual elements and transitions. This gives creators both creative freedom and production consistency, reducing manual effort while ensuring high-quality output.
Meet the Team

Shengli Li
Founding Architect
20+ years in distributed systems, search & recommendation engine architecture.

Ting Zhang
GTM & Partnerships
Biz dev strategist on B2B partnerships across entertainment, education, and media industries.