Helping news, media, brands and institutions leverage our world-class content and cutting-edge services to drive value to their audiences and business.
Small and fast: only 123M parameters. High-quality voice cloning: state-of-the-art performance in speaker similarity, intelligibility, and naturalness. Multi-lingual: support Chinese and English.
Official pytorch code release of "DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation" @misc{kim2024deeptalkdynamicemotionembedding ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results