The digital characters are available in half bodies or full bodies, and the service is available in both Chinese and English. Some aspects, like background and tone, are customizable. The videos avoid the flat intonation and single speech rhythm that plagues traditional acoustic models by using an in-house small-sample timbre customization technology that relies on deep learning acoustic models and neural network vocoders. […] Tencent offers five styles for its digital humans: 3D realistic, 3D semi-realistic, 3D cartoon, 2D real person, and 2D cartoon. Customized Q&As can be created for the digital human, turning them into a type of deepfaked chatbot.
Categories: Leben (Life aka misc)Technology