ESTsoft said on the 25th that it has advanced the dubbing quality of its artificial intelligence (AI) dubbing subscription service "Perso AI" with a focus on emotional expression and expanded the supported languages.
With this update, Perso AI's AI dubbing service has improved overall in emotional expression, intonation, and utterance timing, enabling dubbing closer to real voices. It supports producing natural and immersive results in various situations such as lines with large emotional swings, emphatic expressions, whispers, and laughter.
In addition, the supported dubbing languages have been expanded to 33, and it strengthened the global content production environment based on recognition of 100 languages. Generation speed has also been reduced compared with before, significantly improving production efficiency.
To improve user experience, a new "VoiceTone card-style selector" was also added. It helps users intuitively choose a voice style that fits the content type—such as education, marketing, and entertainment—improving work convenience.
Perso AI's AI dubbing is an integrated service that automatically handles the entire process when a video is uploaded, from ▲ audio separation ▲ script extraction and context-based translation ▲ emotion-reflecting speech synthesis ▲ frame-level lip-sync ▲ to final video output.
ESTsoft is continuing collaboration with AI voice technology corporations such as ElevenLabs to strengthen global service quality and is expanding its utility in global markets including Germany, Spain, Brazil, and Russia.
Kwon Taek-sun, ESTsoft chief technology officer (CTO), said, "This update is the result of directly reflecting our voice technology capabilities in dubbing quality," and added, "We will continue to enhance the sophistication of vocal expression and multilingual stability so AI dubbing becomes a default tool for content creators."