(Wednesday, 31 May 2023. New York City, NY) – One month after making its ground-breaking AI speech-to-speech translator generally available, KUDO, the world leader in real-time multilingual solutions, prepares for the rollout of its AI Engine v.2.0. This update is set to increase the accuracy of KUDO AI’s speech translation capabilities by up to 30%.
KUDO AI (patent pending) represents a speech-to-speech translation solution that offers multilingual audio and captioning. This sophisticated technology empowers users to listen to speakers in their preferred language, eliminating the need to rely solely on subtitles. A standout feature of the KUDO AI Speech Translator is its capacity to operate in real-time, supporting near-simultaneous and uninterrupted translation, thus promising a frictionless user experience. This feature has been specifically designed and optimized for live translation of speeches, lectures, and presentations among others.
Underpinning this capability is our simultaneity module, based on advanced machine learning and NLP techniques, which processes and analyzes the structure of the speech as it unfolds, making informed decisions with each new word spoken. “This task is intricate due to the necessity of maintaining a delicate balance: creating short and compact segments of text to facilitate real-time translation, while also considering that machine translation often delivers better results with longer texts and more context”, says Claudio Fantinuoli, CTO and designer of KUDO AI. Achieving the right balance is crucial to enhancing translation accuracy and elevating the overall user experience.
In light of the importance of this balance, for version 1.5 considerable R&D efforts have been invested in refining the simultaneity module. The latest version showcases a significant accuracy improvement, demonstrating an impressive 30.42% increase compared to the previous iteration (English).
The quality is measured by letting linguists manually evaluate a corpus of texts processed by the simultaneity module over a balanced corpus of speeches in each language. The corpus represents prototypical inputs for which the application has been designed and comprises several categories of texts, such as technical presentations, political speeches, lectures, but also more challenging ones, such as casual talks, speeches rich at disfluencies, non native speakers, etc. This evaluation is performed at each iteration and is a measure used internally to assess improvements of the engine.
Isabel Canovas, Linguistic Analyst, adds “A roughly 30% reduction of errors in the machine learning-based simultaneity module corresponds to a similar uplift in the overall translation experience”. In this scenario, it’s not just the accuracy but the precision with which the original message is conveyed in translation that sees a positive impact, but also and foremost the grammatical and syntactical naturalness of the translated output.
This update to the AI Engine reflects the ongoing evolutions planned by KUDO to advance product sophistication and use cases for their Speech Translator. Currently, KUDO AI is designed for one-directional meetings and events where presenters are speaking one or multiple languages to an audience; All Hands meetings, L&D, webinars, global events, etc. Initial adoption of the solution has shown that the need for language access in these use cases remains high.
KUDO is the world leader in providing real-time multilingual solutions that enable people to communicate effortlessly in any language—on any platform. Their network of 12,000 professional language interpreters, combined with their ground-breaking Speech Translator, empower organizations of all sizes to collaborate more efficiently, with greater inclusivity, and on an international scale. KUDO Inc. is a New-York based technology start-up founded and managed by language and conferencing industry insiders seeking to create a world in which everyone has the power to understand and be understood in their own language. More info at www.kudoway.com.