How Alexa learned to talk to an Irish accent

How Alexa learned to talk to an Irish accent

In the last five years, speech synthesis technology has moved to all-neural models that allow the separate elements of speech-proodi, accent, language and speaker identity (voice) to be controlled separately. It is the technology that allowed the Amazon-Text-to-Tale group to teach the feminine-sounding, English-language Alexa voice to speak in perfectly highlight us Spanish and the … Read more

INTERSPEECH: Where speech recognition and synthesis converge

INTERSPEECH: Where speech recognition and synthesis converge

As the start of this year’s Interspeesch is approaching, “Generative AI” has become a guard word in both the machine learning community and the popular press, where it generally refers to models that synthesize text or images. TTS) Models (TTS-to-Tale), which is an important research area at Interspeech, has in some sense always been “generative”. … Read more