INTERSPEECH: Where speech recognition and synthesis converge

INTERSPEECH: Where speech recognition and synthesis converge

As the start of this year’s Interspeesch is approaching, “Generative AI” has become a guard word in both the machine learning community and the popular press, where it generally refers to models that synthesize text or images. TTS) Models (TTS-to-Tale), which is an important research area at Interspeech, has in some sense always been “generative”. … Read more