Amazon scientists' work from Interspeech 2022
Learn more about the work Amazon researchers presented at Interspeech 2022—the world's largest and most comprehensive conference on the science and technology of spoken-language processing.
Amazon researchers had more than 40 papers accepted, ranging from topics such as automatic speech recognition and text-to-speech to acoustic watermarking and automatic dubbing.
Senior applied scientist Penny Karanasou was an area and session chair for Interspeech 2022. Across her career, she has worked on speech recognition, language understanding, and text-to-speech. Find out why cross-pollination of speech-related fields intrigues her and how the conference program reflected that.
Alexa AI senior principal scientist Andreas Stolcke highlighted some speech-related papers, focusing on end-to-end models and fairness. He also wrote about the techniques Amazon scientists are using, like toggling neural blocks on and off, adding multiple CNN front ends to RNN-T models, and adversarial reweighting.
Alexa AI senior principal scientist Gokhan Tur selected papers that covered a wide range of topics in spoken-language understanding—like learning from noisy data, using phonetic embeddings to improve entity resolution, and quantization-aware training.
Senior applied scientist Antonio Bonafonte wrote about work being done on transference—of prosody, accent, and speaker identity—in text-to-speech, and the new ways scientists have used tools like normalizing flows and variational autoencoders.
Get a monthly digest of the latest news, research papers, conferences, and career opportunities at Amazon, by signing up for our newsletter.
Principal Product Marketing Manager, Amazon Science
1y👏👏