SSW11: The 11th ISCA Speech Synthesis Workshop https://www.hotelnautis.hu/en Gárdony (by Lake Velence), Hungary, August 26-28, 2021 |
Conference website | https://ssw11.hte.hu/en/ |
Abstract registration deadline | April 23, 2021 |
Submission deadline | May 3, 2021 |
Notification of acceptance | June 14, 2021 |
Camera-ready final paper submission | June 28, 2021 |
Early-bird registration ends | July 12, 2021 |
Late registration ends | August 16, 2021 |
Major challenges call for major meetings: the Speech Synthesis Workshops (SSWs), which are held every three years under the auspices of ISCA's SynSIG. In 2019 it was decided to have an SSW every two years, since the technology is advancing faster these days. SSWs provide a unique occasion for people in the speech synthesis area to meet each other. They contribute to establishing a feeling that we are all participating in a joint effort towards intelligible, natural, and expressive synthetic speech.
Submission Guidelines
All papers must be original and not simultaneously submitted to another journal or conference. Papers in all areas of speech synthesis technology are encouraged to be submitted.
Call for Demos
We are planning to have a demo session to showcase new developments in speech synthesis. If you have some demonstrations of your work that does not really fit in a regular oral or poster presentation, please let us know.
List of Topics
- Including but not limited to:
- Grapheme-to-phoneme conversion for synthesis
- Text processing for speech synthesis (text normalization, syntactic and semantic analysis, intent detection)
- Segmental-level and/or concatenative synthesis
- Signal processing/statistical model for synthesis
- Speech synthesis paradigms and methods; articulatory synthesis, articulation-to-speech synthesis, parametric synthesis etc.
- Prosody modeling, transfer and generation
- Expression, emotion and personality generation
- Voice conversion and modification, morphing (parallel and non-parallel)
- Concept-to-speech conversion speech synthesis in dialog systems
- Avatars and talking faces
- Cross-lingual and multilingual aspects for synthesis (e.g. automatic language switching)
- Applications of synthesis technologies to communication disorders
- TTS for embedded devices and computational issues
- Tools and data for speech synthesis
- Quality assessment/evaluation metrics in synthesis
- End-to-end text-to-speech synthesis
- Direct speech waveform modelling and generation
- Neural vocoding for speech synthesis
- Speech synthesis using non-ideal data ('found', user-contributed, etc.)
- Natural language generation for speech synthesis
- Special topic: Speech uniqueness and deep learning (generating diverse and natural speech)
Committees
Scientific Committee
Nagaraj Adiga | University of Crete |
Gerard Bailly | GIPSA-Lab |
Pallavi Baljekar | |
Roberto Barra-Chicote | |
Timo Baumann | University of Hamburg |
Antonio Bonafonte | Universitat Politècnica de Catalunya |
Robert Clark | |
Erica Cooper | National Institute of Informatics |
Tamás Gabor Csapo | Budapest University of Technology and Economic |
Daniel Erro | Cirrus Logic |
Raul Fernandez | IBM |
Philip N. Garner | Idiap Research Institute |
Balint Gyires-Toth | Budapest University of Technology and Economic |
Gustav Eje Henter | KTH Royal Institute of Technology |
Esther Klabbers | ReadSpeaker |
Zhen-Hua Ling | University of Science and Technology of China |
Damien Lolive | IRISA/ Université Rennes 1 |
Sébastien Le Maguer | Trinity College Dublin / Adapt Centre |
Jindrich Matousek | University of West Bohemia |
Thomas Merritt | Amazon |
Bernd Möbius | Saarland University |
Eva Navas | University of the Basque Country |
Géza Németh | Budapest University of Technology and Economic |
Yamato Ohtani | AI Inc., |
Michael Pucher | Acoustics Research Institute |
Francesc Alias Pujol | La Salle - Universitat Ramon Llull |
Tuomo Raitio | Apple Inc |
Manuel Sam Ribeiro | The University of Edinburgh |
Srikanth Ronanki | Amazon |
Andrew Rosenberg | |
Milan Secujski | University of Novi Sad |
Adriana Stan | Technical University of Cluj-Napoca |
Eva Szekely | KTH Royal Institute of Technology |
Tomoki Toda | Nagoya University |
Markus Toman | Neuratec |
Jaime Lorenzo Trueba | Universidad Politecnica de Madrid |
Pirros Tsiakoulis | Samsung Electronics |
Junichi Yamagishi | The University of Edinburgh |
Csaba Zainkó | Budapest University of Technology and Economic |
Heiga Zen | |
Yi Zhao | National Institute of Informatics |
Organizing Committee
Géza Németh Chairman BME TMIT, Hungary |
Junichi Yamagishi National Institute of Informatics Japan,University of Edinburgh, UK |
Sébastien Le Maguer ADAPT Centre/TCD, Ireland |
Esther Klabbers Readspeaker, Netherlands |
|
Mátyás Bartalis BME TMIT, Hungary |
Tamás Gábor Csapó BME TMIT, Hungary |
Bálint Gyires-Tóth BME TMIT, Hungary |
Gábor Olaszy BME TMIT, Hungary |
Csaba Zainkó BME TMIT, Hungary |
Keynotes
-
Thomas Drugman, Amazon, Germany
-
Expressive Neural TTS
-
Thomas Drugman is a Science Manager in Amazon TTS Research team. He received his PhD in 2011 from the University of Mons, winning the IBM Belgium award for “Best Thesis in Computer Science”. His PhD thesis studied the use of glottal source analysis in Speech Processing. He then made a 3-year post-doc on speech/audio analysis for two biomedical applications: trachea-esophageal speech reconstruction and cough detection in chronic respiratory diseases. In 2014, he joined Amazon as a Scientist in the Alexa ASR team. He then transferred to the TTS team in 2016, where he is Science Manager since 2017. He has contributed in making Amazon’s Neural TTS more natural and expressive, notably by enriching Alexa’s experience with different speaking styles: emotions, newscaster, whispering, etc. His current research interests lie in improving the naturalness and flow of longer synthetic speech interactions. He has about 125 publications in the field of Speech Processing. He got the Interspeech Best Student Paper awards in 2009 and 2014 (as supervisor). He is also member of the IEEE Speech and Language Technical Committee since 2019.
- István Winkler, Research Centre for Natural Sciences, Hungary,
- Early Development of Infantile Communication by Sound
- István Winkler, PhD, DSc, electrical engineer, psychologist. He received his PhD in 1993 at the University of Helsinki, studying auditory sensory memory by electroencephalographic measures. He defended his Doctor of Science thesis in 2005 at the Hungarian Academy of Sciences on auditory deviance detection. His current fields of interest are predictive processing in the auditory deviance detection, auditory scene analysis, communication by sound, and the development of these functions in infancy. During his career, he has authored/coauthored over 250 publications, which received over 11000 references. Currently he is the director of the Institute of Cognitive Neuroscience and Psychology, Research Centre for Natural Sciences, Budapest, Hungary and the head of the Sound and Speech Perception research group (http://www.ttk.hu/kpi/en/sound-and-speech-perception/).
- Lior Wolf, Facebook AI Research and Tel Aviv University, Israel
- Deep Audio Conversion Technologies and Their Applications in Speech, Singing, and Music
- Lior Wolf is a research scientist at Facebook AI Research and a full professor in the School of Computer Science at Tel-Aviv University, Israel. He conducted postdoctoral research at prof. Poggio's lab at the Massachusetts Institute of Technology and received his PhD degree from the Hebrew University, under the supervision of Prof. Shashua. He is an ERC grantee and has won the ICCV 2001 and ICCV 2019 honorable mention, and the best paper awards at ECCV 2000 and ICANN 2016. His research focuses on computer vision, audio synthesis, and deep learning.
Venue
VITAL HOTEL NAUTIS **** wellness and conference hotel in Gárdony, the capital of Lake Velence directly on the lakeshore, next to the port and the beach.Lake Velence is the third largest freshwater lake in Hungary. It is situated halfway between Lake Balaton and Budapest near the highway M7 connecting the Hungarian capital and Lake Balaton. The lake which is always different in appearance captivates the visitors in every season. In summer it offers opportunities to bathe, provides an ideal place for surfing and waterski lovers and attracts with numerous programmes taking place near the lakeshore.
Contact
All questions about submissions should be emailed to ssw11@hte.hu