CFP

SSW11: The 11th ISCA Speech Synthesis Workshop

https://www.hotelnautis.hu/en

Gárdony (by Lake Velence), Hungary, August 26-28, 2021

Conference website	https://ssw11.hte.hu/en/
Abstract registration deadline	April 23, 2021
Submission deadline	May 3, 2021
Notification of acceptance	June 14, 2021
Camera-ready final paper submission	June 28, 2021
Early-bird registration ends	July 12, 2021
Late registration ends	August 16, 2021

Topics: speech synthesis voice conversion neural vocoding cross lingual and multilingual

Major challenges call for major meetings: the Speech Synthesis Workshops (SSWs), which are held every three years under the auspices of ISCA's SynSIG. In 2019 it was decided to have an SSW every two years, since the technology is advancing faster these days. SSWs provide a unique occasion for people in the speech synthesis area to meet each other. They contribute to establishing a feeling that we are all participating in a joint effort towards intelligible, natural, and expressive synthetic speech.

Submission Guidelines

All papers must be original and not simultaneously submitted to another journal or conference. Papers in all areas of speech synthesis technology are encouraged to be submitted.

Call for Demos

We are planning to have a demo session to showcase new developments in speech synthesis. If you have some demonstrations of your work that does not really fit in a regular oral or poster presentation, please let us know.

List of Topics

Including but not limited to:
Grapheme-to-phoneme conversion for synthesis
Text processing for speech synthesis (text normalization, syntactic and semantic analysis, intent detection)
Segmental-level and/or concatenative synthesis
Signal processing/statistical model for synthesis
Speech synthesis paradigms and methods; articulatory synthesis, articulation-to-speech synthesis, parametric synthesis etc.
Prosody modeling, transfer and generation
Expression, emotion and personality generation
Voice conversion and modification, morphing (parallel and non-parallel)
Concept-to-speech conversion speech synthesis in dialog systems
Avatars and talking faces
Cross-lingual and multilingual aspects for synthesis (e.g. automatic language switching)
Applications of synthesis technologies to communication disorders
TTS for embedded devices and computational issues
Tools and data for speech synthesis
Quality assessment/evaluation metrics in synthesis
End-to-end text-to-speech synthesis
Direct speech waveform modelling and generation
Neural vocoding for speech synthesis
Speech synthesis using non-ideal data ('found', user-contributed, etc.)
Natural language generation for speech synthesis
Special topic: Speech uniqueness and deep learning (generating diverse and natural speech)

Committees

Scientific Committee

Nagaraj Adiga	University of Crete
Gerard Bailly	GIPSA-Lab
Pallavi Baljekar	Google
Roberto Barra-Chicote
Timo Baumann	University of Hamburg
Antonio Bonafonte	Universitat Politècnica de Catalunya
Robert Clark	Google
Erica Cooper	National Institute of Informatics
Tamás Gabor Csapo	Budapest University of Technology and Economic
Daniel Erro	Cirrus Logic
Raul Fernandez	IBM
Philip N. Garner	Idiap Research Institute
Balint Gyires-Toth	Budapest University of Technology and Economic
Gustav Eje Henter	KTH Royal Institute of Technology
Esther Klabbers	ReadSpeaker
Zhen-Hua Ling	University of Science and Technology of China
Damien Lolive	IRISA/ Université Rennes 1
Sébastien Le Maguer	Trinity College Dublin / Adapt Centre
Jindrich Matousek	University of West Bohemia
Thomas Merritt	Amazon
Bernd Möbius	Saarland University
Eva Navas	University of the Basque Country
Géza Németh	Budapest University of Technology and Economic
Yamato Ohtani	AI Inc.,
Michael Pucher	Acoustics Research Institute
Francesc Alias Pujol	La Salle - Universitat Ramon Llull
Tuomo Raitio	Apple Inc
Manuel Sam Ribeiro	The University of Edinburgh
Srikanth Ronanki	Amazon
Andrew Rosenberg	Google
Milan Secujski	University of Novi Sad
Adriana Stan	Technical University of Cluj-Napoca
Eva Szekely	KTH Royal Institute of Technology
Tomoki Toda	Nagoya University
Markus Toman	Neuratec
Jaime Lorenzo Trueba	Universidad Politecnica de Madrid
Pirros Tsiakoulis	Samsung Electronics
Junichi Yamagishi	The University of Edinburgh
Csaba Zainkó	Budapest University of Technology and Economic
Heiga Zen	Google
Yi Zhao	National Institute of Informatics

Organizing Committee

Géza Németh

Chairman

BME TMIT, Hungary

Junichi Yamagishi

National Institute of Informatics Japan,University of Edinburgh, UK

Sébastien Le Maguer

ADAPT Centre/TCD, Ireland

Esther Klabbers

Readspeaker, Netherlands

Mátyás Bartalis

BME TMIT, Hungary

Tamás Gábor Csapó

BME TMIT, Hungary

Bálint Gyires-Tóth

BME TMIT, Hungary

Gábor Olaszy

BME TMIT, Hungary

Csaba Zainkó

BME TMIT, Hungary

Keynotes

Thomas Drugman, Amazon, Germany
Expressive Neural TTS
Thomas Drugman is a Science Manager in Amazon TTS Research team. He received his PhD in 2011 from the University of Mons, winning the IBM Belgium award for “Best Thesis in Computer Science”. His PhD thesis studied the use of glottal source analysis in Speech Processing. He then made a 3-year post-doc on speech/audio analysis for two biomedical applications: trachea-esophageal speech reconstruction and cough detection in chronic respiratory diseases. In 2014, he joined Amazon as a Scientist in the Alexa ASR team. He then transferred to the TTS team in 2016, where he is Science Manager since 2017. He has contributed in making Amazon’s Neural TTS more natural and expressive, notably by enriching Alexa’s experience with different speaking styles: emotions, newscaster, whispering, etc. His current research interests lie in improving the naturalness and flow of longer synthetic speech interactions. He has about 125 publications in the field of Speech Processing. He got the Interspeech Best Student Paper awards in 2009 and 2014 (as supervisor). He is also member of the IEEE Speech and Language Technical Committee since 2019.

István Winkler, Research Centre for Natural Sciences, Hungary,

Early Development of Infantile Communication by Sound

István Winkler, PhD, DSc, electrical engineer, psychologist. He received his PhD in 1993 at the University of Helsinki, studying auditory sensory memory by electroencephalographic measures. He defended his Doctor of Science thesis in 2005 at the Hungarian Academy of Sciences on auditory deviance detection. His current fields of interest are predictive processing in the auditory deviance detection, auditory scene analysis, communication by sound, and the development of these functions in infancy. During his career, he has authored/coauthored over 250 publications, which received over 11000 references. Currently he is the director of the Institute of Cognitive Neuroscience and Psychology, Research Centre for Natural Sciences, Budapest, Hungary and the head of the Sound and Speech Perception research group (http://www.ttk.hu/kpi/en/sound-and-speech-perception/).

Lior Wolf, Facebook AI Research and Tel Aviv University, Israel

Deep Audio Conversion Technologies and Their Applications in Speech, Singing, and Music

Lior Wolf is a research scientist at Facebook AI Research and a full professor in the School of Computer Science at Tel-Aviv University, Israel. He conducted postdoctoral research at prof. Poggio's lab at the Massachusetts Institute of Technology and received his PhD degree from the Hebrew University, under the supervision of Prof. Shashua. He is an ERC grantee and has won the ICCV 2001 and ICCV 2019 honorable mention, and the best paper awards at ECCV 2000 and ICANN 2016. His research focuses on computer vision, audio synthesis, and deep learning.

Venue

VITAL HOTEL NAUTIS **** wellness and conference hotel in Gárdony, the capital of Lake Velence directly on the lakeshore, next to the port and the beach.Lake Velence is the third largest freshwater lake in Hungary. It is situated halfway between Lake Balaton and Budapest near the highway M7 connecting the Hungarian capital and Lake Balaton. The lake which is always different in appearance captivates the visitors in every season. In summer it offers opportunities to bathe, provides an ideal place for surfing and waterski lovers and attracts with numerous programmes taking place near the lakeshore.

Contact

All questions about submissions should be emailed to ssw11@hte.hu