Days: Thursday, September 14th Friday, September 15th
View this program: with abstractssession overviewtalk overview
Opening addresses from:
Louis Cotgrove (Organising Committee)
Cornelia Ruhe (Dean of the School of Humanities, University of Mannheim)
Angelika Wöllstein (Deputy Director of the IDS)
09:30 | Individual linguistic variability in social media (abstract) |
11:00 | Ellipsis Points in Messaging Interactions and on Wikipedia Talk Pages (abstract) |
11:30 | A Multivariate Register Perspective on Reddit: Exploring Lexicogrammatical Variation in Online Communities (abstract) |
12:00 | “Also ehrlich” – From adjectival use to interactive discourse marker (abstract) |
11:00 | Collecting Health Memes for a Subcorpus of Peer Health Discourse (abstract) |
11:30 | Collecting and de-identifying half a million WhatsApp messages (abstract) |
12:00 | Little Big Data: Karelian Twitter Corpus (abstract) |
14:00 | Not an expert, but not a fan either. A corpus-based study of negative self-identification as epistemic index in web forum interaction. (abstract) |
14:30 | Balancing expert and peer-student identities in online discussion forums (abstract) |
15:00 | Studying Socially Unacceptable Discourse Classification (SUD) through different eyes: "Are we on the same page ?" (abstract) |
14:00 | Scientific communication on social media: Analysing Twitter for knowledge recontextualisation (abstract) |
14:30 | The recontextualization of expert knowledge: intertextual patterns in digital science dissemination (abstract) |
15:00 | Can I Publish my Social Media Corpus? Legal Considerations for Data Publication (abstract) |
Structural linguistic characteristics of podcasts as an emerging register of computer-mediated communication (abstract) |
Ellipsis of the subject pronoun ich (‘I’) in German WhatsApp chats: A usage-based approach (abstract) |
ChrisTof: A Novel Corpus of Christian Online Forums (abstract) |
Negotiating knowledge in cooperative learning scenarios: a multimodal approach to practices of computer-mediated and face-to-face communication in the university classroom (abstract) |
Deontic Authority in Computer-mediated Communication Between University Teachers and Students: A Comparative Study of German and Chinese (abstract) |
CoDEC-M: the multi-lingual Manosphere subcorpus of the Corpus of Digital Extremism and Conspiracies (abstract) |
Zooming in on emerging norms: Preliminary findings from a cross-linguistic investigation of backchanneling (abstract) |
Semantic Prosody Evolution of the Word "女权[Feminist]" in Chinese Social Media (abstract) |
What kind of socio-emotional support can families dealing with autism receive in social media? Sentiment and attitudinal analyses of comments on Bilibili videos from 2019-2022 (abstract) |
Tracing Perceptions of Black History by Comparison of Two Corpora (abstract) |
Building corpora of Russian fake and genuine news for linguistic analysis (abstract) |
Building a Parallel Discourse-annotated Multimedia Corpus (abstract) |
Linguistic features, device affordances, and contextual factors: A mixed-methods, two-corpora approach (abstract) |
View this program: with abstractssession overviewtalk overview
Speakers: Alexander König and Egon Stemle
09:30 | Anonymization of Persons in Videos of Authentic Social Interaction: Machine Learning Model Selection and Parameter Optimization. (abstract) |
10:00 | A Pipeline for the Large-Scale Acoustic Analysis of Streamed Content (abstract) |
09:30 | Digital Corpus Linguistic Analysis of the Language on disability and inclusion in social media – in a German corpus of 2,559 Tweets on #disability and #inclusion between 1st of December – 31st of December 2020 (abstract) |
10:00 | The representation of the ‘Jew’ as enemy in French public Telegram channels within the identitarian-conspiratorial milieu (abstract) |
09:30 | IDA - Incel Data Archive: a multimodal comparable corpus for exploring extremist dynamics in online interaction. (abstract) |
10:00 | “Don’t be afraid of Greeklish”: Adolescent students’ transliteration practices (abstract) |
11:00 | Workflows and Methods for Creating Structured Corpora of Multimodal Interaction (abstract) |
11:30 | Multimodal Intertextual Practices in Video Film Reviews (abstract) |
12:00 | Acquiring, Analyzing, and Understanding Multimodal TikTok Short Video Data: The Case of Online Sex Worker Visibility Management (abstract) |
11:00 | Phonetic Metaphor of Chinese Emojis: An Approach of Neologism Formation (abstract) |
11:30 | "megageil", "mega geil", and "voll mega": Intensification in YouTube comments (abstract) |
11:00 | The Reply Function in WhatsApp Chat Communication (abstract) |
11:30 | Specific behaviours in Wikipedia talk pages: some insights from extreme cases (abstract) |
12:00 | A Corpus study on the negotiation of pronominal address on talk pages of the German, French, and Italian Wikipedia (abstract) |
14:00 | MMWAH! Compiling a Corpus of Multilingual / Multimodal WhatsApp Discussions by Swedish-speaking Young Adults in Finland (abstract) |
14:30 | MigrTwit Corpora. (Im)migration Tweets of French Politics. (abstract) |
14:00 | “Hebrew level: Bibist.”: Online Hebrew language corrections as a tool for “civilized” bashing (abstract) |
14:30 | Towards a more inclusive approach of digital literacy: social media writing at an older age (abstract) |
The workshop gives a basic introduction to methodological and technological aspects of working with multimedia data as part of CMC corpora. We will give an overview of the most important tools for manual transcription of audio or video, discuss the role of automatic methods (ASR, Automatic Speech Recognition) and look at standardisation in the TEI framework, including integration with solutions for written CMC data. Researchers of all stages are welcome, no special equipment is required to follow the workshop.