Tags:coherence relations, discourse connectives, lexicon, speech vs. writing and Ukrainian language
Abstract:
We introduce a new lexicon of discourse connectives for the Ukrainian language. Discourse connectives like ‘because’, ‘therefore’ are grammatical elements which link clauses and sentences semantically and play a crucial role in discourse structure. They have shown to be useful for many tasks in natural language processing from argumentation mining to authorship analysis.
We introduce a semi-automatic method for inventorizing discourse connectives in underresourced languages, by leveraging existing lexicons from other languages. As a result, we provide the first computer-readable lexicon of 129 Ukrainian discourse connectives. We provide syntactic as well as semantic information for these items. Finally, we carry out a small pilot study using the lexicon for discourse level corpus annotation, and report on the distribution of connectives in Ukrainian in two different types of media.
A Computational Lexicon of Ukrainian Discourse Connectives