Tags:(COALS), Bilingual Terminology Extraction, comparable/parallel corpus, Machine Learning and Natural language processing
Abstract:
In this work we provide a new approach for Automatic Bilingual Terminology Extraction involving Arabic and French languages. This approach aims to build a bilingual list of terms for a specific field “Algerian Culture”. It is based on building a semantic space for context understanding, adopting COALS (Correlated Occurrence Analogue to Lexical Semantics) Semantic model. We use “Wikipedia” to build a comparable corpus on “Algerian heritage & culture” and a translation tool for term alignment.