Categories and Genres in CHET and CECHeT

This paper describes one of the concerns of corpus compilers when gathering samples of texts. In particular, it explores how to classify such samples in wider categories in the case of the Corpus of English Chemistry Texts (CECheT), one of the subcorpus of the Coruña Corpus of English Scientific Writing. To this end, authors have revised the literature to find (and try to solve) the terminological mess that includes laves such as genre, text-type and textual category. These laves have been widely related either to the form or the function of the text. In this paper the idea of “communicative format” is used to bring together form and function as they are seen as intermingled in texts at all levels.

