Building Corpora From Newspapers, Prose, And Poetic Works
Keywords:
corpus creationAbstract
The creation of corpora from varied text genres – such as newspaper publications, prose, and poetry – provides unique insights into linguistic patterns and cultural trends. Each genre presents distinct challenges and requires specific methodologies for corpus construction, such as ensuring representativeness and handling genre-specific linguistic features. This article explores these challenges and examines the work of prominent researchers like Tony McEnery, David Hoover, and Michael Stubbs, who have contributed to the development of corpora from newspapers, prose, and poetry. Through these corpora, scholars can study language use in different registers and literary forms, expanding our understanding of linguistic and stylistic diversity