20,281,810 Words 328 Texts 31 Authors
Below you will find information on how to obtain integrated corpora and also have the opportunity to download specialised corpora. In our recent research, we have needed a corpus of DH Lawrence's novels and one for Modernist prose that contains the word "colourless/colorless".
Be sure to visit the website often for new additions to our specialised offerings and sign up to receive our newsletter. As always, feel free to email us with questions.
Our aim is to provide researchers with an open accessible corpus of Modernist Literature. As such, we offer a main Corpus of Modernist Literature, composed of three main sub-corpora:
To access these larger files, please complete our subcription form.
We also offer smaller sub-corpora of Prose Fiction, Shorter Prose, and Poetry organised per authors, as well as Specialist Corpora directly accessible on this website. These smaller sub-corpora have been tagged for Part of Speech (POS) and Semantic Domains (SEM).
Because more and more text become available each year in the public domain, we will periodically update our corpora. Subscribe to receive announcements about those updates or information on workshops or publications.
In our recent research, we have had the need for a corpus of DH Lawrence's novels and Modernist prose containing the word "colourless/colorless". In this section, you will be able to access the corpora in various formats. Be sure to check back often for new additions and sign up to receive our newsletter. As always, feel free to email us with questions.
Workshops and tutorials will be available soon -
watch this space!
Until then, we offer a case study using the Colourless Corpus below to showcase how corpora can be used for thematic analysis. Click below for the full text.
The initial investigation for this project involved 368 prose texts from 31 authors in The Modernist Literature Project. As the focus of our research was on the single lexeme "colourless", we selected four authors with the highest frequency of the lexeme: Henry James, Edith Wharton, Joseph Conrad, and Rudyard Kipling. The aim of our analysis is to show how colourless is employed in Modernist prose and is not purely based on quantitative analysis. A zipped file containing the 85 examined texts is provided below and the paper .
Below you will find the ten major novels of DH Lawrence that are contained in the DHL corpus and the 85 novels that comprise the Literary Reference corpus. A list of the 95 texts and authors is available below and contains type, token and TTR information. Also available for download are the Wmatrix parts-of-speech and semantic domain tagsets. Please reference as:
McClure, S. (2021) Oppositional Language as Thematic Signals in the Novels of DH Lawrence: A Corpus-Based Examination.
PhD Dissertation, University of Liverpool.
Copyright © 2022 The Modernist Literature Project - All Rights Reserved.
Please reference The Modernist Literature Project as follows:
McClure, S. and Pager-McClymont, K. (2022). The Modernist Literature Project. ModernistLiteratureProject.org.
Powered by GoDaddy Website Builder