---------------------------------- German Literary History (LitHist) ---------------------------------- The German Literary Corpus included on the HistCorp platform is the German Literary History section of the Universal Dependencies treebank, containing texts from different genres and different authors of the German literary history. The current version holds two texts by Friedrich Schlegel (1772--1829) and the text 'Blüthenstaub' by Novalis (1772--1801). HistCorp inclusion date ------------------------ November 20, 2020 Website -------- https://github.com/UniversalDependencies/UD_German-LIT/blob/master/README.md Licence -------- Creative Commons BY-NC-SA 4.0 (https://creativecommons.org/licenses/by-nc-sa/4.0/) The HistCorp files ------------------- On the HistCorp page, the German texts from 'The German Literary Corpus' are provided in a plain text format, a tokenised format and a linguistically annotated CoNLL-U format. The linguistically annotated files ('anno') contain information on part-of-speech tags, lemma, morphology and syntax (expressed as dependency relations), following the same CoNLL-U format as on the Universal Dependencies site from which the files were extracted, except that metadata has been added in a TEI-compatible format at the top of each file. The metadata information was mainly extracted from the metadata stated in the README file on the German Literary History section of the Universal Dependencies site (https://github.com/UniversalDependencies/UD_German-LIT/blob/master/README.md). The plain text files ('txt') contain one sentence on each line. The sentences were automatically extracted from the CoNLL-U files. In the tokenised files ('tok'), the texts are split into one token on each line. The tokenised files were automatically created, by extracting the first and second columns only (word id and word form) from the CoNLL-U files. Size: 40,545 tokens. Genre: literature. Time Period: late 18th century to early 19th century.