------------------------ Coptic Scriptorium ------------------------ The HistCorp texts from the Coptic Scriptorium corpus have been harvested by Cosimo Palma, from the website: https://github.com/CopticScriptorium/corpora HistCorp inclusion date ------------------------ February 8, 2023 Website -------- https://copticscriptorium.org/ Contact information -------------------- https://copticscriptorium.org/about Licence -------- https://creativecommons.org/licenses/by/4.0/ The HistCorp files ------------------- The Coptic Scriptorium texts in HistCorp are extracted from the Coptic Scriptorium GitHub page (https://github.com/CopticScriptorium/corpora) by Cosimo Palma, and are provided in a plain text diplomatic format ('dipl'), a plain text diplomatic format without spaces and diacritic signs ('dipl-clean'), and a complete download of all the Corpus Scriptorium texts as presented on the GitHub page https://github.com/CopticScriptorium/corpora ('complete'). Furthermore, there is a downloadable folder containing documents thought to facilitate the cryptology work ('util'). All texts contained in the other folders are merged into a single corpus, where coptic glyphs are replaced by latin ones to facilitate computation tasks, since many cryptology softwares only work with latin characters. The list of 5-grams is structured as followed: ngram + log10 of ngram frequency (values ranging from 125 to 7)