Uppsala universitet  
 
 

Eva Pettersson

Researcher in Computational Linguistics

Uppsala University
Department of Linguistics and Philology
Box 635, SE-75126 Uppsala
Tel: +46 18 4711091
eva.pettersson@lingfil.uu.se

Research

Research interests: digital humanities, NLP for historical text, historical cryptology, corpus linguistics, automatic spell checking and grammar checking.

Tools and resources developed:

Publications

2024

  • Eva Pettersson, Lars Borin and Erik Lenas (2024)
    SWENER-1800: A Corpus for Named Entity Recognition in 19th Century Swedish
    In Proceedings of the Digital Humanities in the Nordic and Baltic Countries 8th Conference, Reykjavík, Iceland, May 27–31, 2024. [pdf]

  • Eva Pettersson, Lasse Mårtensson and Veturliði Óskarsson (2024)
    Quantifying Medieval Scribal Habits – The Case of Abbreviations in West Norse Manuscripts
    In Proceedings of the Digital Humanities in the Nordic and Baltic Countries 8th Conference, Reykjavík, Iceland, May 27–31, 2024. [pdf]

  • Crina Tudor and Eva Pettersson (2024)
    People and Places of the Past - Named Entity Recognition in Swedish Labour Movement Documents from Historical Sources
    In the 8th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature. The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Malta. [pdf]

  • Olle Sköld, Raphaela Heil, Silke Neunsinger, Eva Pettersson, Örjan Simonson and Jonas Söderqvist (2024)
    The Labour’s Memory Project
    In Proceedings of the Digital Humanities in the Nordic and Baltic Countries 8th Conference, Reykjavík, Iceland, May 27–31, 2024.

2023

  • Ellinor Lindqvist, Eva Pettersson, and Joakim Nivre (2023)
    Low-Resource Techniques for Analysing the Rhetorical Structure of Swedish Historical Petitions
    In Workshop on Resources and Representations for Under-Resourced Languages and Domains, RESOURCEFUL-2023. 24th Nordic Conference on Computational Linguistics (NoDaLiDa), Tórshavn, Faroe Islands. [pdf]

2022

  • Eva Pettersson and Lars Borin (2022)
    Swedish Diachronic Corpus
    In Darja Fišer & Andreas Witt (eds.), CLARIN. The Infrastructure for Language Resources. Berlin: deGruyter. [book entry]

  • Ellinor Lindqvist, Eva Pettersson and Joakim Nivre (2022)
    To the Most Gracious Highness, from Your Humble Servant: Analysing Swedish 18th Century Petitions Using Text Classification
    In Proceedings of the 6th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pages 53--64, Gyeongju, Republic of Korea. [pdf]

  • Maria-Elena Gambardelli, Eva Pettersson and Beáta Megyesi (2022)
    Identifying Cleartext in Historical Ciphers
    In Proceedings of the Workshop on Language Technologies for Historical and Ancient Languages. LT4HALA 2022. / [ed] Rachele Sprugnoli and Marco Passarotti. [pdf]

2020

  • Helena Hubková, Pavel Kral and Eva Pettersson (2020)
    Czech Historical Named Entity Corpus v 1.0
    In Proceedings of Proceedings of the 12th Language Resources and Evaluation Conference (LREC 2020), pp 4458–4465. [pdf]

  • Beáta Megyesi, Bernhard Esslinger, Alicia Fornés, Nils Kopal, Benedek Láng, George Lasry, Karl de Leeuw, Eva Pettersson, Arno Wacker & Michelle Waldispühl (2020)
    Decryption of historical manuscripts: the DECRYPT project
    In Cryptologia. [pdf]

2019

  • Eva Pettersson and Lars Borin (2019)
    Towards a Swedish diachronic corpus: Intended content, structure and format of version 1.0
    In Swe-Clarin Report Series, SCR-03-2019. [pdf]

  • Eva Pettersson and Beáta Megyesi (2019)
    Matching Keys and Encrypted Manuscripts
    In Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa 2019), September 30–October 2, 2019, Turku, Finland, pages 253–261. [pdf]

  • Eva Pettersson and Lars Borin (2019)
    Swedish diachronic texts: Resources and user needs to consider in a Swedish diachronic corpus
    In Swe-Clarin Report Series, SCR-02-2019. [pdf]

  • Beáta Megyesi, Nils Blomqvist, and Eva Pettersson (2019)
    The DECODE Database: Collection of Historical Ciphers and Keys
    In Proceedings of the 2nd International Conference on Historical Cryptology. HistoCrypt 2019, June 23-25, 2019, Mons, Belgium. NEALT Proceedings Series 37, Linköping Electronic Press. [pdf]

  • Eva Pettersson and Lars Borin (2019)
    Characteristics of diachronic and historical corpora: Features to consider in a Swedish diachronic corpus
    In Swe-Clarin Report Series, SCR-01-2019. [pdf]

2018

  • Gongbo Tang, Fabienne Cap, Eva Pettersson, and Joakim Nivre (2018)
    An Evaluation of Neural Machine Translation Models on Historical Spelling Normalization
    In Proceedings of COLING 2018. [pdf]

  • Eva Pettersson and Beáta Megyesi (2018)
    The HistCorp Collection of Historical Corpora and Resources.
    In Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference, Helsinki, Finland, March 7-9, 2018. [pdf]

2017

  • Erik Tjong Kim Sang et al. (2017)
    The CLIN27 Shared Task: Translating Historical Text to Contemporary Language for Improving Automatic Linguistic Annotation
    In Computational Linguistics in the Netherlands Journal volume 7, pages 53–64. [pdf]

  • Gerold Schneider, Eva Pettersson and Michael Percillier (2017)
    Comparing Rule-based and SMT-based Spelling Normalisation for English Historical Texts
    In Proceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language. [pdf]

  • Sara Stymne, Eva Pettersson, Beáta Megyesi and Ann Palmér (2017)
    Annotating Errors in Student Texts: First Experiences and Experiments
    In Proceedings of the Joint 6th Workshop on NLP for Computer Assisted Language Learning and 2nd Workshop on NLP for Research on Language Acquisition at NoDaLiDa 2017. Linköping Electronic Conference Proceedings 134: 47–60. [pdf]

2016

  • Eva Pettersson (2016)
    Spelling Normalisation and Linguistic Analysis of Historical Text for Information Extraction
    Doctoral Thesis. [pdf]

  • Eva Pettersson, Jonas Lindström, Benny Jacobsson and Rosemarie Fiebranz
    HistSearch — Implementation and Evaluation of a Web-based Tool for Automatic Information Extraction from Historical Text
    In Proceedings of the 3rd HistoInformatics Workshop, Krakow, Poland, 11 July 2016. [pdf]

2015

  • Eva Pettersson, Beáta Megyesi and Joakim Nivre (2015)
    Ranking Relevant Verb Phrases Extracted from Historical Text
    In Proceedings of the 9th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, pages 39–47, Beijing, China, July 30, 2015. [pdf]

  • Eva Pettersson and Joakim Nivre (2015)
  • Improving Verb Phrase Extraction from Historical Text by use of Verb Valency Frames
    In Proceedings of the 20th Nordic Conference of Computational Linguistics (NODALIDA 2015). [pdf]

2014

  • Eva Pettersson, Beáta Megyesi and Joakim Nivre (2014)
    Verb Phrase Extraction in a Historical Context
    In The First Swedish National SWE-CLARIN Workshop at the Swedish Language Technology Conference (SLTC) 2014. [pdf]

  • Eva Pettersson, Beáta Megyesi and Joakim Nivre (2014)
    A Multilingual Evaluation of Three Spelling Normalisation Methods for Historical Text
    In Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH) @ EACL 2014, pages 32–41,Gothenburg, Sweden, April 26 2014. [pdf]

2013

  • Eva Pettersson, Beáta Megyesi and Joakim Nivre (2013)
    Normalisation of Historical Text Using Context-Sensitive Weighted Levenshtein Distance and Compound Splitting
    In Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013); Linköping Electronic Conference Proceedings 85. [pdf]

  • Eva Pettersson, Beáta Megyesi and Jörg Tiedemann (2013)
    An SMT Approach to Automatic Annotation of Historical Text
    In Proceedings of the Workshop on Computational Historical Linguistics at NODALIDA 2013. NEALT Proceedings Series 18; Linköping Electronic Conference Proceedings 87:54-69. [pdf]

2012

  • Eva Pettersson, Beáta Megyesi and Joakim Nivre (2012)
    Rule-Based Normalisation of Historical Text - A Diachronic Study
    In Proceedings of the First international Workshop on Language Technology for Historical Text(s). KONVENS, Vienna, Austria, September 2012. [pdf]

  • Eva Pettersson, Beáta Megyesi and Joakim Nivre (2012)
    Parsing the Past - Identification of Verb Constructions in Historical Text
    In Proceedings of the 6th EACL Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities. Avignon, France, April 2012. [pdf]

2011

  • Eva Pettersson and Joakim Nivre (2011)
    Automatic Verb Extraction from Historical Swedish Texts
    In Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities. Portland, OR, USA, Juni 2011. [pdf]

Earlier Work

  • Eva Pettersson (2007)
    Tools for automatic annotation of Swedish and Turkish texts [pdf]

  • Eva Pettersson (2005)
    Pilotstudie om maskinöversättning inom ramen för Projekt Kursdatabas - Utveckling av språkliga resurser för ett vetenskapsområde samt utvärdering [pdf]

  • Weijnitz, Per, Eva Forsbom, Ebba Gustavii, Eva Pettersson and Jörg Tiedemann (2004)
    MT goes farming: Comparing two machine translation approaches on a new domain
    In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC'04). Lisbon, Portugal, May 26–28, 2004. [pdf]

  • Weijnitz, Per, Anna Sågvall Hein, Eva Forsbom, Ebba Gustavii, Eva Pettersson and Jörg Tiedemann (2004)
    The machine translation system MATS - past, present & future
    In Proceedings of RASMAT'04 (Recent Advances in Scandinavian Machine Translation). Uppsala, Sweden, April 23, 2004. [pdf]

  • Ebba Gustavii och Eva Pettersson (2003), master's thesis
    A Swedish Grammar for Word Prediction [pdf]

  • Ebba Gustavii and Eva Pettersson (2003)
    Utveckling av ett svensk-engelskt lexikon för maskinöversättning inom jordbruksdomänen [pdf]

Teaching

  • Master Thesis in Language Technology, 2020-2024
  • Current Topics in Digital Philology, 2018–2024
  • Language Technology: Research and Development, 2017–2018, 2021
  • Språk, datorer och textbehandling, 2007, 2016–2019
  • Grundläggande textanalys, 2014–2017
  • Introduktion till språkteknologi, 2016
  • Språkgranskningsverktyg, 2008–2011
  • Maskinöversättning, 2006–2009
  • Maskinöversättning och Språkgranskning, 2004–2008
  • Datorbaserade verktyg i humanistisk forskning, 2007
  • Elementära verktyg för korpusbearbetning, 2007
  • Korpuslingvistik, 2006–2007
  • Grundläggande databehandling, 2006
  • Metoder och tillämpningar inom språkteknologin, 2005–2006
  • Datorlingvistisk lexikografi, 2004–2005
  • Machine Translation (level 2). GSLT-kurs (Graduate School of Language Technology), 2004