Office Hours

on appointment only

  • please sent me an email and describe the topic/problem shortly
  • if you want to ask for supervision of a BA-/MA-thesis, please read the information on BA-/MA-thesis below first


  • SFB 1102 Project B1: Information Density and Scientific Literacy in English — Synchronic and Diachronic Perspectives
  • CLARIN-D: Common Language Resources and Technology Infrastructure.
  • N-grams as phraseological units, their usage and function

Former Projects at the UdS

  • Registers in Contact: linguistic evolution of specialized scientific registers
  • EM-Multi: Multilingualism and Multiculturalism. Erasmus Mundus Exchange Programm with South East Asia for Master students, PhDs, PostDocs, Staff

Former Projects at the IMS

  • KCL - Kompetenzzentrum Computerlinguistik
    Competence Center Computational Linguistics
  • TFB - Transferbereich: Automatische Exzerption
  • Automatic extraction of lexicographic information from large text corpora
  • DEREKO- German Reference Corpus (Deutsches Referenzkorpus)
  • ELSNET Syntax / Semantic Annotation Task

Research interests

  • linguistic properties of scientific writing, especially

    • the diachronic development
    • register variation
    • relation of linguistic encoding and information density

  • empirical studies of (lexico-)grammatical phenomena in English and German
  • n-grams as recurrent pattern, their usage and function
  • combining traditional corpus linguistic methods with techniques from machine learning and data mining
  • corpus compilation and corpus annotation
  • multilingual corpora
  • chunking / shallow parsing

Lehrveranstaltungen / Courses

The course websites are only accessible UdS internally (eduroam does not suffice). For external access use VPN with your UdS login. See also:


SoSe 2017

- Practical lab sessions on Corpus Linguistics

Course website:

WS 2017/2018

- PS: You shall know a word by the company it keeps — Contextual phenomena in English

Course website:

WS 2016/2017

SoSe 2016

WS 2015/2016

SoSe 2015

  • HS+Ü: Comparing Corpora (with Prof Dr. Elke Teich)

WS 2014/2015

  • HS+Ü: Diachronic Corpus Linguistics - on the track of change (with Prof Dr. Elke Teich)

SoSe 2014

  • HS+Ü: Corpus based Linguistics and Translatology + Tutorial (with Prof. Dr. Elke Teich and Stefania Degaetano-Ortlieb)

WS 2013/2014

  • V: Sprache, Kommunikation, Translation im Englischen Bereich: Introduction to linguistics and translatology (English/German) (with Prof. Dr. Elke Teich)

SoSe 2013

  • PS: Computergestützte Phraseologie / Computational Phraseology

WS 2012/2013

  • PS: Computergestützte Phraseologie / Computational Phraseology

SoSe 2012

  • HS+Ü: Corpusbasierte Sprach- und Übersetzungswissenschaft

WS 2010 / 2011

  • Tutorial  for V Introduction to Linguistics and Translatology (English / German)
  • Tutorial for HS Corpusbasierte Sprach- und Übersetzungswissenschaft

SoSe 2010

  • Tutorial for HS Computer Applications in Linguistics


  • I can only supervise corpus-based studies (qualitative or quantitative) on English (including contrastive studies with German and to some extend French as well as translation related studies)
  • You should have some prior knowledge/experience in corpus linguistics, i.e., you should have attended a respective course or have worked through our step-by-step online tutorial:
  • Please contact me via email for an appointment. In this email you should address the following points:

    • planned topic of your thesis + abstract
    • prior corpus linguistic experience (courses, tools, methods)
    • corpus linguistic resources you plan to use or build - list of corpora and tools
    • list of references

Dr. Hannah Kermes

Wissenschaftliche Mitarbeiterin


Universität des Saarlandes
Fachrichtung Sprachwissenschaft
und Sprachtechnologie (ehem. FR 4.6)
Geb.: A22 Raum: 1.07
D- 66123 Saarbrücken

Tel.: +49 681 302-70077
Fax: +49 681 302-70072
E-Mail: h.kermes(at)