SS22: Crowdsourcing high quality annotations and experimental data

Seminar (in English)

taught by:  Dr. Merel Scholman
start date: 25.04.2022
time: Monday, from 10:15 to 11:45 a.m.
located in:The seminar will be held online via MS Teams
Link to MS Teams
sign-up:Interested students please join the group on Teams
before the start of the semester.
If you have any questions, feel free to send me an email
suited for:  B.Sc. in Computational Linguistics
B.Sc. in Computer Science
M.Sc. in Computer Science
M.Sc. in Language Science and Technology
more details:In LSF                            

Course description

Crowdsourcing observations from non-experts is one of the most common approaches to collecting data and annotations in NLP, and it is becoming increasingly popular in psycholinguistics. Crowdsourcing has been applied to a plethora of tasks, such as eliciting annotations of diverse phenomena ranging from discourse relations to image labelling, as well as obtaining experimental data such as reading times or word recognition.

Despite crowdsourcing having grown into a fundamental method for collecting data, its usage is largely guided by common practices and personal experience of researchers. This seminar has a focus on how methodology can shape our research results. We will discuss various principles and practices that have proven effective in generating high quality data for a large range of tasks.