  1. Lecturer: Maarten Marx, http://maartenmarx.nl
  2. Assistant(s):
  3. For times, see Datanose


Alle literatuur is gratis legaal te verkrijgen op het web.


Each lecture will be accompanied by lecture notes and/or slides.
These notes are typically IPython Notebooks or MarkDown files. Lecture notes contain pointers to the literature and form the basic requirements of what you are supposed to know. They are excellent material for helping you master the course and know what you should study for the exam.

Lecturenotes Folder
Slides Folder


Lectures may be accompanied by IPython notebooks, as indicated in the lecturenotes and the courseplan. You can view notebooks and slides using http://nbviewer.ipython.org, but if you want to run them you should download them and run them on your own machine.

IPython notebook Software

We strongly advice you to install the Anaconda Python Distribution. This distribution contains almost all the necessary modules and packages needed in your study. It is available for all platforms and provides a simple installation procedure. More detailed installation instructions can be found here: http://docs.continuum.io/anaconda/install.html

NoteBook Folder


An important aim of the course is to get acquainted with off-the-shelf software with which you can create an IR system. We will look at 2 or 3 major and industrial strength solutions.

Course Objectives

A schedule of all assignments and exams and their weight can be found in the CoursePlan.
Exams can be held in multiple formats, depending on availability of rooms and technical services. We prefer digital exams, i.e., exams in which the answers are typed in a computer terminal. If this is not possible, you get the same questions you were asked digitally, but then on paper.

2016 Deeltentamen 1 vindt op papier plaats, deeltentamen 2 en het hertentamen digitaal. Door logistieke oorzaken kan hiervan afgeweken worden.

In any case, even if the form in which the exam is taken may be different, the questions and the answers to be given are exactly the same, irrespective of the way the exam is held.

Aanwezigheidsplicht Wij volgen de OER voor de werkcolleges op dinsdag en woensdag. Aanwezigheid wordt in alle werkcolleges zo'n 15 minuten na het begin ingevuld en later niet meer.

For dates and locations, see datanose.

Weighting of the assignments and exams

Midterm exam week 4 25%
End exam week 8 25%
Text classification group assignment week 6 16%
Search engine group assignment week 8 19%
5 weekly assignments (3% each)15%
Exams last two hours and are closed-book. Group assignments are with maximally 3 persons. The averages of all assignments and of the two exams each have to be 4.5 or more.


A resit of the entire course is possible for those not having passed the course. The resit will be individual and covers the material of the entire course. This includes both theoretical material from the IR books, and practical knowledge about the systems we have worked with.

Resit is only possible for those who obtained at least a 4.5 on average for the assignments..

