This page provides a list of datasets published by the DMIR group.
For research purposes, we offer a dataset of the BibSonomy database in form of an SQL dump to interested people. See here for more details.
The Bib100 Evaluation Dataset contains 100 pairs of English words along with human-assigned relatedness judgments. More details can be found here.
German Novel Dataset
The German Novel Dataset (GND) is a dataset of 270 sentences extracted from German novels, labelled by crowdsourcing for polarity and Plutchik's eight basic emotion. See here for more information.