This page provides a list of datasets published by the DMIR group.


    For research purposes, we offer a dataset of the BibSonomy database in form of an SQL dump to interested people. See here for more details.


    The Bib100 Evaluation Dataset contains 100 pairs of English words along with human-assigned relatedness judgments. More details can be found here.

    German Novel Dataset

    The German Novel Dataset (GND) is a dataset of 270 sentences extracted from German novels, labelled by crowdsourcing for polarity and Plutchik's eight basic emotion. See here for more information.


    Universität Würzburg
    Sanderring 2
    97070 Würzburg

    Tel.: +49 931 31-0
    Fax: +49 931 31-82600

    Suche Ansprechpartner

    Sanderring Röntgenring Hubland Nord Hubland Süd Campus Medizin