DMIR Research Group


    This page provides a list of datasets published by the DMIR group.


    For research purposes, we offer a dataset of the BibSonomy database in form of an SQL dump to interested people. See here for more details.


    The Bib100 Evaluation Dataset contains 100 pairs of English words along with human-assigned relatedness judgments. More details can be found here.

    German Novel Dataset

    The German Novel Dataset (GND) is a dataset of 270 sentences extracted from German novels, labelled by crowdsourcing for polarity and Plutchik's eight basic emotion. See here for more information.

    Social Media

    Andreas Hotho
    DMIR Research Group
    Am Hubland
    97074 Würzburg

    Tel.: +49 931 31-86731
    Fax: +49 931 31-86732

    Suche Ansprechpartner

    Hubland Süd, Geb. M2