piwik-script

Deutsch Intern
    DMIR Research Group

    Publications by Andreas Hotho

    These publications are hosted by BibSonomy.

    Conditional Random Fields For Local Adaptive Reference Extraction

    Toepfer, Martin; Kluegl, Peter; Hotho, Andreas; Puppe., Frank in Proceedings of LWA2010 - Workshop-Woche: Lernen, Wissen & Adaptivitaet Atzmüller, Martin; Benz, Dominik; Hotho, Andreas; Stumme, Gerd ( Eds. ), Kassel, Germany , 2010 .

    The accurate extraction of bibliographic information from scientific publications is an active field of research. Machine learning and sequence labeling approaches like Conditional Random Fields (CRF) are often applied for this reference extraction task, but still suffer from the ambiguity of reference notation. Reference sections apply a predefined style guide and contain only homogeneous references. Therefore, other references of the same paper or journal often provide evidence how the fields of a reference are correctly labeled. We propose a novel approach that exploits the similarities within a document. Our process model uses information of unlabeled documents directly during the extraction task in order to automatically adapt to the perceived style guide. This is implemented by changing the manifestation of the features for the applied CRF. The experimental results show considerable improvements compared to the common approach. We achieve an average F1 score of 96.7% and an instance accuracy of 85.4% on the test data set.
    Further Information
    Editor(s) Atzmüller, Martin; Benz, Dominik; Hotho, Andreas; Stumme, Gerd
    Tags2010  crf  extraction  for:dmir  information  myown 

    Data privacy protection

    By clicking 'OK' you are leaving the web sites of the Julius-Maximilians-Universität Würzburg and will be redirected to Facebook. For information on the collection and processing of data by Facebook, refer to the social network's data privacy statement.

    Data privacy protection

    By clicking 'OK' you are leaving the web sites of the Julius-Maximilians-Universität Würzburg and will be redirected to Twitter. For information on the collection and processing of data by Facebook, refer to the social network's data privacy statement.

    Social Media
    Contact

    Andreas Hotho
    DMIR Research Group
    Am Hubland
    97074 Würzburg

    Phone: +49 931 31-86731
    Fax: +49 931 31-86732

    Find Contact