FoLiA in practice: The infrastructure of a linguistic annotation format

Maarten van Gompel, Ko van der Sloot, Martin Reynaert, Antal van Den Bosch

    Research output: Chapter in Book/Report/Conference proceedingChapterScientificpeer-review

    Abstract

    We present an overview of the software and data infrastructure for FoLiA, a Format for Linguistic Annotation developed within the scope of the CLARIN-NL project and other projects. FoLiA aims to provide a single unified file format accommodating a wide variety of linguistic annotation types, preventing the proliferation of different formats for different annotation types. FoLiA is being developed in a bottom-up and practice-driven fashion. We have invested mainly in the creation of a rich infrastructure of tools that enable developers and end-users to work with the format. This work will present the current state of this infrastructure.
    Original languageEnglish
    Title of host publicationCLARIN-NL in the Low Countries
    EditorsJan Odijk, Arjan van Hessen
    Place of PublicationLondon
    PublisherUbiquity Press, London
    Chapter6
    Pages71-81
    Number of pages10
    ISBN (Electronic)9781911529255, 9781911529262
    ISBN (Print)9781911529248
    DOIs
    Publication statusPublished - 28 Dec 2017

      Fingerprint

    Keywords

    • processing tools
    • data model
    • linguistic annotation
    • Natrural language processing

    Cite this

    van Gompel, M., van der Sloot, K., Reynaert, M., & van Den Bosch, A. (2017). FoLiA in practice: The infrastructure of a linguistic annotation format. In J. Odijk, & A. van Hessen (Eds.), CLARIN-NL in the Low Countries (pp. 71-81). Ubiquity Press, London. https://doi.org/10.5334/bbi.6