Abstract
We present an overview of the software and data infrastructure for FoLiA, a Format for Linguistic Annotation developed within the scope of the CLARIN-NL project and other projects. FoLiA aims to provide a single unified file format accommodating a wide variety of linguistic annotation types, preventing the proliferation of different formats for different annotation types. FoLiA is being developed in a bottom-up and practice-driven fashion. We have invested mainly in the creation of a rich infrastructure of tools that enable developers and end-users to work with the format. This work will present the current state of this infrastructure.
Original language | English |
---|---|
Title of host publication | CLARIN-NL in the Low Countries |
Editors | Jan Odijk, Arjan van Hessen |
Place of Publication | London |
Publisher | Ubiquity Press, London |
Chapter | 6 |
Pages | 71-81 |
Number of pages | 10 |
ISBN (Electronic) | 9781911529255, 9781911529262 |
ISBN (Print) | 9781911529248 |
DOIs | |
Publication status | Published - 28 Dec 2017 |
Keywords
- processing tools
- data model
- linguistic annotation
- Natrural language processing