Abstract
The OpenSoNaR-CGN project set out to develop WhiteLab 2.0 for the online exploitation of the SoNaR-500 and CGN corpora. Important changes in comparison to the first version of WhiteLab are the addition of audio support and support for multiple corpora. The web interface has been redeveloped and adapted to accommodate these changes. At the backend, WhiteLab 2.0 comes with a new data importer and plugin for Neo4j, while also remaining compatible with BlackLab. Although performance of the new backend is not yet up to par with BlackLab, the investment in new technology that will likely be further developed is expected to make the application more future-proof and a great addition to the set of tools available to the humanities.
Original language | English |
---|---|
Title of host publication | CLARIN-NL in the Low Countries |
Editors | Jan Odijk, Arjan van Hessen |
Place of Publication | London |
Publisher | Ubiquity Press, London |
Chapter | 19 |
Pages | 231-243 |
Number of pages | 12 |
ISBN (Electronic) | 9781911529255 |
ISBN (Print) | 9781911529248 |
DOIs | |
Publication status | Published - 28 Dec 2017 |
Keywords
- Computer Sciences
- Computers and the Humanities
- Language and literature
- Linguistics
- Online corpora
- Dutch
- written language
- BlackLab