Abstract
This article presents a set of standardised corpora of poetry comprising over 330,000 poems in ten languages (Czech, English, French, German, Hungarian, Italian, Portuguese, Russian, Slovenian, and Spanish). Each corpus has been deduplicated, enriched with Universal Dependencies, provided with additional metadata, and converted into a unified json structure.
| Original language | English |
|---|---|
| Number of pages | 17 |
| Journal | Research Data Journal for the Humanities and Social Sciences |
| DOIs | |
| Publication status | Published - Sept 2024 |
Keywords
- poetry
- computational poetry
- corpus linguistics
- digital humanities
Fingerprint
Dive into the research topics of 'PoeTree: Poetry Treebanks in Czech, English, French, German, Hungarian, Italian, Portuguese, Russian, Slovenian and Spanish'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver