Abstract
We propose a method for automatically identifying rhetorical relations. We use supervised machine learning but exploit cue phrases to automatically extract and label training data. Our models draw on a variety of linguistic cues to distinguish between the relations. We show that these feature-rich models outperform the previously suggested bigram models by more than 20%, at least for small training sets. Our approach is therefore better suited to deal with relations for which it is difficult to automatically label a lot of training data because they are rarely signalled by unambiguous cue phrases (e.g., "continuation").
Original language | English |
---|---|
Title of host publication | Proceedings of Recent Advances in Natural Language Processing (RANLP-05) |
Place of Publication | Borovets, Bulgaria |
Publisher | Unknown Publisher |
Pages | 532-539 |
Number of pages | 8 |
Publication status | Published - 2005 |