Combining rule-based and machine learning methods for efficient information extraction from enforcement decisions

Research output: Chapter in Book/Report/Conference proceedingChapterScientificpeer-review

Abstract

This paper presents an effective and efficient approach for automatic extraction of key features from enforcement decisions, such as their legal basis and their legal effect, by strategically applying a Large Language Model (LLM) on top of rule-based methods. Initially, rule-based methods identify candidate sentences within these decisions containing these features, after which these sentences are analyzed by GPT-3.5 to extract the features. This approach is efficient as it reduces the input and number of resources needed for effective and context aware information extraction. Furthermore, other features that have not been subject to a rule-based selection first can be extracted by an LLM from the same set of candidate sentences when they exist in close proximity of each other.
Original languageEnglish
Title of host publicationJURIX 2024
Subtitle of host publicationThe thirty-seventh annual conference, Brno, Czech Republic, 11-13 December 2024
EditorsJaromir Savelka, Jakub Harasta, Tereza Novotna, Jakub Misek
PublisherIOS Press
Pages321-326
Number of pages6
Volume395
ISBN (Electronic)9781643685625
DOIs
Publication statusPublished - 5 Dec 2024
Event37th International Conference on Legal Knowledge and Information Systems​ - Faculty of Law, Masaryk University, Brno, Czech Republic
Duration: 11 Dec 202413 Dec 2024
https://jurix2024.law.muni.cz/

Publication series

NameFrontiers in Artificial Intelligence and Applications

Conference

Conference37th International Conference on Legal Knowledge and Information Systems​
Country/TerritoryCzech Republic
CityBrno
Period11/12/2413/12/24
Internet address

Keywords

  • information extraction
  • enforcement decisions
  • rule-based methods
  • machine learning methods
  • large language model
  • text segmentation

Fingerprint

Dive into the research topics of 'Combining rule-based and machine learning methods for efficient information extraction from enforcement decisions'. Together they form a unique fingerprint.

Cite this