In several data-centric application domains, the need arises to extract valuable information from unstructured text documents. The recent paradigm of Ontology Mediated Information Extraction (OMIE) faces this problem by taking into account the knowledge expressed by a domain ontology, and reasoning over it to improve the quality of extracted data. MASTRO SYSTEM-T is a novel tool for OMIE, developed by Sapienza University and IBM Almaden Research. In this work, we demonstrate its usage for information extraction over real-world financial text documents from the U.S. EDGAR system.
2020, Proceedings of the ISWC 2020 Demos and Industry Tracks, Pages 256-261 (volume: 2721)
Ontology Mediated Information Extraction with MASTRO SYSTEM-T (04b Atto di convegno in volume)
Lembo Domenico, Li Yunyao, Popa Lucian, Qian Kun, Scafoglieri Federico
Gruppo di ricerca: Artificial Intelligence and Knowledge Representation, Gruppo di ricerca: Data Management and Semantic Technologies