View article

[PDF] from psu.edu

The HiLeX System for Semantic Information Extraction

Authors

Marco Manna, Ermelinda Oro, Massimo Ruffolo, Mario Alviano, Nicola Leone

Publication date

2012

Journal

Transactions on Large-Scale Data- and Knowledge-Centered Systems V

Pages

91-125

Publisher

Springer Berlin / Heidelberg

Description

The explosive growth and popularity of the Web has resulted in a huge amount of digital information sources on the Internet. Unfortunately, such sources only manage data, rather than the knowledge they carry. Recognizing, extracting, and structuring relevant information according to their semantics is a crucial task. Several approaches in the field of Information Extraction (IE) have been proposed to support the translation of semi-structured/unstructured documents into structured data or knowledge. Most of them have a high precision but, since they are mainly syntactic, they often have a low recall, are dependent on the document format, and ignore the semantics of information they extract. In this paper, we describe a new approach for semantic information extraction that could represent the basis for automatically extracting highly structured data from unstructured web sources without any undesirable trade …

Total citations

Cited by 19

201120122013201420156 1 3 4 5

Scholar articles

The HL ε X System for Semantic Information Extraction

M Manna, E Oro, M Ruffolo, M Alviano, N Leone - Transactions on Large-Scale Data-and Knowledge …, 2012