Authors
Mrinmaya Sachan, Avinava Dubey, Eduard H Hovy, Tom M Mitchell, Dan Roth, Eric P Xing
Publication date
2020/1
Journal
Computational Linguistics
Volume
45
Issue
4
Pages
627-665
Publisher
MIT Press
Description
To ensure readability, text is often written and presented with due formatting. These text formatting devices help the writer to effectively convey the narrative. At the same time, these help the readers pick up the structure of the discourse and comprehend the conveyed information. There have been a number of linguistic theories on discourse structure of text. However, these theories only consider unformatted text. Multimedia text contains rich formatting features that can be leveraged for various NLP tasks. In this article, we study some of these discourse features in multimedia text and what communicative function they fulfill in the context. As a case study, we use these features to harvest structured subject knowledge of geometry from textbooks. We conclude that the discourse and text layout features provide information that is complementary to lexical semantic information. Finally, we show that the harvested …
Total citations
20192020202120222023202417585
Scholar articles
M Sachan, A Dubey, EH Hovy, TM Mitchell, D Roth… - Computational Linguistics, 2020