1 |
Learning a Domain Ontology from Hierarchically Structured Texts
|
|
|
|
In: http://www.gelbukh.com/CV/Publications/2005/ICML-2005-ontology.pdf (2005)
|
|
Abstract:
Any scientific or technical document is organized hierarchically: some sections of the text (such as the abstract or conclusions) summarize the contents of the main text; sections have titles describing their contents in general words; chapter titles describe the contents of a set of sections; book title describes the contexts of all chapters, etc. Moreover, whole collections of scientific documents are usually organized hierarchically: e.g., papers are organized in journals, conferences, etc., which in turn have their own titles. We exploit this hierarchical structure to learn a lexical ontology, in which subordination relationships roughly mirror those between the texts and titles in which these words occur: words occurring in more general titles subordinate the words occurring in the texts described by these titles. 1.
|
|
URL: http://www.gelbukh.com/CV/Publications/2005/ICML-2005-ontology.pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.153.238
|
|
BASE
|
|
Hide details
|
|
|
|