Advanced search
1 file | 222.62 KB Add to list

A hybrid approach to domain-independent taxonomy learning

Els Lefever (UGent)
(2016) APPLIED ONTOLOGY. 11(3). p.255-278
Author
Organization
Abstract
Creating domain ontologies is usually performed by teams of knowledge engineers and domain experts, and is considered to be a time-consuming and difficult task. As a result, scientists have started to develop automatic approaches to ontology learning and population. For the proposed research, we focus on the central subtask of ontology learning, being the hypernym detection task, where the system has to detect hierarchical semantic relationships, i.e. hypernym–hyponym relationships, between domain-specific terms, resulting in a domain-specific taxonomy. We propose in this paper a hybrid approach to automatic taxonomy learning, which combines a data-driven and a knowledge-based component. The data-driven component is composed of a lexico-syntactic pattern-based module, a morpho-syntactic analyzer and a distributional model, whereas the knowledge-based component extracts structured semantic information from the Linked Open Data cloud (DBpedia) and WordNet. The proposed methodology has been applied to three different knowledge domains: viz. food , equipment and science . A thorough quantitative and qualitative evaluation has shown promising results for all considered test domains. In addition, the results show a clear contribution of all different modules to the automatic taxonomy learning task. Although there is still room for improvement for all different modules, our approach outperforms state-of-the-art systems that participated in the SemEval “Taxonomy Extraction Evaluation” task when it comes to comparing the automatically constructed taxonomy against a manually verified gold standard taxonomy. As all modules are run automatically, the system provides a flexible and domain-independent approach to automatic taxonomy learning and could be an important step in solving the knowledge acquisition bottleneck in ontology learning.
Keywords
hypernym detection, Taxonomy learning

Downloads

  • (...).pdf
    • full text
    • |
    • UGent only
    • |
    • PDF
    • |
    • 222.62 KB

Citation

Please use this url to cite or link to this publication:

MLA
Lefever, Els. “A Hybrid Approach to Domain-Independent Taxonomy Learning.” APPLIED ONTOLOGY, vol. 11, no. 3, IOS PRESS, 2016, pp. 255–78, doi:10.3233/AO-160170.
APA
Lefever, E. (2016). A hybrid approach to domain-independent taxonomy learning. APPLIED ONTOLOGY, 11(3), 255–278. https://doi.org/10.3233/AO-160170
Chicago author-date
Lefever, Els. 2016. “A Hybrid Approach to Domain-Independent Taxonomy Learning.” APPLIED ONTOLOGY 11 (3): 255–78. https://doi.org/10.3233/AO-160170.
Chicago author-date (all authors)
Lefever, Els. 2016. “A Hybrid Approach to Domain-Independent Taxonomy Learning.” APPLIED ONTOLOGY 11 (3): 255–278. doi:10.3233/AO-160170.
Vancouver
1.
Lefever E. A hybrid approach to domain-independent taxonomy learning. APPLIED ONTOLOGY. 2016;11(3):255–78.
IEEE
[1]
E. Lefever, “A hybrid approach to domain-independent taxonomy learning,” APPLIED ONTOLOGY, vol. 11, no. 3, pp. 255–278, 2016.
@article{8123723,
  abstract     = {{Creating domain ontologies is usually performed by teams of knowledge engineers and domain experts, and is considered to be a time-consuming and difficult task. As a result, scientists have started to develop automatic approaches to ontology learning and population. For the proposed research, we focus on the central subtask of ontology learning, being the hypernym detection task, where the system has to detect hierarchical semantic relationships, i.e. hypernym–hyponym relationships, between domain-specific terms, resulting in a domain-specific taxonomy. We propose in this paper a hybrid approach to automatic taxonomy learning, which combines a data-driven and a knowledge-based component. The data-driven component is composed of a lexico-syntactic pattern-based module, a morpho-syntactic analyzer and a distributional model, whereas the knowledge-based component extracts structured semantic information from the Linked Open Data cloud (DBpedia) and WordNet. The proposed methodology has been applied to three different knowledge domains: viz. food , equipment and science . A thorough quantitative and qualitative evaluation has shown promising results for all considered test domains. In addition, the results show a clear contribution of all different modules to the automatic taxonomy learning task. Although there is still room for improvement for all different modules, our approach outperforms state-of-the-art systems that participated in the SemEval “Taxonomy Extraction Evaluation” task when it comes to comparing the automatically constructed taxonomy against a manually verified gold standard taxonomy. As all modules are run automatically, the system provides a flexible and domain-independent approach to automatic taxonomy learning and could be an important step in solving the knowledge acquisition bottleneck in ontology learning.}},
  author       = {{Lefever, Els}},
  issn         = {{1570-5838}},
  journal      = {{APPLIED ONTOLOGY}},
  keywords     = {{hypernym detection,Taxonomy learning}},
  language     = {{eng}},
  number       = {{3}},
  pages        = {{255--278}},
  publisher    = {{IOS PRESS}},
  title        = {{A hybrid approach to domain-independent taxonomy learning}},
  url          = {{http://doi.org/10.3233/AO-160170}},
  volume       = {{11}},
  year         = {{2016}},
}

Altmetric
View in Altmetric
Web of Science
Times cited: