Ontology-based Automatic Semantic Annotation for Named Entity Disambiguation

J.-H. Choi and Y.-T. Park (Korea)


Ontology, named entity, disambiguation, and HMM


The vision of Semantic Web can be realized when there are masses of machine-processable semantic metadata. Manual construction of metadata is not feasible, methods for automated semantic annotation have been developed. Semantic annotation is the process to identify the most appropriate semantic tagging to entities. The key challenge in automatic semantic annotation is resolving ambiguities in identifying semantic tagging. We propose the OntoNEO (Ontology-based semantic Named Entity disambiguatiOn), which is a new named entity disambiguation algorithm, based on Hidden Markov Model (HMM) and semantic contexts. OntoNEO employs HMM to represent probabilistic model of a named entities among various sentences. For each named entity in ontology, we build a probabilistic model using HMM from corpus of documents. The sequence of named entities are generated using HMM. The probabilistic models of the named entity have been created considering the sequence, and then it used to resolve ambiguity of a named entity for semantic tagging from the perspective of sequential probability. We generate appropriate semantic contexts from a text to understand the meaning of a semantically ambiguous named entity and solve the problem of ambiguities during semantic annotation by searching the optimized Markov Model corresponding to the sequence of words included in the semantic contexts. Based on our OntoNEO, ubiquitous applications can reason about named entity free of contradictions. Compared to SemTag algorithm, our system has an improved performance by about 18%.

Important Links:

Go Back