MAISA - Maintenance of semantic annotations

Abstract : Semantic annotations are often used in a wide range of applications ranging from information retrieval to decision support. Annotations are produced through the association of concept labels from Knowledge Organization System (KOS), i.e. ontology, thesaurus, dictionaries, with pieces of digital information, e.g. images or texts. Annotations enable machines to interpret, link, and use a vast amount of data. However, the dynamic nature of KOS may affect annotations each time a new version of a KOS is released. New concepts can be added, obsolete ones removed and the definition of existing concepts may be refined through the modification of their labels/properties. As a result, many annotations can lose their relevance, thus hindering the intended use and exploitation of annotated data. To solve this problem, methods to maintain the annotations up-to-date are required. In this thesis we propose a framework called MAISA to tackle the problem of adapting outdated annotations when the KOS utilized to create them change. We distinguish two different cases. In the first one we consider that annotations are directly modifiable. In this case, we proposed a rule-based approach implementing information derived from the evolution of KOS as well as external knowledge from the Web. In the second case, we consider that the annotations are not modifiable. The goal is then to keep the annotated documents searchable even if the annotations are produced with a given KOS version but the user used another version to query them. In this case, we designed a knowledge graph that represent a KOS and its successive evolution and propose a method to extract the history of a concept and add the gained label to the initial query allowing to deal with annotation evolution. We experimentally evaluated MAISA on realistic cases-studies built from four well-known biomedical KOS: ICD-9-CM, MeSH, NCIt and SNOMED CT. We show that the proposed maintenance method allow to maintain semantic annotations using standard metrics.
Complete list of metadatas

Cited literature [134 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-02288589
Contributor : Abes Star <>
Submitted on : Sunday, September 15, 2019 - 1:01:28 AM
Last modification on : Wednesday, September 18, 2019 - 9:31:29 AM

File

73421_CARDOSO_2018_archivage.p...
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-02288589, version 1

Citation

Silvio Domingos Cardoso. MAISA - Maintenance of semantic annotations. Artificial Intelligence [cs.AI]. Université Paris-Saclay, 2018. English. ⟨NNT : 2018SACLS338⟩. ⟨tel-02288589⟩

Share

Metrics

Record views

165

Files downloads

94