Mapping between the OBO and OWL ontology languages
MetadataShow full item record
Background: Ontologies are commonly used in biomedicine to organize concepts to describe domains such as anatomies, environments, experiment, taxonomies etc. NCBO BioPortal currently hosts about 180 different biomedical ontologies. These ontologies have been mainly expressed in either the Open Biomedical Ontology (OBO) format or the Web Ontology Language (OWL). OBO emerged from the Gene Ontology, and supports most of the biomedical ontology content. In comparison, OWL is a Semantic Web language, and is supported by the World Wide Web consortium together with integral query languages, rule languages and distributed infrastructure for information interchange. These features are highly desirable for the OBO content as well. A convenient method for leveraging these features for OBO ontologies is by transforming OBO ontologies to OWL. Results: We have developed a methodology for translating OBO ontologies to OWL using the organization of the Semantic Web itself to guide the work. The approach reveals that the constructs of OBO can be grouped together to form a similar layer cake. Thus we were able to decompose the problem into two parts. Most OBO constructs have easy and obvious equivalence to a construct in OWL. A small subset of OBO constructs requires deeper consideration. We have defined transformations for all constructs in an effort to foster a standard common mapping between OBO and OWL. Our mapping produces OWL-DL, a Description Logics based subset of OWL with desirable computational properties for efficiency and correctness. Our Java implementation of the mapping is part of the official Gene Ontology project source. Conclusions: Our transformation system provides a lossless roundtrip mapping for OBO ontologies, i.e. an OBO ontology may be translated to OWL and back without loss of knowledge. In addition, it provides a roadmap for bridging the gap between the two ontology languages in order to enable the use of ontology content in a language independent manner.
Syed Hamid Tirmizi, Juan Sequeda, and Daniel P. Miranker are with the Department of Computer Science, The University of Texas at Austin, Austin, Texas 78701, USA -- Stuart Aitken is with the Artificial Intelligence Applications Institute, The University of Edinburgh, Edinburgh EH8 9LE, UK and the Informatics Life-Sciences Institute, The University of Edinburgh, Edinburgh EH8 9LE, UK -- Dilvan A. Moreira is with the Department of Computer Science, Mathematics and Computing Institute, University of São Paulo, São Carlos, São Paulo, Brazil -- Chris Mungall is with the Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA -- NIgam H. Shah is with the Center for Biomedical Informatics Research, School of Medicine, Stanford University, Stanford, California 94305, USA -- Daniel P. Miranker is with the Institute for Cell and Molecular Biology, The University of Texas at Austin, Austin, Texas 78701, USA