Sujatha Ph D Abstract

Realizing the Semantic Web using XML and OWL ontologies

Ph D Thesis, Sujatha R Upadhyaya, July 2007

In the Semantic Web, the emphasis lies on associating semantics with data.

This process, often known as the semantic annotation of data, is the starting

point for making the web applications inter-operate. Semantic interoperability

falls in place, if the data on the web is described using standard vocabulary.

Ontologies come into this picture as the sources of such standard

vocabulary. The other task expected to be accomplished with ontologies

in the context of the Semantic Web is that, they must form the logical

reasoning support for deriving inferences from data.

As of today, we have few standard domain ontologies that can be employed

for annotating data on the web. Keeping this mind, this thesis examines

the methodologies for building domain ontologies, for employing them in

semantic annotation of data and for building systems that can reason with

ontologies. We accomplish each of these tasks by building appropriate

tools that help web applications to migrate to the Semantic Web and bene-

fit from logical reasoning. Our methodologies are particularly applicable to

the structured web content that appears as XML Databases and not to the

kind of data that appears as plain text. We represent ontologies in OWL,

Web Ontology Language, the only recommended standard is for ontology

representation.

To begin with, this thesis investigates the means for building ontologies

to provide standard vocabulary for describing the data on the web. For

marking an advancement in attempts to build ontologies from structured

data, this thesis proposes a method of building ontologies from Entity

Relationship models and Extended Entity Relation models. It discusses, also

the implementation of a tool named ERONTO that builds OWL ontologies

from E/R and EE/R models. The result is particularly significant as E/R

and EE/R form the richer source of domain knowledge compared to relational

schema and data, which were used in previous attempts.

Employing ontologies effectively for annotating one’s data, is rather difficult

because of its complexity of representation. In this direction, this thesis explores

the methods of making ontologies easily employable by XML users.

We proposed a procedure for presenting the vocabulary in OWL ontologies

in XML Schema. With this procedure, we built a tool called OntoSchema.

The output schema of OntoSchema is particularly helpful in choosing the

right term in the right context for describing one’s data. With this effort,

we studied the expressiveness of two language standards namely OWL and

XML schema and suggested the limit of expressiveness of XML Schema.

Reasoners form the inference engines in the Semantic Web. Querying with

a reasoner could be time intensive especially with large ontologies. This

thesis examines the possibilities making quick inferences by storing the

ontology in a suitable form. We built a data structure called Knowledge

Table, which intuitively stores the constraints on properties as applicable

to concept descriptions. With Knowledge Table, typical reasoning tasks are

reduced to table look-ups. It provides a mechanism for interpreting the semantics

of OWL constructs. However, we restrict the expressiveness of the

language by specifying certain rules. We demonstrate the use of Knowledge

table for identifying the instances of ontology concepts, represented as XML

data. We also demonstrate its use in enhancing ontologies by adding new

concepts. By employing the knowledge table in instance identification and

ontology enhancement we illustrate how to perform the general reasoning

tasks such as, checking subsumption, disjointness, satisfiability and finding

the most specific concept of an individual.