Semantic Web technologies and SKOS

Oreste Signore, <oreste.signore@w3.org>


SW logo ATHENA WP4 SKOS Workshop
Rome, ICCU, 16-17 July 2009

Slides: http://www.w3c.it/talks/2009/athena/
Pdf version: [ single page] [ two pages per sheet]

Cover page Weblab Logo Valid XHTML 1.0! Valid CSS!

Acknowledgements

Contents

Cultural heritage applications: some issues

Data integration: the rough picture

esportazione dei dati semplificata

The Rough Structure of Data Integration

  1. Map the various data onto an abstract data representation
    • make the data independent of its internal representation…
  2. Merge the resulting representations
  3. Start making queries on the whole!
    • queries that could not have been done on the individual data sets

Data integration: role of technologies

integrazione dei dati con indicazione delle tecnologie

A Simplified Bookstore Data (Dataset "A")

Table Book
ID Author Title Publisher Year
ISBN 0-00-651409-X id_xyz The Glass Palace id_qpr 2000
Table Author
ID Name Home page
id_xyz Amitav Ghosh http://www.amitavghosh.com/
Table Publisher
ID Publisher Name City
id_qpr Harper Collins London

1st Step: Export Your Data as a Set of Relations

The previous table in an RDF format

Some Notes on the Exporting the Data

Another Bookstore (dataset "F")

Table Livre
ID Titre Auteur Traducteur Original
ISBN 2020386682 Le Palais des miroirs i_abc i_qrs ISBN 0-00-651409-X
Table Auteur
ID Nom
i_abc Amitav Ghosh
i_qrs Christiane Besse

2nd Step: Export Your Second Set of Data

The French data in RDF

3rd Step: Start Merging Your Data

The French and English data side by side

3rd Step: Start Merging Your Data (cont.)

The merged data with nodes with identical URI-s pointed out

3rd Step: Merge Identical Resources

The merged data with one of the nodes merged with common URI

Start Making Queries…

The merged data with one of the nodes merged with common URI

However, More Can Be Achieved…

3rd Step Revisited: Use the Extra Knowledge

The merged data with extra nodes identified as a result of identifying same as properties

Start Making Richer Queries!

The merged data with one of the nodes merged with common URI

Is That Surprising?

What Did We Do?

It Could Become Even More Powerful

So Where is the Semantic Web?

The Semantic Web provides technologies to make such integration possible!
(hopefully you get a full picture at the end of the tutorial…)

The Semantic Web “layer cake"”

immagine dei livelli del semantic web
Semantic Web is ...
  • a metadata based infrastructure for reasoning on the Web
  • an extension, not a replacement of the current web
Metadata
  • machine understandable” information
  • shared vocabularies (ontologies)
  • a shared data model
Technological standards
  • RDF, OWL, SKOS,…

…just a technical aspect

Basic RDF

Resource Description Framework (RDF) is for Semantic Web what HTML has been for the Web

immagine dei livelli del semantic web immagine dei livelli del semantic web

RDF Triples

RDF Triples (cont.)

(<http://…isbn 6682>, <http://…/original>, <http://…isbn 409X>)

And that's it! (simple, isn't it? winking smiley)

RDF Triples (cont.)

A Simple RDF Example (in RDF/XML)

A Simple RDF Graph with full URI-s
<rdf:Description rdf:about="http://…/isbn/2020386682">
    <f:titre xml:lang="fr">Le palais des mirroirs</f:titre>
    <f:original rdf:resource="http://…/isbn/000651409X"/>
</rdf:Description>

(Note: namespaces are used to simplify the URI-s)

A Simple RDF Example (in Turtle)

A Simple RDF Graph with full URI-s
<http://…/isbn/2020386682>
    f:titre "Le palais des mirroirs"@fr;
    f:original <http://…/isbn/000651409X>.

Few Words About Syntax

URI-s Play a Fundamental Role

RDFSchema

immagine dei livelli del semantic web immagine dei livelli del semantic web

Why we need RDF Schema?

Why we need RDF Schema? (cont.)

Classes, Resources, …

A slide showing an inferred property

Classes, Resources, … (cont.)

Let's return to our example…

Classes, Resources in RDF(S)

A slide showing the book with its own Schema and the RDFS entitites, all merged

Schema Example in RDF/XML

Inferred properties

A slide showing an inferred property

Properties

Some Predefined Classes (Collections, Containers)

Collections (in RDF/XML and Turtle)

<rdf:Description rdf:about="#Inventory">
            <a:consistsOf rdf:parseType="Collection">
                <rdf:Description rdf:about="http://.../isbn/000651409X"/>
                <rdf:Description rdf:about="http://.../isbn/XXXX"/>
                <rdf:Description rdf:about="http://.../isbn/YYYY"/>
            </axsvg:consistsOf>
        </rdf:Description>
        
       :Inventory axsvg:consistsOf (<http://.../isbn/000651409X> <http://.../isbn/XXXX> …).
       
Example for RDF Lists, our own abbreviated form

Ontologies (OWL)

SW logo immagine dei livelli del semantic web

Ontologies

There is a need to support ontologies on the Semantic Web

What is an ontology?

Jim Hendler

A set of knowledge terms, including the vocabulary, the semantic interconnections and some simple rules of inference and logic for some particular topic

Studer et al. (1998)

An ontology is a formal, explicit specification of a shared conceptualisation.
A 'conceptualisation' refers to an abstract model of some phenomenon in the world by having identified the relevant concepts of that phenomenon.
'Explicit' means that the type of concepts used, and the constraints on their use are explicitly defined. For example, in medical domains, the concepts are diseases and symptoms, the relations between them are causal and a constraint is that a disease cannot cause itself.
'Formal' refers to the fact that the ontology should be machine readable, which excludes natural language.
'Shared' reflects the notion that an ontology captures consensual knowledge, that is, it is not private to some individual, but accepted by a group.

Many definitions, but…

consensus among the ontology community

Ontologies are on the Web

OWL: three sublanguages

OWL logo
OWL Lite
supports those users primarily needing a classification hierarchy and simple constraints. Provides a quick migration path for thesauri and other taxonomies. Owl Lite also has a lower formal complexity than OWL DL
OWL DL
supports those users who want the maximum expressiveness while retaining computational completeness (all conclusions are guaranteed to be computable) and decidability (all computations will finish in finite time)
OWL Full
for users who want maximum expressiveness and the syntactic freedom of RDF with no computational guarantees
Scarcely probable it will be entirely supported by software tools implementing reasoning.

Classes in OWL

OWL Resource and Thing

Object and Datatype Property

owl:ObjectProperty
  • relate objects to other objects
owl:DatatypeProperty
  • relate objects to datatype values (e.g. phoneNumber, name, birthDate, etc.)
  • its range are typed literals
  • no predefined types
  • we can use XML Schema data types (layered Semantic Web architecture)

Property Characterization

The top level OWL property definitions

Property Characterization (cont.)

owl:minCardinality
owl:maxCardinality
owl:SymmetricProperty
owl:TransitiveProperty
owl:FunctionalProperty
  • has at most one value for each object (e.g. birthDate, name)
owl:InverseFunctionalProperty
  • two different objects cannot have the same value (e.g. isTheSocialSecurityNumberFor etc.)

Property Characterization: an example

example of symmetric property

From: Antoine Isaac (with Guus Schreiber): Publishing Vocabularies on the Web. NETTAB 2007 workshop on A Semantic Web for Bioinformatics: Goals, Tools, Systems, Applications. Pisa, Italy, June 14, 2007.
[ Slides]

Property Characterization: another example

example of inverse property

From: Antoine Isaac (with Guus Schreiber): Publishing Vocabularies on the Web. NETTAB 2007 workshop on A Semantic Web for Bioinformatics: Goals, Tools, Systems, Applications. Pisa, Italy, June 14, 2007.
[ Slides]

An example in OWL

The statement
The painting of the Sistine Chapel was carried out by Michelangelo Buonarroti
Abstracting from the statement
  • The painting of the Sistine Chapel (the subject) is an (instance of) activity
  • carried out by is a predicate
  • Michelangelo Buonarroti is an (instance of) Person
In OWL (conceptually)
the paintingOfSistineChapel (E7.Activity) was carried_out_by (P14F) MichelangeloBuonarroti (E21.Person)
In OWL (graphically)
In OWL (XML Syntax)

Thesauri and SKOS

Thesaurus vs ontology

(from: Fausto Giunchiglia and Ilya Zaihrayeu: LIGHTWEIGHT ONTOLOGIES - October 2007 - Technical Report DIT-07-071)

Thesaurus
  • can be represented as a graph
  • nodes = thesaurus terms
  • edges = semantic associations

Faceted thesauri

Faceted thesauri belong to the family of KOS, which has been used by the library community in modelling for purposes associated with information retrieval applications. They provide a semantic structure at a suitable granularity for the general problem of search and retrieval. In such applications, where a fuzzy notion of “aboutness“ is the basis for indexing or classifying a document, as opposed to an assertion of fact, the lightweight semantics of faceted thesauri and related KOS may be more suited than the formal semantics provided by AI ontologies, designed for precisely modelling the objects of interest in a domain. The SKOS standard representation, combined with other developments in standard identifiers and service protocols, now affords the combination of formal syntax and informal semantics, in Semantic Web applications and online applications generally. This offers a cost effective approach for annotation, search and browsing oriented applications that don't require first order logic.

(Douglas Tudhope & Ceri Binding: Faceted Thesauri, Axiomathes (2008) 18:211–222 DOI DOI 10.1007/s10516-008-9031-6)

From thesauri to ontologies

Simple Knowledge Organization System (SKOS)

Example: Entries in a Glossary (1)

“Assertion”
“(i) Any expression which is claimed to be true. (ii) The act of claiming something to be true.”
“Class”
“A general concept, category or classification. Something used primarily to classify or categorize other things.”
“Resource”
“(i) An entity; anything in the universe. (ii) As a class name: the class of everything; the most inclusive category possible.”

(from the RDF Semantics Glossary)

Example: Entries in a Glossary (2)

A slide for a simple SKOS glossary

Example: Thesaurus (1)

Term
Economic cooperation
Used For
Economic co-operation
Broader terms
Economic policy
Narrower terms
Economic integration, European economic cooperation, …
Related terms
Interdependence
Scope Note
Includes cooperative measures in banking, trade, …

(from UK Archival Thesaurus)

Example: Thesaurus (2)

A slide for a simple SKOS thesaurus

SKOS example: multilingual labels for concepts

example of symmetric property

From: Antoine Isaac (with Guus Schreiber): Publishing Vocabularies on the Web. NETTAB 2007 workshop on A Semantic Web for Bioinformatics: Goals, Tools, Systems, Applications. Pisa, Italy, June 14, 2007.
[ Slides]

SKOS example: collections

example of symmetric property

From: Antoine Isaac (with Guus Schreiber): Publishing Vocabularies on the Web. NETTAB 2007 workshop on A Semantic Web for Bioinformatics: Goals, Tools, Systems, Applications. Pisa, Italy, June 14, 2007.
[ Slides]

Why Having SKOS and OWL?

Conclusion

SKOS: “One small step for man. One giant leap for mankind.”?
(Photo Credits)

Thanks for your attention

Questions?

... and answers


If it isn't on the Web it doesn't exist ...

... you will find on the office site (http://www.w3c.it/)
the slides (http://www.w3c.it/talks/2009/athena/)