Rome Kres Hackathon

From STLab

(Difference between revisions)
Jump to: navigation, search
Current revision (07:05, 15 April 2011) (view source)
m (Text replace - '150.146.88.63' to 'wit.istc.cnr.it')
 
(20 intermediate revisions not shown.)
Line 1: Line 1:
Link to the work flow: http://stlab.istc.cnr.it/documents/events/kres-hackathon/rome-kres-hackaton-2.png
Link to the work flow: http://stlab.istc.cnr.it/documents/events/kres-hackathon/rome-kres-hackaton-2.png
-
[[Image:Board.jpg|380px]]
 
-
==1)Detect dbpedia entities==
 
-
===Load enahcnements :: /store===
+
Link to the photogallery: [[KReSHackathonRomeGallery]]
 +
<center>[[Image:Board.jpg|240px]]</center>
 +
 
 +
LOD resources endpoint is : http://wit.istc.cnr.it:8893/sparql
 +
 
 +
Stanbol instance at : http://wit.istc.cnr.it:9292/
 +
 
 +
== Detect dbpedia entities ==
 +
 
 +
===Load enahcnements :: <stanbol>/store===
# Create a file with my example text:  
# Create a file with my example text:  
#* movie.txt
#* movie.txt
#* "Una nuova missione per l'agente Ethan Hunt ( Tom Cruise ) che, questa volta, dovrà  cercare di fermare la diffusione di Chimera, un virus geneticamente modificato che produce effetti letali sul sistema nervoso dell'essere umano, capace di mettere in ginocchio il mondo. Al fianco di Ethan il genio dell'informatica Luther Stickell ( Ving Rhames ) e l'affascinante ladra internazionale Nyah Nordoff-Hall (Thandie Newton). Il nemico da combattere, invece, è il perfido Sean Ambrose ( Dougray Scott ) il cui piano è di vendere su scala mondiale al miglior offerente l'antidoto al virus."  
#* "Una nuova missione per l'agente Ethan Hunt ( Tom Cruise ) che, questa volta, dovrà  cercare di fermare la diffusione di Chimera, un virus geneticamente modificato che produce effetti letali sul sistema nervoso dell'essere umano, capace di mettere in ginocchio il mondo. Al fianco di Ethan il genio dell'informatica Luther Stickell ( Ving Rhames ) e l'affascinante ladra internazionale Nyah Nordoff-Hall (Thandie Newton). Il nemico da combattere, invece, è il perfido Sean Ambrose ( Dougray Scott ) il cui piano è di vendere su scala mondiale al miglior offerente l'antidoto al virus."  
-
# curl -i -X PUT -H "Content-Type:text/plain" -T movie1.txt http://150.146.88.63:9292/store/content/movie1.txt
+
# curl -i -X PUT -H "Content-Type:text/plain" -T movie1.txt http://wit.istc.cnr.it:9292/store/content/movie1.txt
-
# Result: http://150.146.88.63:9292/store/page/movie1.txt
+
# Result: http://wit.istc.cnr.it:9292/store/page/movie1.txt
-
===Select entity references :: /sparql===
+
===Select entity references :: <lod-resource>/sparql===
-
  select ?z from <prova_dataset> where{
+
  prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
 +
prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#>
 +
prefix owl: <http://www.w3.org/2002/07/owl#>
 +
prefix xsd: <http://www.w3.org/2001/XMLSchema#>  
   
   
-
  ?x <http://fise.iks-project.eu/ontology/entity-type>  <http://dbpedia.org/ontology/Actor> .
+
  select distinct
 +
  ?z
 +
WHERE
 +
{
 +
    ?x <http://fise.iks-project.eu/ontology/extracted-from>  <http://wit.istc.cnr.it:9292/store/content/movie1.txt> .
 +
    ?x <http://fise.iks-project.eu/ontology/entity-reference> ?z
 +
}
-
?x <http://fise.iks-project.eu/ontology/entity-reference> ?z}
+
OR
-
curl -L -v -H "Accept:text/rdf+nt" http://150.146.88.63:8893/sparql?query=select+%3Fz+from%3Cprova_dataset%3E+where%7B%0D%0A%0D%0A%3Fx+%3Chttp%3A%2F%2Ffise.iks-project.eu%2Fontology%2Fentity-type%3E+%3Chttp%3A%2F%2Fdbpedia.org%2Fontology%2FActor%3E+.%0D%0A%0D%0A%3Fx+%3Chttp%3A%2F%2Ffise.iks-project.eu%2Fontology%2Fentity-reference%3E+%3Fz%7D
+
prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
 +
prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#>
 +
prefix owl: <http://www.w3.org/2002/07/owl#>
 +
prefix xsd: <http://www.w3.org/2001/XMLSchema#>
 +
construct {
 +
  ?z rdf:type owl:Thing
 +
}
 +
WHERE
 +
{
 +
    ?x <http://fise.iks-project.eu/ontology/extracted-from>  <http://wit.istc.cnr.it:9292/store/content/movie1.txt> .
 +
    ?x <http://fise.iks-project.eu/ontology/entity-reference> ?z
 +
}
-
OUTPUT:
+
===Select owl:sameAs from each LOD source===
-
<http://dbpedia.org/resource/Tom_Cruise>
+
====Linked MDB====
 +
PREFIX oddlinker: <http://data.linkedmdb.org/resource/oddlinker/>
 +
CONSTRUCT {
 +
  ?Subject ?type ?source
 +
}
 +
FROM <lmdb_dataset>
 +
 +
WHERE
 +
{
 +
?x oddlinker:link_source ?source .
 +
?x oddlinker:link_type ?type .
 +
?x oddlinker:link_target ?Subject .
 +
FILTER ( ?type = "owl:sameAs" || ?type = "rdfs:SeeAlso") .
 +
FILTER (
 +
    ?Subject = <http://sws.geonames.org/2118968/> ||
 +
    ?Subject = <http://dbpedia.org/resource/Tom_Cruise> ||
 +
    ?Subject = <http://sws.geonames.org/556951/> ||
 +
    ?Subject = <http://dbpedia.org/resource/Al_Pacino> ||
 +
    ?Subject = <http://sws.geonames.org/106281/> ||
 +
    ?Subject = <http://dbpedia.org/resource/Al_Gore>
 +
  )
 +
}
 +
LIMIT 10000
-
4)QUERY FOR GET RESOURCE with SAME_AS property
+
And the result is:
-
 
+
-
PREFIX oddlinker: <http://data.linkedmdb.org/resource/oddlinker/>
+
-
SELECT ?source ?type
+
{| class=smwtable
 +
! Subject
 +
! type
 +
! source
 +
|-
 +
| http://dbpedia.org/resource/Al_Gore
 +
| owl:sameAs
 +
| http://data.linkedmdb.org/resource/actor/12737
 +
|-
 +
| http://dbpedia.org/resource/Al_Pacino
 +
| owl:sameAs
 +
| http://data.linkedmdb.org/resource/actor/12748
 +
|-
 +
| http://dbpedia.org/resource/Tom_Cruise
 +
| owl:sameAs
 +
| http://data.linkedmdb.org/resource/actor/28241
 +
|}
-
FROM <lmdb_dataset>
+
===Compute owl:sameAs with the reasoner :: /enrich===
-
WHERE{
+
==Fetch the information from LOD sources==
-
?x oddlinker:link_source ?source .
+
-
?x oddlinker:link_type ?type .
+
-
?x oddlinker:link_target <http://dbpedia.org/resource/Tom_Cruise> .
+
-
FILTER ( ?type = "owl:sameAs" || ?type = "rdfs:SeeAlso")
+
-
}
+
-
ORDER BY ?source
+
SELECT *
-
LIMIT 10000
+
FROM <lmdb_dataset>
 +
WHERE{
 +
  {<http://dbpedia.org/resource/Al_Gore> ?p ?o}
 +
  UNION
 +
  {?s ?pp <http://dbpedia.org/resource/Al_Gore>}
 +
  UNION
 +
  {<http://dbpedia.org/resource/Al_Pacino> ?p ?o}
 +
  UNION
 +
  {?s ?pp <http://dbpedia.org/resource/Al_Pacino>}
 +
  UNION
 +
  {<http://dbpedia.org/resource/Tom_Cruise> ?p ?o}
 +
  UNION
 +
  {?s ?pp <http://dbpedia.org/resource/Tom_Cruise>}
 +
  UNION
 +
  {<http://data.linkedmdb.org/resource/actor/12737> ?p ?o}
 +
  UNION
 +
  {?s ?pp <http://data.linkedmdb.org/resource/actor/12748>}
 +
  UNION
 +
  {<http://data.linkedmdb.org/resource/actor/12737> ?p ?o}
 +
  UNION
 +
  {?s ?pp <http://data.linkedmdb.org/resource/actor/12748>}
 +
  UNION
 +
  {?s ?pp <http://data.linkedmdb.org/resource/actor/28241>}
 +
  UNION
 +
  {<http://data.linkedmdb.org/resource/actor/28241> ?p ?o}
 +
}
-
CURL:
+
== Select the Recipe and do /refactor ==
-
curl -L -v -H "Accept:text/rdf+nt" http://150.146.88.63:8893/sparql?query=PREFIX+oddlinker%3A+%3Chttp%3A%2F%2Fdata.linkedmdb.org%2Fresource%2Foddlinker%2F%3E%0D%0A%0D%0ASELECT+%3Fsource+%3Ftype%0D%0A%0D%0AFROM+%3Clmdb_dataset%3E%0D%0A%0D%0AWHERE%7B+%3Fx+oddlinker%3Alink_source+%3Fsource+.+%3Fx+oddlinker%3Alink_type+%3Ftype+.+%3Fx+oddlinker%3Alink_target+%3Chttp%3A%2F%2Fdbpedia.org%2Fresource%2FTom_Cruise%3E+.+FILTER+%28+%3Ftype+%3D+%22owl%3AsameAs%22+%7C%7C+%3Ftype+%3D+%22rdfs%3ASeeAlso%22%29+%7D%0D%0A%0D%0AORDER+BY+%3Fsource+LIMIT+10000+&format=text%2Fhtml
+
-
OUTPUT:
+
<source lang="xml">
-
http://data.linkedmdb.org/resource/actor/28241 <--risultato
+
google = <http://rdf.data-vocabulary.org/#> .
-
<http://data.linkedmdb.org/resource/actor/29671> <--ID con più dati.
+
mdb = <http://data.linkedmdb.org/resource/movie/> .
 +
dcterms = <http://purl.org/dc/terms/> .
 +
foaf = <http://xmlns.com/foaf/0.1/> .  
-
5)QUERY SPARQL TO GET ALL INFORMATION FOR actor
+
!film2product[
 +
is(mdb:Film, ?film)
 +
->
 +
is(google:Product, ?film)
 +
] .
-
SELECT *
+
!actor2person[
 +
is(mdb:Actor, ?actor)
 +
->
 +
is(google:Person, ?actor) .
 +
values(google:title, ?actor, "Actor")
 +
] .
-
FROM <lmdb_dataset>
+
actorname[
 +
is(google:Person, ?p) .
 +
values(mdb:actor_name, ?p, ?name) .
 +
->
 +
values(google:name, ?p, ?name)
 +
] .
-
WHERE{{<http://data.linkedmdb.org/resource/actor/29671> ?p ?o} UNION {?s ?pp <http://data.linkedmdb.org/resource/actor/29671>}}
+
actorCharacter[
 +
is(google:Person, ?p) .  
 +
has(mdb:performance, ?p, ?performance) .
 +
has(mdb:performance_character, ?performance, ?character) .  
 +
values(performance_film, ?performance, ?film) .
 +
->
 +
values(google:role, ?character) .
 +
values(google:affiliation, ?film)
 +
] .
-
CURL:
+
actorPage[
-
curl -L -v -H "Accept:text/rdf+nt" http://150.146.88.63:8893/sparql?query=SELECT+*%0D%0A%0D%0AFROM+%3Clmdb_dataset%3E%0D%0A%0D%0AWHERE%7B%7B%3Chttp%3A%2F%2Fdata.linkedmdb.org%2Fresource%2Factor%2F29671%3E+%3Fp+%3Fo%7D+UNION+%7B%3Fs+%3Fpp+%3Chttp%3A%2F%2Fdata.linkedmdb.org%2Fresource%2Factor%2F29671%3E%7D%7D&format=text%2Fhtml
+
is(google:Person, ?p) .  
 +
has(foaf:page, ?p, ?page) .  
 +
let(?pageString, str(?page))
 +
->
 +
values(google:role, ?pageString)
 +
]
 +
</source>

Current revision

Link to the work flow: http://stlab.istc.cnr.it/documents/events/kres-hackathon/rome-kres-hackaton-2.png

Link to the photogallery: KReSHackathonRomeGallery

LOD resources endpoint is : http://wit.istc.cnr.it:8893/sparql

Stanbol instance at : http://wit.istc.cnr.it:9292/

Contents

Detect dbpedia entities

Load enahcnements :: <stanbol>/store

  1. Create a file with my example text:
    • movie.txt
    • "Una nuova missione per l'agente Ethan Hunt ( Tom Cruise ) che, questa volta, dovrà cercare di fermare la diffusione di Chimera, un virus geneticamente modificato che produce effetti letali sul sistema nervoso dell'essere umano, capace di mettere in ginocchio il mondo. Al fianco di Ethan il genio dell'informatica Luther Stickell ( Ving Rhames ) e l'affascinante ladra internazionale Nyah Nordoff-Hall (Thandie Newton). Il nemico da combattere, invece, è il perfido Sean Ambrose ( Dougray Scott ) il cui piano è di vendere su scala mondiale al miglior offerente l'antidoto al virus."
  2. curl -i -X PUT -H "Content-Type:text/plain" -T movie1.txt http://wit.istc.cnr.it:9292/store/content/movie1.txt
  3. Result: http://wit.istc.cnr.it:9292/store/page/movie1.txt

Select entity references :: <lod-resource>/sparql

prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> 
prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> 
prefix owl: <http://www.w3.org/2002/07/owl#> 
prefix xsd: <http://www.w3.org/2001/XMLSchema#> 

select distinct 
  ?z 
WHERE
{ 
    ?x <http://fise.iks-project.eu/ontology/extracted-from>  <http://wit.istc.cnr.it:9292/store/content/movie1.txt> .
    ?x <http://fise.iks-project.eu/ontology/entity-reference> ?z
}

OR

prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> 
prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> 
prefix owl: <http://www.w3.org/2002/07/owl#> 
prefix xsd: <http://www.w3.org/2001/XMLSchema#> 
construct {
 ?z rdf:type owl:Thing
}
WHERE
{ 
   ?x <http://fise.iks-project.eu/ontology/extracted-from>  <http://wit.istc.cnr.it:9292/store/content/movie1.txt> .
   ?x <http://fise.iks-project.eu/ontology/entity-reference> ?z
}

Select owl:sameAs from each LOD source

Linked MDB

PREFIX oddlinker: <http://data.linkedmdb.org/resource/oddlinker/>
CONSTRUCT {
 ?Subject ?type ?source
}
FROM <lmdb_dataset>

WHERE
{
?x oddlinker:link_source ?source .
?x oddlinker:link_type ?type .
?x oddlinker:link_target ?Subject .
FILTER ( ?type = "owl:sameAs" || ?type = "rdfs:SeeAlso") .
FILTER ( 
    ?Subject = <http://sws.geonames.org/2118968/> ||
    ?Subject = <http://dbpedia.org/resource/Tom_Cruise> ||
    ?Subject = <http://sws.geonames.org/556951/> ||
    ?Subject = <http://dbpedia.org/resource/Al_Pacino> ||
    ?Subject = <http://sws.geonames.org/106281/> ||
    ?Subject = <http://dbpedia.org/resource/Al_Gore>
 )
}
LIMIT 10000

And the result is:

Subject type source
http://dbpedia.org/resource/Al_Gore owl:sameAs http://data.linkedmdb.org/resource/actor/12737
http://dbpedia.org/resource/Al_Pacino owl:sameAs http://data.linkedmdb.org/resource/actor/12748
http://dbpedia.org/resource/Tom_Cruise owl:sameAs http://data.linkedmdb.org/resource/actor/28241

Compute owl:sameAs with the reasoner :: /enrich

Fetch the information from LOD sources

SELECT *
FROM <lmdb_dataset>
WHERE{
 {<http://dbpedia.org/resource/Al_Gore> ?p ?o} 
 UNION 
 {?s ?pp <http://dbpedia.org/resource/Al_Gore>}
 UNION
 {<http://dbpedia.org/resource/Al_Pacino> ?p ?o} 
 UNION 
 {?s ?pp <http://dbpedia.org/resource/Al_Pacino>}
 UNION
 {<http://dbpedia.org/resource/Tom_Cruise> ?p ?o} 
 UNION 
 {?s ?pp <http://dbpedia.org/resource/Tom_Cruise>}
 UNION
 {<http://data.linkedmdb.org/resource/actor/12737> ?p ?o} 
 UNION 
 {?s ?pp <http://data.linkedmdb.org/resource/actor/12748>}
 UNION
 {<http://data.linkedmdb.org/resource/actor/12737> ?p ?o} 
 UNION 
 {?s ?pp <http://data.linkedmdb.org/resource/actor/12748>}
 UNION 
 {?s ?pp <http://data.linkedmdb.org/resource/actor/28241>}
 UNION
 {<http://data.linkedmdb.org/resource/actor/28241> ?p ?o} 
}

Select the Recipe and do /refactor

google = <http://rdf.data-vocabulary.org/#> . 
mdb = <http://data.linkedmdb.org/resource/movie/> . 
dcterms = <http://purl.org/dc/terms/> . 
foaf = <http://xmlns.com/foaf/0.1/> . 
 
!film2product[
	is(mdb:Film, ?film)
		->
	is(google:Product, ?film)
] . 
 
!actor2person[
	is(mdb:Actor, ?actor)
		->
	is(google:Person, ?actor) . 
	values(google:title, ?actor, "Actor")
] . 
 
actorname[
	is(google:Person, ?p) . 
	values(mdb:actor_name, ?p, ?name) . 
		->
	values(google:name, ?p, ?name)
] . 
 
actorCharacter[
	is(google:Person, ?p) . 
	has(mdb:performance, ?p, ?performance) .
	has(mdb:performance_character, ?performance, ?character) . 
	values(performance_film, ?performance, ?film) .
		->
	values(google:role, ?character) . 
	values(google:affiliation, ?film)
] . 
 
actorPage[
	is(google:Person, ?p) . 
	has(foaf:page, ?p, ?page) . 
	let(?pageString, str(?page))
		->
	values(google:role, ?pageString)
]
Personal tools