LOD:ontology statistics

From STLab

(Difference between revisions)
Jump to: navigation, search
(Balthasar)
Current revision (14:04, 22 December 2010) (view source)
m (Knowledge pattern implementations in LOD)
 
(28 intermediate revisions not shown.)
Line 10: Line 10:
* #triples
* #triples
* #classes
* #classes
-
* t/c ratio: triples / classes ratio, gives an idea about how much annotations a class generally has.
 
* #props: amount of unique properties
* #props: amount of unique properties
* #mappings: alignments with other LOD corpora. Should include total nr mappings, but preferably also nr mappings to specific corpora.
* #mappings: alignments with other LOD corpora. Should include total nr mappings, but preferably also nr mappings to specific corpora.
Line 20: Line 19:
== World knowledge ==
== World knowledge ==
-
{| border="1" cellspacing="5" cellpadding="5" align="left"
+
{|class="smwtable"
! ontology
! ontology
! #triples
! #triples
! #classes
! #classes
-
! t/c ratio
 
! #props
! #props
! #mappings
! #mappings
Line 33: Line 31:
| 1 billion
| 1 billion
| 3.4 million (of which 1.5 million are classified in a consistent Ontology)
| 3.4 million (of which 1.5 million are classified in a consistent Ontology)
-
| 294.1
 
| ?
| ?
| 4,887,000
| 4,887,000
Line 44: Line 41:
|  
|  
|  
|  
-
|  
+
| N/A
| constantly
| constantly
-
| (N/A)
 
|-  
|-  
| [[http://sw.opencyc.org/ opencyc]]
| [[http://sw.opencyc.org/ opencyc]]
|  
|  
| 56,780
| 56,780
-
|
 
|  
|  
| 73,132
| 73,132
Line 60: Line 55:
| 2008
| 2008
|-  
|-  
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|-
 
-
|
 
|  
|  
|  
|  
Line 82: Line 67:
-
{| border="1" cellspacing="5" cellpadding="5" align="center"
+
{|class="smwtable"
! ontology
! ontology
! #triples
! #triples
! #classes
! #classes
-
! t/c ratio
 
! #props
! #props
! #mappings
! #mappings
Line 93: Line 77:
|-  
|-  
| foaf
| foaf
-
|
 
|  
|  
|  
|  
Line 101: Line 84:
|  
|  
|-  
|-  
-
|  
+
| NY-Times people
-
|  
+
| 103,496
-
|  
+
| 4,979
-
|  
+
| 19
-
|  
+
| 6,094
-
|
+
* 3027 to dbpedia
-
|  
+
* 3066 to freebase
-
|  
+
| ?
 +
| ?
|-  
|-  
-
|
 
|  
|  
|  
|  
Line 119: Line 102:
|  
|  
|}
|}
-
 
== Places ==
== Places ==
-
{| border="1" cellspacing="5" cellpadding="5" align="center"
+
{|class="smwtable"
! ontology
! ontology
! #triples
! #triples
! #classes
! #classes
-
! t/c ratio
 
! #props
! #props
! #mappings
! #mappings
Line 133: Line 114:
! update freq
! update freq
|-  
|-  
-
| foaf
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|-
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|-
 
-
|
 
|  
|  
|  
|  
Line 163: Line 125:
== Lexical ==
== Lexical ==
-
{| border="1" cellspacing="5" cellpadding="5" align="center"
+
{|class="smwtable"
! ontology
! ontology
! #triples
! #triples
! #classes
! #classes
-
! t/c ratio
 
! #props
! #props
! #mappings
! #mappings
Line 173: Line 134:
! update freq
! update freq
|-  
|-  
-
| foaf
+
| WordNet
-
|
+
|  
|  
|  
|  
Line 182: Line 142:
|  
|  
|-  
|-  
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|-
 
-
|
 
|  
|  
|  
|  
Line 204: Line 154:
-
{| border="1" cellspacing="5" cellpadding="5" align="center"
+
{|class="smwtable"
! ontology
! ontology
! domain
! domain
! #triples
! #triples
! #classes
! #classes
-
! t/c ratio
 
! #props
! #props
! #mappings
! #mappings
Line 215: Line 164:
! update freq
! update freq
|-  
|-  
-
| LinkedMDB
+
| [http://wiki.linkedmdb.org/Main/Statistics LinkedMDB]
| movies
| movies
 +
| 2,366,572
 +
| 334,930
 +
| 110 ([[linkedMDB properties]] or [http://eculture2.cs.vu.nl:53025/browse/list_predicates?graph=file:///data/schopman/cliopatria/ClioPatria/_linkedData/linkedMDB/linkedmdb-clean.nt in ClioPatria])
|  
|  
-
|
+
* freebase: 502,988
-
|
+
** 197,271 are performances, eg [http://www.freebase.com/view/guid/9202a8c04000641f800000000111c1fa] and [http://www.freebase.com/view/guid/9202a8c04000641f800000000111c1cc]
-
|
+
** 4,529 are music contributions, eg [http://www.freebase.com/view/guid/9202a8c04000641f8000000013155697] and [http://www.freebase.com/view/guid/9202a8c04000641f8000000013106569]
-
|
+
** 45259 are film cuts, eg [http://www.freebase.com/view/guid/9202a8c04000641f80000000011223b0] and [http://www.freebase.com/view/m/0k48qj]
-
|  
+
** writers, eg [http://www.freebase.com/view/guid/9202a8c04000641f800000000057788c] and [http://www.freebase.com/view/guid/9202a8c04000641f8000000000562261]
-
|  
+
** etc ...
 +
* dbpedia: 91,062
 +
* yago: 60,708
 +
* imdb: 18,404 (using false rottentomatoes links)
 +
| 2008
 +
| ?
|-  
|-  
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|
 
-
|-
 
-
|
 
|  
|  
|  
|  
Line 246: Line 192:
|}
|}
 +
= Knowledge pattern implementations in LOD =
-
= TVA example =
+
== The Music ontology (model) ==
 +
Partly developed by BBC, it is widely used, in part or in full, by a vast amount of music and recording-related Linked Data Resources. Specifications available at [http://musicontology.com/], RDF code at [http://motools.sourceforge.net/doc/musicontology.rdfs] and [http://motools.sourceforge.net/doc/musicontology.n3].
-
<?xml version="1.0" encoding="UTF-8"?>
+
Some elementary knowledge pattern classes can be identified as follows (<tt>mo</tt> is prefix for <tt>http://purl.org/ontology/mo/</tt>):
-
<tva:TVAMain xml:lang=""
+
* Roles of objects
-
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
+
** As is most common, these are hardwired in object properties, e.g. <tt>mo:interpreter</tt> and its subproperties,and also <tt>mo:producer</tt>, <tt>mo:publisher</tt>, etc.
-
xmlns:mpeg7="urn:tva:mpeg7:2005"
+
** Some are available as OWL classes, e.g. <tt>mo:Listener equivalentTo mo:hasListened some mo:Performance</tt>
-
xmlns:tva="urn:tva:metadata:2007"
+
** Other role relationships are embedded in event participation models (see below)
-
xmlns:ug="http://www.few.vu.nl/~schopman/notube/uitzendingGemist.ttl#"
+
* Event and Participation
-
publicationTime="2010-06-08T09:30:47.0Z"
+
** Specialises the [http://purl.org/NET/c4dm/event.owl Event Ontology].
-
originID="NIC-ID">
+
** Participation in Event subclasses is inherently time-indexed and with a role e.g. <tt>mo:Performance mo:performer foaf:Person</tt>. See usage in DBTune for details.
 +
* About
 +
** <tt>dc:subject</tt> and <tt>foaf:isPrimaryTopicOf</tt> are available from the respective imported ontologies, but not specialised.
 +
* Aggregation
 +
** Reuses <tt>event:sub_event</tt> (formerly <tt>event:hasSubEvent</tt>) in order to define hierarchies by partitioning complex events by time, space and involvement (see DBTune usage).
-
<tva:ProgramDescription>
+
== DBTune ==
-
<tva:ProgramInformationTable>
+
DBTune embeds several datasets on musical resources, mostly authored by BBC UK, that use the Music Ontology as a reference schema.
-
<tva:ProgramInformation programId="http://www.uitzendinggemist.nl/index.php/aflevering?aflID=11447702">
+
-
<tva:BasicDescription>
+
-
<tva:Title>Koefnoen</tva:Title>
+
-
<tva:Synopsis>Humoristisch programma waarin persiflages, sketches, quickies en nepdocumentaires elkaar snel afwisselen. Paul Groot, Owen Schumacher, Plien van Bennekom, Bianca Krijgsman, Marjan Luif en Sander van Opzeeland schetsen hun kijk op het leven en de actualiteiten. Daarin worden zij bijgestaan door diverse gastacteurs.</tva:Synopsis>
+
-
+
-
<!-- <tva:Keyword>key</tva:Keyword> -->
+
-
+
-
<!-- genre -->
+
-
<tva:Genre href="ug:amusement"/>
+
-
<tva:Genre href="ug:comedy"/>
+
-
<tva:Genre href="ug:kunst_cultuur"/>
+
-
+
-
<!-- parental rating guidance -->
+
-
<tva:ParentalGuidance>
+
-
<mpeg7:ParentalRating href="urn:dvb:metadata:cs:ParentalGuidanceCS:2007:1.1.2">
+
-
<mpeg7:Name>P</mpeg7:Name>
+
-
<mpeg7:Definition>Suitable for All Ages.</mpeg7:Definition>
+
-
</mpeg7:ParentalRating>
+
-
</tva:ParentalGuidance>
+
-
+
-
<!-- credits -->
+
-
<tva:CreditsList>
+
-
<tva:CreditsItem role="http://dbpedia.org/ontology/starring">
+
-
<tva:PersonName>
+
-
<mpeg7:FamilyName>Schumacher</mpeg7:FamilyName>
+
-
<mpeg7:GivenName>Owen</mpeg7:GivenName>
+
-
</tva:PersonName>
+
-
</tva:CreditsItem>
+
-
<tva:CreditsItem role="http://dbpedia.org/ontology/starring">
+
-
<tva:PersonName>
+
-
<mpeg7:FamilyName>Groot</mpeg7:FamilyName>
+
-
<mpeg7:GivenName>Paul</mpeg7:GivenName>
+
-
</tva:PersonName>
+
-
</tva:CreditsItem>
+
-
</tva:CreditsList>
+
-
+
-
</tva:BasicDescription>
+
-
</tva:ProgramInformation>
+
-
</tva:ProgramInformationTable>
+
-
</tva:ProgramDescription>
+
-
</tva:TVAMain>
+
-
= minutes 13-9-2010 =
+
=== John Peel Sessions ===
-
== Guus ==
+
'''Homepage''': [http://dbtune.org/bbc/peel/ http://dbtune.org/bbc/peel/] (query interface + RDF dumps. SPARQL endpoint doesn't seem to be working.)
-
patterns discussion in Rome, 23 Sep
+
A DBTune RDF catalog of Radio One live performances for the John Peel Show. As this dataset is not concerned with describing musical artists per se, but only their radio performances, we do not expect topic patterns to be instantiated, but a strong take on events as described by the Music Ontology.
-
sample pattern: loccation -> subject class (e.g. a fish species) -<=> location linked to similar subject class
+
-
pattern from E-culture: two artists woorking on the same style
+
-
or related style
+
-
dimensions for these patterns: space, time, people
+
-
and subject
+
-
knowledge vs. navigational pattern
+
-
intersting results from paterns that link different knowledge structures??!
+
-
e.g. partof hierarchy of locations linked to subvlass hierarchy of subject types
+
-
influence of levelat a certain level the generality gets too high (or specifificity may be too low: only interesting for subject geeks) of generality in hierarchy:
+
-
navigational pattern is a (partial path through a knowledge pattern/graph
+
-
[if I understood it correctly]
+
-
We will take example useful for News and SocialWeb use case in NoTube
+
-
wieght of relations is present in navigational patterns not in knowledge patterns
+
-
navigational patterns are local, but can we generalize over these as well.? Well, this is a background hypothesis to explora
+
-
can we use the level in the hierarch: say low = long tail?
+
-
only if we have meta-info about the structure of the hierarchy
+
-
see analysis by Brockmuller 2003
+
-
?Grouping terms supply attributes or classification dimensions to the terms grouped in
+
-
the subhierarchy below them, but not to other grouping terms;
+
-
- Natural categories are at the basic level of search and in principle divide general from
+
-
specific terms;
+
-
- Abstract classes are more general than the basic level, and
+
-
- Domain-specific types are more specific.?
+
-
== Balthasar ==
+
* Linked Data Alignments
-
knowledge pattern = general pattern that models knowledge
+
** only with DBPedia. For RDF dumps, alignments are stored in separate modules that do not import the main dataset (!).
-
navigaton pattern = prototype path in LOD
+
** DBPedia alignments are <tt>owl:sameAs</tt> relations that hold for both <tt>foaf:Person</tt> and <tt>mo:MusicArtist</tt> individuals (if e.g. a MusicArtist is a single person such as Elton John). Although it is generally no good, it is not so dangerous, so long as DBPedia only contains triples that relate to the real person and we don't represent the musical artist as an ensemble. The risk would be to have a foaf:Person that is comprised of other mo:MusicArtists or foaf:Persons.
 +
* Event Patterns:
 +
** mo:MusicArtist mo:performed mo:Performance
 +
(e.g. http://dbtune.org/bbc/peel/artist/280 ("Bratmobile", a http://purl.org/ontology/mo/MusicArtist) http://purl.org/ontology/mo/performed http://dbtune.org/bbc/peel/session/483 ("Performance 483 in Maida Vale 4", a http://purl.org/ontology/mo/Performance))
 +
* Role patterns:
 +
**for single musicians, we have participations with role:
 +
http://dbtune.org/bbc/peel/perf_ins/2955cda0c6177f466b6e049686d225ab (Allison Wolfe's performance on vocals in the above show, a http://purl.org/ontology/mo/Performance)
 +
mo:performer http://dbtune.org/bbc/peel/artist/2955cda0c6177f466b6e049686d225ab (foaf:name="Allison Wolfe", a foaf:Person)
 +
mo:instrument = "Vocals" (note: the usage of this object property is ambiguous, e.g. uses untyped literals for Elton John's performances)
 +
* Topic patterns:
 +
** No usage of the dc:subject and foaf:isPrimaryTopicOf properties imported by the music ontology
 +
* Aggregation patterns
 +
** Achieved in Performances by means of hierarchies
 +
(e.g.
 +
http://dbtune.org/bbc/peel/session/483 (Bratmobile Peel Session 483)
 +
event:hasSubEvent http://dbtune.org/bbc/peel/perf_ins/2955cda0c6177f466b6e049686d225ab (Allison Wolfe's vocal performance)
 +
mo:performer
 +
event:hasSubEvent http://dbtune.org/bbc/peel/perf_work/8323 (performance of "Make me Miss America" at that show)
 +
event:usesWork http://dbtune.org/bbc/peel/work/8323 (song "Make me Miss America", serql:directType mo:MusicalWork)
 +
)
-
types of patterns:  
+
=== Jamendo ===
-
between...
+
'''Homepage''': [http://dbtune.org/jamendo/ http://dbtune.org/jamendo/] (query interface + RDF dumps). SPARQL endpoint at [http://dbtune.org/jamendo/sparql/ http://dbtune.org/jamendo/sparql/] seems broken, use [http://dbtune.org/jamendo/store/user/query http://dbtune.org/jamendo/store/user/query] instead
-
* places (geonames)
+
-
* people (IMDB people)
+
-
* animals (geospecies)
+
-
* topics (genres in TVA, IMDB, )
+
-
The UP can be used as heuristics in pathfinding... eg if the user has a biological background, the path towards geospecies is chosen, but when the user has IT background another direction may be chosen.
+
A DBTune RDF port of independent musical artists and their releases. Can be useful if we wish to exploit either geographical aggregation relations or the implicit semantics of genre-related tags.
-
Roles can also be important. In general, people are interested in 'actors', but the user profile might specify the user is interested in correlations in the 'director' role.
+
* Linked Data Alignments
 +
** with MusicBrainz via <tt>owl:sameAs</tt> (e.g. from <tt>mo:MusicArtist</tt>). This is generally safe. However, many link targets are resolved into Zitgist, so they mostly work as hyperlinks.
 +
** with Geonames via foaf:based_near. This should have no side effects.
 +
* Event Patterns:
 +
** none. Album recording events are not modelled.
 +
* Role patterns:
 +
** none. Artists are never represented as ensembles and the authorship relation with released is expressed via <tt>foaf:made</tt>.
 +
* Topic patterns:
 +
** Record tags are ported to RDF via the <tt>tag:taggedWithTag</tt> property from the Tag ontology at [http://www.holygoat.co.uk/owl/redwood/0.1/tags/]. Tag values usually denote very specific genres (entities in Jamendo).
 +
* Aggregation patterns
 +
** Only implicit meronomy inherited from Geonames via foaf:based_near links.
 +
 
 +
=== MusicBrainz (plain) ===
 +
'''Homepage''': [http://musicbrainz.org http://musicbrainz.org] (with search interface). This is NOT the DBTune representation.
 +
Dataset access point: [http://wiki.musicbrainz.org/RDF http://wiki.musicbrainz.org/RDF] (no SPARQL endpoint found)
 +
 
 +
An open-content and release-centered musical knowledge base. It mainly uses internal vocabularies plus Dublin Corea and Amazon (http://www.amazon.com/gp/aws/landing.html) for release info.
 +
 
 +
* Linked Data Alignments
 +
** none found. MusicBrainz is self-contained
 +
* Event Patterns:
 +
** Album releases, by anonymous instantiation of <tt>mm:ReleaseDate</tt> (with <tt>dc:date</tt> and <tt>mm:country</tt> values)
 +
* Role patterns:
 +
** Authorship-related roles are expressed via <tt>ar:Producer</tt> and <tt>ar:Composer</tt> properties.
 +
** Membership is time-indexed via ar:MemberofBand nodes (if subject is of mm:artistType tmm:TypeGroup) and ar:SupportingMusician and ar:InstrumentalSupportingMusician nodes. Note that artists can be of mm:artistType tmm:TypeGroup and mm:artistType mm:TypePerson
 +
** Instruments used are plain attribute lists wrapped into ar:attributeList nodes.
 +
* Topic patterns:
 +
** None found
 +
* Aggregation patterns
 +
** Mainly wraps RDF aggregations (eg. rdf:Bag) for artist-related lists such as album tracklists, album releases in countries etc.
 +
 
 +
=== Last.fm (RDFize) ===
 +
'''Homepage''': [http://lastfm.rdfize.com http://lastfm.rdfize.com] (faceted search interface). This is NOT the DBTune representation.
 +
No SPARQL endpoint found.
 +
 
 +
A datasource providing an event-centered edge on Last.fm data. Vocabularies used include the Music Ontology, the Event Ontology by Yves Raimond, FOAF, Dublin Core, W3C Geo and Vcard.
 +
 
 +
* Linked Data Alignments
 +
** TBD
 +
* Event Patterns:
 +
** Lots of interesting stuff. TBD
 +
* Role patterns:
 +
** TBD
 +
* Topic patterns:
 +
** TBD
 +
* Aggregation patterns
 +
** TBD
 +
 
 +
== YAGO ==
 +
{|class="smwtable"
 +
! endpoint
 +
! cross-dataset
 +
references
 +
! equivalence
 +
relations
 +
! roles
 +
! event and
 +
participation
 +
! "about" relations
 +
! aggregation
 +
hierarchies
 +
|-
 +
| [http://www.mpi-inf.mpg.de/yago-naga/yago/demo.html] (no SPARQL)
 +
|
 +
* {Wikipedia English page URL} :describes
 +
* :hasImdb {Entry code on IMDB}
 +
|
 +
* ''means'' (weak equivalence)
 +
|
 +
|
 +
* [implicit] hasWonPrize :
 +
** usage of the same property for identifying the prize and its object
 +
** NOT contextualized in the actual prize assignment event!
 +
** :Roberto_Benigni :hasWonPrize :Life_Is_Beautiful
 +
** :Roberto_Benigni :hasWonPrize :Academy_Award_for_Best_Actor
 +
 
 +
* :show subclassOf :social_event
 +
* :social_event subclassOf :event
 +
* :event subclassOf :psychologicalFeature
 +
* :Ghosts_of_Mars rdf:type [all of the above!]
 +
 
 +
* :Italy :establishedOnDate :1861-03-17
 +
|
 +
| ''No geographical meronomy!''
 +
|}
-
Discussion: what kind of restrictions should a pattern contain. EG: both node tyes & specific properties, or: only node types (being indifferent to the properties)
+
== Rough notes ==
 +
* Freebase has separate individuals for movies and respective soundtracks, but does not appear to be linking them with RDF.
 +
* Freebase also has separate individuals for real and fictional characters (may come in useful for avoiding inconsistencies, but be wary of equivalences!).

Current revision

Contents

Setup

On this page we list useful statistics of ontologies (vocabularies, schemas, etc). This information about ontologies can be used to compare them and decide on which corpora to use for what.

Sketch for discovering patterns in LOD
Sketch for discovering patterns in LOD

Proposals

If you have any proposals for different statistics, or vocabularies to include, please state them here.

Statistics / measures explained

  • #triples
  • #classes
  • #props: amount of unique properties
  • #mappings: alignments with other LOD corpora. Should include total nr mappings, but preferably also nr mappings to specific corpora.
  • last update: date of the last update
  • update freq: nr of times the ontology is updated

Statistics

World knowledge

ontology #triples #classes #props #mappings last update update freq
dbpedia 1 billion 3.4 million (of which 1.5 million are classified in a consistent Ontology)  ? 4,887,000 2010-4-28 3 times per year
freebase N/A constantly
[opencyc] 56,780 73,132
  • 41039 to DBpedia
  • 20886 to Umbel
  • 11207 to WordNet 2.0
2008

People

ontology #triples #classes #props #mappings last update update freq
foaf
NY-Times people 103,496 4,979 19 6,094
  • 3027 to dbpedia
  • 3066 to freebase
 ?  ?

Places

ontology #triples #classes #props #mappings last update update freq

Lexical

ontology #triples #classes #props #mappings last update update freq
WordNet

Misc ontologies

ontology domain #triples #classes #props #mappings last update update freq
LinkedMDB movies 2,366,572 334,930 110 (linkedMDB properties or in ClioPatria)
  • freebase: 502,988
    • 197,271 are performances, eg [1] and [2]
    • 4,529 are music contributions, eg [3] and [4]
    • 45259 are film cuts, eg [5] and [6]
    • writers, eg [7] and [8]
    • etc ...
  • dbpedia: 91,062
  • yago: 60,708
  • imdb: 18,404 (using false rottentomatoes links)
2008  ?

Knowledge pattern implementations in LOD

The Music ontology (model)

Partly developed by BBC, it is widely used, in part or in full, by a vast amount of music and recording-related Linked Data Resources. Specifications available at [9], RDF code at [10] and [11].

Some elementary knowledge pattern classes can be identified as follows (mo is prefix for http://purl.org/ontology/mo/):

  • Roles of objects
    • As is most common, these are hardwired in object properties, e.g. mo:interpreter and its subproperties,and also mo:producer, mo:publisher, etc.
    • Some are available as OWL classes, e.g. mo:Listener equivalentTo mo:hasListened some mo:Performance
    • Other role relationships are embedded in event participation models (see below)
  • Event and Participation
    • Specialises the Event Ontology.
    • Participation in Event subclasses is inherently time-indexed and with a role e.g. mo:Performance mo:performer foaf:Person. See usage in DBTune for details.
  • About
    • dc:subject and foaf:isPrimaryTopicOf are available from the respective imported ontologies, but not specialised.
  • Aggregation
    • Reuses event:sub_event (formerly event:hasSubEvent) in order to define hierarchies by partitioning complex events by time, space and involvement (see DBTune usage).

DBTune

DBTune embeds several datasets on musical resources, mostly authored by BBC UK, that use the Music Ontology as a reference schema.

John Peel Sessions

Homepage: http://dbtune.org/bbc/peel/ (query interface + RDF dumps. SPARQL endpoint doesn't seem to be working.)

A DBTune RDF catalog of Radio One live performances for the John Peel Show. As this dataset is not concerned with describing musical artists per se, but only their radio performances, we do not expect topic patterns to be instantiated, but a strong take on events as described by the Music Ontology.

  • Linked Data Alignments
    • only with DBPedia. For RDF dumps, alignments are stored in separate modules that do not import the main dataset (!).
    • DBPedia alignments are owl:sameAs relations that hold for both foaf:Person and mo:MusicArtist individuals (if e.g. a MusicArtist is a single person such as Elton John). Although it is generally no good, it is not so dangerous, so long as DBPedia only contains triples that relate to the real person and we don't represent the musical artist as an ensemble. The risk would be to have a foaf:Person that is comprised of other mo:MusicArtists or foaf:Persons.
  • Event Patterns:
    • mo:MusicArtist mo:performed mo:Performance

(e.g. http://dbtune.org/bbc/peel/artist/280 ("Bratmobile", a http://purl.org/ontology/mo/MusicArtist) http://purl.org/ontology/mo/performed http://dbtune.org/bbc/peel/session/483 ("Performance 483 in Maida Vale 4", a http://purl.org/ontology/mo/Performance))

  • Role patterns:
    • for single musicians, we have participations with role:

http://dbtune.org/bbc/peel/perf_ins/2955cda0c6177f466b6e049686d225ab (Allison Wolfe's performance on vocals in the above show, a http://purl.org/ontology/mo/Performance) mo:performer http://dbtune.org/bbc/peel/artist/2955cda0c6177f466b6e049686d225ab (foaf:name="Allison Wolfe", a foaf:Person) mo:instrument = "Vocals" (note: the usage of this object property is ambiguous, e.g. uses untyped literals for Elton John's performances)

  • Topic patterns:
    • No usage of the dc:subject and foaf:isPrimaryTopicOf properties imported by the music ontology
  • Aggregation patterns
    • Achieved in Performances by means of hierarchies

(e.g. http://dbtune.org/bbc/peel/session/483 (Bratmobile Peel Session 483) event:hasSubEvent http://dbtune.org/bbc/peel/perf_ins/2955cda0c6177f466b6e049686d225ab (Allison Wolfe's vocal performance) mo:performer event:hasSubEvent http://dbtune.org/bbc/peel/perf_work/8323 (performance of "Make me Miss America" at that show) event:usesWork http://dbtune.org/bbc/peel/work/8323 (song "Make me Miss America", serql:directType mo:MusicalWork) )

Jamendo

Homepage: http://dbtune.org/jamendo/ (query interface + RDF dumps). SPARQL endpoint at http://dbtune.org/jamendo/sparql/ seems broken, use http://dbtune.org/jamendo/store/user/query instead

A DBTune RDF port of independent musical artists and their releases. Can be useful if we wish to exploit either geographical aggregation relations or the implicit semantics of genre-related tags.

  • Linked Data Alignments
    • with MusicBrainz via owl:sameAs (e.g. from mo:MusicArtist). This is generally safe. However, many link targets are resolved into Zitgist, so they mostly work as hyperlinks.
    • with Geonames via foaf:based_near. This should have no side effects.
  • Event Patterns:
    • none. Album recording events are not modelled.
  • Role patterns:
    • none. Artists are never represented as ensembles and the authorship relation with released is expressed via foaf:made.
  • Topic patterns:
    • Record tags are ported to RDF via the tag:taggedWithTag property from the Tag ontology at [12]. Tag values usually denote very specific genres (entities in Jamendo).
  • Aggregation patterns
    • Only implicit meronomy inherited from Geonames via foaf:based_near links.

MusicBrainz (plain)

Homepage: http://musicbrainz.org (with search interface). This is NOT the DBTune representation. Dataset access point: http://wiki.musicbrainz.org/RDF (no SPARQL endpoint found)

An open-content and release-centered musical knowledge base. It mainly uses internal vocabularies plus Dublin Corea and Amazon (http://www.amazon.com/gp/aws/landing.html) for release info.

  • Linked Data Alignments
    • none found. MusicBrainz is self-contained
  • Event Patterns:
    • Album releases, by anonymous instantiation of mm:ReleaseDate (with dc:date and mm:country values)
  • Role patterns:
    • Authorship-related roles are expressed via ar:Producer and ar:Composer properties.
    • Membership is time-indexed via ar:MemberofBand nodes (if subject is of mm:artistType tmm:TypeGroup) and ar:SupportingMusician and ar:InstrumentalSupportingMusician nodes. Note that artists can be of mm:artistType tmm:TypeGroup and mm:artistType mm:TypePerson
    • Instruments used are plain attribute lists wrapped into ar:attributeList nodes.
  • Topic patterns:
    • None found
  • Aggregation patterns
    • Mainly wraps RDF aggregations (eg. rdf:Bag) for artist-related lists such as album tracklists, album releases in countries etc.

Last.fm (RDFize)

Homepage: http://lastfm.rdfize.com (faceted search interface). This is NOT the DBTune representation. No SPARQL endpoint found.

A datasource providing an event-centered edge on Last.fm data. Vocabularies used include the Music Ontology, the Event Ontology by Yves Raimond, FOAF, Dublin Core, W3C Geo and Vcard.

  • Linked Data Alignments
    • TBD
  • Event Patterns:
    • Lots of interesting stuff. TBD
  • Role patterns:
    • TBD
  • Topic patterns:
    • TBD
  • Aggregation patterns
    • TBD

YAGO

endpoint cross-dataset

references

equivalence

relations

roles event and

participation

"about" relations aggregation

hierarchies

[13] (no SPARQL)
  • {Wikipedia English page URL} :describes
  •  :hasImdb {Entry code on IMDB}
  • means (weak equivalence)
  • [implicit] hasWonPrize :
    • usage of the same property for identifying the prize and its object
    • NOT contextualized in the actual prize assignment event!
    •  :Roberto_Benigni :hasWonPrize :Life_Is_Beautiful
    •  :Roberto_Benigni :hasWonPrize :Academy_Award_for_Best_Actor
  •  :show subclassOf :social_event
  •  :social_event subclassOf :event
  •  :event subclassOf :psychologicalFeature
  •  :Ghosts_of_Mars rdf:type [all of the above!]
  •  :Italy :establishedOnDate :1861-03-17
No geographical meronomy!

Rough notes

  • Freebase has separate individuals for movies and respective soundtracks, but does not appear to be linking them with RDF.
  • Freebase also has separate individuals for real and fictional characters (may come in useful for avoiding inconsistencies, but be wary of equivalences!).
Personal tools