DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to ask sophisticated queries against Wikipedia, and to link other data sets on the Web to Wikipedia data. We hope this will make it easier for the amazing amount of information in Wikipedia to be used in new and interesting ways, and that it might inspire new mechanisms for navigating, linking and improving the encyclopaedia itself.
DBpedia 3.5.1 available on Amazon EC2
As the Amazon Web Services are getting used a lot for cloud computing, we have started to provide current snapshots of the DBpedia dataset for this environment.
We provide the DBpedia dataset for Amazon Web Services in two ways:
1. Source files for being mounted: http://developer.amazonwebservices.com/connect/entry.jspa?externalID=2319
2. Virtuoso SPARQL store for being instanciated: http://www.openlinksw.com/dataspace/dav/wiki/Main/VirtAWSDBpedia351C
DBpedia 3.5.1 released
Hi all,
we are happy to announce the release of DBpedia 3.5.1.
This is primarily a bugfix release, which is based on Wikipedia dumps dating from March 2010. Thanks to the great community feedback about the previous DBpedia release, we were able to resolve the reported issues as well as to improve template to ontology mappings.
The new [...]
DBpedia 3.5 released
Hi all,
we are happy to announce the release of DBpedia 3.5. The new release is based on Wikipedia dumps dating from March 2010. Compared to the 3.4 release, we were able to increase the quality of the DBpedia knowledge base by employing a new data extraction framework which applies various data cleansing heuristics as well [...]
The DBpedia Knowledge Base
Knowledge bases are playing an increasingly important role in enhancing the intelligence of Web and enterprise search and in supporting information integration. Today, most knowledge bases cover only specific domains, are created by relatively small groups of knowledge engineers, and are very cost intensive to keep up-to-date as domains change. At the same time, Wikipedia has grown into one of the central knowledge sources of mankind, maintained by thousands of contributors. The DBpedia project leverages this gigantic source of knowledge by extracting structured information from Wikipedia and by making this information accessible on the Web under the terms of the
Creative Commons Attribution-ShareAlike 3.0 License and the
GNU Free Documentation License.
The DBpedia knowledge base currently describes more than 3.4 million things, out of which 1.5 million are classified in a consistent Ontology, including 312,000 persons, 413,000 places, 94,000 music albums, 49,000 films, 15,000 video games, 140,000 organizations, 146,000 species and 4,600 diseases. The DBpedia data set features labels and abstracts for these 3.2 million things in up to 92 different languages; 841,000 links to images and 5,081,000 links to external web pages; 9,393,000 external links into other RDF datasets, 565,000 Wikipedia categories, and 75,000 YAGO categories. The DBpedia knowledge base altogether consists of over 1 billion pieces of information (RDF triples) out of which 257 million were extracted from the English edition of Wikipedia and 766 million were extracted from other language editions.
The DBpedia knowledge base has several advantages over existing knowledge bases: it covers many domains; it represents real community agreement; it automatically evolves as Wikipedia changes, and it is truly multilingual. The DBpedia knowledge base allows you to ask quite surprising queries against Wikipedia, for instance “Give me all cities in New Jersey with more than 10,000 inhabitants” or “Give me all Italian musicians from the 18th century”. Altogether, the use cases of the DBpedia knowledge base are widespread and range from enterprise knowledge management, over Web search to revolutionizing Wikipedia search.
Nucleus for the Web of Data
Within the
W3C Linking Open Data (LOD) community effort, an increasing number of data providers have started to publish and interlink data on the Web according to Tim Berners-Lee’s
Linked Data principles. The resulting Web of Data currently consists of several billion RDF triples and covers domains such as geographic information, people, companies, online communities, films, music, books and scientific publications. In addition to publishing and interlinking datasets, there is also ongoing work on Linked Data browsers, Linked Data crawlers, Web of Data search engines and other applications that consume Linked Data from the Web.
The DBpedia knowledge base is served as Linked Data on the Web. As DBpedia defines Linked Data URIs for millions of concepts, various data providers have started to set RDF links from their data sets to DBpedia, making DBpedia one of the central interlinking-hubs of the ermerging Web of Data.
Wiki Contents
This Wiki provides information about the DBpedia community project:
- Datasets gives an overview about the DBpedia knowledge base.
- Ontology gives an overview about the DBpedia ontology.
- Online Access describes how the data set can be accessed via a SPARQL endpoint and as Linked Data.
- Downloads provides the DBpedia data sets for download.
- Interlinking describes how the DBpedia data set is interlinked with various other datasets on the Web.
- Use Cases lists different use cases for the DBpedia data set.
- Extraction Framework describes the DBpedia information extraction framework.
- Data Provision Architecture paints a picture of the software and protocols used to serve DBpedia on the Web.
- Community explains how the DBpedia community collaborates and how people can contribute to the DBpedia effort.
DBpedia Mapping Wiki containing the mappings used by the DBpedia extraction.
- Credits lists the people and institutions that have contributed to DBpedia so far.
- Next steps describes ideas and future plans for the DBpedia project.

For a recent overview paper about DBpedia, please refer to:
- Christian Bizer, Jens Lehmann, Georgi Kobilarov, Sören Auer, Christian Becker, Richard Cyganiak, Sebastian Hellmann:
DBpedia – A Crystallization Point for the Web of Data. Journal of Web Semantics: Science, Services and Agents on the World Wide Web, Issue 7, Pages 154–165, 2009.
- Further papers about DBpedia are found at Publications
Information
Last Modification:
2010-04-28 19:02:41 by Ted Thibodeau Jr
