Open data

Things and Stuff Wiki - An organically evolving personal wiki knowledge base. An on-the-fly taxonomy containing a patchwork trail of topic outlines, descriptions, notes, stubs and breadcrumbs, with links to sites, systems, software, manuals, organisations, people, articles, guides, slides, papers, books, comments, videos, screencasts, webcasts, scratchpads and more. Content is orientated towards mostly free/libre/open, mostly Linux. Quality and age varies drastically. Sometimes old things are first, sometimes last. Use the Table of Contents menu to navigate long pages. Zoom in if text is too small. Dead link? Wayback Machine. I probably need to fix the theme CSS after an update. See also libreav.org. Chat to msg me (not checking tho atm). e

General

data, noun

facts and statistics collected together for reference or analysis: there is very little data available
- the quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media.
- Philosophy things known or assumed as facts, making the basis of reasoning or calculation.

Forbes: A Very Short History Of Data Science

Data, Information, Knowledge, and Wisdom - some abstractions

Learning

Open Data

Semantic Web

Linked Open Data

http://en.wikipedia.org/wiki/Linked_data

W3C: Linking Open Data - The Open Data Movement aims at making data freely available to everyone. There are already various interesting open data sets available on the Web. Examples include Wikipedia, Wikibooks, Geonames, MusicBrainz, WordNet, the DBLP bibliography and many more which are published under Creative Commons or Talis licenses. The goal of the W3C SWEO Linking Open Data community project is to extend the Web with a data commons by publishing various open data sets as RDF on the Web and by setting RDF links between data items from different data sources. RDF links enable you to navigate from a data item within one data source to related data items within other sources using a Semantic Web browser. RDF links can also be followed by the crawlers of Semantic Web search engines, which may provide sophisticated search and query capabilities over crawled data. As query results are structured data and not just links to HTML pages, they can be used within other applications.
- http://www.w3.org/TR/ld-glossary/

Linked Data is about using the Web to connect related data that wasn't previously linked, or using the Web to lower the barriers to linking data currently linked using other methods. More specifically, Wikipedia defines Linked Data as "a term used to describe a recommended best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web using URIs and RDF." This site exists to provide a home for, or pointers to, resources from across the Linked Data community.

datavisualization.ch: Introduction to Linked Open Data for Visualization Creators
How to Publish Linked Data on the Web
http://www.slideshare.net/DLFCLIR/intro-to-lod

Learning Linked Data
Learn Linked Data - Helping you get to grips with RDF, SPARQL & linked data.
http://linkeddatabook.com/

LOD cloud diagram shows datasets that have been published in Linked Data format, by contributors to the Linking Open Data community project and other individuals and organisations. It is based on metadata collected and curated by contributors to the Data Hub. Clicking the image will take you to an image map, where each dataset is a hyperlink to its homepage.
- https://upload.wikimedia.org/wikipedia/commons/3/34/LOD_Cloud_Diagram_as_of_September_2011.png
- http://datahub.io/group/lodcloud

LodLive project provides a demonstration of the use of Linked Data standards (RDF, SPARQL) to browse RDF resources. The application aims to spread linked data principles using a simple and friendly interface with reusable techniques.

http://www.w3.org/DesignIssues/ReadWriteLinkedData.html

http://rww.io/ - webid
- http://www.w3.org/community/rww/2013/08/15/distributed-microblogging-with-rww-io-and-tabulator/

http://code.google.com/p/linked-data-api/

http://aksw.org/Projects/LODStats.html

http://lists.w3.org/Archives/Public/public-lod/2011Sep/0091.html

http://hypernotation.org/

RFD

Quick Intro to RDF

RDF is a general method to decompose any type of knowledge into small pieces, with some rules about the semantics, or meaning, of those pieces. The point is to have a method so simple that it can express any fact, and yet so structured that computer applications can do useful things with it.

The Concept of Triples

The basic unit of RDF is a statement called a triple. One can think of a triple as a type of sentence that states a single "fact" about a resource. RDF allows you to define statements about things (or resources), in the form of subject-predicate-object expressions (known as RDF-triples due to the 3 constituent parts).

http://www.w3.org/TR/rdf-schema/ - RDF Vocabulary Description Language 1.0: RDF Schema
- https://en.wikipedia.org/wiki/RDF_Schema

http://www.w3.org/TR/rdf-mt/ - semantics

Different RDF Formats - March 11 2013

The different forms for representing the RDF data are:

RDF/XML
Notation-3 (N3)
Turtle - a simplified, RDF-only subset of N3.
N-Triple
RDFa
TRiX
TRiG
JSON-LD

RDF/XML

http://www.w3.org/TR/rdf-syntax-grammar/

Here's some RDF XML:

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"

xmlns:ns="http://www.example.org/#">

 <ns:Person rdf:about="http://www.example.org/#john">
   <ns:hasMother rdf:resource="http://www.example.org/#susan" />
   <ns:hasFather>
     <rdf:Description rdf:about="http://www.example.org/#richard">
       <ns:hasBrother rdf:resource="http://www.example.org/#luke" />
     </rdf:Description>
   </ns:hasFather>
 </ns:Person>
</rdf:RDF>

HTTP Vocabulary in RDF 1.0
TriG - RDF Dataset Language. A concrete syntax for RDF as defined in the RDF Concepts and Abstract Syntax ([rdf11-concepts]). TriG is an extension of Turtle ([turtle]), extended to support representing a complete RDF Dataset.

http://www.w3.org/wiki/CustomRdfDialects

N3

Here's some N3 RDF:

@prefix : <http://www.example.org/> .
:john    a           :Person .
:john    :hasMother  :susan .
:john    :hasFather  :richard .
:richard :hasBrother :luke .

Turtle

https://en.wikipedia.org/wiki/Turtle_(syntax)

N-Triple

https://en.wikipedia.org/wiki/N-Triples

RDFa

2004. RDFa 1.1 reached recommendation status in June 2012.

RDFa is an extension to HTML5 that helps you markup things like People, Places, Events, Recipes and Reviews. Search Engines and Web Services use this markup to generate better search listings and give you better visibility on the Web, so that people can find your website more easily.
- https://en.wikipedia.org/wiki/RDFa

HTML+RDFa 1.1 - Support for RDFa in HTML4 and HTML5 [1]

JSON-LD

http://en.wikipedia.org/wiki/JSON-LD - RDF-like
- http://manu.sporny.org/2013/json-ld-is-the-bees-knees/

JSON-LD was created by people that have been directly involved in the Linked Data, lowercase semantic web, uppercase Semantic Web, Microformats, Microdata, and RDFa work. It has proven to be useful to them. There are a number of very large technology companies that have adopted JSON-LD, further underscoring its utility.

http://decentralyze.com/2010/06/04/from-json-to-rdf-in-six-easy-steps-with-jron/ - older system

http://rdfs.org/ - SIOC, ResumeRDF, SCOT

http://www.w3.org/2012/pyRdfa/extract?uri=http%3A%2F%2Fschema.org%2FNewsArticle&format=turtle&rdfagraph=output&vocab_expansion=false&rdfa_lite=false&embedded_rdf=true&space_preserve=true&vocab_cache=true&vocab_cache_report=false&vocab_cache_refresh=false - rdfs in rdfa

SIOC

SIOC initiative (Semantically-Interlinked Online Communities) aims to enable the integration of online community information. SIOC provides a Semantic Web ontology for representing rich data from the Social Web in RDF. It has recently achieved significant adoption through its usage in a variety of commercial and open-source software applications, and is commonly used in conjunction with the FOAF vocabulary for expressing personal profile and social networking information. By becoming a standard way for expressing user-generated content from such sites, SIOC enables new kinds of usage scenarios for online community site data, and allows innovative semantic applications to be built on top of the existing Social Web. The SIOC ontology was recently published as a W3C Member Submission, which was submitted by 16 organisations.
- http://rdfs.org/sioc/spec/
- http://johnbreslin.com/blog/2006/09/07/creating-connections-between-discussion-clouds-with-sioc/

http://sioc-project.org/node/158

ResumeRDF

http://rdfs.org/resume-rdf/

SCOT

SCOT is an acronym for Social Semantic Cloud of Tags. The name was chosen to emphasise the goal of providing a consistent framework for expressing social tagging at a semantic level in machine-understandable way. The SCOT ontology provides a model for expressing the main concepts and properties required to describe information for tagging activities (e.g., users, tags, resources, etc.) on the Semantic Web. This document contains a detailed description of the SCOT Ontology.

Data Cube

RDF Data Cube Vocabulary - There are many situations where it would be useful to be able to publish multi-dimensional data, such as statistics, on the web in such a way that it can be linked to related data sets and concepts. The Data Cube vocabulary provides a means to do this using the W3C RDF (Resource Description Framework) standard. The model underpinning the Data Cube vocabulary is compatible with the cube model that underlies SDMX (Statistical Data and Metadata eXchange), an ISO standard for exchanging and sharing statistical data and metadata among organizations. The Data Cube vocabulary is a core foundation which supports extension vocabularies to enable publication of other aspects of statistical data flows or other multi-dimensional data sets.

http://www.w3.org/community/ontolex/

http://www.w3.org/community/microxml/

Other

http://semanticweb.org/wiki/VoID

https://github.com/mhausenblas/schema-org-rdf

http://commontag.org/Home

http://musicontology.com/

http://www.faw.jku.at/wwoess/webs/webs.html#Topics

http://birte2013.cs.aau.dk/files/Matei.pdf
- http://semanticweb.org/wiki/OLAP_of_Linked_Data

Tools

http://rdf-translator.appspot.com/

http://jibbering.com/rdf-parser/

http://semweb.salzburgresearch.at/apps/rdf-gravity/

https://github.com/alangrafu/visualRDF
- http://graves.cl/visualRDF/?url=http://graves.cl/visualRDF/

http://rdface.aksw.org/
- http://wiki.aksw.org/Projects/RDFaCE

W3C: RDFImportersAndAdapters
W3C: ConverterToRdf converts application data from an application-specific format into RDF for use with RDF tools and integration with other data. Converters may be part of a one-time migration effort, or part of a running system which provides a semantic web view of a given application.

Redland is a set of free software C libraries that provide support for the Resource Description Framework (RDF).
- Raptor is a free software / Open Source C library that provides a set of parsers and serializers that generate Resource Description Framework (RDF) triples by parsing syntaxes or serialize the triples into a syntax. The supported parsing syntaxes are RDF/XML, N-Quads, N-Triples, TRiG, Turtle, RDFa 1.0 and 1.1, RSS tag soup including all versions of RSS, Atom 1.0 and 0.3, GRDDL and microformats for HTML, XHTML and XML. The serializing syntaxes are RDF/XML (regular, and abbreviated), Atom 1.0, GraphViz, JSON, N-Quads, N-Triples, RSS 1.0 and XMP.

rdfstore-js is a pure Javascript implementation of a RDF graph store with support for the SPARQL query and data manipulation language. node.js

http://www.visualdataweb.org/tools.php

http://code.google.com/p/cumulusrdf/

http://graphite.ecs.soton.ac.uk/
- http://graphite.ecs.soton.ac.uk/browser/
- https://github.com/semsol/arc2

other

http://en.wikipedia.org/wiki/RDF_query_language

http://sw.deri.org/2007/07/sitemapextension/

http://rhizomik.net/html/redefer/rdf2svg-form/

https://en.wikipedia.org/wiki/XDI

[http://people.csail.mit.edu/emax/papers/atomate-www2010-camera.pdf Atomate It! End-user Context-Sensitive Automation using

Heterogeneous Information Sources on the Web

http://www.slideshare.net/otaviofff/semantic-web-services-a-restful-approach]

OWL

http://en.wikipedia.org/wiki/Web_Ontology_Language

http://www.w3.org/TR/owl2-overview/

http://www.w3.org/TR/owl2-mapping-to-rdf/

https://en.wikipedia.org/wiki/Synonym_ring

http://www.pr-owl.org/

http://ontorule-project.eu/parrot/parrot

SPARQL

http://en.wikipedia.org/wiki/SPARQL

http://stackoverflow.com/questions/7163639/how-to-use-json-output-from-external-sparql-request-directly-from-browser

http://data-gov.tw.rpi.edu/ws/sparqlproxy.php

http://data.semanticweb.org/snorql/

http://sparallax.deri.ie/

http://revyu.com/

http://www.w3.org/TR/sparql11-http-rdf-update/

LOV

http://lov.okfn.org/dataset/lov/

GRDDL

GRDDL is a mechanism for Gleaning Resource Descriptions from Dialects of Languages. It is a technique for obtaining RDF data from XML documents and in particular XHTML pages. Authors may explicitly associate documents with transformation algorithms, typically represented in XSLT, using a link element in the head of the document. Alternatively, the information needed to obtain the transformation may be held in an associated metadata profile document or namespace document.
- https://en.wikipedia.org/wiki/GRDDL

VOAF

http://lov.okfn.org/vocab/voaf/v2.3/index.html

Web Observatory

Python

https://pypi.python.org/pypi/django-4store

http://www.slideshare.net/alchueyr/getting-the-most-out-of-sparql-with-python

JavaScript

Sgvizler is a javascript which renders the result of SPARQL SELECT queries into charts or html elements. It is cool stuff (ivan_herman, timbl).
- http://sgvizler.googlecode.com/svn/release/0.4/example/index.html

http://www.w3.org/2011/rdf-wg/wiki/JSON-Serialization-Examples

REST

See WebDev#API

http://watson.kmi.open.ac.uk/REST_API.html

http://answers.semanticweb.com/questions/2763/the-relation-of-linked-datasemantic-web-to-rest

http://www.w3.org/TR/ldp/
- http://www.w3.org/2012/ldp/
- http://www.w3.org/2012/ldp/charter.html - Linked Data Platform (LDP) Working Group is to produce a W3C Recommendation for HTTP-based (RESTful) application integration patterns using read/write Linked Data. This work will benefit both small-scale in-browser applications (WebApps) and large-scale Enterprise Application Integration (EAI) efforts. It will complement SPARQL and will be compatible with standards for publishing Linked Data, bringing the data integration features of RDF to RESTful, data-oriented software development.
- http://lists.w3.org/Archives/Public/public-ldp-wg
- http://www.w3.org/Submission/ldbp/

Validation

http://www.w3.org/2001/sw/wiki/SWValidators

Search

http://en.wikipedia.org/wiki/Swoogle

Other

Data sources

http://www.w3.org/wiki/TaskForces/CommunityProjects/LinkingOpenData/DataSets

http://www.reddit.com/r/datasets/

http://semwebquality.org/

Hubs / platforms

CKAN is a fully-featured, mature, open source data portal and data management solution. CKAN provides a streamlined way to make your data discoverable and presentable. Each dataset is given its own page with a rich collection of metadata, making it a valuable and easily searchable resource.
- http://ckan.org/features/visualise/

UK

NCVO UK Civil Society Almanac Datastore

http://www.data-archive.ac.uk/find/hasset-thesaurus/skos-hasset This new resource is an outcome of the Jisc-funded SKOS-HASSET project, led by staff at the UK Data Archive at the University of Essex, which owns and manages HASSET. Like dictionaries, thesauri describe the changing world around them; this is why the UK Data Archive continues work to ensure HASSET is up to date. Simple Knowledge Organisation System(SKOS) makes the thesaurus machine-readable. It is the version of Resource Description Framework (RDF) specific to classification resources. It encodes these products in a standardised way to make their structures comparable and to facilitate interaction.

Government

Data.gov.uk is a key part of the Government's work on Transparency which is being lead by the Transparency Board. Data.gov.uk implementation is being led by the Transparency and Open Data team in the Cabinet Office, working across government departments to ensure that data is released in a timely and accessible way. This work is being supported by Sir Tim-Berners Lee & Professor Nigel Shadbolt. There are a number of technical partners involved in the project to date. These include the Comprehensive Knowledge Archive Network (CKAN): CKAN runs the catalogue at data.gov.uk/data as well as a growing number of open data registries around the world. It is a project created by the Open Knowledge Foundation to make it easy to find, share and reuse open content and data. The CKAN software provides a web interface, programmer's API, feeds notifying of changes, and a browsable history of all changes. The API is documented here: http://data.gov.uk/data/api.
- http://data.gov.uk/linked-data

data.gov.uk: Who is doing what? - This page lists the domains which publish and maintain linked data and short term projects developing the government use of linked data. Most sectors have one or more SPARQL endpoints, which enable you to perform searches across the data; you can access these interactively on this site.

London Datastore has been created by the Greater London Authority (GLA) as an innovation towards freeing London’s data. We want citizens to be able access the data that the GLA and other public sector organisations hold, and to use that data however they see fit – free of charge. The GLA is committed to influencing and cajoling other public sector organisations into releasing their data here too.

http://www.datagm.org.uk/ - manchester

Open Data Communities - Open Access to Local Data. This site is the UK Department for Communities and Local Government's official Linked Open Data site. It provides a selection of statistics on a variety of themes including Local Government finance, housing and homelessness, wellbeing, deprivation, and the department's business plan as well as supporting geographical data. All of the data is available as fully browsable and queryable Linked Data, and the majority is free to re-use under the Open Government Licence.

http://api.opencorporates.com/documentation/Home

Education

http://ukdataservice.ac.uk/

http://linkedup-project.eu/

http://ldfocus.blogs.edina.ac.uk/

http://datashare.is.ed.ac.uk/

BBC

http://www.bbc.co.uk/developer/technology/apis.html

http://www.johngoodwin.me.uk/mashups/6radio.html

http://api.bbcnews.appengine.co.uk/

http://www.programmableweb.com/api/bbc/mashups

Other

http://www.nestoria.co.uk/help/api

Scotland

https://sites.google.com/site/scottishlinkeddataswig/

National

A Digital Ambition for Scotland - October 22 2010
Scotland's Digital Future: A Strategy for Scotland - Strategy setting out how the Scottish Government will ensure Scotland takes full advantage of digital technology.

"Action 2.4 We will develop proposals with partners for releasing more government information and data for use by the public. Initial proposals to be developed and implementation to begin by end of July 2011. We invite suggestions for areas where the greater availability of public data could lead to new services or innovative applications " - March 3 2011

http://www.govcampscotland.com/conversation/digital-future

Scottish Index of Multiple Deprivation - Data Sources and Suitability

http://bigdata.holyrood.com/agenda
- http://www.youtube.com/watch?v=5el8G6Xo1Lc

Health

http://www.isdscotland.org/Products-and-Services/eDRIS/DSLS-Consultation/

ALISS stands for Access to Local Information to Support Self Management. It’s a wide-ranging project taking a number of approaches to making it easier to find local self management support.