Open data
Jump to navigation
Jump to search
General
See also Data, Semantic, Database, WebDev#API, Free/open,
- https://en.wikipedia.org/wiki/Data_publishing - also data publication) is the act of releasing research data in published form for use by others. It is a practice consisting in preparing certain data or data set(s) for public use thus to make them available to everyone to use as they wish. This practice is an integral part of the open science movement. There is a large and multidisciplinary consensus on the benefits resulting from this practice
- The ODI - Open Data Institute, works with companies and governments to build an open, trustworthy data ecosystem, where people can make better decisions using data and manage any harmful impacts.
- https://en.wikipedia.org/wiki/Open_Data_Institute - a non-profit private company limited by guarantee, based in the United Kingdom. Founded by Sir Tim Berners-Lee and Sir Nigel Shadbolt in 2012, the ODI's mission is to connect, equip and inspire people around the world to innovate with data.
- Frictionless Data - a progressive open-source framework for building data infrastructure – data management, data integration, data flows, etc. It includes various data standards and provides software to work with data. The software is based on a suite of data standards that have been designed to make it easy to describe data structure and content so that data is more interoperable, easier to understand, and quicker to use. There are several aspects to the Frictionless software, including two high-level data frameworks (for Python and JavaScript), 10 low-level libraries for other languages, like R, and also visual interfaces and applications. You can read more about how to use the software (and find documentation) on the projects page.
- frictionless-ci | Frictionless Repository - Continuous Data Validation: With Frictionless Repository you can ensure the quality of your data. This Github Action will report any problems with your data like bad header or missing cells.
Data sources
- The Our World in Data-Grapher - The Our World in Data Grapher is the open-source tool to store and visualize data developed by the Our World in Data team.As every other tool developed and used at Our World in Data, the Grapher is also open source and free to use on any other web publication. You can find all the code in the Github repository, published under the MIT license.
- LC for Robots | Library of Congress - Explore the many ways the Library of Congress provides machine-readable access to its digital collections.
Hubs / platforms
CKAN
- CKAN is a fully-featured, mature, open source data portal and data management solution. CKAN provides a streamlined way to make your data discoverable and presentable. Each dataset is given its own page with a rich collection of metadata, making it a valuable and easily searchable resource.
Socrata
- Socrata - In support of its commitment to the open data community and to the proliferation of open data standards, Socrata is proud to bring you the "Socrata Open Data Server, Community Edition." Community Edition is a freely-available, open source product that shares the core of our open data platform. Read here about the motivations behind the Socrata Open Data Server, the architecture of the system we are building, and how to contribute.
- Socrata - The Socrata Open Data API allows you to programmatically access a wealth of open data resources from governments, non-profits, and NGOs around the world. Click the link below and try a live example right now.
- https://en.wikipedia.org/wiki/Socrata - business-to-government software company, that sells an "open data platform" whose goal was to help "civic developers build apps more efficiently." In July 2014, Socrata launched the Open Data Network, a machine learning-powered initiative aimed at promoting data-centered collaboration between the public and private sectors. This network provides governments with access to various types of data, including crime data, transit data, 311 service request data, and expenditure data.[9] The San Francisco administration later incorporated the open data network into its operations.
- Swagger UI - Service for Data Access This service implements the IVOA SODA-1.0 service specification. To use this service, the caller must use a dataset identifier found through some means (for example, querying the ALMA TAP ObsCore Service). The SODA service provides a drill-down mechanism to access the data files and associated resources.
SODA API
- SODA API - SODA Foundation - Provides the standardization for Data / Storage Management APIs. Currently we support block and file APIs for key features of data management (provisioning, migration, fileshare, etc). Working to add the storage management APIs. This is the key external interface to platforms, which can do a seamless integration with heterogeneous storage backends. Users can develop SODA North-Bound Plugins (SODA NBP) under SODA NBP project to connect any platform or application solutions to SODA API from north for all storage/data requirements. We envision this to be the reference implementation of SODA Data Standard API Specification, which we plan to work with our industry partners and standards bodies. At that stage, this layer will upgraded to support Block, File and Object APIs across the Edge, Core and Cloud.
- SODA API Specification :: Documentation for SODA Project - Standards for Data and Storage are an umbrella API Standards comprising of a collection of multiple data and storage API specifications released by the SODA Foundation (under Linux Foundation). It provides unified RESTful API interfaces with standardized data models for data and storage across the edge, core(on-prem), and cloud. It will consolidate, update, or develop API definitions to provide unified, extensible, and open industry standards collaborating with partners, vendors, and standard associations. It will have the universal application with needed customization as per the country, region and other legal needs where it is used/deployed Overall Scope SODA API Standards for Data and Storage aim to put together a set of specifications that would be: Unified | Open | Vendor-neutral | Platform agnostic | Environment aware | Extensible This document provides the latest versions of all API specifications under SODA API Standards for Data and Storage. Audience Main audience members are (not limited to) SODA API Standards Team, API Specification Software implementors, Platform&Vendors who want to utilize SODA API for their solutions.
- https://github.com/sodafoundation/soda
- https://github.com/sodafoundation/api - SODA Terra Project API module : is an open source implementation of SODA API connecting storage to platforms like Kubernetes, OpenStack, and VMware
- SODA Foundation - an open source project under Linux Foundation that aims to establish an open, unified, and autonomous data management framework for data mobility from the edge, to core, to cloud. SODA brings together industry leaders to collaborate on building a common framework to promote standardization and best practices for data storage, data protection, data governance, data analytics, etc. to support IoT, big data, machine learning, and other applications. We are fostering collaboration and innovation across vendors, system integrators, cloud service providers, standards organizations, and consortiums across different industries, to provide quality end-to-end solutions to end users.
- SODA Foundation Documentation - an open source project under Linux Foundation that aims to foster an ecosystem of open source data management and storage software for data autonomy. SODA Foundation offers a neutral forum for cross-projects collaboration and integration and provides end users quality end-to-end solutions.
- https://github.com/sodafoundation/delfin - the SODA Infrastructure Manager project is an an open source project to provide unified, intelligent and scalable resource management, alert and performance monitoring. It will cover the resource management of all the storage backends & other infrastructures under SODA deployment. It will also provide the alert management and metric data(performance/health) for monitoring and further analysis. It will provide a scalable framework where more and more backends as well as client exporters can be added. This will enable to add more storage and infrastructure backends and also support different management clients for monitoring and health prediction. It provides unified APIs to access, export and connect with clients as well as a set of interfaces for various driver addition.
UK
- http://www.data-archive.ac.uk/find/hasset-thesaurus/skos-hasset This new resource is an outcome of the Jisc-funded SKOS-HASSET project, led by staff at the UK Data Archive at the University of Essex, which owns and manages HASSET. Like dictionaries, thesauri describe the changing world around them; this is why the UK Data Archive continues work to ensure HASSET is up to date. Simple Knowledge Organisation System(SKOS) makes the thesaurus machine-readable. It is the version of Resource Description Framework (RDF) specific to classification resources. It encodes these products in a standardised way to make their structures comparable and to facilitate interaction.
Government
- Data.gov.uk is a key part of the Government's work on Transparency which is being lead by the Transparency Board. Data.gov.uk implementation is being led by the Transparency and Open Data team in the Cabinet Office, working across government departments to ensure that data is released in a timely and accessible way. This work is being supported by Sir Tim-Berners Lee & Professor Nigel Shadbolt. There are a number of technical partners involved in the project to date. These include the Comprehensive Knowledge Archive Network (CKAN): CKAN runs the catalogue at data.gov.uk/data as well as a growing number of open data registries around the world. It is a project created by the Open Knowledge Foundation to make it easy to find, share and reuse open content and data. The CKAN software provides a web interface, programmer's API, feeds notifying of changes, and a browsable history of all changes. The API is documented here: http://data.gov.uk/data/api.
- data.gov.uk: Who is doing what? - This page lists the domains which publish and maintain linked data and short term projects developing the government use of linked data. Most sectors have one or more SPARQL endpoints, which enable you to perform searches across the data; you can access these interactively on this site.
- National Institute for Health Research - Clinical Research Network: App Centre
- https://www.odp.nihr.ac.uk/ODP_QlikView%20Reporting%20User%20Guide%20v0.4.pdf
- London Datastore has been created by the Greater London Authority (GLA) as an innovation towards freeing London’s data. We want citizens to be able access the data that the GLA and other public sector organisations hold, and to use that data however they see fit – free of charge. The GLA is committed to influencing and cajoling other public sector organisations into releasing their data here too.
- http://www.datagm.org.uk/ - manchester
- Open Data Communities - Open Access to Local Data. This site is the UK Department for Communities and Local Government's official Linked Open Data site. It provides a selection of statistics on a variety of themes including Local Government finance, housing and homelessness, wellbeing, deprivation, and the department's business plan as well as supporting geographical data. All of the data is available as fully browsable and queryable Linked Data, and the majority is free to re-use under the Open Government Licence.
Education
BBC
- http://www.bbc.co.uk/blogs/internet/posts/News-Linked-Data-Ontology
- http://www.infoq.com/presentations/bbc-data-platform-api
Other
Scotland
National
- A Digital Ambition for Scotland - October 22 2010
- Scotland's Digital Future: A Strategy for Scotland - Strategy setting out how the Scottish Government will ensure Scotland takes full advantage of digital technology.
"Action 2.4 We will develop proposals with partners for releasing more government information and data for use by the public. Initial proposals to be developed and implementation to begin by end of July 2011. We invite suggestions for areas where the greater availability of public data could lead to new services or innovative applications " - March 3 2011
- http://www.scotland.gov.uk/Topics/Government/sustainabilityperformance/Data
- http://cofog01.data.scotland.gov.uk/ - linked data
- Scottish Index of Multiple Deprivation - Data Sources and Suitability
Health
- ALISS stands for Access to Local Information to Support Self Management. It’s a wide-ranging project taking a number of approaches to making it easier to find local self management support.
Local
Ireland
Europe
- http://open-data.europa.eu/
- http://ec.europa.eu/information_society/policy/psi/open_data_portal/index_en.htm
- http://publicdata.eu/
- http://lod2.okfn.org/eu-data-catalogues/
- http://latc-project.eu/datasets
- http://www.oecd.org/statistics/
- http://www.eea.europa.eu/data-and-maps
USA
- Bulk Data | GovInfo - select collections available in bulk in a machine-readable format (i.e. XML) via our Bulk Data Repository. The top level directory for select collections, such as the Federal Register and Code of Federal Regulations, also includes a Resources directory that contains the XML schema, XSL stylesheet, and user guide.
- https://github.com/usgpo/bulk-data - User Guides for XML on the govinfo Bulk Data Repository
Gloal
- http://www.mpi-inf.mpg.de/yago-naga/yago/ - YAGO2s is a huge semantic knowledge base, derived from Wikipedia WordNet and GeoNames. Currently, YAGO2s has knowledge of more than 10 million entities (like persons, organizations, cities, etc.) and contains more than 120 million facts about these entities.
UN
WordNet
- http://wordnet.princeton.edu/
- http://wordnet.princeton.edu/wordnet/related-projects/
- https://en.wikipedia.org/wiki/WordNet - english language semantic relations
- http://globalwordnet.org/
Crowdsourced
DBpedia
WikiData
Geo
Other
- http://echoprint.me/data_download - music id
internet of things;
Wikxhibit
- Wikxhibit - Author interactive applications of Wikidata and other sources of data on the web
Commercial
- http://www.whoownsscotland.org.uk/ - has to cover land registry cost?
Development
- Observable - Discover insights faster and communicate more effectively with interactive notebooks for data analysis, visualization, and exploration.
JavaScript
APIs
See WebDev#API
- Swagger Specification - an API description format for REST APIs. An OpenAPI file allows you to describe your entire API, including:
- Improve CSVs and API descriptions with these Open Standards Board recommendations - Technology in government - [4]
- https://github.com/zalando/connexion - a framework that automagically handles HTTP requests based on OpenAPI Specification (formerly known as Swagger Spec) of your API described in YAML format. Connexion allows you to write an OpenAPI specification, then maps the endpoints to your Python functions; this makes it unique, as many tools generate the specification based on your Python code. You can describe your REST API in as much detail as you want; then Connexion guarantees that it will work as you specified.