Database

this is an organically evolving personal wiki-form knowledge base, with on-the-fly/twenty years of copy-edited n otherwise curated patchworks of folksnomies n headings, containing trails n spirals of topics, descriptions, notes, breadcrumbs n stubs, links to sites, systems, software, manuals, organisations, people, articles, guides, slides, papers, books, comments, videos, screencasts, webcasts, scratchpads, etc | content is orientated towards mostly free/libre/open, mostly Linux | quality and age varies drastically | sometimes old things are first, sometimes last | Ctrl + mouse wheel to zoom in if text is too small | use the Table of Contents menu to navigate long pages | use the header -ToC links to shrink n expand the menu | link rot? Wayback Machine! | probably need to fix the theme CSS after an update | Chat to msg me (this I am not checking atm) | e

Resources

Weather

Edinburgh

Scotland

Smiley / Lorem

About / ToDo

Meta / misc

Maths

Breath

Being

Grounding

Living

Camping

Mapping

Organising

Media

Digital lit.

Design

Politics / p

Free/open

Volly Guide

Fire brand

Radio / TV

Signal

Type / Emoji

Data / Open

Semantic

Backup

Storage / Files

File managers

Editors/ IDE

Vim / Emacs

Dotfiles / Box

Logging / Search

Notebooks

VCS / Git

Regex

Languages

C/C++ / Lisp

Perl / PHP

Python / Ruby

JavaScript / Lua

Creative coding

Visual / Pd

ML / AI

Computing

Computer / CA

OSs / *nix / CLI

Distros / Packages

Android / Apps

Apple / Windows

Amiga / Emulation

Web dev

Web systems

Wiki / Forums

Feeds

Open social

Scraping

Net/web media

E-mail

Chat / IRC

VoIP / Comms

File sharing

Link / Wi-Fi

Internet / Mesh

Transport / DNS

HTTP(S) / SSH

Stack

MediaWiki

Web Audio

GFX / Colours

UI / X11 / GUI

Terminals / TUI

WM/DE / Wayland

AwesomeWM / i3

Demoscene

Shaders

Gaming / AR

Photos / Images

Lighting / Laser

CAD / 3D

Video / Vision

Visuals

Audio / s / AV

Effects

Softsynths

Speech / vox

Speaker / s

Sampling

Sound banks

Notation

MIDI / OSC

Tracker

DAW

Generative

Styles

Playback / MPD

Net AV/media

Rip / Tag / t

DJing

Stations

General

Relational

WP: Relational_database_management_system - a database management system (DBMS) that is based on the relational model as invented by E. F. Codd, of IBM's San Jose Research Laboratory. In 2017, many of the databases in widespread use are based on the relational database model.

WP: Edgar_F._Codd

WP: Relation_(database)

https://hpi.de/naumann/projects/rdbms_genealogy.html [3]

WP: Codd%27s_theorem

WP: Relational_calculus - consists of two calculi, the tuple relational calculus and the domain relational calculus, that are part of the relational model for databases and provide a declarative way to specify database queries. This in contrast to the relational algebra which is also part of the relational model but provides a more procedural way for specifying queries.

WP: Tuple_relational_calculus

WP: Domain_relational_calculus

WP: Relational_algebra - first created by E.F. Codd while at IBM, is a family of algebras with a well-founded semantics used for modelling the data stored in relational databases, and defining queries on it. The main application of relational algebra is providing a theoretical foundation for relational databases, particularly query languages for such databases, chief among which is SQL.

WP: Relational_model
- https://www.youtube.com/watch?v=nc1yivH1Yac

WP: Database_normalization

WP: First_normal_form

etc.

WP: Boyce%E2%80%93Codd_normal_form

WP: Entity-relationship_model

WP: Unique_key

Google I/O 2012 - SQL vs NoSQL: Battle of the Backends

http://programmers.stackexchange.com/questions/190482/why-use-a-database-instead-of-just-saving-your-data-to-disk

Jailer - a tool for database subsetting, schema and data browsing. It exports consistent, referentially intact row-sets from relational databases. It removes obsolete data without violating integrity. It is DBMS agnostic (by using JDBC), platform independent, and generates DbUnit datasets, hierarchically structured XML, and topologically sorted SQL-DML.
- https://github.com/Wisser/Jailer

http://dbdsgnr.appspot.com/

http://www.cubrid.org/

http://wozniak.ca/what-orms-have-taught-me-just-learn-sql [4]

WP: Two-phase_locking - In databases and transaction processing, two-phase locking (2PL, is a concurrency control method that guarantees serializability. It is also the name of the resulting set of database transaction schedules (histories). The protocol uses locks, applied by a transaction to data, which may block (interpreted as signals to stop) other transactions from accessing the same data during the transaction's life. By the 2PL protocol, locks are applied and removed in two phases: Expanding phase: locks are acquired and no locks are released. Shrinking phase: locks are released and no locks are acquired.

Concurrency Freaks: 50 years later, is Two-Phase Locking the best we can do?

Object

WP: Object_database

WP: Object-relational_database

Media

WP: Multimedia_database - a collection of related for multimedia data. The multimedia data include one or more primary media data types such as text, images, graphic objects (including drawings, sketches and illustrations) animation sequences, audio and video. A Multimedia Database Management System (MMDBMS) is a framework that manages different types of data potentially represented in a wide diversity of formats on a wide array of media sources. It provides support for multimedia data types, and facilitate for creation, storage, access, query and control of a multimedia database.

Vector

What is vector search? - Algolia Blog | Algolia - Vector search is a way to find related objects that have similar characteristics using machine learning models that detect semantic relationships between objects in an index. Solutions for vector search and recommendation are becoming more and more common. If you want to add a natural language text search on your site, create image search, or build a powerful recommendation system, you’ll want to look into using vectors.

A 101 overview of vector databases, vector embeddings, and indexing | by Mostafa Ibrahim | The Techlife | Medium

Vector databases (Part 4): Analyzing the trade-offs · The Data Quarry

https://github.com/m1guelpf/tinyvector - a tiny embedding database in pure Rust

https://github.com/typesense/typesense - Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

Embedded

WP: Embedded_database - a database management system which is tightly integrated with an application software; it is embedded in the application.

Chroma

Chroma - the open-source embedding database. The fastest way to build Python or JavaScript LLM apps with memory!
- https://github.com/chroma-core/chroma

SpacetimeDB

SpacetimeDB - You can think of SpacetimeDB as a database that is also a server. It is a relational database system that lets you upload your application logic directly into the database by way of very fancy stored procedures called "modules". Instead of deploying a web or game server that sits in between your clients and your database, your clients connect directly to the database and execute your application logic inside the database itself. You can write all of your permission and authorization logic right inside your module just as you would in a normal server. This means that you can write your entire application in a single language, Rust, and deploy it as a single binary. No more microservices, no more containers, no more Kubernetes, no more Docker, no more VMs, no more DevOps, no more infrastructure, no more ops, no more servers.
- https://github.com/clockworklabs/SpacetimeDB

Tarantool

Tarantool - middleware for data

https://github.com/tarantool/tarantool - an in-memory computing platform consisting of a database and an application server.

https://github.com/tarantool/awesome-tarantool - A curated list of delightful Tarantool modules, connectors and other resources

SQL

ugh

WP: SQL

https://github.com/enochtangg/quick-SQL-cheatsheet - A quick reminder of all SQL queries and examples on how to use them.

SQL: One of the most valuable skills - [5]

Modern SQL: A lot has changed since SQL-92

YouTube: SQL (Video versions) - Khan Academy

http://blog.sqlizer.io/posts/sql-43/ [6]

WP: Information_schema - an ANSI-standard set of read-only views that provide information about all of the tables, views, columns, and procedures in a database. It can be used as a source of the information that some databases make available through non-standard commands, such as:
the SHOW command of MySQL
the DESCRIBE command of Oracle's SQL*Plus
the \d command in psql (PostgreSQL's default command-line program)

WP: Relation_(database)

WP: Select_(SQL)

SELECT * from table;

WP: Insert_(SQL)

WP: Update_(SQL)

WP: Delete_(SQL)

https://news.ycombinator.com/item?id=16772276

http://www.udel.edu/evelyn/SQL-Class2/SQLclass2_Join.html

WP: Set_operations_(SQL)

WP: View_(SQL)

WP: Database_transaction

https://blog.jooq.org/2016/12/09/a-beginners-guide-to-the-true-order-of-sql-operations/

http://tech.pro/tutorial/1555/10-easy-steps-to-a-complete-understanding-of-sql [7]

http://sqlfiddle.com/

http://www.codinghorror.com/blog/2007/10/a-visual-explanation-of-sql-joins.html

https://vimeo.com/56639635 - Michael 'Monty' Widenius - Author of the MySQL Server and MariaDB fork

http://htsql.org/

SQL-DK – an batch/terminal client for relational databases

https://news.ycombinator.com/item?id=12671667

WP: Information_schema - (information_schema) is an ANSI-standard set of read-only views which provide information about all of the tables, views, columns, and procedures in a database. It can be used as a source of the information which some databases make available through non-standard commands

https://osquery.io/ [8]

SQL 3d engine (interactive preview) / Observable -

Emoji in SQL - SELECT 🗣 FROM 👤

https://github.com/xo/usql - Universal command-line interface for SQL databases [9]

https://github.com/forbesmyester/esqlate - Build minimum viable admin panels quickly with just SQL

https://github.com/lerocha/chinook-database - Sample database for SQL Server, Oracle, MySQL, PostgreSQL, SQLite, DB2

MySQL

Wikipedia:MySQL
- InnoDB is the default storage engine for MySQL

MariaDB is compatible with MySQL, you probably want to use that, if not Postgres

https://linux.die.net/man/1/mysql

https://wiki.archlinux.org/index.php/MySQL

Gist: MySQL Command Line Cheatsheet

MySQL Reference Manuals
- 4.3.1. mysqld — The MySQL Server

5.1.2. Server Command Options

Developer Zone

MySQL Forge

YouTube: MySQL - A series covering working with MySQL including managing databases, tables and data.

YouTube: Mysql Database - playlist

Connecting

mysql -p
  # connect with anonymous user, prompt for password

mysql -u username -ppassword -h nonlocalhost dbname
  # specify user, password and nonlocalhost address, USE dbname

https://dev.mysql.com/doc/refman/5.7/en/mysql-command-options.html

Admin

MySQL Administrator Best Practices

http://www.ssw.com.au/ssw/Standards/Rules/RulesToBetterSQLServerDatabases.aspx

mysqladmin - a client for performing administrative operations. You can use it to check the server's configuration and current status, to create and drop databases, and more.

YouTube: 15 Mysql Database MySQL Admin

http://serverfault.com/questions/9948/what-is-the-debian-sys-maint-mysql-user-and-more

SHOW engines;

SHOW processlist;

Show variables;

Database management

SHOW databases;

USE [db name];

CREATE DATABASE [dbname];

CREATE DATABASE IF NOT EXISTS [dbname]

mysql -u username -p -e "CREATE DATABASE dbname CHARACTER SET utf8 COLLATE utf8_general_ci";
  # drupal 7

select database();
  # show which database is in use

DROP database dbname;
  # remove dbname from db;

User management

CREATE USER 'jeffrey'@'localhost' IDENTIFIED BY 'newpassword';

SELECT User FROM mysql.user;
  # show all database users

USE mysql;
SET PASSWORD FOR 'user-name-here'@'hostname-name-here' = PASSWORD('new-password-here');

UPDATE mysql.user SET Password=PASSWORD('new-password-here') WHERE User='user-name-here' AND Host='host-name-here';

Passwords

6.3.5. Assigning Account Passwords

SET PASSWORD FOR 'user'@'localhost' = PASSWORD('mypass');

mysqladmin -u user_name -h host_name password "newpwd"

Resetting a forgotten MySQL root password, starting with --skip-grant-tables

Password=PASSWORD('NewPassword') WHERE User='root'; FLUSH PRIVILEGES;

Permissions

http://dev.mysql.com/doc/refman/5.8/en/default-privileges.html

SHOW GRANTS;
  # show permissions for current db user

SHOW GRANTS FOR user@localhost;
  # show permissions for a user

SELECT CONCAT("SHOW GRANTS FOR '",user,"'@'",host,"';") FROM mysql.user WHERE host!='localhost';
 # Create command list for showing user grants [10]

GRANT all on [dbname].* TO '[username]'; [11]

GRANT SELECT, INSERT, UPDATE, DELETE, CREATE, DROP, INDEX, ALTER, CREATE TEMPORARY TABLES ON databasename.* TO 'username'@'localhost' IDENTIFIED BY 'password';
  # drupal 7

create database wikidb;
grant index, create, select, insert, update, delete, alter, lock tables on wikidb.* to 'wikiuser'@'localhost' identified by 'password';
#mediawiki??

https://dev.mysql.com/doc/refman/5.7/en/revoke.html

REVOKE ALL PRIVILEGES, GRANT OPTION FROM user [, user] ...

FLUSH privileges;
  # Reloads the privileges from the grant tables in the mysql database.

The server caches information in memory as a result of GRANT, CREATE USER, CREATE SERVER, and INSTALL PLUGIN statements. This memory is not released by the corresponding REVOKE, DROP USER, DROP SERVER, and UNINSTALL PLUGIN statements, so for a server that executes many instances of the statements that cause caching, there will be an increase in memory use. This cached memory can be freed with FLUSH PRIVILEGES.

Table management

SHOW tables;

SELECT * from mysql.user;
  # return ascii table with rows from user table from mysql database

SELECT * from mysql.user\G;
  # return vertical row information from user table from mysql database

pager less -SFX
  # info will now be returned via less, use arrow keys to navigate large tables, q to quit this mode

nopager
  # reset output from pager to stdout

SELECT table_schema "Data Base Name", sum( data_length + index_length ) / 1024 / 1024 "Data Base Size in MB" FROM information_schema.TABLES GROUP BY table_schema ;

13.8.1. DESCRIBE provides information about the columns in a table.

http://dev.mysql.com/doc/refman/5.0/en/lock-tables.html

Troubleshooting

5.3. MySQL Server Logs
- 5.3.1. The Error Log

4.6.7. mysqlbinlog — Utility for Processing Binary Log Files

6.2.7. Causes of Access-Denied Errors
Remote Clients Cannot Connect - #mysql wiki

Backup and restore

http://dev.mysql.com/doc/refman/5.0/en/backup-and-recovery.html

https://mariadb.com/kb/en/library/backup-restore-and-import-clients/

http://www.mysqldumper.net/

http://zmanda.com/backup-mysql.html

mysql < database.sql
  # if database file was saved with CREATE DATABASE

mysql databasename < database.sql
  # specify database name to import database file

mysqldump

mysqldump

mysqldump [options] db_name [tbl_name ...]

mysqldump -u [username] -p -A -R -E --triggers --single-transaction > full_backup.sql
  # full database backup [12]
  # -A For all databases (you can also use --all-databases)
  # -R For all routines (stored procedures & triggers)
  # -E For all events
  # --single-transaction Without locking the tables i.e., without interrupting any connection (R/W).

mysqldump -u root -ppassword dbname | mysql -u root -ppassword --host=remote-server -C dbname
  # copy direct to new database instance, direct db connection
  # REMEMBER - zsh, space before the command to hide from history

mysqldump -u username -ppassword dbname | ssh user@remote.box.com mysql -u username -ppassword dbname
  # copy direct to new database instance, via ssh connection
  # REMEMBER - zsh, space before the command to hide from history
  # fails for large DBs

mysqldump -u username -ppassword dbname | gzip -c | ssh USERNAME@YOUR_TO_HOST 'cat > ~/dump.sql.gz'
  # gzip across the wire to a compressed file on the other end
  # REMEMBER - zsh, space before the command to hide from history

http://stackoverflow.com/questions/104612/run-mysqldump-without-locking-tables

http://www.cyberciti.biz/faq/linux-unix-mysqldump-got-error1044-access-denied/

http://www.mysqlperformanceblog.com/2010/11/08/an-argument-for-not-using-mysqldump-in-production/

https://github.com/sadreck/mysqldbsplit - This script breaks down a mysqldump file into one-file-per-table. [13]

mysqlbackup

Install the mysql-client package to access.

https://dev.mysql.com/doc/mysql-enterprise-backup/3.6/en/mysqlbackup.backup.html

For a non-busy server;

mysqlbackup --port=3306 --protocol=tcp --user=root --password --backup-dir=/home/user/backupdir backup-and-apply-log

mysqlhotcopy

http://dev.mysql.com/doc/refman/5.0/en/mysqlhotcopy.html - currently deprecated.

Xtrabackup

Percona XtraBackup - Documentation - an open-source hot backup utility for MySQL - based servers that doesn’t lock your database during the backup. It can back up data from InnoDB, XtraDB, and MyISAM tables on MySQL 5.1, 5.5, 5.6 and 5.7 servers, as well as Percona Server with XtraDB.

https://wiki.archlinux.org/index.php/Xtrabackup

Replication

http://dev.mysql.com/doc/refman/5.0/en/replication.html

Tools

https://github.com/dbcli/mycli - A command line client for MySQL that can do auto-completion and syntax highlighting.

dbdeploy is a Database Change Management tool.
- http://davedevelopment.co.uk/2008/04/14/how-to-simple-database-migrations-with-phing-and-dbdeploy.html
https://github.com/tanin47/php_db_migrate
https://bitbucket.org/stepancheg/mysql-diff/wiki/Home

dBug - "PHP version of ColdFusion’s cfdump. Outputs colored and structured tabular variable information. Variable types supported are: Arrays, Classes/Objects, Database and XML Resources."

anywhereindb - Sometime we need to find out a small piece of string in big Database. Like where is the configuration is saved, or where is Jon's Date of birth is saved. This code is search all the tables and all the rows and columns in a MYSQL Database. The code is written in PHP. For faster result, we are only searching in the varchar field.

http://sourceforge.net/projects/ajaxmytop/ - monitoring

http://dbpatterns.com/

https://github.com/victorstanciu/dbv/

http://www.percona.com/software/percona-toolkit

https://github.com/leftnode/dbmigrator

https://www.dbvis.com/

https://github.com/facebookincubator/OnlineSchemaChange - a tool for making schema changes for MySQL tables in a non-blocking way

dbdiagram.io - A free, simple database relationship diagrams design tool to draw ER diagrams by just writing code. Designed for developers and data analysts. [14]

Scripts

http://jetpackweb.com/blog/2009/07/20/bash-script-to-create-mysql-database-and-user/
- createdb.sh

Search Replace DB - This script was made to aid the process of migrating PHP and MySQL based websites. It has additional features for WordPress but works for most other similar CMSes.
- http://interconnectit.com/124/search-and-replace-for-wordpress-databases/

./searchreplacedb2cli.php --host localhost --user root --database test --pass "pass"
     --charset utf\-8 --search "findMe" --replace "replaceMe"
--dry-run

http://ankane.github.io/groupdate.sql/

Performance

http://use-the-index-luke.com/welcome

Native clients

SQL Workbench/J - a free, DBMS-independent, cross-platform SQL query tool. It is written in Java and should run on any operating system that provides a Java Runtime Environment. Its main focus is on running SQL scripts (either interactively or as a batch) and export/import features. Graphical query building or more advanced DBA tasks are not the focus and are not planned

SQuirreL SQL Client

SQuirreL SQL Client - a graphical SQL client written in Java that will allow you to view the structure of a JDBC compliant database, browse the data in tables, issue SQL commands etc.

HeidiSQL

HeidiSQL - a useful and reliable tool designed for web developers using the popular MySQL server, Microsoft SQL databases and PostgreSQL. It enables you to browse and edit data, create and edit tables, views, procedures, triggers and scheduled events. Also, you can export structure and data either to SQL file, clipboard or to other servers. [15]

DBeaver

DBeaver - Free multi-platform database tool for developers, SQL programmers, database administrators and analysts. Supports all popular databases: MySQL, PostgreSQL, MariaDB, SQLite, Oracle, DB2, SQL Server, Sybase, MS Access, Teradata, Firebird, Derby, etc.

Web clients

phpMyAdmin

phpMyAdmin
- docs
- wiki

Chive

Chive

SQL Buddy

SQL Buddy - Web based MySQL administration

Adminer

Adminer - single php file
- WP: Adminer

wget http://www.adminer.org/latest-mysql-en.php -O adminer.php
wget http://www.adminer.org/latest-en.php -O adminer.php

OmniDB

OmniDB - Open Source Web Tool For Database Management
- https://github.com/OmniDB/OmniDB

MariaDB

MariaDB Knowledge Base
- Using mysqlbinlog

Fork of MySQL, drop in replacement.

https://kb.askmonty.org/en/mariadb-vs-mysql-compatibility/

Percona Server

Percona Server for MySQL
- https://github.com/percona/percona-server

WP: Percona_Server_for_MySQL

PostgreSQL

http://www.postgresql.org/
- WP: PostgreSQL

10 Things I Hate About PostgreSQL - Rick Branson - Medium - [16]

https://wiki.postgresql.org/wiki/Don%27t_Do_This [17]

https://github.com/dbcli/pgcli - Postgres CLI with autocompletion and syntax highlighting

LiteCLI - a user-friendly CommandLine client for SQLite database. It is based on the popular pgcli and mycli projects. LiteCLI is written in python using the wonderful prompt-toolkit library. It is cross-platform compatible and it is tested on Linux, MacOS and Windows. [18]
- https://github.com/dbcli/litecli

http://pgmodeler.com.br/

http://darthdeus.github.io/blog/2013/08/19/postgresql-basics-by-example/

https://wiki.postgresql.org/wiki/Community_Guide_to_PostgreSQL_GUI_Tools

https://www.postgresql.org/download/products/1-administrationdevelopment-tools/

http://dailytechnology.net/2013/08/03/redshift-what-you-need-to-know/

http://www.postgresqlstudio.org/

https://github.com/begriffs/postgrest

https://functionwhatwhat.com/json-in-postgresql/

https://news.ycombinator.com/item?id=10960344

https://github.com/okbob/pspg

pgModeler - PostgreSQL Database Modeler [19]

https://github.com/mgartner/pg_flame - A flamegraph generator for Postgres EXPLAIN ANALYZE output.

https://github.com/aquametalabs/aquameta - Web development platform built entirely in PostgreSQL

https://news.ycombinator.com/item?id=22472175

Ingres

WP: Ingres_(database)

SQLite

SQLite
- Categorical Index Of SQLite Documents

YouTube: D. Richard Hipp - SQLite (The Databaseology Lectures - CMU Fall 2015)

https://phiresky.github.io/blog/2021/hosting-sqlite-databases-on-github-pages [20]

sqlpkg - SQLite Package Registry - the (unofficial) SQLite package registry

https://github.com/nalgeon/sqlpkg-cli#readme - manages SQLite extensions, just like pip does with Python packages or brew does with macOS programs. It works primarily with the SQLite package registry, but is not limited to it. You can install SQLite extensions from GitHub repositories or other websites. All you need is a package spec file (more on that later,.

Vulcan - Convergent, Replicated SQLite. Multi-writer and CRDT support for SQLite. Develop distributed & collaborative applications that sync & react to changing state. Vulcan augments SQLite, giving it the power of eventual consistency and multi-writer replication. It's like Git, for your data.
- https://github.com/vlcn-io/cr-sqlit

SQLiteStudio

SQLiteStudio - A free, open source, multi-platform SQLite database manager written in C++, with use of Qt framework.
- https://github.com/pawelsalawa/sqlitestudio

DB Browser for SQLite

DB Browser for SQLite - a high quality, visual, open source tool to create, design, and edit database files compatible with SQLite.It is for users and developers wanting to create databases, search, and edit data. It uses a familiar spreadsheet-like interface, and you don't need to learn complicated SQL commands.
- https://github.com/sqlitebrowser/sqlitebrowser

sql_with_qt

https://github.com/katecpp/sql_with_qt - This repository is a small example of how to set up sqlite database with Qt and perform some basic queries.

sqlite-tui

https://github.com/mathaou/sqlite-tui - A TUI for viewing sqlite databases

sqlite-manager

https://github.com/lunu-bounir/sqlite-manager - a browser extension to read, manipulate, plot and write SQLite databases

sqlite-schema-diagram

https://gitlab.com/Screwtapello/sqlite-schema-diagram - A properly normalised database can wind up with a lot of small tables connected by a complex network of foreign key references. Like a real-world city, it's pretty easy to find your way around once you're familiar, but when you first arrive it really helps to have a map.

Lots of database management tools include some kind of schema diagram view, either automatically generated or manually editable so you can get the layout just right. But it's usually part of a much bigger suite of tools, and sometimes I don't want to install a tool, I just want to get a basic overview quickly. [21]

CockroachDB

CockroachDB - Architected for the cloud, CockroachDB delivers resilient, consistent, distributed SQL at your scale
- https://github.com/cockroachdb/cockroach

HyperSQL / HSQL

HyperSQL Documentationa relational database engine written in Java. Version 2.7 offers many features and adheres closely to the latest SQL and JDBC 4 standards.
- https://sourceforge.net/projects/hsqldb/
- https://github.com/ryenus/hsqldb

dBASE

WP: dBase - also stylized dBASE, was one of the first database management systems for microcomputers and the most successful in its day. The dBase system includes the core database engine, a query system, a forms engine, and a programming language that ties all of these components together.

WP: XBase - the generic term for all programming languages that derive from the original dBASE (Ashton-Tate, programming language and database formats. These are sometimes informally known as dBASE "clones". While there was a non-commercial predecessor to the Ashton-Tate product (Vulcan written by Wayne Ratliff), most clones are based on Ashton-Tate's 1986 dBASE III+ release — scripts written in the dBASE III+ dialect are most likely to run on all the clones.

WP: XBase++ - an object oriented programming language which has multiple inheritance and polymorphism. It is based on the XBase language dialect and conventions. It is 100% Clipper compatible language supporting multiple inheritance, polymorphism, object oriented programming. It supports the xBase data types, including Codeblocks. With Xbase++ it is possible to generate applications for Windows NT, 95, 98, Me, 2000, XP, VISTA and Windows 7, 8, 10.

The Oasis Clipper Source. Over 300,000,000,000 bytes served! - the largest file archive for CA-Clipper and xBase on the web! The Oasis is created specifically for Clipper programmers to satisfy their need for demo's, utilities, Clipper source code, patches and libraries. The Oasis evolved from the FIDO filebone, additions from FIDO message bases, messages from the internet comp.lang.clipper newsgroup, my personal source code and utility donations and other programmers donations directly into the site. The Oasis always welcomes new submissions. If you have any Clipper or Xbase++ related material, please feel free to send it in for everyone to use. This can be your original work donated to public domain, or other public domain, freeware or shareware works. Of course, The Oasis won't knowingly distribute any commercial or copyrighted but not shareable works, so don't send them!

WP: Clipper_(programming_language) - an xBase compiler that implements a variant of the xBase computer programming language. It is used to create or extend software programs that originally operated primarily under MS-DOS. Although it is a powerful general-purpose programming language, it was primarily used to create database/business programs. One major dBase feature not implemented in Clipper is the dot-prompt, prompt, interactive command set, which was an important part of the original dBase implementation.

Harbour - the open/free software implementation of a cross-platform, multi-threading, object-oriented, scriptable programming language, backwards compatible with xBase languages. Harbour consists of a compiler and runtime libraries with multiple UI, database and I/O backends, its own build system and a collection of libraries and bindings for popular APIs. With Harbour, you can build apps running on GNU/Linux, Windows, macOS, iOS, Android, *BSD, *nix, and more

NoSQL

WP: NoSQL
- http://blog.mongodb.org/post/119945109/why-schemaless

http://blog.mongohq.com/schema-less-is-usually-a-lie/

https://news.ycombinator.com/item?id=13015841

https://www.reddit.com/r/programming/comments/60mcms/we_should_stick_with_the_old_and_boring_sql/

dbm

WP: Dbm

Redis

CouchDB

http://wiki.apache.org/couchdb/FrontPage

PouchDB was written to help web developers build applications that work as well offline as well as they do online, applications save data locally so the user can use all the features of an app even while offline and synchronise the data between clients so they have up to date data wherever they go.
- https://github.com/daleharvey/pouchdb
- https://github.com/nick-thompson/pouchdb-server

Couchbase

http://www.couchbase.com/couchbase-server/overview
- WP: Couchbase_Server

http://stackoverflow.com/questions/5578608/difference-between-couchdb-and-couchbase

MongoDB

http://www.mongodb.org/

https://www.mongohq.com/home

https://github.com/louischatriot/nedb
- http://blog.mongodb.org/post/55693224724/nedb-a-lightweight-javascript-database-using-mongodbs

http://nyeggen.com/blog/2013/10/18/the-genius-and-folly-of-mongodb/ [22]

JavaScript

https://github.com/louischatriot/nedb

MDBM

http://yahooeng.tumblr.com/post/104861108931/mdbm-high-speed-database [23]

Riak

Riak - product line of distributed databases is built on a set of core services providing a highly reliable, scalable distributed systems framework. Riak KV is a distributed NoSQL database. Riak TS is built on the same core foundation as Riak KV and is highly optimized for IoT and time series data. Riak also integrates with Riak S2 to optimize large object storage, and integrates with other data services including Apache Spark, Redis Caching, Apache Solr, and Apache Mesos.
- https://github.com/basho/riak

Other

http://sphia.org/

https://github.com/UWSysLab/tapir [24]

http://deepstream.io/

Kinto - a minimalist JSON storage service with synchronisation and sharing abilities.
- https://github.com/Kinto/kinto/ [25]

https://github.com/kallaballa/Janosh - A json document database with a shell interface and lua scripting support.Janosh is written in C++11. It is used in the ScreenInvader project.

https://github.com/pingcap/tikv [26]

replikativ - an open, scalable and distributive infrastructure for a data-driven community of applications. It can serve as a storage backend for your applications and make your application state always accessible on all your endpoints. For our applications it radically simplifies frontend development by streaming state changes directly into our reactive UI pipelines.

Fauna - the only Mission-Critical NoSQL Database that guarantees data correctness without operational complexity from the team that scaled Twitter.
- https://github.com/fauna

https://github.com/fastio/pedis - NoSQL data store using the SEASTAR framework, compatible with Redis

ArangoDB - From the ground up, ArangoDB is designed as a native multi-model database, supporting key/value, document and graph models. This means you can model your data and application in a very flexible way. ArangoDB can operate as a highly scalable database cluster for all data models. An ArangoDB cluster can be configured to serve various types of loads and runs on container orchestration systems like Kubernetes & DC/OS.
- https://github.com/arangodb/arangodb/

to sort

DBMS Musings: NewSQL database systems are failing to guarantee consistency, and I blame Spanner -

ingestr - a command-line application that allows ingesting or copying data from any source into any destination database.
- https://github.com/bruin-data/ingestr

http://radar.oreilly.com/2012/02/nosql-non-relational-database.html - http://news.ycombinator.com/item?id=3610844

http://developers.memsql.com/

http://phppgadmin.sourceforge.net/doku.php

WP: Datalog

http://highscalability.com/blog/2012/7/9/data-replication-in-nosql-databases.html

http://www.youtube.com/watch?v=Cym4TZwTCNU

http://labs.codernity.com/codernitydb/

http://www.rethinkdb.com/
- https://github.com/rethinkdb/rethinkdb

http://unqlite.org/

https://github.com/zbase

Trousseau is a gpg encrypted key-value store designed to be a simple, safe and relient place for your data. It stores data in a single multi-recipients encrypted file and can supports both local and remote storage sources (S3 and ssh so far) import/export.

https://news.ycombinator.com/item?id=6859767

http://probcomp.csail.mit.edu/bayesdb/

https://news.ycombinator.com/item?id=6935709

https://speakerdeck.com/bkeepers/git-the-nosql-database [29]

http://www.nuodb.com/

https://news.ycombinator.com/item?id=8729420

gpu;

http://opentsdb.net/

http://kinto.readthedocs.org/en/latest/overview.html [31]

http://gun.js.org/

https://github.com/fiatjaf/summadb

http://rethinkdb.com/

https://github.com/nolanlawson/socket-pouch

http://probcomp.csail.mit.edu/bayesdb/ [32]

https://www.reindex.io/

https://news.ycombinator.com/item?id=11001619

https://github.com/forward3d/uphold [33]

https://crate.io
- https://github.com/crate/crate

https://news.ycombinator.com/item?id=18789332

https://github.com/nocodb/nocodb

RocksDB - A persistent key-value store
- https://github.com/facebook/rocksdb
- WP: RocksDB

https://github.com/amirouche/hoply - a generic n-tuple store that can be used to create a triplestore or a quadstore or whatever.

WP: Bitmap_index - a special kind of database index that uses bitmaps.Bitmap indexes have traditionally been considered to work well for low-cardinality columns, which have a modest number of distinct values, either absolutely, or relative to the number of records that contain the data. The extreme case of low cardinality is Boolean data (e.g., does a resident in a city have internet access?), which has two values, True and False. Bitmap indexes use bit arrays (commonly called bitmaps) and answer queries by performing bitwise logical operations on these bitmaps. Bitmap indexes have a significant space and performance advantage over other structures for query of such data. Their drawback is they are less efficient than the traditional B-tree indexes for columns whose data is frequently updated: consequently, they are more often employed in read-only systems that are specialized for fast query - e.g., data warehouses, and generally unsuitable for online transaction processing applications.Some researchers argue that bitmap indexes are also useful for moderate or even high-cardinality data (e.g., unique-valued data) which is accessed in a read-only manner, and queries access multiple bitmap-indexed columns using the AND, OR or XOR operators extensively.[1]Bitmap indexes are also useful in data warehousing applications for joining a large fact table to smaller dimension tables such as those arranged in a star schema.

A primer on Roaring bitmaps: what they are and how they work -

Roaring Bitmaps -

Judy Arrays - a C library that provides a state-of-the-art core technology that implements a sparse dynamic array. Judy arrays are declared simply with a null pointer. A Judy array consumes memory only when it is populated, yet can grow to take advantage of all available memory if desired.Judy's key benefits are scalability, high performance, and memory efficiency. A Judy array is extensible and can scale up to a very large number of elements, bounded only by machine memory. Since Judy is designed as an unbounded array, the size of a Judy array is not pre-allocated but grows and shrinks dynamically with the array population.Judy combines scalability with ease of use. The Judy API is accessed with simple insert, retrieve, and delete calls that do not require extensive programming. Tuning and configuring are not required (in fact not even possible). In addition, sort, search, count, and sequential access capabilities are built into Judy's design.Judy can be used whenever a developer needs dynamically sized arrays, associative arrays or a simple-to-use interface that requires no rework for expansion or contraction.Judy can replace many common data structures, such as arrays, sparse arrays, hash tables, B-trees, binary trees, linear lists, skiplists, other sort and search algorithms, and counting functions.

Virtuoso Universal Server

WP: Virtuoso_Universal_Server - a middleware and database engine hybrid that combines the functionality of a traditional Relational database management system (RDBMS), Object-relational database (ORDBMS), virtual database, RDF, XML, free-text, web application server and file server functionality in a single system. Rather than have dedicated servers for each of the aforementioned functionality realms, Virtuoso is a "universal server"; it enables a single multithreaded server process that implements multiple protocols. The free and open source edition of Virtuoso Universal Server is also known as OpenLink Virtuoso. The software has been developed by OpenLink Software with Kingsley Uyi Idehen and Orri Erling as the chief software architects.
- http://virtuoso.openlinksw.com/

Dolt

https://github.com/liquidata-inc/dolt - a relational database, i.e. it has tables, and you can execute SQL queries against those tables. It also has version control primitives that operate at the level of table cell. Thus Dolt is a database that supports fine grained value-wise version control, where all changes to data and schema are stored in commit log. It is inspired by RDBMS and Git, and attempts to blend concepts about both in a manner that allows users to better manage, distribute, and collaborate on, data.

DoltHub - public data

XTDB

XTDB - a general purpose database with graph-oriented bitemporal indexes. Datalog, SQL & EQL queries are supported, and Java, HTTP & Clojure APIs are provided. XTDB follows an unbundled architectural approach, which means that it is assembled from decoupled components through the use of an immutable log and document store at the core of its design. A range of storage options are available for embedded usage and cloud native scaling. Bitemporal indexing of schemaless documents enables broad possibilities for creating layered extensions on top, such as to add additional transaction, query, and schema capabilities. In addition to SQL, XTDB supplies a Datalog query interface that can be used to express complex joins and recursive graph traversals.
- https://github.com/xtdb/xtdb

SurrealDB

SurrealDB - an end-to-end cloud-native database designed for modern applications, including web, mobile, serverless, Jamstack, backend, and traditional applications. With SurrealDB, you can simplify your database and API infrastructure, reduce development time, and build secure, performant apps quickly and cost-effectively.
- https://github.com/surrealdb/surrealdb

Time series

WP: Time_series_database - a software system that is optimized for handling time series data, arrays of numbers indexed by time (a datetime or a datetime range). In some fields these time series are called profiles, curves, or traces. Ideally, repositories of time series are natively implemented using specialized database algorithms. However, it is possible to store time series as binary large objects (BLOBs) in a relational database or by using a VLDB approach coupled with a pure star schema.[citation needed] Efficiency is often improved if time is treated as a discrete quantity rather than as a continuous mathematical dimension.

InfluxDB

InfluxDB - an open source time series database with no external dependencies. It's useful for recording metrics, events, and performing analytics.
- https://github.com/influxdata/influxdb

Graph

See also Semantic and Semantic#Triplestore

WP: Graph_database - a database that uses graph structures for semantic queries with nodes, edges, and properties to represent and store data. A key concept of the system is the graph (or edge or relationship). The graph relates the data items in the store to a collection of nodes and edges, the edges representing the relationships between the nodes. The relationships allow data in the store to be linked together directly and, in many cases, retrieved with one operation. Graph databases hold the relationships between data as a priority. Querying relationships within a graph database is fast because they are perpetually stored within the database itself. Relationships can be intuitively visualized using graph databases, making them useful for heavily inter-connected data.

How to Cook a Graph Database in a Night

Neo4j

Neo4j - The Leader in Graph Databases

Cayley

https://github.com/google/cayley - an open-source graph inspired by the graph database behind Freebase and Google's Knowledge Graph. Its goal is to be a part of the developer's toolbox where Linked Data and graph-shaped data (semantic webs, social networks, etc) in general are concerned. [34]

Gaffer

https://github.com/gchq/Gaffer - A large-scale entity and relation database supporting aggregation of properties [35]

Grakn

Grakn - an intelligent database: a knowledge graph engine to organise complex networks of data and making it queryable, by performing knowledge engineering. Rooted in Knowledge Representation and Automated Reasoning, Grakn provides the knowledge foundation for cognitive and intelligent (e.g. AI) systems, by providing an intelligent language for modelling, transactions and analytics. Being a distributed database, Grakn is designed to scale over a network of computers through partitioning and replication. Under the hood, Grakn has built an expressive knowledge representation system based on hypergraph theory (a subfield in mathematics that generalises an edge to be a set of vertices) with a transactional query interface, Graql. Graql is Grakn’s reasoning (through OLTP) and analytics (through OLAP) declarative query language.
- https://github.com/graknlabs/grakn

TinkerPop

Apache TinkerPop - a graph computing framework for both graph databases (OLTP) and graph analytic systems (OLAP).

Linkurious

Linkurious - an on-premises graph visualization and analysis platform. Fraud, intelligence or cyber analysts use it to detect and investigate threats in large and complex datasets. [36]

SurrealDB

SurrealDB - The ultimate database for tomorrow's applications. With an SQL-style query language, real-time queries with highly-efficient related data retrieval, advanced security permissions for multi-tenant access, and support for performant analytical workloads, SurrealDB is the next generation serverless database.
- https://github.com/surrealdb - not libre

GraphQL

GraphQL - a query language for APIs and a runtime for fulfilling those queries with your existing data. GraphQL provides a complete and understandable description of the data in your API, gives clients the power to ask for exactly what they need and nothing more, makes it easier to evolve APIs over time, and enables powerful developer tools.
- https://github.com/graphql
- https://github.com/graphql/graphql-spec
- WP: GraphQL - an open-source data query and manipulation language for APIs, and a runtime for fulfilling queries with existing data. GraphQL was developed internally by Facebook in 2012 before being publicly released in 2015. On 7 November 2018, the GraphQL project was moved from Facebook to the newly-established GraphQL Foundation, hosted by the non-profit Linux Foundation.

https://news.ycombinator.com/item?id=25014582

Queries and Mutations

https://github.com/graphql/graphiql - An in-browser IDE for exploring GraphQL.

YouTube: GraphQL - playlist by Fun Fun Function

GRANDstack - A new paradigm for building APIs, GraphQL is a way of describing data and enabling clients to query it.

Neo4j and GraphQL

GitHub Developer Guide: GitHub GraphQL API v4

Urql - [37]

https://github.com/imolorhe/altair - A beautiful feature-rich GraphQL Client for all platforms

https://github.com/dgraph-io/dgraph - Native GraphQL Database with graph backend

https://github.com/surrealdb - A scalable, distributed, collaborative, document-graph database, for the realtime web

hypergraphql

https://github.com/hypergraphql/hypergraphql - a GraphQL interface for querying and serving linked data on the Web. It is designed to support federated querying and exposing data from multiple linked data services using GraphQL query language and schemas. The basic response format is JSON-LD, which extends the standard JSON with the JSON-LD context enabling semantic disambiguation of the contained data.

Weaving Linked Data Cloud with (Hyper)GraphQL | by Szymon Klarman | Medium - " The approach, implemented as HyperGraphQL, is simple: you only need to define a GraphQL schema and map it onto URIs of the vocabulary employed in your RDF graph. Under the hood HyperGraphQL performs a rather straightforward rewriting of GraphQL queries to SPARQL, delegates them to the SPARQL endpoint, and returns the responses as a JSON-LD objects."

GraphQL-LD

GraphQL-LD: Linked Data Querying with GraphQL - The Linked Open Data cloud has the potential of significantly enhancing and transforming end-user applications. For example, the use of URIs to identify things allows data joining between separate data sources. Most popular (Web) application frameworks, such as React and Angular have limited support for querying the Web of Linked Data, which leads to a high-entry barrier for Web application developers. Instead, these developers increasingly use the highly popular GraphQL query language for retrieving data from GraphQL APIs, because GraphQL is tightly integrated into these frameworks. In order to lower the barrier for developers towards Linked Data consumption, the Linked Open Data cloud needs to be queryable with GraphQL as well. In this article, we introduce GraphQL-LD, an approach that consists of a method for transforming GraphQL queries coupled with a JSON-LD context to SPARQL, and a method for converting SPARQL results to the GraphQL query-compatible response. We demonstrate this approach by implementing it into the Comunica framework. This approach brings us one step closer towards widespread Linked Data consumption for application development.
- https://github.com/rubensworks/GraphQL-LD.js

Replication

http://www.symmetricds.org/ - open source software for database and file synchronization with Multi-master replication, filtered synchronization, and transformation capabilities. It is designed to scale for a large number of nodes, work across low-bandwidth connections, and withstand periods of network outage. Data synchronization occurs asynchronously from a scheduled job, with data changes being sent over a push or pull operation.
WP: SymmetricDS

Distributed

H-Store

H-Store - an experimental main-memory, parallel database management system that is optimized for on-line transaction processing (OLTP) applications. It is a highly distributed, row-store-based relational database that runs on a cluster on shared-nothing, main memory executor nodes. The H-Store project is a collaboration between MIT, Brown University, Carnegie Mellon University, Yale University, and Intel.
- WP: H-Store

HBase

https://hbase.apache.org/
- WP: Apache_HBase
- https://phoenix.incubator.apache.org/ - SQL layer

RethinkDB

http://www.rethinkdb.com/

FoundationDB

FoundationDB - gives you the power of ACID transactions in a distributed database.[38]

to sort

https://github.com/attic-labs/noms [39]

https://news.ycombinator.com/item?id=15862895

TiDB

https://github.com/pingcap/tidb - an open source distributed scalable Hybrid Transactional and Analytical Processing (HTAP) database built by PingCAP. Inspired by the design of Google F1 and Google Spanner, TiDB features infinite horizontal scalability, strong consistency, and high availability. The goal of TiDB is to serve as a one-stop solution for both OLTP (Online Transactional Processing) and OLAP (Online Analytical Processing).

tidis

https://github.com/yongman/tidis - a Distributed NoSQL database, providing a redis-protocal api(string,list,hash,set,sorted-set), written in Go. Tidis is like TiDB layer, providing protocol transform, powered by tikv backend distributed storage which use raft for data replication and 2PC for distributed transaction. [40]

Scuttlebot

Scuttlebot - an open source peer-to-peer log store used as a database, identity provider, and messaging system. It features global replication, file-syncronization, and end-to-end encryption.

to sort

https://news.ycombinator.com/item?id=25871605

GUN

GUN - a small, easy, and fast data sync and storage system that runs everywhere JavaScript does. The aim of GUN is to let you focus on the data that needs to be stored, loaded, and shared in your app without worrying about servers, network calls, databases, or tracking offline changes or concurrency conflicts. This lets you build cool apps fast
- https://github.com/amark/gun - a realtime, distributed, offline-first, graph database engine. Doing 20M+ ops/sec in just ~9KB gzipped.
- https://gun.eco/distributed/matters.html

Titan

Titan - a scalable graph database optimized for storing and querying graphs containing hundreds of billions of vertices and edges distributed across a multi-machine cluster. Titan is a transactional database that can support thousands of concurrent users executing complex graph traversals in real time.

Social

http://thebigdb.com/

Other

http://highscalability.com/blog/2012/12/10/switch-your-databases-to-flash-storage-now-or-youre-doing-it.html

http://craigkerstiens.com/2012/11/30/sharding-your-database/

http://cr.yp.to/cdb.html [41]
- http://www.unixuser.org/~euske/doc/cdbinternals/

http://rocksdb.org/overview/

http://sqlmap.org/

https://github.com/cube2222/octosql - a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.

https://github.com/SemWebCentral/parliament - Standards-compliant triple store for RDF, OWL, and SPARQL

Datanymizer - an open-source, GDPR-compliant, privacy-preserving data anonymization tool flexible about how the anonymization takes place. Written in Rust.
- https://github.com/datanymizer/datanymizer

Introduction | Datanymizer

ORM

WP: Object-relational_mapping - ORM, O/RM, and O/R mapping tool, in computer science is a programming technique for converting data between incompatible type systems using object-oriented programming languages. This creates, in effect, a "virtual object database" that can be used from within the programming language. There are both free and commercial packages available that perform object-relational mapping, although some programmers opt to construct their own ORM tools.

Prisma

Prisma - provides a database-agnostic abstraction to be used from any programming language.
- https://github.com/prisma/prisma

Database

General

Relational

Object

Media

Vector

Embedded

Chroma

SpacetimeDB

Tarantool

SQL

MySQL

Connecting

Admin

Database management

User management

Passwords

Permissions

Table management

Troubleshooting

Backup and restore

mysqldump

mysqlbackup

mysqlhotcopy

Xtrabackup

Replication

Tools

Scripts

Performance

Native clients

SQuirreL SQL Client

HeidiSQL

DBeaver

Web clients

phpMyAdmin

Chive

SQL Buddy

Adminer

OmniDB

MariaDB

Percona Server

PostgreSQL

Ingres

SQLite

SQLiteStudio

DB Browser for SQLite

sql_with_qt

sqlite-tui

sqlite-manager

sqlite-schema-diagram

CockroachDB

IBM DB2

Oracle Database

IBM System R

HyperSQL / HSQL

dBASE

NoSQL

dbm

Redis

CouchDB

Couchbase

MongoDB

JavaScript

MDBM

Riak

Other

to sort

Virtuoso Universal Server

Dolt

XTDB

SurrealDB

Time series

InfluxDB

Graph

Neo4j

Cayley

Gaffer

Grakn

TinkerPop

Linkurious