Audio

this is an organically evolving personal wiki-form knowledge base with on-the-fly, about twenty years of copy-edited n otherwise curated patchworks of folksnomies n headings containing trails n spirals of topics, descriptions, notes, breadcrumbs n stubs, links to sites, systems, software, manuals, organisations, people, articles, guides, slides, papers, books, comments, videos, screencasts, webcasts, scratchpads, etc | content is orientated towards mostly free/libre/open, mostly Linux | quality and age varies drastically | sometimes old things are first, sometimes last | Ctrl + mouse wheel to zoom in if text is too small | use the Table of Contents menu to navigate long pages | use the header -ToC links to shrink n expand the menu | link rot? Wayback Machine! | probably need to fix the theme CSS after an update | Chat to msg me (this I am not checking atm) | e

Resources

Weather

Edinburgh

Scotland

Smiley / Lorem

About / ToDo

Meta / misc

Maths

Breath

Being

Grounding

Living

Camping

Mapping

Organising

Media

Digital lit.

Design

Politics / p

Free/open

Volly Guide

Fire brand

Radio / TV

Signal

Type / Emoji

Data / Open

Semantic

Backup

Storage / Files

File managers

Editors/ IDE

Vim / Emacs

Dotfiles / Box

Logging / Search

Notebooks

VCS / Git

Regex

Languages

C/C++ / Lisp

Perl / PHP

Python / Ruby

JavaScript / Lua

Creative coding

Visual / Pd

ML / AI

Computing

Computer / CA

OSs / *nix / CLI

Distros / Packages

Android / Apps

Apple / Windows

Amiga / Emulation

Web dev

Web systems

Wiki / Forums

Feeds

Open social

Scraping

Net/web media

E-mail

Chat / IRC

VoIP / Comms

File sharing

Link / Wi-Fi

Internet / Mesh

Transport / DNS

HTTP(S) / SSH

Stack

MediaWiki

Web Audio

GFX / Colours

UI / X11 / GUI

Terminals / TUI

WM/DE / Wayland

AwesomeWM / i3

Demoscene

Shaders

Gaming / AR

Photos / Images

Lighting / Laser

CAD / 3D

Video / Vision

Visuals

Audio / s / AV

Softsynths

Speech / vox

Speaker / s

Sampling

Sound banks

Notation

MIDI / OSC

Tracker

DAW

Generative

Styles

Playback / MPD

Net AV/media

Rip / Tag / t

DJing

Stations

General

Mostly Linux, mostly free software.

http://stuff.mihozu.net/stuff/bookmarks - old but interesting

https://librazik.tuxfamily.org/doc2/logiciels - francophone

https://github.com/ad-si/awesome-music-production

CDM - blog, etc. on Create Digital music, motion, and more.

“Sound is when you mow your lawn, noise is when your neighbor mows their lawn, and music is when your neighbor mows your lawn,” [1]

Training

See Music#Training

https://www.gnu.org/software/solfege/

http://harmanhowtolisten.blogspot.co.uk/ [2]

https://www.soundgym.co

RealSimple Project - musical acoustics laboratory exercises integrating both hands-on laboratory experience and computer-based simulation.

Electrical

Digital

WP: Digital_audio

Advanced Aspects of Digital Audio - Collected for the inquisitive audio enthusiast

WP: Nyquist–Shannon_sampling_theorem

https://github.com/MTG/conferences - This repository hosts a list of upcoming and past conference calls and journal calls for the wider music technology community.

http://wiki.hydrogenaud.io/index.php?title=Topic_Index - software wiki with format etc information

WP: I²S - Inter-IC Sound, eye-squared-ess, is an electrical serial bus interface standard used for connecting digital audio devices together. It is used to communicate PCM audio data between integrated circuits in an electronic device. The I²S bus separates clock and serial data signals, resulting in simpler receivers than those required for asynchronous communications systems that need to recover the clock from the data stream. Alternatively I²S is spelled I2S (pronounced eye-two-ess) or IIS (pronounced eye-eye-ess). Despite the similar name, I²S is unrelated to the bidirectional I²C (IIC) bus.

Maxim: Direct-Sampling DACs in Theory and Application - Application Note

SRC Comparisons - "We have organized the testing of some of the objective parameters of SRC algorithms in the 96 kHz - 44.1 kHz conversion mode. This mode is considered "hard" because of its fractional resampling ratio. The set of test signals has been discussed among engineers from Weiss Engineering, Alexey Lukin and members of Glenn Meadows' Mastering Web-Board. The test files were available in a variety of resolutions (32-bit int, 32-bit float, 24-bit), and the best supported resolution has been used for each of the SRC algorithms tested. The resulting graphs have been drawn by a modified version of the RightMark Audio Analyzer (RMAA) and some specially developed analysis software."

Fixing Phase... What, Why, When and How

Sound on Sound: Phase Demystified - Phase interactions are well known for their ability to destructively interfere with recorded signals, but an understanding of the process can turn it into one of the most powerful creative tools available to you.

PDF: The Quest For The Perfect Resampler - 2003.06.23, Laurent de Soras
- Web: http://ldesoras.free.fr

https://github.com/Stenzel/newpathdown

https://github.com/swesterfeld/pandaresampler - Fast factor 2 resampler for audio signals

Hardware

http://www.richardfarrar.com/are-your-speakers-wired-correctly/

http://www.electrobob.com/auto-amp/

see sound on sound, etc.

https://www.reddit.com/r/diysound

https://www.reddit.com/r/diyaudio

http://www.diyaudio.com/index.php

Schematic Vault - This collection of pro audio schematics and reference materials has been amassed both from my private stock and from various internet resources. All materials have been formatted as multi-page pdf files for ease of use. Please feel free to email me (address on home page) with any material you'd care to add.

Studio

http://www.bbc.co.uk/rd/pubs/archive/pdffiles/architectural-acoustics/bbc_guideacousticpractice.pdf

Wiring

WP: Audio_and_video_interfaces_and_connectors#Audio_connectors

WP: Shielded_cable

WP: Unbalanced_circuit

WP: Phone_connector_(audio) - also known as phone jack, audio jack, headphone jack or quarter inch jack plug, is a family of electrical connectors typically used for analog audio signals. The phone connector was invented for use in telephone switchboards in the 19th century and is still widely used. The phone connector is cylindrical in shape, with a grooved tip to retain it. In its original audio configuration, it typically has two, three, four and, occasionally, five contacts. Three-contact versions are known as TRS connectors, where T stands for "tip", R stands for "ring" and S stands for "sleeve". Ring contacts are typically the same diameter as the sleeve, the long shank. Similarly, two-, four- and five- contact versions are called TS, TRRS and TRRRS connectors respectively. The outside diameter of the "sleeve" conductor is 1⁄4 inch (6.35 millimetres).

The "mini" connector has a diameter of 3.5 mm (0.14 in) and the "sub-mini" connector has a diameter of 2.5 mm (0.098 in).

WP: RCA_connector

WP: Speaker_wire

WP: Binding_post

YouTube: Connecting your neutrik speakON Connector on wire - MrFlexy SMPS

https://github.com/clacktronics/AudioJacks - KiCAD footprint library and 3D models for commonly used connectors used in synths and other audio equipment

WP: Balanced_audio - a method of interconnecting audio equipment using balanced lines. This type of connection is very important in sound recording and production because it allows the use of long cables while reducing susceptibility to external noise caused by electromagnetic interference.Balanced connections typically use shielded twisted-pair cable and three-conductor connectors. The connectors are usually 3-pin XLR or 1⁄4 inch (6.35 mm) TRS phone connectors. When used in this manner, each cable carries one channel, therefore stereo audio (for example) would require two of them.

WP: XLR_connector - a style of electrical connector, primarily found on professional audio, video, and stage lighting equipment. The connectors are circular in design and have between 3 and 7 pins. They are most commonly associated with balanced audio interconnection, including AES3 digital audio, but are also used for lighting control, low-voltage power supplies, and other applications. XLR connectors are available from a number of manufacturers and are covered by an international standard for dimensions, IEC 61076-2-103.[1] They are superficially similar to the older and smaller DIN connector range, but are not physically compatible with them.

WP: Audio_multicore_cable

furutech PCOCC process - PCOCC Pure Copper by Ohno Continuous Casting

https://audiophilereview.com/cables/a-new-twist-on-copper.html

Patch bay

WP: Patch_panel

Patchbay Tutorial

Patchbay Normaling

"Full-Normal : Each jack on the top-row is connected to the jack under it on the bottom-row. This allows the audio or video signal to “pass-through” the patchbay without using a patch cable. When we want to change the “normal” signal path we can use a patch cable to change the destination of the signal. Placing a patch cable into the either row breaks the signal path. The signal follows the patch cable to where it is patched.

"Half-Normal: ...Placing a patch cable into the bottom-row breaks the signal path. Placing a patch cable into the top-row allows the signal to still go to the jack under it on the bottom-row (without breaking the normal) and also follows the patch cable."

Microphones

WP: Microphone - colloquially nicknamed mic or mike (/maɪk/), is a transducer that converts sound into an electrical signal.Microphones are used in many applications such as telephones, hearing aids, public address systems for concert halls and public events, motion picture production, live and recorded audio engineering, sound recording, two-way radios, megaphones, radio and television broadcasting, and in computers for recording voice, speech recognition, VoIP, and for non-acoustic purposes such as ultrasonic sensors or knock sensors.Several different types of microphone are in use, which employ different methods to convert the air pressure variations of a sound wave to an electrical signal. The most common are the dynamic microphone, which uses a coil of wire suspended in a magnetic field; the condenser microphone, which uses the vibrating diaphragm as a capacitor plate, and the piezoelectric microphone, which uses a crystal of piezoelectric material. Microphones typically need to be connected to a preamplifier before the signal can be recorded or reproduced.

http://mysite.du.edu/~jcalvert/tech/microph.htm

http://blog.shure.com/10-things-might-not-know-sm58/

WP: Microphone_preamplifier

WP: Microphone_practice

How to build a microphone

Preamplifier

WP: Preamplifier

Direct injection

WP: DI_unit - an electronic device typically used in recording studios and in sound reinforcement systems to connect a high-output impedance, line level, unbalanced output signal to a low-impedance, microphone level, balanced input, usually via an XLR connector and XLR cable. DIs are frequently used to connect an electric guitar or electric bass to a mixing console's microphone input jack. The DI performs level matching, balancing, and either active buffering or passive impedance matching/impedance bridging to minimize unwanted noise, distortion, and ground loops. DI units are typically metal boxes with input and output jacks and, for more expensive units, “ground lift” and attenuator switches. DI units are also referred to as a DI box, direct box, or simply DI, with each letter pronounced, as in "Dee Eye." The term is variously claimed to stand for direct input, direct injection, direct induction or direct interface.

DI Basics - Radial Engineering

Mixer

WP: Mixing_console

WP: Insert_(effects_processing)

RD/MPCTools - Extension Toolkit for Martin M-Series Software

https://github.com/SpotlightKid/xair-remote - Tools for querying and controlling Behringer X-AIR and MIDAS M-AIR audio mixers

RA: The cult of rotary mixers - [3]

Amplifier

WP: Amplifier

For amp to be twice as loud as the 10 watt RMS amp you need a 100 watt RMS amp, and then for a amp to be twice as loud as the 100 watt RMS amp you need 1,000 watt RMS amp.

WP: Amplifier#Power_amplifier_classes

Class A - 100% of the input signal is used (conduction angle Θ = 360°). The active element remains conducting all of the time.
Class B - 50% of the input signal is used (Θ = 180°); the active element carries current half of each cycle, and is turned off for the other half.
Class AB - Class AB is intermediate between class A and B, the two active elements conduct more than half of the time.
Class C - Less than 50% of the input signal is used (conduction angle Θ < 180°).
Class D - uses some form of pulse-width modulation to control the output devices; the conduction angle of each device is no longer related directly to the input signal but instead varies in pulse width. These are sometimes called "digital" amplifiers because the output device is switched fully on or off, and not carrying current proportional to the signal amplitude.

WP: Amplifier_figures_of_merit - numerical measures that characterize its properties and performance. Figures of merit can be given as a list of specifications that include properties such as gain, bandwidth, noise and linearity, among others listed in this article. Figures of merit are important for determining the suitability of a particular amplifier for an intended use.

http://www.ebay.co.uk/gds/6-Different-Types-of-Car-Amplifiers-to-Consider-/10000000177631330/g.html

http://www.head-fi.org/t/701900/schiit-happened-the-story-of-the-worlds-most-improbable-start-up

YouTube: Is equipment burn in real? - "all capacitors have to form, you put voltage on them and little microscopic holes are filled and it changes the equivalent series resistance and a number of other characteristics"

https://joebennett.net/2014/10/13/guitarists-stop-hurting-the-audience-at-small-gigs/ - inc. cab placement

http://www.henryaudio.co.uk/

http://schematicheaven.net/vox.html

http://monster.partyhat.co/article/amplifier-tone-stacks/

http://monster.partyhat.co/article/amplifier-diagram-libraries/

DIY amps: a roadmap for beginners – Audio Primate - [4]

NwAvGuy

IEEE Spectrum: NwAvGuy: The Audio Genius Who Vanished - [5] [6]

https://github.com/jaakkopasanen/AutoEq - a project for equalizing headphone frequency responses automatically and it achieves this by parsing frequency response measurements and producing equalization settings which correct the headphone to a neutral sound. This project currently has over 2500 headphones covered in the results folder. See Usage for instructions how to use the results with different equalizer softwares and Results section for details about parameters and how the results were obtained. [7]

Calibration

CDM: What it's like calibrating headphones and monitors with Sonarworks tools

https://github.com/hamsternz/audio_distortion - Measure the THD+Noise for your default audio device.

Synthesizer

See Synthesis, Creative coding

Vocoder

Elektor 10 Channel Vocoder - "I'm a synth & electronics passionate fan and live near Antwerp, Belgium. In the mid 1990's, really young and inexperienced, I decided to build the Elektor 10 channel vocoder as described in the Dutch Elektor magazine from the early 1980's."

Drum machine

AKAI MPC 3000: The Best Drum Machine of All Time - Audio Jive

LXR Drum Synthesizer - The LXR is a full fledged digital drum machine with integrated sequencer. Its sound engine provides 6 different instruments, each with over 30 parameters to tweak. It can produce a wide variety of sounds, ranging from classic analogue emulations to crunchy digital mayhem.

eXaDrums - Electronic drums for Linux. The goal project is to use a Raspberry Pi to make a drum module. As far as the software goes, it is written in C++ and uses Gtkmm to display a nice graphical user interface (GUI) on a Raspberry Pi official 7" touchscreen. The hardware consists of some accelerometers or piezos connected to an analog to digital converter (ADC).

DrumKid - an "aleatoric" drum machine, meaning it uses random numbers to determine the rhythm and sound of a drum beat. It comes in a handheld, battery-powered unit, designed for live performance. Check out the video here: https://www.youtube.com/watch?v=pyN_HQfCtoQ

I built my own analog drum machine (2020) - - https://news.ycombinator.com/item?id=27365989

Sampler

http://www.samplerbox.org/

YouTube: Laguna presents Mind over MIDI (clean Akai s3000xl with no extra FX nor post processing)

https://github.com/ryanjamesmcgill/Audio-Sampler-Selector - An embedded linux system that plays back audio samples for musical performance. The development board used was a Texas Instruments (TI) Beagle Bone Black based on an AM335x 1GHz ARM® Cortex-A8 processer. For high-quality audio, a TI PCM5102A DAC was used. The program leveraged the c++ library: JUCE and Linux sound server: JACK.
- YouTube: SelectorDemo

Sound module

WP: Sound_module - an electronic musical instrument without a human-playable interface such as a piano-style musical keyboard. Sound modules have to be operated using an externally connected device, which is often a MIDI controller, of which the most common type is the musical keyboard (although wind controllers, guitar controllers and electronic drum pads are also used). Controllers are devices that provide the human-playable interface and which may or may not produce sounds of its own. Another common way of controlling a sound module is through a sequencer, which is computer hardware or software designed to record and play back control information for sound-generating hardware (e.g., a DJ may program a bassline and use the sound module to produce the sound). Connections between sound modules, controllers, and sequencers are generally made with MIDI (Musical Instrument Digital Interface), which is a standardized protocol designed for this purpose, which includes special ports (jacks) and cables.

MatrixEd Oberheim Matrix-1000 Patch Editor

DAC / ADC

WP: Digital-to-analog_converter#Audio

WP: Analog-to-digital_converter

https://github.com/hideakitai/MCP4728 - Arduino library for MCP4728 quad channel, 12-bit voltage output Digital-to-Analog Convertor with non-volatile memory and I2C compatible Serial Interface

https://github.com/skiselev/i2s_audio_phat - a Raspberry Pi Zero pHAT form-factor I2S audio interface board based on a Cirrus Logic (Wolfson) WM8731 audio codec. It provides line input, line output, headphones output, and includes an on-board microphone.

https://github.com/onkelDead/tascam.lv2 - LV2 plugin to control Tascam US-16x08 interface via cutsom alsa driver

https://github.com/onkelDead/tascam-gtk - GTK+ based application to control Tascam US-16x08 DSP mixer

https://github.com/geoffreybennett/alsa-scarlett-gui - a Gtk4 GUI for the ALSA controls presented by the Linux kernel Focusrite Scarlett Gen 2/3 Mixer Driver.

https://github.com/mattogodoy/h6 - allows you to control your Zoom H6 recorder from your computer using an USB to TTL adapter. For this, you will need a few components to make a specific cable, but it's quite simple.

Sound chip/card

WP: Programmable_sound_generator

WP: Sound_chip

WP: List_of_Yamaha_products#Sound_chips

https://github.com/Skidlz/YM3427 - Info/Code for Yamaha's YM3427 IC

Oct SN76489 MIDI Array

https://github.com/rhargreaves/mega-drive-midi-interface - Control the Yamaha YM2612 and SN76489 chips of the SEGA Mega Drive via MIDI

https://github.com/bitluni/ULPSoundESP32 - These sketch show how to use the Ultra Low Power coprocessor (U.L.P.) of the ESP32 in order to play music, and relieve main processor's core of this task. Only a lightweight task refill from time to time the ULP separate memory with instruction which contains samples. That could be usefull for videos games, where graphics can monopolize both 2 cores.

WP: Sound_card

https://github.com/Wohlstand/ail32-sandbox - A sandbox over AIL32. Build was ported for a modern environment with GNU Make and OpenWatcom. For details, please read the read.me.utf8.txt file - an official document for AIL32.

WP: Environmental_Audio_Extensions

WP: AC'97 - Audio Codec '97; also MC'97 for Modem Codec '97, is an audio codec standard developed by Intel Architecture Labs in 1997. The standard was used in motherboards, modems, and sound cards. Audio components integrated into chipsets consist of two component classes: an AC'97 digital controller (DC97), which is built into the southbridge of the chipset, and AC'97 audio and modem codecs, which are the analog components of the architecture. AC'97 defines a high-quality, 16- or 20-bit audio architecture with surround sound support for the PC. AC'97 supports a 96 kHz sampling rate at 20-bit stereo resolution and a 48 kHz sampling rate at 20-bit stereo resolution for multichannel recording and playback. AC97 defines a maximum of 6 channels of analog audio output.

WP: Intel_High_Definition_Audio - or HD Audio or HDA, is a specification for the audio sub-system of personal computers. It was released by Intel in 2004 as successor to their AC'97 PC audio standard. During development it had the codename "Azalia".

http://www.intel.com/content/dam/www/public/us/en/documents/product-specifications/high-definition-audio-specification.pdf

https://www.kernel.org/doc/Documentation/sound/hd-audio/notes.rst

http://wiki.linuxaudio.org/wiki/list_of_jack_frame_period_settings_ideal_for_usb_interface

http://bela.io/
- https://media.ccc.de/v/minilac16-belaanopen

http://blokas.io/

ISA

http://www.flaterco.com/kb/audio/ISA

OS/2 Museum: A Sound Card Before Its Time - [8]

WP: Sound_Blaster

https://github.com/schlae/snark-barker - a 100% compatible clone of the famed SB 1.0 "Killer Card" sound card from 1989. It implements all the features, including the digital sound playback and recording, Ad Lib compatible synthesis, the joystick/MIDI port, and the CMS chips (which are actually Philips SAA1099 synthesizer devices). [9] [10]

https://github.com/crazii/SBEMU - Sound blaster emulation with OPL3 for AC97. Supported Sound cards: Intel ICH / nForce, Intel High Definition Audio, VIA VT82C686, VT8233, SB Live/Audigy

PCI

http://www.flaterco.com/kb/audio/PCI/

http://www.jrigg.co.uk/linuxaudio/ice1712multi.html

envy24control - alsa-utils

https://code.google.com/p/mudita24

https://sourceforge.net/projects/kenvy24 - VIA Envy24 based sound cards control utility, for the KDE environment

Wavetable

WP: Wavetable_synthesis#Confusion_with_sample-based_synthesis_(S&S)_and_Digital_Wave_Synthesis - In 1992, with the introduction of the Creative Labs Sound Blaster 16 the term "wavetable" started to be (incorrectly) applied as a marketing term to their sound card. However, these sound cards did not employ any form of wavetable synthesis, but rather PCM samples and FM synthesis.

WP: Creative_Wave_Blaster - was an add-on MIDI-synthesizer for Creative Sound Blaster 16 and Sound Blaster AWE32 family of PC soundcards. It was a sample-based synthesis General MIDI compliant synthesizer. For General MIDI scores, the Wave Blaster's wavetable-engine produced more realistic instrumental music than the SB16's onboard Yamaha-OPL3.

WaveBlaster compatible MIDI daughterboards

YouTube: MIDI and Wavetable - PhilsComputerLab playlist

USB

https://github.com/eltortugo/audioxtreamer - A simple multichannel USB/FPGA PCM audio interface:

DSP

https://github.com/tanvach/prettygood_dsp - self contained, Arduino compatible board for applying audio DSP. The intended purpose is to equalize and apply bass boost to BMR VR off ear headphones, but can be adapted for any other light DSP tasks by SGTL5000 codec.

https://github.com/AidaDSP/AidaDSP - an audio shield for Arduino

Computer

platforms

https://github.com/szymonkaliski/LoopPI2 - 6-track audio looper working on Raspberry PI 3, made with ChucK.

Raspberry Pi Music Server With Built-in Crossover and DSP : 12 Steps (with Pictures) - Instructables
- https://github.com/jrubinstein/raspiDSP - make a raspberry pi crossover and DSP and HDMI receiver

https://github.com/dagargo/overwitch - an Overbridge 2 device client for JACK (JACK Audio Connection Kit).This project is based on the Overbridge USB reverse engineering done by Stefan Rehm in dtdump.The papers Controlling adaptive resampling and Using a DLL to filter time by Fons Adriaensen have been very helpful and inspiring, as well as his own implementation done in the zita resamplers found in the alsa tools project.At the moment, it provides support for all Overbridge 2 devices, which are Analog Four MKII, Analog Rytm MKII, Digitakt, Digitone, Digitone Keys, Analog Heat and Analog Heat MKII.

Music workstaion

WP: Music_workstation

FireWire

WP: IEEE_1394 - an interface standard for a serial bus for high-speed communications and isochronous real-time data transfer. It was developed in the late 1980s and early 1990s by Apple, which called it FireWire. The 1394 interface is also known by the brands i.LINK (Sony), and Lynx (Texas Instruments). The copper cable it uses in its most common implementation can be up to 4.5 metres (15 ft) long. Power is also carried over this cable allowing devices with moderate power requirements to operate without a separate power supply. FireWire is also available in Cat 5 and optical fiber versions. The 1394 interface is comparable to USB, though USB requires a master controller and has greater market share.

FFADO project aims to provide a generic, open-source solution for the support of FireWire based audio devices for the Linux platform. It is the successor of the FreeBoB project.

WP: FFADO

http://ffado.org/?q=devicesupport/list

https://www.youtube.com/watch?v=61kc3Rs_xNQ

AES3

WP: AES3 - (also known as AES/EBU) is a standard for the exchange of digital audio signals between professional audio devices. An AES3 signal can carry two channels of PCM audio over several transmission media including balanced lines, unbalanced lines, and optical fiber. AES3 was jointly developed by the Audio Engineering Society (AES) and the European Broadcasting Union (EBU). The standard was first published in 1985 and was revised in 1992 and 2003. AES3 has been incorporated into the International Electrotechnical Commission's standard IEC 60958, and is available in a consumer-grade variant known as S/PDIF.

ADAT Lightpipe

WP: ADAT_Lightpipe - officially the ADAT Optical Interface, is a standard for the transfer of digital audio between equipment. It was originally developed by Alesis but has since become widely accepted,[1] with many third party hardware manufacturers including Lightpipe interfaces on their equipment. The protocol has become so popular that the term "ADAT" is now often used to refer to the transfer standard rather than to the Alesis Digital Audio Tape itself.

MADI

WP: MADI - or Multichannel Audio Digital Interface or AES10, is an Audio Engineering Society (AES) standard electronic communications protocol that defines the data format and electrical characteristics of an interface that carries multiple channels of digital audio. The AES first documented the MADI standard in AES10-1991, and updated it in AES10-2003 and AES10-2008. The MADI standard includes a bit-level description and has features in common with the two-channel format of AES3. It supports serial digital transmission over coaxial cable or fibre-optic lines of 28, 56, or 64 channels; and sampling rates of up to 96 kHz with resolution of up to 24 bits per channel. Like AES3 or ADAT it is a Uni-directional interface (one sender and one receiver).

Why are there so many MADI Variants? Some answers...

RME: HDSPe MADI FX -

USB

http://www.androidauthority.com/3-5mm-audio-usb-type-c-701507/ [11]

FOSDEM 2019: Linux and USB Audio Class 3 - The USB Audio Class 3.0 is a specification recently introduced by USB Implementers Forum. Ruslan is an author of UAC3 implementation in Linux, he will give an overview of improvements and changes in this USB Audio spec, and will share current Linux support status and challenges faced during ALSA drivers implementation.

https://wiki.linuxaudio.org/wiki/list_of_jack_frame_period_settings_ideal_for_usb_interface

Jackit - Differences in latency between USB and internal audio interfaces

Bluetooth

Improving Bluetooth Audio Quality on Ubuntu Linux - [12]

Headless A2DP Audio Streaming on Raspbian Stretch - how to setup Raspbian Stretch as a headless Bluetooth A2DP audio sink. This will allow your phone, laptop or other Bluetooth device to play audio wirelessly through a Rasperry Pi.

Pedal

Jesusonic

The OWL

The OWL - an open source, programmable audio platform made for musicians, hackers and programmers alike. Users can program their own effects, or download ready-made patches from our growing online patch library. It is available both as a guitar fx pedal and a Eurorack synthesizer module. OWL stands for Open Ware Laboratory which refers to the fact that the entire project is open source in both hardware and software. Being open source is an important issue for us in terms of making all of the technology completely accessible to the end user.
- https://github.com/pingdynasty/OwlSim - Simulator for Open Ware Laboratory, a programmable audio effects pedal

to sort

http://howleraudio.com/frontpage

http://axoloti.be

https://github.com/fengalin/gstation-edit

https://github.com/EleonoreMizo/pedalevite - a DIY multi-FX pedalboard for guitar, bass or any other electric instrument.It is based on a Raspberry Pi 4 and uses a custom audio board.

https://github.com/moddevices/mod-controller-proto/blob/master/mod-protocol.h

Controller

Game controller

https://github.com/grejppi/wmcv - wmcv is a Python module that lets you use the Wiimote controller as a CV controller with JACK.

Eye tracking

http://paulbatchelor.github.io/proj/eyejam - an open-source eye-controlled music composition environment. This was developed during for my summer internship with the Enable Group at Microsoft Research. The source code can be found on github under the official project name Microsoft Hands-Free Sound Jam. EyeJam is cross-platform, with suport for Windows, Mac, and Linux. Eye-control is only available on Windows. On the other platforms, eye control is simulated using the mouse cursor.

Instruments

Chimaera - a poly-magneto-phonic-theremin (we had to come up with this new subcategory in the domain of electronic instruments, as the Chimaera did not fit anywhere else). Other terms that would describe it well could be: a general-purpose-continuous-music-controller, a multi-touch-less-ribbon-controller or a possible offspring of a mating experiment of a keyboard and violin. Think of it as an invisible string that is excitable by an arbitrary number of magnetic sources. Depending on where the magnetic sources are located on the string and depending on how strong (or how near) they are, the device outputs different event signals. These general-purpose event signals then can be used to e.g. drive a synthesizer, an effects processing unit or some other hardware.

chair.audio - making digital instruments with analog interfaces. Our mission is to make sounds tangible. That's why we are developing instruments with haptic interfaces for electronic sound - both analog and software synthesis. Our Instruments have excitable surfaces that you can scratch, hit or bow. A very limited run of our developer edition will soon be available here.

https://github.com/SammyIAm/Moppy2 - Musical flOPPY controller

https://news.ycombinator.com/item?id=19595623 - roli seaboard

Wire

WP: Wire_recording

YouTube: Wire Recording

Music roll

WP: Music_roll - a storage medium used to operate a mechanical musical instrument. They are used for the player piano, mechanical organ, electronic carillon and various types of orchestrion. The vast majority of music rolls are made of paper. Other materials that have been utilized include thin card (Imhof-system), thin sheet brass (Telektra-system), composite multi-layered electro-conductive aluminium and paper roll (Triste-system) and, in the modern era, thin plastic or PET film. The music data is stored by means of perforations. The mechanism of the instrument reads these as the roll unwinds, using a pneumatic, mechanical or electrical sensing device called a tracker bar, and the mechanism subsequently plays the instrument. After a roll is played, it is necessary for it to be rewound before it can be played again. This necessitates a break in a musical performance. To overcome this problem, some instruments were built with two player mechanisms allowing one roll to play while the other rewinds. A piano roll is a specific type of music roll, and is designed to operate an automatic piano like the player piano or the reproducing piano.

WP: Piano_roll - a music storage medium used to operate a player piano, piano player or reproducing piano. A piano roll is a continuous roll of paper with perforations (holes) punched into it. The perforations represent note control data. The roll moves over a reading system known as a 'tracker bar' and the playing cycle for each musical note is triggered when a perforation crosses the bar and is read. A rollography is a listing of piano rolls, especially made by a single performer, analogous to a discography.

Piano rolls were in continuous mass production from around 1896 to 2008, and are still available today, with QRS Music claiming to have 45,000 titles available with "new titles being added on a regular basis". Largely replacing piano rolls, which are no longer mass-produced today, MIDI files represent a modern way in which musical performance data can be stored. MIDI files accomplish digitally and electronically what piano rolls do mechanically. Software for editing a performance stored as MIDI data often has a feature to show the music in a piano roll representation.

http://www.terrysmythe.ca/Estey.htm

Estey Player Organ Music Rolls

Midimusic eplayWin32 - Estey and Wurlitzer e-roll player for Hauptwerk, Miditzer, GrandOrgue & eplayOrgan. This graphical player will play Estey e-rolls on any Hauptwerk or Miditzer organ and Wurlitzer Band Organ e-rolls on eplayOrgan (Windows, iMac and Linux) It will automatically operate the manuals, pedals, stops, couplers and swell. As supplied this version plays the Hauptwerk St. Annes Moseley and Paramount 310 plus the Miditzer 160, 216 or 260 organs. It also plays Wurlitzer 125, 150 and 165 organs. Other Hauptwerk or Miditzer organs can be played by adding their data via the menus. It also plays my new eplayOrgan and most other organs which can be played from midi keyboards, including GrandOrgue, Viscount and jOrgan.

Multitrack recording

WP: Multitrack_recording - also known as multitracking, double tracking, or tracking—is a method of sound recording developed in 1955 that allows for the separate recording of multiple sound sources or of sound sources recorded at different times to create a cohesive whole. Multitracking became possible in the mid-1950s when the idea of simultaneously recording different audio channels to separate discrete "tracks" on the same reel-to-reel tape was developed. A "track" was simply a different channel recorded to its own discrete area on the tape whereby their relative sequence of recorded events would be preserved, and playback would be simultaneous or synchronized.

Prior to the development of multitracking, the sound recording process required all of the singers, band instrumentalists, and/or orchestra accompanists to perform at the same time in the same space. Multitrack recording was a significant technical improvement as it allowed studio engineers to record all of the instruments and vocals for a piece of music separately. Multitracking allowed the engineer to adjust the levels and tone of each individual track, and if necessary, redo certain tracks or overdub parts of the track to correct errors or get a better "take." As well, different electronic effects such as reverb could be applied to specific tracks, such as the lead vocals, while not being applied to other tracks where this effect would not be desirable (e.g., on the electric bass). Multitrack recording was much more than a technical innovation; it also enabled record producers and artists to create new sounds that would be impossible to create outside of the studio, such as a lead singer adding many harmony vocals with their own voice to their own lead vocal part, an electric guitar player playing many harmony parts along with their own guitar solo, or even recording the drums and replaying the track backwards for an unusual effect.

In the 1980s and 1990s, computers provided means by which both sound recording and reproduction could be digitized, revolutionizing audio recording and distribution. In the 2000s, multitracking hardware and software for computers was of sufficient quality to be widely used for high-end audio recordings by both professional sound engineers and by bands recording without studios using widely available programs, which can be used on a high-end laptop computer. Though magnetic tape has not been replaced as a recording medium, the advantages of non-linear editing (NLE) and recording have resulted in digital systems largely superseding tape. Even in the 2010s, with digital multitracking being the dominant technology, the original word "track" is still used by audio engineers.

Wifi recording

Sparrow - a basic but versatile product, allowing the recording, live broadcasting and other operations common to stage performances and TV/movie recordings.

MP3

WP: MP3_player - an electronic device that can play MP3 digital audio files. It is a type of digital audio player, or portable media player. Most players play more than the MP3 file format, such as Windows Media Audio (WMA), Advanced Audio Coding (AAC), Vorbis, FLAC, Speex and Ogg.

WP: Portable_media_player - or digital audio player (DAP) is a portable consumer electronics device capable of storing and playing digital media such as audio, images, and video files. The data is typically stored on a CD, DVD, BD, flash memory, microdrive, or hard drive. Most portable media players are equipped with a 3.5 mm headphone jack, which users can plug headphones into, or connect to a boombox or hifi system. In contrast, analogue portable audio players play music from non-digital media that use analogue signal storage, such as cassette tapes or vinyl records.Often mobile digital audio players are marketed and sold as "portable MP3 players", even if they also support other file formats and media types. Increasing sales of smartphones and tablet computers have led to a decline in sales of portable media players, leading to most devices being phased out, though flagship devices like the Apple iPod and Sony Walkman are still in production. Portable DVD/BD players are still manufactured by brands across the world.

Rockbox

Rockbox is a free replacement firmware for digital music players. It runs on a wide range of players:

gtkpod

gtkpod - a graphical user interface for the Apple iPod for Unix-like systems, written using the GTK+ toolkit.
- WP: Gtkpod

PA system

WP: Public_address_system

WP: Constant-voltage_speaker_system - 90v system

YouTube: Differences between 8-Ohm and 70-Volt Systems

WP: Backline_(stage) - used in popular music and sound reinforcement system contexts to refer to electronic audio amplification equipment and speaker enclosures that are placed behind the band or the rhythm section on stage, including amplifiers and speaker cabinets for guitars, bass guitars and keyboards. In the US and Canada, the term has expanded to include many of the musical instruments that the rhythm section musicians play, including pianos, Hammond organs, drum kits and various percussion instruments such as congas and bongos.

Sound system

WP: Sound_reinforcement_system - the combination of microphones, signal processors, amplifiers, and loudspeakers in enclosures all controlled by a mixing console that makes live or pre-recorded sounds louder and may also distribute those sounds to a larger or more distant audience. In many situations, a sound reinforcement system is also used to enhance or alter the sound of the sources on the stage, typically by using electronic effects, such as reverb, as opposed to simply amplifying the sources unaltered.

A sound reinforcement system for a rock concert in a stadium may be very complex, including hundreds of microphones, complex live sound mixing and signal processing systems, tens of thousands of watts of amplifier power, and multiple loudspeaker arrays, all overseen by a team of audio engineers and technicians. On the other hand, a sound reinforcement system can be as simple as a small public address (PA) system, consisting of, for example, a single microphone connected to a 100 watt amplified loudspeaker for a singer-guitarist playing in a small coffeehouse. In both cases, these systems reinforce sound to make it louder or distribute it to a wider audience.

Some audio engineers and others in the professional audio industry disagree over whether these audio systems should be called sound reinforcement (SR) systems or PA systems. Distinguishing between the two terms by technology and capability is common, while others distinguish by intended use (e.g., SR systems are for live event support and PA systems are for reproduction of speech and recorded music in buildings and institutions). In some regions or markets, the distinction between the two terms is important, though the terms are considered interchangeable in many professional circles.

WP: Live_event_support

http://www.reddit.com/r/SoundSystem/wiki/index

soundsystem.world - worldwide map of original soundsystems

RA: The esoteric art of great sound

http://music.tutsplus.com/tutorials/how-to-calculate-a-delay-tower--audio-10471

AV Tools - Android app

WP: Sound_system_(Jamaican) - group of disc jockeys, engineers and MCs playing ska, rocksteady or reggae music. The sound system is an important part of Jamaican culture and history.

WP: Sound_system_(DJ) - a group of DJs and audio engineers contributing and working together as one, playing and producing music over a large PA system or sound reinforcement system, typically for a dance event or party.

Noise meter

Noise Meter - "My wife and I have both been working from home during the COVID-19 pandemic, and have observed that often our children (3, 6, and 8) will be making noise quite unaware of how loud they’ve become. A quick search showed that there are a number of noise traffic lights on the market, but it occurred to me that I could make my own, and that it would be a great way of learning about the Arduino platform."

Linux

https://github.com/torvalds/linux/tree/master/sound

https://git.sleepmap.de/dave/master-thesis.git/

"there are many good reasons to consider Linux (for pro audio): low- or no-cost software; a high-performance sound system; great audio routing; powerful sound synthesis; high-quality score notation; it's extremely customisable; more of your CPU power may be used for audio processing; it avoids many patent/license restriction pitfalls; it avoids costly tie-ins to specific product ranges; software may be (legally) modified to suit your individual needs; modular software allows you to configure your software studio the way that you want; and it's driven by passion before profit." [13]

https://wiki.archlinux.org/index.php/Professional_audio

News and communities

Linuxaudio.org is a not-for-profit consortium of libre software projects and artists, companies, institutions, organizations, and hardware vendors using Linux kernel-based systems and allied libre software for audio-related work, with an emphasis on professional tools for the music, production, recording, and broadcast industries.
- http://wiki.linuxaudio.org/wiki/tutorials/start
- http://wiki.linuxaudio.org/apps/start

Linux Audio Users Guide
- http://lists.linuxaudio.org/listinfo/linux-audio-user

LinuxMusicians forum - mission: to facilitate discussion, learning, and discovery of music making on the Linux platform.

Libre Music Production is a community-driven online resource, focused on promoting musical creation and composition using free and open source (FLOSS) software. By providing hands-on material submitted by the community, such as guides, tutorials, articles and news updates, we want to show not only that there is great FLOSS audio software out there, but also how to practically use that software to make music.
- https://github.com/linuxaudio/libremusicproduction

http://opensourcemusician.com
- https://archive.org/details/osmpodcast
- #opensourcemusician

https://www.reddit.com/r/linuxaudio - A subreddit dedicated towards music and audio related topics on the Linux platform.

Mailing lists

https://lists.linuxaudio.org/listinfo - public mailing lists on lists.linuxaudio.org

The Linux-audio-dev Archives
- Nabble: Linux Audio Developers' Mailing List

Index of /lad - Stanford archive, 4 more years
Linux Audio Development - Audio Codecs - old, 2006 snapshot

The Linux-audio-announce Archives
- Nabble: The Linux Audio Developers' Mailing List

Jack-devel
- Nabble: Jack-devel

The Linux-audio-user Archives
- Nabble: Linux Audio Users' Mailing List

https://lists.linuxaudio.org/pipermail/linux-audio-tuning/

https://lists.archlinux.org/listinfo/arch-proaudio

IRC

Freenode

#lau - Linux Audio Users, slower traffic, related to the mailing list
#lad - Linux Audio Developers programming chat, related to the mailing list
#linuxmusicians - slower traffic, related to the forum
#opensourcemusicians - related to the podcast, FLOSS on all platforms, chat can get quite general
#linuxmao - Francophone, related to the site
#audio4linux.de - Germanophone, related to the site
#archlinux-proaudio - Arch Linux proaudio project and general discussion
#kxstudio - Debian audio repo/distro
#studioware - Slackware multimedia
#proaudio-overlay - Gentoo audio
#lv2 - open audio plugin format
#jack - audio system
#alsa - audio system
#pulseaudio - audio system
#audacity - sample editor
#ardour - DAW
#ingen - audio host
#non - "DAW"
#lmms - DAW
#rosegarden - sequencer
#zrythm - DAW
#surgesynth - synth
##zynsubaddfx - synth
#dataflow - Pure Data
#lilypond - notation
#laborejo - notation, sequencer, SF2
#openal - 3d audio
#vorbis - codec
##dsp - digital signal processing
##music-electronics
#musicbrainz - music tagging
#metabrainz - musicbrains dev
#edmproduction - related to the subreddit
#RedditAudio - mostly audio consumer electronics
##xenharmonic - microtonal
##audio
#audiovisual
##radio
##hamradio
##rtlsdr - software defined radio
##electronics
##music - general music listening chat
#music - general music listening chat
#EDM - slow, general electronic dance music chat channel
#Juce - framework

OFTC

#debian-multimedia

Cons

Linux Audio Conference

Sonoj Convention Archive - media and results from past events

Distros

See Distros#Audio/visual, Playback#Operating System

Software lists

Planet Linux Audio - aggregated software update news

Programming and using Linux sound systems

An in-depth look at programming sound using Linux — jannewmarch

- https://www.gitbook.com/book/jannewmarch/programming-and-using-linux-sound-systems/details

WP: List_of_Linux_audio_software

https://www.mindmeister.com/213699337/audio - mindmap

http://linux-sound.org/one-page.html | http://jackaudio.github.io/applications/

https://github.com/nodiscc/awesome-linuxaudio

Sound & MIDI Software For Linux

http://www.alsa-project.org/main/index.php/Applications

http://www.mixxx.org/wiki/doku.php/list_of_open-source_music_production_software

http://www.hitsquad.com/smm/linux/ - Linux software list

KVR: Linux free software listing - by date updated

http://www.kvraudio.com/plugins/linux/free/highest-rated
- http://www.kvraudio.com/forum

Setup

to better

https://wiki.linuxaudio.org/wiki/system_configuration

https://www.alsa-project.org/wiki/Low_latency_howto

https://github.com/redtide/archlinux-realtime-generic-setup - Common / generic configuration for an Archlinux RT enabled kernel

https://github.com/usrmusicman/ArchStudioUtils - Useful Scripts For Archlinux Audio

https://gitlab.com/rudenoise/manjaro-setup/-/blob/master/audio-setup.sh

https://github.com/joao4linux/music-daw - The porpuse of this project is to create a shell script to facilitate the process of configuring Ubuntu or Mint operating systems to use the low latency and real-time kernel, transforming your computer into a digital audio workstation (DAW). [14]
- https://github.com/joao4linux/music-daw

https://github.com/elaOnMars/ALSA-only-Jack-Archlinux-Audio-Config

https://github.com/dynobot/Linux-Audio-Adjustments -Debian Based RPi tweaks for improved sound.

Linux Audio Survival kit. - Linux Audio Survival Kit

Real time

Audio systems

WP: Sound_server - A sound server is software that manages the use of and access to audio devices (usually a sound card). It commonly runs as a background process.

In a Unix-like operating system, a sound server mixes different data streams and sends out a single unified audio to an output device. The mixing is usually done by software, or by hardware if there is a supported sound card.

The "sound stack" can be visualized as follows, with programs in the upper layers calling elements in the lower layers:

Applications (e.g. mp3 player, web video)
Sound server (e.g. aRts, ESD, JACK, PulseAudio)
Sound subsystem (described as kernel modules or drivers; e.g. OSS, ALSA)
Operating system kernel (e.g. Linux, Unix)

https://wiki.archlinux.org/index.php/Sound_system

http://wiki.linuxaudio.org/faq/start

http://wiki.linuxaudio.org/wiki/system_configuration

Linux Audio Survival kit - A brief guide to have success setting up the audio in Linux.

https://github.com/raboof/realtimeconfigquickscan - scripts to inspect a linux installation and make suggestions for improving realtime/audio performance.

https://github.com/linuxaudio/realtime-suggestions - A bash script, that suggests optimization options (while not stating the obvious) for Linux kernel realtime use. As these are just suggestions, they should be considered with a grain of salt: Configurations on Linux distributions can differ quite a lot. That being said: This script will not think for you!

https://github.com/dynobot/Linux-Audio-Adjustments - Audio Tweaks for Debian Based RPi

A Guide Through The Linux Sound API Jungle - a guide to the Linux audio jungle.

How it works: Linux audio explained

Linux audio explained

Linux Sound Help And Tricks

"If we were drawing the [internet] OSI model used to describe the networking framework that connects your machine to every other machine on the network, we'd find clear strata, each with its own domain of processes and functionality. There's very little overlap in layers, and you certainly don't find end-user processes in layer seven messing with the electrical impulses of the raw bitstreams in layer one.

"Yet this is exactly what can happen with the Linux audio framework. There isn't even a clearly defined bottom level, with several audio technologies messing around with the kernel and your hardware independently. Linux's audio architecture is more like the layers of the Earth's crust than the network model, with lower levels occasionally erupting on to the surface, causing confusion and distress, and upper layers moving to displace the underlying technology that was originally hidden."

"ALSA itself has a kernel level stack and a higher API for programmers to use, mixing drivers and hardware properties with the ability to play back surround sound or an MP3 codec. Most distributions stick PulseAudio and GStreamer on top[,] ... The deeper the layer, the closer to the hardware it is." [15]

Arch Forum: OSS4 vs ALSA vs Pusleaudio vs Jack

How does Pulse Audio compare to Jackd / ALSA?

https://forum.manjaro.org/t/how-to-replace-pulseaudio-with-jack-jack-and-pulseaudio-together-as-friend/2086

NOT Jack Audio Interface - Community / Ideas for Ardour - Ardour Forum

https://github.com/hodefoting/atty - audio interface and driver for terminals

OSS

Open Sound System is an audio subsystem that provides a cross platform API and device drivers for most consumer and professional audio devices for UNIX® and POSIX based operating systems, including Linux. Owing to its open architecture, applications developed on one supporting operating system platform can be easily recompiled on any other platform.
- WP: Open_Sound_System
- https://wiki.archlinux.org/index.php/Open_Sound_System

Old.

The API is designed to use the traditional Unix framework of open(), read(), write(), and ioctl(), via special devices. For instance, the default device for sound input and output is /dev/dsp. Examples using the shell:

cat /dev/random > /dev/dsp
  # plays white noise through the speaker

cat /dev/dsp > a.a
  # reads data from the microphone and copies it to file a.a

https://github.com/libfuse/osspd - OSS Proxy uses CUSE (extension of FUSE allowing character devices to be implemented in userspace) to implement OSS interface - /dev/dsp, /dev/adsp and /dev/mixer. From the POV of the applications, these devices are proper character devices and behave exactly the same way so it can be made quite versatile.

https://github.com/hselasky/virtual_oss - an audio mixing application that multiplexes and demultiplexes asingle OSS device into multiple customizable OSS compatible devicesusing character devices from userspace. These devices can be used torecord played back audio and mix the individual channels in multipleways.

ALSA

Advanced Linux Sound Architecture (ALSA) provides audio and MIDI functionality to the Linux operating system.

"ALSA is responsible for translating your audio hardware's capabilities into a software API that the rest of your system uses to manipulate sound. It was designed to tackle many of the shortcomings of OSS (and most other sound drivers at the time), the most notable of which was that only one application could access the hardware at a time. This is why a software component in ALSA needs to manages audio requests and understand your hardware's capabilities.

"ALSA was designed to replace OSS. However, OSS isn't really dead, thanks to a compatibility layer in ALSA designed to enable older, OSS-only applications to run. It's easiest to think of ALSA as the device driver layer of the Linux sound system. Your audio hardware needs a corresponding kernel module, prefixed with snd_, and this needs to be loaded and running for anything to happen. This is why you need an ALSA kernel driver for any sound to be heard on your system, and why your laptop was mute for so long before someone thought of creating a driver for it. Fortunately, most distros will configure your devices and modules automatically. [16]

https://wiki.archlinux.org/index.php/Advanced_Linux_Sound_Architecture

https://wiki.gentoo.org/wiki/ALSA

The Linux Kernel documentation: Linux Sound Subsystem Documentation
- The Linux Kernel documentation: Advanced Linux Sound Architecture - Driver Configuration guide
- The ALSA Driver API

http://www.alsa-project.org/main/index.php/Main_Page
http://alsa.opensrc.org/ - existed befo

re official wiki

rendaw - ALSA, exposed! - [17]

http://www.volkerschatz.com/noise/alsa.html

http://www.sabi.co.uk/Notes/linuxSoundALSA.html

Basic ALSA PCM Terminology

http://www.alsa-project.org/main/index.php/ALSA_Library_API

http://www.alsa-project.org/alsa-doc/alsa-lib

https://www.alsa-project.org/main/index.php/FramesPeriods

http://processors.wiki.ti.com/index.php/Sitara_Linux_Audio_Sample_Application_Overview

http://i.imgur.com/f66sf.png - ALSA job

https://github.com/tiwai/salsa-lib - a small, light-weight, hot and spicy version of the ALSA library, mainly for embedded systems with limited resources. The library is designed to be source-level compatible with ALSA library API for limited contents. Most of function calls are inlined, and accesses directly to the hardware via system calls. Some components like ALSA sequencer aren't supported, and most of all, the alsa-lib plugins and configurations are completely dropped. Thus, neither dmix nor format conversion is available with SALSA-lib.

Information

http://alsa.opensrc.org/Proc_asound_documentation

less /proc/asound/card0/pcm0p/sub0/hw_params
  # current hardware info

cat /proc/asound/cards
  # List audio hardware

cat /proc/asound/card0
  # List card info

cat /proc/asound/devices
  # List audio hardware

aplay -L
  # List all PCMs defined

modinfo soundcore
  # Kernel sound module info

lsmod | grep snd

lspci -v | grep -i audio
  # show some kernel info

alsacap - ALSA device capability lister. alsacap - ALSA device capability lister. scans soundcards known to ALSA for devices and subdevices. displays ranges of configuration parameters for the given ALSA device.

QasConfig is a graphical browser for the configuration tree and can help to analyze and debug an ALSA setup.

alsa-query.c - Print hardware capabilities of ALSA device

Configuration

kernel.org: Advanced Linux Sound Architecture - Driver Configuration guide

ALSA settings are stored in file 'asound.state', location can vary depending on distribution

http://www.alsa-project.org/main/index.php/Asoundrc - Neither the user-side .asoundrc nor the asound.conf configuration files are required for ALSA to work properly. Most applications will work without them. These files are used to allow extra functionality, such as routing and sample-rate conversion, through the alsa-lib layer.

The keyword default is defined in the ALSA lib API and will always access hw:0,0 — the default device on the default soundcard. Specifying the !default name supersedes the one defined in the ALSA lib API.

pcm.NAME { 
	type hw               # Kernel PCM 
	card INT/STR          # Card name or number
  	[device] INT          # Device number (default 0)     
	[subdevice] INT       # Subdevice number, -1 first available (default -1)
	mmap_emulation BOOL   # enable mmap emulation for ro/wo devices
}

ALSA project - the C library reference: PCM (digital audio) interface - ALSA uses the ring buffer to store outgoing (playback) and incoming (capture, record) samples. There are two pointers being maintained to allow a precise communication between application and device pointing to current processed sample by hardware and last processed sample by application. The modern audio chips allow to program the transfer time periods. It means that the stream of samples is divided to small chunks. Device acknowledges to application when the transfer of a chunk is complete.

http://www.alsa-project.org/main/index.php/PCM_Ring_Buffer

PCM (digital audio) plugins - these extend functionality and features of PCM devices. The plugins take care about various sample conversions, sample copying among channels and so on.

Plugin: hw

This plugin communicates directly with the ALSA kernel driver. It is a raw communication without any conversions. The emulation of mmap access can be optionally enabled, but expect worse latency in the case.

The nonblock option specifies whether the device is opened in a non-blocking manner. Note that the blocking behavior for read/write access won't be changed by this option. This influences only on the blocking behavior at opening the device. If you would like to keep the compatibility with the older ALSA stuff, turn this option off.

Plugin: file

This plugin stores contents of a PCM stream to file or pipes the stream to a command, and optionally uses an existing file as an input data source (i.e., "virtual mic")

http://alsa.opensrc.org/Dmix - Mixing enables multiple applications to output sound at the same time. Most discrete sound cards support hardware mixing, which is enabled by default if available. Integrated motherboard sound cards (such as Intel HD Audio), usually do not support hardware mixing. On such cards, software mixing is done by an ALSA plugin called dmix. This feature is enabled automatically if hardware mixing is unavailable.

http://alsa.opensrc.org/Dsnoop - the equivalent of the dmix plugin, but for recording sound. The dsnoop plugin allows several applications to record from the same device simultaneously.

Alsa Opensrc Org - These instructions apply to digital connections using either electrical coax or optical connections.

Linux ALSA sound notes
- https://dl.dropboxusercontent.com/u/18371907/asoundrc - mega config suggestions/fixes/information

JACK and Headphones - a virtual device in .asoundrc file that maps two channels of audio to all four channels on the soundcard. With that in place, we need to tell JACK to connect to our new virtual device "hpfix".

https://github.com/dh1tw/remoteAudio/wiki/Persistent-USB-Mapping-of-Audio-devices-(Linux)

https://github.com/hselasky/alsa-seq-server - Userspace ALSA MIDI sequencer server

https://github.com/xTibor/aseqmatrix - A matrix-style patch bay for the ALSA sequencer interface.

https://github.com/danieloneill/alsalist - Very basic tool to scan ALSA sequencer devices and list clients/sources in an easily parseable way, used (instead of a BASH mess of "aconnect -l" and a pile of pipes) to automate connecting a DTX400k kit USB MIDI data to the output of a USB MIDI adapter.

Libraries

https://github.com/yobert/alsa - golang alsa client implementation

loopback

http://linux.die.net/man/1/alsaloop - allows create a PCM loopback between a PCM capture device and a PCM playback device, supports multiple soundcards, adaptive clock synchronization, adaptive rate resampling using the samplerate library (if available in the system). Also, mixer controls can be redirected from one card to another (for example Master and PCM).

http://www.alsa-project.org/main/index.php/Matrix%3aModule-aloop

Tools

alsactl
  # advanced controls for ALSA soundcard driver

alsactl init
  # initiate basic configure

alsactl store
  # storae configuration

stativ / asoundconf - asoundconf-gtk, the GTK+ applet to allow you to select your default sound card.

http://alsa.opensrc.org/Alsa-tools
- https://launchpad.net/ubuntu/xenial/+package/alsa-tools-gui
  - echomixer - control tool for Echoaudio soundcards
  - envy24control - control tool for Envy24 (ice1712) based soundcards
  - hdajackretask - retask jacks on HDA Intel hardware
  - hdspconf - GUI program to control the Hammerfall HDSP Alsa Settings.
  - hdspmixer - tool to control the advanced routing features of the
  - RME Hammerfall DSP.
  - rmedigicontrol - control tool for RME Digi32 and RME Digi96 soundcards

http://alsa.opensrc.org/Speaker-test
- https://www.mythtv.org/wiki/Using_ALSA's_speaker-test_utility

speaker-test -c 2
  # Using 16 octaves of pink noise, alsa-utils

Turn your mic jack into a headphone jack! « A better sounding world - hdajackretask

https://github.com/nedko/u7 - a program for controlling ALSA volume through Linux input device.

https://alsa.opensrc.org/Aconnect

https://github.com/nuc/Midi-Connector - aconnect wrapper & web ui, to be used on a Raspberry Pi

https://github.com/mzero/amidiminder - ALSA utility to keep your MIDI devices connected

https://github.com/x42/alsa_request_device - This tool sends a request to the session message bus to reserve an audio-device: Other applications which may currently use the device are asked to release it (which may or may not succeed depending on the given priority -p).

https://github.com/gch1p/alsa-volume-monitor - a simple program written in C that listens to ALSA events and emits a DBus signal when something has been changed (e.g. volume). It was created for use with Awesome WM in volume indicator widgets.

https://github.com/pascalhuerst/alsa2fifo - Simple tool, that reads audio samples from an alsa device and writes it into a fifo.

https://github.com/pascalhuerst/alsa2fifo - Simple tool, that reads audio samples from an alsa device and writes it into a fifo.

https://github.com/alsa-project/alsa-tests - a collection of various test tools for the API conformance and functionality.

PulseAudio

PulseAudio is a sound system for POSIX OSes, meaning that it is a proxy for your sound applications. It allows you to do advanced operations on your sound data as it passes between your application and your hardware. Things like transferring the audio to a different machine, changing the sample format or channel count and mixing several sounds into one are easily achieved using a sound server.

https://www.freedesktop.org/wiki/Software/PulseAudio/Documentation
- https://www.freedesktop.org/wiki/Software/PulseAudio/Documentation/User
  - https://www.freedesktop.org/wiki/Software/PulseAudio/Documentation/User/Modules
- https://www.freedesktop.org/wiki/Software/PulseAudio/Documentation/Developer

How PulseAudio works - graphic

https://wiki.archlinux.org/index.php/PulseAudio
- https://wiki.archlinux.org/index.php/PulseAudio/Examples

http://en.gentoo-wiki.com/wiki/PulseAudio

http://www.cendio.com/pulseaudio/

PulseAudio under the hood - [18] [19]

Configuration

user specific pulseaudio config;

~/.pulse/default.pa
  # to load modules and define defaults
~/.pulse/client.conf
  # to configure a client for the sound server
~/.pulse/daemon.conf
  # to define sample rates and buffers

To avoid .pulse-cookie in home folder, set the following in /etc/pulse/client.conf [20]

cookie-file = /tmp/pulse-cookie

By default, pulseaudio changes master and application volume at the same time. To disable this, edit /etc/pulse/daemon.conf or ~/.config/pulse/daemon.conf with:

flat-volumes = no

alternate-sample-rate = 44100
  # in daemon.conf

echo "alternate-sample-rate = 44100" >> ~/.pulse/daemon.conf && echo "flat-volumes = no" >> ~/.pulse/daemon.conf

man pulse-cli-syntax
  # pulseaudio commandline help

pactl
  # control a running PulseAudio sound server

pacmd
  # reconfigure a PulseAudio sound server during runtime

pacmd list-cards

pacmd dump

pacmd unload-module module-udev-detect && pacmd load-module module-udev-detect [21]
  # register a new device

pacmd load-module module-null-sink sink_name=virtual
  # loopback device

http://0pointer.de/lennart/projects/paprefs - PulseAudio Preferences (paprefs) is a simple GTK based configuration dialog for the PulseAudio sound server.

https://pypi.org/project/pulsectl - high-level interface and ctypes-based bindings for PulseAudio (libpulse), mostly focused on mixer-like controls and introspection-related operations (as opposed to e.g. submitting sound samples to play, player-like client).
- https://github.com/mk-fg/python-pulse-control

https://github.com/flexibeast/pulseaudio-control - pulseaudio-control controls PulseAudio volumes from Emacs, via pactl.

https://github.com/umlaeute/pa-systray - tiny systray icon to turn on/off pulseaudio

https://github.com/christophgysin/pasystray - allows setting the default PulseAudio source/sink and moving streams on the fly between sources/sinks without restarting the client applications.

https://github.com/Junker/mictray - a Lightweight application which lets you control the microphone state and volume from system tray

https://github.com/miek/midi2pamixer - Control PulseAudio mixer with MIDI device

Gist: Normalize volume level with PulseAudio

https://github.com/rhaas80/pa_volume - a simple tool to set the remembered volume level of pulseaudio clients. It requires module-stream-restore to be loaded (which is usually the case) to function. When called without arguments it shows all the known clients (running and non-running) and their remembered volume level. To set the volume level pass it the name of the client followed by the volume in pecent.

Mixer

GUI

https://freedesktop.org/software/pulseaudio/pavucontrol
- https://gitlab.freedesktop.org/pulseaudio/pavucontrol
- https://github.com/pulseaudio/pavucontrol - mirror

https://github.com/lxde/pavucontrol-qt - A Pulseaudio mixer in Qt (port of pavucontrol)

https://github.com/rafalcieslak/pavucontrol - fork with a compact UI

CLI

https://github.com/graysky2/pulseaudio-ctl - Control pulseaudio volume from the shell or mapped to keyboard shortcuts. No need for alsa-utils. [22]

https://github.com/cdemoulins/pamixer - like amixer but for pulseaudio. It can control the volume levels of the sinks.

https://github.com/falconindy/ponymix - CLI volume control for PulseAudio

TUI

https://github.com/GeorgeFilipkin/pulsemixer - cli and curses mixer for pulseaudio. horizontal level bars, mousewheel selects channel

 h/j/k/l, arrows               navigation, volume change
 H/L, Shift+Left/Shift+Right   change volume by 10
 1/2/3/4/5/6/7/8/9/0           set volume to 10%-100%
 m                             mute/unmute
 Space                         lock/unlock channels together
 Enter                         context menu
 F1/F2/F3                      change modes
 Tab                           go to next mode
 Mouse left click              select device or mode
 Mouse wheel                   volume change
 q/Esc/^C                      quit

https://github.com/fulhax/ncpamixer - horizontal ncurses PulseAudio Mixer inspired by pavucontrol.

https://github.com/patroclos/PAmix - horizontalncurses/curses pulseaudio mixer in c++ similar to pavucontrol

https://github.com/mk-fg/pulseaudio-mixer-cli - Interactive python/ncurses UI to control volume of pulse streams, kinda like alsamixer, focused not on sink volume levels (which can actually be controlled via alsamixer, with alsa-pulse plugin), but rather on volume of individual streams, so you can tune down the music to hear the stuff from games, mumble, skype or browser.

https://github.com/KenjiTakahashi/pacmixer - an alsamixer alike for PulseAudio. breaks PA connections for PNmixer and gives error. mousewheel = WTF!

https://github.com/TheDarrenJoseph/purses - PulseAudio ncurses Audio Visualiser written in C

Web

https://github.com/Siot/PaWebControl - PulseAudio Web Volume Control. Requirements: PHP web server, PulseAudio pactl command

Processing

pulseaudio-equalizer

http://www.freedesktop.org/wiki/Software/PulseAudio/Documentation/User/Equalizer - A LADSPA based multiband equalizer approach for getting better sound out of pulseaudio. This equalizer clearly is more potent than the (deprecated ?), optional one from Pulseaudio.

https://sourceforge.net/projects/qpaeq - qpaeq is an equalizer interface for pulseaudio

Loudness Equalizer

https://github.com/gotbletu/shownotes/blob/master/pulseaudio-dynamic-range-compression.md - Loudness Equalizer aka Pulseaudio Dynamic Range Compression (LADSPA swh-plugins)

T5! Crossover Rack / Parametric Equalizer

T5! - DIY DSP software projects, speaker crossovers and equalization in software. This is for several reasons: It is way easier to adjust filters in software than in hardware and you will be adjusting a lot while designing your own speakers... In my opinion software based filters (DSP) are good enough to perform similarly if not better than their analog counterparts. With a multi-channel DAC multi-amping is a no-brainer, too. Processing power is cheap nowadays, analog circuitry is not.Thus you will find some on this page.

ladspa-t5-plugins - a collection of LADSPA audio processing plugins. They are used in the Pulseaudio Parametric Equalizer and Pulseaudio Crossover Rack.
- https://gitlab.com/t-5/ladspa-t5-plugins

Pulseaudio Crossover Rack - a program to design and implement multi-way speaker crossovers using any linux powered computer with a multi-channel sound card and a running desktop environment which uses Pulseaudio as it's sound backend. It also uses a set of LADSPA plugins, namely ladspa-t5-plugins for the heavy lifting of DSP/autio processing. It's written in python3 and uses QT as the windowing toolkit.
- https://gitlab.com/t-5/paxoverrack

Pulseaudio Parametric Equalizer - a python GUI to insert a fully parametric three band equalizer with high and low shelves into the pulseaudio audio server. I mainly wrote this being inspired by the existing project pulseaudio-euqalizer. I was in need of a fully parametric EQ for proper speaker response equalization though and so I wrote whis application.
- https://gitlab.com/t-5/PulseAudioParametricEq

vamp-live-host

https://github.com/chrisbaume/vamp-live-host - Host for vamp plugins which processes a live audio signal using pulseaudio.

micFX

https://github.com/schelcc/micFX - Live microphone effects in linux using SoX and pulseaudio

prettyeq

https://github.com/keur/prettyeq - a system-wide paramateric equalizer for pulseaudio [23]

pulseaudio-webrtc-audio-processing

https://github.com/freedesktop/pulseaudio-webrtc-audio-processing - meant to be a more Linux packaging friendly copy of the AudioProcessing module from the WebRTC[ project. The ideal case is that we make no changes to the code to make tracking upstream code easy.

sound-of-interrupts

https://github.com/matiaslina/sound-of-interrupts - A small program that make sounds depending on the amount of disruption that has the processor using pulseaudio

JDSP4Linux

https://github.com/Audio4Linux/JDSP4Linux - An audio effect processor for PipeWire and PulseAudio clients

Other

pasuspender

pasuspender -- audacity
  # temporaraly suspend pulseaudio and launch audacity, for when PA gets in the way and config yak shaving isn't an option

autopulse

https://github.com/naftulikay/autopulse - Script for dynamically changing your default PulseAudio sink on hotplug events for USB peripherals, etc.

pamidicontrol

https://github.com/solarnz/pamidicontrol - A utility to control the volume of PulseAudio streams / sinks / sources with a midi device

vu

https://github.com/zezic/vu - Super-smooth VU meter for PulseAudio

papeaks

https://github.com/futpib/papeaks - PulseAudio volume peaks as a text or binary output (for scripts to work with)

padsp

https://linux.die.net/man/1/padsp - PulseAudio OSS Wrapper. starts the specified program and redirects its access to OSS compatible audio devices (/dev/dsp and auxiliary devices) to a PulseAudio sound server. padsp uses the $LD_PRELOAD environment variable that is interpreted by ld.so(8) and thus does not work for SUID binaries and statically built executables. Equivalent to using padsp is starting an application with $LD_PRELOAD set to libpulsedsp.so

SE: Generating random noise for fun in /dev/snd/

cat /dev/urandom | padsp tee /dev/audio > /dev/null

apulse

https://github.com/i-rinat/apulse - pulseaudio emulation for ALSA

pulseaudio-dlna

https://github.com/masmu/pulseaudio-dlna - A lightweight streaming server which brings DLNA / UPNP and Chromecast support to PulseAudio and Linux

pulseaudio-raop2

http://hfujita.github.io/pulseaudio-raop2 - Experimental RAOP2 (Apple AirPlay2) support for PulseAudio

noise-volume-daemon

https://github.com/sonofevil/noise-volume-daemon - Bash script which dynamically fades in/out the volume of an audio-generating process (e.g. an ambient noise generator like anoise.py) in response to whether other processes are using audio. Requires Pulseaudio. Uses pipes that might be fragile. Can't tell the difference between paused and playing audio.

pagraphcontrol

https://github.com/futpib/pagraphcontrol - PulseAudio Graph Control

PulseDroid

https://github.com/dront78/PulseDroid
- https://superuser.com/questions/605445/how-to-stream-my-gnu-linux-audio-output-to-android-devices-over-wi-fi

JACK

Clients

JACK-AUDIO-CONNECTION-KIT: Creating & manipulating clients

https://github.com/jackaudio/example-clients
- https://github.com/jackaudio/example-clients/blob/master/inprocess.c - internal client, runs as part of jackd

https://github.com/resinbeard/jacksandbox - a simple JACK client for learning and testing audio code.

https://github.com/marcdinkum/jack_module - C++ wrapper for JACK audio, containg lock free ringbuffers and some examples.

https://github.com/fps/jack_wakeup - A small utility to sample wakeup times for a jackd client. Its usefullness is mostly limited to being run as the only client in a jack session since only in this case it's guaranteed to be run as soon as possible after a period has started.

https://github.com/fps/jack2_split - A program that facilitates parallelism in serial jack graphs by introducing latency. Only useful for jack2/jackdmp - it does nothing but add latency in jack1 setups.

https://github.com/dagargo/jack-client-template - Template to create JACK clients

Libraries / headers

https://github.com/jackaudio/headers - JACK API headers

https://github.com/ventosus/jack_osc - a workaround for Jack to support routing sample-accurate OSC packets via Jack MIDI ports as discussed at LAC2014.

https://github.com/x42/weakjack - small library abstracts the JACK Application Binary Interface for weak/runtime libjack linking.

https://github.com/stetre/luajack - a Lua binding library for the JACK Audio Connection Kit. It runs on GNU/Linux and requires Lua (>=5.3) and JACK (API >= v0.124.1).

https://gitlab.com/gabrbedd/jacksquat - A JACK mock library for use in unit testing

Metadata

https://github.com/jackaudio/jack2/blob/develop/common/JackMetadata.cpp#L30

https://github.com/drobilla/jackey - Jack Metadata Property Definitions

jack-property-listener.py - Listen to and print JACK client/port meta-data changes.

sndio

sndio is a small audio and MIDI framework part of the OpenBSD project. It provides an lightweight audio & MIDI server and a fully documented user-space API to access either the server or directly the hardware in a uniform way. Sndio is designed to work for desktop applications, but pays special attention to synchronization mechanisms and reliability required by music applications. Reliability through simplicity are part of the project goals.

WP: sndio

https://wiki.voidlinux.eu/Sndio

https://www.reddit.com/r/linux/comments/3i849k/playing_around_with_openbsds_sound_server_sndio/

PipeWire

See Audiovisual#PipeWire

CRAS

CRAS: ChromeOS Audio Server - allows for sound to be routed dynamically to newly attached audio-capable monitors (DisplayPort and HDMI), USB webcam, USB speakers, bluetooth headsets, etc., and in a way that requires as little CPU as possible and that adds little or no latency.

https://notabug.org/ghaglund/cras

dspd

https://github.com/dspdaemon/dspd - A Linux sound daemon with minimal dependencies that implements several existing APIs and protocols

aRts

See #aRts_2

NAS

The Network Audio System (NAS) - a network transparent, client/server audio transport system. It can be described as the audio equivalent of an X server. Enjoy!
- WP: Network_Audio_System

Enlightened Sound Daemon

WP: Enlightened_Sound_Daemon - old

Integration

Fedora: Integrating PulseAudio with JACK
YouTube: jackAndPulse

JACK FAQ: How do I route audio from Flash to JACK?
JACK FAQ: Routing GStreamer audio via JACK

Setting up Jack Audio for GStreamer, Flash, and VLC - First of all I need to say that I won't be mentioning Pulseaudio so if that is what you're here for then you are at the wrong place because I don't use Pulseaudio at all, Pulseaudio can be run ontop of Jack but doing so will increase CPU load (a very tiny amount on modern systems).

https://github.com/brummer10/pajackconnect - Make JACK Work With PulseAudio. This script is intended to be invoked via QjackCtl to start up and shut down JACK on a system running PulseAudio. It handles the necessary setup to make the two work together, so PulseAudio clients get transparently routed through JACK while the latter is running, or if pulseaudio is suspend by pasuspender, do nothing [24]

Windows

https://wiki.jriver.com/index.php/Audio_Output_Modes

WP: Windows_legacy_audio_components

WP: Windows_98#Windows_Driver_Model

WP: Universal_Audio_Architecture - Vista

WP: Technical_features_new_to_Windows_Vista#Audio

https://wiki.jriver.com/index.php/WASAPI - is Microsoft's most modern method for talking with audio devices. It is available in Windows Vista, Windows 7, and later versions of Windows. It allows delivering an unmodified bitstream to a sound device, and provides benefits similar to those provided by ASIO drivers. One of the other main benefits of WASAPI is that it provides applications with exclusive access to audio devices, bypassing the system mixer, default settings, and any typically any effects provided by the audio driver. WASAPI is the recommended Audio Output Mode for Windows unless your audio device has a well-behaved ASIO driver, and it effectively replaces all legacy output modes including Kernel Streaming and Direct Sound.

WP: DirectSound - a deprecated software component of the Microsoft DirectX library for the Windows operating system. DirectSound provides a low-latency interface to sound card drivers wrcven for Windows 95 through Windows XP and can handle the mixing and recording of multiple audio streams.

WP: DirectMusic - a deprecated component of the Microsoft DirectX API that allows music and sound effects to be composed and played and provides flexible interactive control over the way they are played. Architecturally, DirectMusic is a high-level set of objects, built on top of DirectSound, that allow the programmer to play sound and music without needing to get quite as low-level as DirectSound. DirectSound allows for the capture and playback of digital sound samples, whereas DirectMusic works with message-based musical data. Music can be synthesized either in hardware, in the Microsoft GS Wavetable SW Synth, or in a custom synthesizer.

WP: XAudio2 - a lower-level audio API for Microsoft Windows, Xbox 360 and Windows Phone 8, the successor to DirectSound on Windows and a supplement to the original XAudio on the Xbox 360. XAudio2 operates through the XAudio API on the Xbox 360, through DirectSound on Windows XP, and through the low-level audio mixer WASAPI on Windows Vista and higher.

ASIO4ALL - Universal ASIO Driver For WDM Audio

https://github.com/chriskohlhoff/asio - Asio C++ Library

https://github.com/duncanthrax/scream - a virtual device driver for Windows that provides a discrete sound device. Audio played through this device is published on your local network as a PCM multicast stream.Receivers on the network can pick up the stream and play it through their own audio outputs. Receivers are available for Unix/Linux (interfacing with PulseAudio or ALSA) and for Windows. For the special scenario of a Windows guest on a QEMU host, @martinellimarco has contributed support for transferring audio via the IVSHMEM driver mechanism, similar to the GPU pass-through software "Looking Glass". See the section on IVSHMEM below. Scream is based on Microsoft's MSVAD audio driver sample code. The original code is licensed under MS-PL, as are my changes and additions

https://github.com/andreiw/adlib21 - Updated Windows 3.1/3.11 msadlib.drv for 21st century with support for OPL2LPT/OPL3LPT

JACK configuration

pasuspender -- jackd
  # temporaraly suspend pulseaudio and start jack (needed for jack1 without PA patch)

jackd(1) - the JACK audio server daemon, a low-latency audio server. Originally written for the GNU/Linux operating system, it also supports Mac OS X and various Unix platforms. JACK can connect a number of different client applications to an audio device and also to each other. Most clients are external, running in their own processes as normal applications. JACK also supports internal clients, which run within the jackd process using a loadable "plugin" interface.

 jackd -R -P89 -s -dalsa -dhw:0 -r48000 -p256 -njack-server
  # start jackd, realtime priority 89, ALSA engine soundcard hw:0, sample rate of 48k, 256 max ports, instancename

Balancing Performance and Reliability in Jack – The Penguin Producer

List of JACK Frame & Period settings ideal for USB interface - LinuxMusicians - (Frames/Sample Rate) * Period = Theoretical (or Math-derived) Latency

# jack2 package commands
jack_alias
jack_bufsize
jack_control
jack_cpu
jack_cpu_load
jack_disconnect
jack_evmon
jack_freewheel
jack_iodelay
jack_latent_client
jack_load
jack_metro
jack_midi_dump
jack_midi_latency_test
jack_midiseq
jack_midisine
jack_monitor_client
jack_multiple_metro
jack_net_master
jack_net_slave
jack_netsource
jack_rec
jack_samplerate
jack_server_control
jack_session_notify
jack_showtime
jack_simple_client
jack_simple_session_client
jack_test
jack_thru
jack_transport
jack_unload
jack_wait
jack_zombie

jack_connect fluidsynth:l_00 system:playback_3 jack_connect fluidsynth:r_00 system:playback_4

jack_lsp
  # list jack ports

jack_lsp -c
  # list jack port connections (sinks indented)

jack-play(1) — jack-tools — Debian stretch — Debian Manpages - a light-weight JACK sound file player. It creates as many output ports as there are channels in the input file. It will connect to ports mentioned in the environment variable JACK_PLAY_CONNECT_TO which must include a %d pattern to indicate port number, otherwise it implements no connection logic, use jack-plumbing(1) instead. Written by Rohan Drape.

q=0: SRC_LINEAR
q=1: SRC_ZERO_ORDER_HOLD
q=2: SRC_SINC_FASTEST
q=3: SRC_SINC_MEDIUM_QUALITY
q=4: SRC_SINC_BEST_QUALITY

jack_iodelay - will create one input and one output port, and then measures the latency (signal delay) between them. For this to work, the output port must be connected to its input port. The measurement is accurate to a resolution of greater than 1 sample.

https://gareus.org/oss/jackfreqd/start - heavily based on powernowd. Instead of taking CPU load as parameter for deciding on the CPU frequency jackfreqd uses JACK DSP-load and jackfreqd only supports the powernowd's aggressive mode 1). Optionally jackfreqd can also take CPU load into account which comes in handy when the JACK-daemon is temporarily unavailable or if frequency-scaling should also be done for on non-audio processes.

https://github.com/anwyn/systemd.user - ALSA, Jack, Pulseaudio and Systemd User Sessions

Utilities

jack_control

D-Bus control via python2-dbus

jack_control start
  # starts the jack server

jack_control stop
  # stops the jack server

jack_control status
  # check whether jack server is started, return value is 0 if running and 1 otherwise

jack_control dg
  # current driver

jack_control dp
  # current driver paramaters

jack_control dl
  # drivers list

jack_control ds alsa
  # selects alsa as the driver (backend)
 
jack_control sm
  # switch master to currently selected driver

jack_control eps realtime True
  # set engine parameters, such as realtime

jack_control dps period 256
  # set the driver parameter period to 256

etc.:
  help                       - print this help text
  dpd <param>                - get long description for driver parameter
  dps <param> <value>        - set driver parameter
  dpr <param>                - reset driver parameter to its default value
  asd <driver>               - add slave driver
  rsd <driver>               - remove slave driver
  il                         - get list of available internals
  ip <name>                  - get parameters of given internal
  ipd <name> <param>         - get long description for internal parameter
  ips <name> <param> <value> - set internal parameter
  ipr <name> <param>         - reset internal parameter to its default value
  iload <name>               - load internal
  iunload <name>             - unload internal
  ep                         - get engine parameters
  epd <param>                - get long description for engine parameter
  eps <param> <value>        - set engine parameter
  epr <param>                - reset engine parameter to its default value

jack-select

https://github.com/SpotlightKid/jack-select - A systray application to quickly change the JACK-DBus configuration from QjackCtl presets.

jackman

https://github.com/progwolff/jackman - Collection of scripts that help managing multiple audio interfaces with Jack
- https://github.com/progwolff/jackman_kcm - GUI for KDE Config Manager

jacksettings

https://github.com/redtide/jacksettings - JACK settings using a jackd based systemd service

Connections

https://linuxmusicians.com/viewtopic.php?p=95025#p95025- jack_load audioadapter

alsa_in / alsa_out

man: alsa_in, alsa_out - Jack clients that perform I/O with an alternate audio interface
- Using Multiple Devices with Jack
- https://github.com/jackaudio/jackaudio.github.com/wiki/WalkThrough_User_AlsaInOut

alsa_in -j "Description" -d prefix:Name -q 1 2>&1 1> /dev/null &
  # used to send ALSA microphone input to an JACK input device
  # -d = device name, hw:2
  # -q = quality of resampler, 1-4
  # -c = channels, automatic default
  # -r 48000 = sample rate, automatic default

alsa_in
  # can automatically detect and open an available soundcard (what type? doesn't work for usb mic)

arecord -l
 ...
  card 2: AK5370 [AK5370], device 0: USB Audio [USB Audio]
  Subdevices: 0/1
  Subdevice #0: subdevice #0

alsa_in  -dhw:2 -jusb-mic
  # or
alsa_in -dhw:AK5370 -j "USB Mic"

alsa_out -j "Description" -d prefix:Name -q 1 2>&1 1> /dev/null &
  # used to send JACK output to an ALSA device, like a speaker or headphones

If you get "Capture open error: Device or resource busy", some other program has control of the playback interface.

To see what application has control of the interface:

fuser -u /dev/snd/pcmC0D0p
  # this is card 0, device 0, pcm playback

If it's pulseaudio, launch pavucontrol, go to the Configuration tab and select Off for the device(s).

https://github.com/IARI/alsa_jack_gui - qt-based gui to manage alsa_jack bridges

Zita-ajbridge

Zita-ajbridge - provides two applications, zita-a2j and zita-j2a. They allow to use an ALSA device as a Jack client, to provide additional capture (a2j) or playback (j2a) channels. Functionally these are equivalent to the alsa_in and alsa_out clients that come with Jack, but they provide much better audio quality. The resampling ratio will typically be stable within 1 PPM and change only very smoothly. Delay will be stable as well even under worse case conditions, e.g. the Jack client running near the end of the cycle.

cat /proc/asound/cards

zita-a2j -dhw:3,0 -jwebcam

Jack Std-I/O

jackstdio - jack-stdout writes JACK audio-sample data to buffered standard output. jack-stdin reads raw audio data from standard-input and writes it to a JACK audio port.
- https://github.com/x42/jack-stdio - unix pipe audio-data from and to JACK

Configuration GUI

QjackCtl

QjackCtl - JACK Audio Connection Kit - Qt GUI Interface

QjackCtl holds its settings and configuration state per user, in a file located as $HOME/.config/rncbc.org/QjackCtl.conf. Normally, there's no need to edit this file, as it is recreated and rewritten everytime qjackctl is run.

D-Bus control and Jack 2 D-Bus control
Connection and JACK Session manager

https://github.com/kmatheussen/qjackctl_macos - Scripts to build qjackctl for macos

https://github.com/cybercatalyst/jackcontrol - older qt4 fork

Cadence

Cadence - a set of tools useful for audio production. Cadence itself is also an application (the main one), which this page will document. There are other applications that are part of the Cadence suite, they are usually named as the "Cadence tools". They are: Catarina (simple patching), Catia (patching), Claudia (LADISH)
- Cadence - controls and monitors various Linux sound systems as well as audio-related system settings

cadence --minimized &

Studio Controls

https://github.com/ovenwerks/studio-controls - A helper for setting up a system for audio work. Formerly known as Ubuntu Studio Controls.

https://manpages.ubuntu.com/manpages/man1/studio-controls.1.html

https://manpages.ubuntu.com/manpages/man2/studio-system.2.html

https://manpages.ubuntu.com/manpages/man2/autojack.2.html

Jack Control Panel

https://gitlab.com/IGBC/jackctl - A no fuss solution to wrangling Pro-Audio on Linux:

Utils

https://github.com/jackaudio/tools

https://github.com/jackaudio/jack-example-tools - Official examples and tools from the JACK project

rd: rju - jackd utilities:
- http://rohandrape.net/sw/rju/cmd
- https://salsa.debian.org/multimedia-team/jack-tools

jack-data: jack audio data onto osc (May 2016)
jack-dl: load dsp algorithms from shared libraries (October 2008)
jack-level: jack cli level meter (April 2019)
jack-lxvst: jack cli host for linux vst instruments (April 2016)
jack-osc: jack <-> open sound control daemon (January 2004)
jack-play: resampling soundfile playback (November 2003)
jack-plumbing: plumbing daemon (July 2003) rju-plumbing.md - JACK Plumbing Daemon
jack-record: soundfile recording (April 2004)
jack-scope: plain X oscilloscope (January 2004)
jack-transport: minimalist ncurses jack transport (November 2006)
jack-udp: jack over udp client (November 2003)

https://github.com/7890/jack_tools - alternative jack_* helpers

https://github.com/Gimmeapill/xruncounter - Small linux tool written in C by Hermann Meyer (aka @brummer10) to measure jack xruns and evaluate the overall performance of a system for realtime audio.

https://github.com/SpotlightKid/jack-audio-tools - A collection of utilities and tools for the JACK audio ecosystem

media.ccc.de - zita-jacktools - Realtime Audio Processors as Python Classes - LAC 2018

https://github.com/be1/jackie - a small graphical jackd launcher

https://github.com/falkTX/wineasio - provides an ASIO to JACK driver for WINE. ASIO is the most common Windows low-latency driver, so is commonly used in audio workstation programs.

https://github.com/kmatheussen/KillJack - The Jack server sometimes crashes/freezes in ways which make users restart the computer in order to use Jack again. This program kills Jack unconditionally. Executables for Linux, MacOS, and Windows are provided. I strongly recommend that all programs using Jack includes a program like this.

https://github.com/guysherman/jack-passthrough - jack client with a known name that can be used to play nice with some apps that work with JACK in a weird way

https://github.com/dagargo/overwitch - JACK client for Overbridge devices

Session management

A brief survey of Linux audio session managers - January 2013

LinuxMusicians: Re: Non-stuff in KXStudio - April 2013, a comparison of session managers
LinuxMusicians: Re: jack_session - August 210, LASH vs LADISH

Linux Synth Notes: robust_session_management
Linux Synth Notes: concurrent_patch_management

linux audio session scripting - 2009

Use something NSM based! Argodejo or RaySession.

LASH Audio Session Handler (/ LADCCA)

LASH - Trac - a session management system for JACK and ALSA audio applications on GNU/Linux. It is an implementation of a proposal that originated from this discussion. Its aim is to allow you to have many different audio programs running at once, to save their setup, close them down and then easily reload the setup at some other time. LASH doesn't deal with any kind of audio data itself; it just runs programs, deals with saving/loading (arbitrary) data and connects different kinds of virtual audio ports together (currently JACK and ALSA sequencer ports). It can also be used to move entire sessions between computers, or post sessions on the Internet for download.
- LASH - a session management system for GNU/Linux audio applications. It allows you to save and restore audio sessions consisting of multiple interconneced applications, restoring program state (ie loaded patches) and the connections between them.

Dead. Inflexible and underused.

GLASHCtl - a simple applet for controlling the LASH Audio Session Handler. When you run it it will appear as a small LASH icon in your "notification area" or "system tray".

formerly;

LADCCA - session management system for JACK audio and ALSA MIDI applications on GNU/Linux. LADCCA's aim is to allow you to have many different audio programs running at once, to save their setup, close them down and then reload the setup at some other time. LADCCA doesn't deal with any kind of audio data itself; it just runs programs, deals with saving/loading data and connects different kinds of virtual audio ports together (currently JACK and ALSA sequencer ports.)LADCCA's name has changed and development continues as the LASH project.
Linux Audio Developer's Configuration and Connection API - Table of Contents - Audio application session management and configuration
- https://wiki.linuxaudio.org/apps/all/ladcca

Became LASH.

LADISH

ladish - LADI Session Handler or simply ladish is a session management system for JACK applications on GNU/Linux using Dbus. Its aim is to allow you to have many different audio programs running at once, to save their setup, close them down and then easily reload the setup at some other time. ladish doesn't deal with any kind of audio or MIDI data itself; it just runs programs, deals with saving/loading (arbitrary) data and connects JACK ports together. It can also be used to move entire sessions between computers, or post sessions on the Internet for download.
- https://github.com/LADI/ladish - LADI Session Handler, a rewrite of LASH. a session management system for JACK applications on GNU/Linux. Its aim is to allow you to have many different audio programs running at once, to save their setup, close them down and then easily reload the setup at some other time. ladish doesn't deal with any kind of audio or MIDI data itself; it just runs programs, deals with saving/loading (arbitrary) data and connects JACK ports together. It can also be used to move entire sessions between computers, or post sessions on the Internet for download. ladish has GUI frontend, gladish, based on lpatchage (LADI Patchage). and the ladish_control command line app for headless operation. LADI Tools is set of apps that interface with ladish, JACK server and a2jmidid
- https://github.com/alessio/ladish/

gladish_keyboard_shortcuts

The LADI Session Handler

LADI Tools

https://github.com/alessio/laditools - LADI Tools, forked from LADI/laditools, is a set of tools aiming to achieve the goals of the LADI project to improve desktop integration and user workflow of Linux audio system based on JACK and LADISH. Those tools take advantage of the DBus interfaces of JACK2 and LADISH to ease the configuration and use of your software studio.

In a near future, it should also be possible to use laditools to control JACK through an OSC interface.

You will find in this suite:

laditools - python module
ladi-system-tray - a system tray icon that allows you to start, stop and monitor JACK, as well as start some JACK related apps (log viewer, connections...)
wmladi - a controller as a Window Maker dockapp which uses a menu similar to ladi-system-tray's
ladi-system-log - a JACK, LADISH and a2jmidid log viewer
ladi-control-center - a GUI to setup JACK's and laditools' configuration
ladi-player - compact front-end to that allows users to start, stop and monitor a LADI system.
g15ladi - a JACK monitor for g15 keyboards

Claudia

Claudia - a LADISH frontend; it's just like Catia, but focused at session management through LADISH.

jack2 (dbus)

Claudia-Launcher is a multimedia application launcher with LADISH support. It searches for installed packages (not binaries), and displays the respective content as a launcher. The content is got through an hardcoded database, created and/or modified to suit the target distribution.

http://kxstudio.linuxaudio.org/Documentation:Manual:simple_claudia_studio

https://repo.or.cz/klaudia.git - formerly

JACK Session

https://wiki.linuxaudio.org/apps/categories/jack_session - deprecated

originally jack 1 (not D-Bus)

Saving a session will save the state of all 'JACK Session'-supported apps plus their JACK connections
Opening a session will automatically launch those apps, restoring their state and JACK connections
Supported apps can be told to save/load their state to/from a specific location

WalkThrough_Dev_JackSession

JACK Session API for clients.

http://audio-and-linux.blogspot.co.uk/2011/08/jacksession-first-steps-it-rocks.html

https://github.com/torbenh3/pyjacksm - a simple sessionmanager for the jack-session protocol. jacksmtray should start the tray app, which controls the sessionmanager daemon.

https://github.com/fps/js_wrap - A simple wrapper for non jack_session enabled apps whose state can be fully qualified by the cmdline to run them

New Session Manager (NSM)

New Session Manager - a tool to assist music production by grouping standalone programs into sessions. Your workflow becomes easy to manage, robust and fast by leveraging the full potential of cooperative applications.It is a community version of the "NON Session Manager" and free in every sense of the word: free of cost, free to share and use, free of spyware or ads, free-and-open-source.You can create a session, or project, add programs to it and then use commands to save, start/stop, hide/show all programs at once, or individually. At a later date you can then re-open the session and continue where you left off.All files belonging to the session will be saved in the same directory.
- https://github.com/linuxaudio/new-session-manager

Non Session Manager - an [older] graphical interface to the NSM Daemon (nsmd). By default, running the command non-session-manager will start both the GUI and an instance of the daemon. NSM manages clients together in a session. NSM doesn't know or care what Window Manager or audio subsystem those clients use--nor should it. Specific clients must be written to persist these environmental factors, and added to sessions when required.

For saving and restoring the JACK connection graph, a simple headless client named jackpatch has been developed and included in the NSM distribution. Simply add jackpatch do your basic template session and all the sessions you base on it will have their JACK connection graphs automatically saved and restored.

http://non.tuxfamily.org/wiki/ApplicationsSupportingNsm

http://non.tuxfamily.org/wiki/nsm-proxy - a simple NSM client for wrapping non-NSM capable programs. It enables the use of programs supporting LADISH Level 0 and 1, and programs which accept their configuration via command-line arguments.

https://github.com/vktec/njsm - bridges Non Session Manager and JACK Session. This allows programs that support JACK Session (say, jalv) to run inside nsm-proxy and have their data saved using njsm.

https://github.com/rhetr/nsm-scripts - various scripts to supplement non session manager (NSM) usage.

https://github.com/rhetr/nsm-git - makes git a little easier to use with non session manager sessions. creates a git repository in the current session and commits all untracked and unstaged files to it whenever save is pressed. nsm-git also reads the session.nsm file and deletes any saved applications that are not listed in the session. This program is meant to be executed within NSM.

https://github.com/diovudau/pynsm2 - Non Session Manager client library in Python - Version2: No dependencies except Python3.

lss/pynsm PyNSMClient - A New Session Manager Client-Library in one file.

https://github.com/newlaurent62/rayZ-builder - Tool to create wizards and fill ray session templates

new-session-manager

https://github.com/linuxaudio/new-session-manager - a tool to assist music production by grouping standalone programs into sessions. Your workflow becomes easy to manage, robust and fast by leveraging the full potential of cooperative applications.It is a community version of the "NON Session Manager" and free in every sense of the word: free of cost, free to share and use, free of spyware or ads, free-and-open-source.

RaySession

https://github.com/Houston4444/RaySession - a GNU/Linux session manager for audio programs as Ardour, Carla, QTractor, Non-Timeline, etc... It uses the same API as Non Session Manager, so programs compatible with NSM are also compatible with Ray Session. As Non Session Manager, the principle is to load together audio programs, then be able to save or close all documents together.

Argodejo

Argodejo - a music production session manager. It is used to start your programs, remember their (JACK) interconnections and make your life easier in general.You can seamlessly change between two view modes to quickly start a few programs or have complete control and a detailed overview.Argodejo does not re-invent the wheel but instead uses the New-Session-Manager daemon and enhances it with some tricks of its own, that always remain 100% compatible with the original sessions.This is a proof of concept version. It aims to show that session management with NSM can be quick and convenient and make the user feel in control. Some functionality has not yet been implemented, most prominently anything related to NSM over network. There is always the possibility to break things when trying out corner cases and hacks.That said, for single-computer sessions with just one daemon and one GUI at the same time Argodejo should provide a good user experience.

Stagepatch

https://github.com/ViktorNova/stagepatch - nsm-git fork. Persistent audio and MIDI patchbay daemon for NSM, which auto-connects remembered devices when they reappear. Based on aj-snapshot

Gonzo

https://github.com/scgolang/gonzo - Command line nsm server. I started this project out of frustration with trying to write nsm clients.

https://github.com/scgolang/nsm - Non session manager OSC protocol implemented in Go

MonoMultiJack

MonoMultiJack - a program for managing, starting and stopping Jackd and music programs. Another feature is connecting and disconnecting Jack audio and MIDI ports, as well as ALSA MIDI ports. It is programmed in Mono using GTK# as GUI toolkit.

JackLinx

JackLinx - Simple Session Manager for the Music Classroom.
- https://github.com/felison/JackLinx - dead?

Preselected applications.

https://issuu.com/felison/docs/jacklinx-manual-131212-1218

chino

chino - a 'special-purpose session manager', requiring customisation to cover one or more similar setups. Once customised, using it is dead simple. Perhaps it is best to not overstress the term "session management", instead describing chino as a framework and toolset to build and manage a meta-application consisting of the user's favorite modular Jack audio and Midi tools, each started and interconnected in predefined ways.
- http://chino.tuxfamily.org/documentation/

chino -n newproject
  # start newproject in current directory

chino -o existingproject
  # open existingproject

Broken? Depends on listlib which depends on anch which depends on tml which depends on flex. None of these were in the AUR. Then tml didn't build.

Routing

Catia

Catia - a simple JACK Patchbay, with some neat features like A2J bridge support and JACK Transport.It was initially part of the Cadence project, but now lives on its own.
- https://github.com/falkTX/Catia

Patchage

Patchage is a modular patch bay for audio and MIDI systems based on Jack and Alsa.

Patchmatrix

https://github.com/OpenMusicKontrollers/patchmatrix - PatchMatrix gives the best user experience with JACK1, as it makes intensive use of JACK's metadata API, which JACK2 still lacks an implementation of.

QJackConnect

QJackConnect - a QT based patchbay for the JACK Audio Connection Kit.

Patchichi

https://github.com/Houston4444/Patchichi Houston4444/Patchich - an abstract JACK patchbay GUI for GNU/Linux systems, but it could be adapted to Mac and Windows with little effort. The software it most closely resembles is probably Catarina, from the Cadence suite.

rofi-jack

https://github.com/madskjeldgaard/rofi-jack - Keyboard centric jack audio management using the rofi app launcher

jsweeper

jsweeper will be a programmable port connection manager for ALSA sequencer and JACK audio and midi ports. Ports are laid out in a matrix so that connecting or disconnecting a port or a group of ports is just one mouse click or keypress.
- http://git.fuzzle.org/jsweeper.git/

njconnect

njconnect - Curses Jack connection manager

CliConnect

CliConnect is a minimal terminal based JACK connection manager. Why is that useful? For using over SSH mostly.
- https://github.com/harryhaaren/openAV-cliconnect

esjit

esjit - a text-mode JACK audio connection manager. not in AUR anymore.
- https://github.com/lotuskip/esjit - dead?

qpwgraph

https://gitlab.freedesktop.org/rncbc/qpwgraph - PipeWire Graph Qt GUI Interface

Routing snapshots

aj-snapshot

aj-snapshot - a small program that can be used to make snapshots of the connections made between JACK and/or ALSA clients. Because JACK can provide both audio and MIDI support to programs, aj-snapshot can store both types of connections for JACK. ALSA, on the other hand, only provides routing facilities for MIDI clients. You can also run aj-snapshot in daemon mode if you want to have your connections continually restored.

aj-snapshot filename
  # make a snapshot

aj-snapshot -r filename
  # restore a snapshot

aj-snapshot -d filename &
  # run in daemon mode

Robust Session Management - QJackCTL and Patchage for setup, diagnostics, and testing, aj-snapshot for management of Jack and ALSA MIDI connections, The DBus version of Jack2

autocable

https://github.com/resinbeard/autocable - A tiny C application that loads a text file and routes Jack audio connections for you.

echo "connect system:capture_1 system:playback_1

> disconnect system:capture_2 system:playback_2" | ./autocable

./autocable yourdirectory/textfile.ac

Beginning a GNU/Linux/JACK headless performance system - with autocable and qjackctl for visual demonstration

JMess

https://github.com/jcacerec/jmess-jack - JMess - A utility to save your audio connections (mess). JMess can save an XML file with all the current Jack Audio connections. This same file can be loaded to connect everything again. The XML file can also be edited. It also also has the option to disconnect all the clients.

jmess -s filename.xml
  # save

jmess -c filename.xml
  # load

jmess -d -c filename.xml
  # disconnect all then load

jmess -d
  # disconnect all

jack_snapshot

jack_snapshot - a little tool for storing/restoring jack connection states. it does this by writing/reading the names of the connected ports into/from a simple textfile. and here is also one weakness: some jack clients don't use the same jack name on each run, but dynamically assign one [like meterbridge] but most of them can be told to use a specific name, so this isn't really a problem. at least not for me. some pattern matching might be added in the future..

jack-plumbing

jack-plumbing maintains a set of port connection rules and manages these as clients register ports with JACK- Port names are implicitly bounded regular expressions and support sub-expression patterns.

jack-matchmaker

https://github.com/SpotlightKid/jack-matchmaker - a small command line utility that listens to JACK port registrations by clients and connects them when they match one of the port pattern pairs given on the command line at startup. jack-matchmaker never disconnects any ports. The port name patterns are specified as pairs of positional arguments or read from a file (see below) and are interpreted as Python regular expressions

jack_autoconnect

https://github.com/kripton/jack_autoconnect - Tiny application that reacts on port registrations by clients and connects them. The port names are interpreted as regular expressions and more than one pair can be defined upon calling.

Jack Sanity

Jack Sanity - A scriptable environment using JavaScript for controlling jackdbus clients.
- https://github.com/psychoticmeow/jack-sanity

jacklistener etc.

https://github.com/gentoo-root/jacklistener

https://github.com/gentoo-root/jacknotifier

https://github.com/gentoo-root/jackeventcmd

patchy

https://github.com/dedelala/patchy - store and recall jack audio port connections. Written in Go.

ASTRUX

ASTRUX - A setup creation tool for live-oriented musicians (under active development by Raphaël Mouneyres)
- https://github.com/jerash/astrux

Timing

JACK Transport / Timebase

JACK Transport Design - The JACK Audio Connection Kit provides simple transport interfaces for starting, stopping and repositioning a set of clients. This document describes the overall design of these interfaces, their detailed specifications are in <jack/transport.h>

JACK-AUDIO-CONNECTION-KIT: Transport and Timebase control

https://github.com/jackaudio/jack2/blob/develop/common/jack/types.h#L554

https://github.com/jackaudio/jackaudio.github.com/wiki/JACK-Transport-limitations

Timebase master relates to the musical/metronomic information being delivered to other JACK clients (tempo, bar, beat, ticks, time-sig. etc.).

Transport modes relate to linear playback location sync and state (stopped vs. rolling). [25]

jack_transport> ?
  activate	Call jack_activate().
  exit		Exit transport program.
  deactivate	Call jack_deactivate().
  help		Display help text [<command>].
  locate	Locate to frame <position>.
  master	Become timebase master [<conditionally>].
  play		Start transport rolling.
  quit		Synonym for `exit'.
  release	Release timebase.
  stop		Stop transport.
  tempo        Set beat tempo <beats_per_min>.
  timeout	Set sync timeout in <seconds>.
  ?  		Synonym for `help'.

echo play |jack_transport
  # pass command to execute
  # tempo change doesn't work via this method

timebase.py - Query and manipulate JACK transport state and provide timebase information using jackclient-python

JackDirector - a Linux app that lets you control Jack Audio Connection Kit's transport play/pause using midi commands (noteon) and let you assign bpm changes and other commands to midi program changes. This program plays a metronome thru 2 audio outputs exposed in Jack.

gjacktransport - a standalone application that provides access to the jack audio connection kit‘s, JACK transport mechanism via a dynamic graphical slider. in other words: this software allows to seek Audio/Video media files when they are played along jack transport. Intended for audio-engineers or A/V editors that work with arodour, ecasound, hydrogen and/or xjadeo. Additionally it provides 'gjackclock'. A "Big Clock" display for jack-transport.

cabestan is a small GTK+ program that interfaces with the jack audio connection kit to play, rewind, or fast forward the stream via the jack transport interface.

jack-transport is a minimalist Jack transport control interface using ncurses. It displays the transport state and current time, and provides standard operating keys.

QJackMMC - a Qt based program that can connect to a device or program that emits MIDI Machine Control (MMC) and allow it to drive JACK transport, which in turn can control other programs. JackCtlMMC is a slightly simpler command-line version of QJackMMC.

https://gitlab.com/Largos/jack-transport-for-plasma - A Python application with a QML GUI for using with Jack Transport. The GUI integrates with Plasma and uses the theming.

https://github.com/ycollet/qtmiditrans - A Jack midi filter which translates midi events into jack transport (stop / play)

jack-osc - publishes the transport state of the local JACK server as OSC packets over a UDP connection. jack-osc allows any OSC enabled application to act as a JACK transport client, receiving sample accurate pulse stream timing data, and monitoring and initiating transport state change.

InConcert - a MIDI-controlled application that allows a musician to control the tempo and synchronization of a MIDI sequence. It features a tap tempo to adjust the beat (and synchronize the beat) and the ability to skip beats or insert beats. It works by controlling the Jack Audio Connection Kit's transport. InConcert depends on Jack and ALSA, and therefore only runs on Linux.

Doesn't work??

TapStart - measures a tempo you tap. But: It sends OSC-messages with the tempo or delay to customizable hosts and paths. It updates the Jack tempo on each click (=new averaged tempo). It can start the Jack transport after tapping a defined number of beats.
- https://github.com/kampfschlaefer/tapstart

jack-trans2midi - a utility that converts jack transport into midi clock messages

https://github.com/harryhaaren/AutoMate - An automation editor, which uses (or will use) JACK MIDI output and JACK Transport to sync to the beat.

jack-file - Jack transport-centric utilities for audio playback
- https://github.com/danmbox/jack-file

Ableton Link

Ableton Link - a technology that keeps devices in time over a local network, so you can forget the hassle of setting up and focus on playing music. Link is now part of Live, and also comes as a built-in feature of other software and hardware for music making.
- Link Documentation | Ableton

https://github.com/Ableton/link - codebase for Ableton Link, a technology that synchronizes musical beat, tempo, and phase across multiple applications running on one or more devices. Applications on devices connected to a local network discover each other automatically and form a musical session in which each participant can perform independently: anyone can start or stop while still staying in time. Anyone can change the tempo, the others will follow. Anyone can join or leave without disrupting the session.

media.ccc.de - Playlist for "Linux Audio Conference 2018"

https://github.com/ak5k/reablink - REAPER plug-in extension providing ReaScript bindings for Ableton Link session, and Ableton Link Test Plan compliant implementations for REAPER.

https://github.com/rncbc/jack_link - a JACK transport timebase prototype bridge to Ableton Link.
- https://www.ableton.com/en/link/

https://github.com/x37v/jack_transport_link - A service that bridges Ableton's Link to and from Jack Transport, allowing applications that use Jack Transport to synchronize their timing with other applications that support Link.

https://github.com/falkTX/Hylia - Host transport library for Ableton Link

https://github.com/Deep-Symmetry/carabiner - A loose connector for interacting with Ableton Link. Carabiner is a program that embeds the C++ Link library and listens for local TCP connections to allow other programs, like beat-link-trigger and Afterglow, to participate in some aspects of a Link session, even though they were not written using C++ compatible languages and runtimes.

https://github.com/gonzaloflirt/link-python - Python wrapper for Ableton Link

https://github.com/magdaddy/ableton-link-rs - Rust bindings for Ableton Link

https://github.com/ianacaburian/AbletonLink_JuceSampler - Simple tutorial on how to build JUCE projects with tempo synchronization by Ableton Link

https://github.com/libpd/abl_link - Ableton Link integration for Pure Data on desktop and Android.

https://github.com/2bbb/node-abletonlink - node.js port of ableton Link with node-addon-api

https://github.com/comoc/UnityAbletonLinkKit

Pioneer DJ Link

https://github.com/g-zi/CDJ_Clock -missing link between Pioneers Pro DJ Link and Ableton Live. CDJ Clock generates MIDI beat clock from Pioneers Pro DJ Link. With CDJ Clock anything what understands MIDI Beat Clock can be synced to Pioneer CDJs.

https://github.com/Deep-Symmetry/beat-link - A Java library for synchronizing with beats from Pioneer DJ Link equipment, and finding out details about the tracks that are playing.
- Decoding Pioneer Pro Link: Connect CDJs To Ableton Link - DJ TechTools

https://github.com/Deep-Symmetry/beat-link-trigger - Trigger events and automate shows in response to events on Pioneer CDJs
- https://blt-guide.deepsymmetry.org

https://github.com/Deep-Symmetry/open-beat-control - Provides a subset of beat-link features over Open Sound Control.

https://github.com/Deep-Symmetry/beat-carabiner - A minimal tempo bridge between Pioneer Pro DJ Link and Ableton Link.

https://github.com/Deep-Symmetry/dysentery - Exploring ways to participate in a Pioneer Pro DJ Link network.

Calculation

Qrest is a musician toolkit aimed at helping composers, performers, recordists and mixers : Find out the tempo of a musical piece, Calculate delay times, Calculate LFO frequencies (i.e., timing conversions)

RASP - "RASP Aids Song Production" is a set of utilities for song production, supplementing functions missing in some DAWs. Features: Tap Tempo, Delay/Hz Calculator, Song Time Calculator, Note-to-Frequency Conversion, Simple Frequency Generator (v2), Metronome (v2)

Metronomes

https://rosettacode.org/wiki/Metronome

kmetronome

Drumstick Metronome (kmetronome) is a MIDI based metronome using the ALSA sequencer. Intended for musicians and music students, it is a tool to keep the rhythm while playing musical instruments.
- No decimal BPM, not MIDI driven

ametro

ametro - a little, simple MIDI Metronome using the ALSA sequencer.
- https://github.com/rabramley/linux_midi_commands
- No decimal BPM

klick

klick is an advanced command-line based metronome for JACK. It allows you to define complex tempo maps for entire songs or performances.

JACK transport connect but not driven by it? BPM argument required, doesn't change when transport master runs.

gtklick - a GTK frontend to klick. It's written in Python and communicates with klick via OSC.

klick -o 12345 60 &
gtklick -q osc.udp://localhost:12345

https://github.com/jean-emmanuel/kleek - Simple klick cli wrapper to setup training patterns faster

https://github.com/MaurizioB/klickui

https://github.com/sonejostudios/klick2wav - a GUI for the export function of Klick.

Polygnome

Polygnome - A polyrhythmic metronome in GTK+. Supports ALSA and JACK audio backends.
- https://gitlab.com/tmatth/polygnome
- Audio only, no MIDI

GTick

GTick - an audio metronome application written for GNU/Linux and other UN*X-like operting systems supporting different meters (Even, 2/4, 3/4, 4/4 and more) and speeds ranging from 10 to 1000 bpm. It utilizes GTK+ and OSS (ALSA compatible).
- https://github.com/yoyonel/gtick
- Audio only, no MIDI

https://github.com/Barabas5532/gtick-guitar - GNU gtick metronome with quick tempo increment control

Hubcap

Hubcap - a fairly simple metronome *nix app with a tempo fader and both auditory and visual feedback on a beat.
- Audio only, no MIDI

Accelerando

Accelerando - a musical metronome that can speed up, allowing you to practice your music at progressively faster tempos. For example, you could set it to play 60 beats per minute for 4 bars, then automatically speed up by 10 beats per minute, and so on. It runs on Unix.
- https://github.com/bcrowell/accelerando
- Audio only, no MIDI

jmetro

https://github.com/jmage619/jmetro - A dumb linux based Jack metronome with Qt based UI

midiclick

midiclick - generates a metronome click-track on MIDI channel 9
- http://www.pjb.com.au/midi/free/midiclick
- ALSA MIDI, no audio

ctronome

ctronome - a very simple yet powerful ;) programmable console metronome software.
- OSS Audio only, no MIDI

Click Tracker

Click Tracker - a program designed for composers, conductors and instrumentalists working with modern music. The main goal of the software is to prepare a click track of any score, no matter how complex it is. This software runs in Windows, OSX and Linux under the open source program Pure Data, and can be used either by conductors in concert, by musicians for practice purposes, by composers while composing.

clicktrack

https://github.com/schollz/clicktrack - Generate a click track from pretty much any computer.

Metronome

https://github.com/witte/Metronome - A simple metronome made with Qt and Juce

Visual metronomes

JVMetro

JVMetro - provides a colorful, realtime visual indication of the passage of bars and beats on the Jack transport--without generating any sound of its own.
- https://github.com/original-male/jvmetro

vimebac

https://gitlab.com/smondet/vimebac - graphical metronome and instructions display that interfaces with JACK-midi applications. The display can be completely driven by MIDI events and it can also send MIDI events. It can also be self-driven and hence run without jackd although this is somewhat less interesting since it becomes just a visual metronome.

BeatViz

https://github.com/kunstmusik/BeatViz - shows a 4x4 grid that represents additive groupings of beats. (Beat here meaning a single atomic tick, equal to a 16th note within the author's Csound Live Code system. UDP controlled.

Nevena's Metronome

Nevena's Metronome - a metronome program with Qt GUI. Besides just being beautiful, and working under X11, Windows, and Mac, it has some advanced features, for example it can count to you or act as a stroboscope.
- https://sourceforge.net/projects/nevenametronome

ticker

https://github.com/medakk/ticker - An ncurses based Visual Metronome designed in C++

Web metronomes

BestMetronome.com - web and mobile app metronomes
- Highly accessible version of metronome

Chrome Web Store: Dr. Beat - developed as a part of the HackTime (http://goo.gl/SscNs) project from GDG Chrome Korea. It's a metro style metronome app. It helps you to keep the beats.

Windows

Open Metronome - Windows only. User definable BPM; Measure can be set to any length, with emphasis on any beat(s); Each beat can be one or more of over forty voices, with the supplied Samples covering the complete General MIDI percussion set, or custom samples; Visual indicator as well as audible output;
- based on http://www.weirdmetronome.com/

http://bouncemetronome.com - Windows/Wine- $

Networked

mLAN

WP: mLAN - short for Music Local Area Network, is a transport level protocol for synchronized transmission and management of multi-channel digital audio, video, control signals and multi-port MIDI over a network. The mLAN protocol was originally developed by Yamaha Corporation, and publicly introduced in January 2000. It was available under a royalty-free license to anyone interested in utilizing the technology. mLAN exploits several features of the IEEE 1394 (FireWire) standard such as isochronous transfer and intelligent connection management. There are two versions of the mLAN protocol. Version 1 requires S200 rate, while Version 2 requires S400 rate and supports synchronized streaming of digital audio at up to 24 bit word length and 192 kHz sample rate, MIDI and wordclock at a bitrate up to 400 Megabits per second. As of early 2008, mLAN appeared to have reached the end of its product life.

mLAN Central -"mLAN FireWire Music Networking is the enabling technology for creating an intelligent, managed local area music network using FireWire. mLAN not only carries multi-channel digital audio and MIDI over 1394 FireWire, it includes the connection management so you can easily manage your entire network."

Linux ALSA-AMDTP Module

AES67

WP: AES67 - a technical standard for audio over IP and audio over ethernet interoperability. The standard was developed by the Audio Engineering Society and first published in September 2013. It is a layer 3 protocol suite based on existing standards and is designed to allow interoperability between various IP-based audio networking systems such as RAVENNA, Livewire, Q-LAN and Dante. It also provides interoperability with layer 2 technologies, like Audio Video Bridging (AVB). AES67 promises interoperability between previously competing networked audio systems and long-term network interoperation between systems. Since its publication, AES67 has been implemented independently by several manufacturers and adopted by many others.

AES67 for Linux

https://github.com/bondagit/aes67-linux-daemon - with configuration WebUI

Merging Technologies - Alsa Ravenna Aes67 Driver - an ALSA Linux driver designed to provide high performance RAVENNA/AES67 support for the Linux ecosystems.Merging is strongly committed to foster the adoption of AES67 networking capability by making available a simple to integrate AES67 ALSA Linux driver with all required functionalities for Linux based OEMs to take advantage of this rapidly evolving market.
- https://bitbucket.org/MergingTechnologies/ravenna-alsa-lkm/src

https://github.com/voc/aes67-recorder - A Linux/GStreamer-Based AES67 Multitrack Audio Backup Solution

VB-Audio Network - VBAN
- https://github.com/quiniouben/vban - VBAN protocol open-source implementation

AVB

https://github.com/christophe-calmejane/Hive - a pro audio Avdecc (IEEE Std 1722.1) controller. Hive allows you to inspect, configure and connect AVB Entities on your network, specifically targeting AVnu Milan compatible devices (but not only).

JACK

Netjack

Netjack - a Realtime Audio Transport over a generic IP Network. It is fully integrated into JACK. Syncs all Clients to one Soundcard so no resampling or glitches in the whole network. Packet loss is now also handled gracefully. By using the celt codec, its even possible, that single packet losses get masked by the Packet Loss Concealment Code.
- http://www.flujos.org/wiki/netjack/

https://github.com/elcorto/jackpod - Control a realtime netjack2 connection between two machines

LinuxMusicians: Setup Netjack2 with a crossover cable

JackTrip

JackTrip - a Linux and Mac OS X-based system used for multi-machine network performance over the Internet. It supports any number of channels (as many as the computer/network can handle) of bidirectional, high quality, uncompressed audio signal streaming.
- https://github.com/jcacerec/jacktrip

https://github.com/noahbailey/jacktrip-docker - Container for JackTrip network audio server

https://github.com/noiseorchestra/autonomous-noise-unit - Python scripts for running JackTrip on an RPi with OLED screen and rotary switch interface.

https://github.com/noiseorchestra/jacktrip_pypatcher - Python scripts to autopatch a JackTrip hubserver

Zita-njbridge

Zita-njbridge Command line Jack clients to transmit full quality multichannel audio over a local IP network, with adaptive resampling by the receiver(s). Zita-njbridge can be used for a one-to-one connection (using UDP) or in a one-to-many system (using multicast). Sender and receiver(s) can each have their own sample rate and period size, and no word clock sync between them is assumed. Up 64 channels can be transmitted, receivers can select any combination of these. On a lightly loaded or dedicated network zita-njbridge can provide low latency (same as for an analog connection). Additional buffering can be specified in case there is significant network delay jitter. IPv6 is fully supported.

https://github.com/rhetr/ipaudio - use jackd, zita-njbridge and systemd for network ip audio

https://github.com/nettings/medianet - The medianet distribution is a derivative of Debian Linux/Raspberry Pi OS. It was created to turn Raspberry Pis into reliable embedded audio nodes, signal processors, and streaming endpoints.The audio system is built around the JACK Audio Connection Kit, complemented with the mod-host to run LV2 plugins, the zita-njbridge to provide clock decoupled uncompressed network audio streaming, and many other open-source audio tools.

https://github.com/gisogrimm/ov-client - Headless clients to share and receive spatial realtime audio on Linux, MacOS and Windows hosts using JACK, zita-njbridge and TASCAR

https://github.com/gisogrimm/ovbox

jack_audio_send / jack_audio_receive

https://github.com/7890/jack_tools/tree/master/audio_rxtx - jack_audio_send & jack_audio_receive - JACK clients allowing to transmit uncompressed

native JACK 32 bit float audio data on the network using UDP OSC messages.

https://github.com/7890/audio_rxtx_gui

MultiJACK

https://github.com/ponderworthy/MultiJACK - a fully operational demo of a framework to increase available audio DSP power available to JACK within a single multicore motherboard, using multiple JACK processes in concert, connected via IP transport.

Compared to jack2??

FLACJACKet

https://github.com/0xsx/FLACJACKet - a DLNA media server that broadcasts streams of audio routed to JACK input ports over the local network encoded in the FLAC format. It aims to provide reliable audio transmission while minimizing latency and taking advantage of FLAC features such as lossless compression and support for surround sound. It is Free and Open Source Software, released under the GNU General Public License.

TPF

https://gitlab.zhdk.ch/TPF/tpf-server - Telematic performance format server software

Release of tpf-client / tpf-server

trx

https://github.com/nettings/trx-jack - fork of http://www.pogo.org.uk/~mark/trx.git (dead) by Mark Hills

jackcast

https://github.com/zokrezyl/jackcast - simple tool to transmit Jack audio and Midi over the network

Spatify

https://github.com/bgola/spatify - Audio spatialization over WebRTC and JACK Audio Connection Kit

Studio Link

Studio Link - professional Audio-Over-IP

https://github.com/Studio-Link/app - This repository contains the studio link - baresip modules and build environment

https://github.com/Studio-Link/overlay-vst

https://github.com/Studio-Link/overlay-lv2 - Linux LV2 VoIP/AoIP Plugin

AudioGridder

https://github.com/apohl79/audiogridder - DSP servers using general purpose networks and computers] - allows you to offload DSP processing from your local to remote computers. This can come in handy when mixing complex projects for instance. AudioGridder comes with a plugin and a server that is enabling VST3 and AudioUnit plugins to be hosted across the network. Simply run the server component on a remote machine and connect your DAW using the AudioGridder AU/VST3 plugin. You can add remote insert chains into your DAW's signal paths that way. The DSP code of the inserted plugins will be executed on the remote machine and the plugin UI's will be streamed over the wire. This allows for an experience very close to hosting the plugins directly in your DAW but not using your local CPU.

Roc

Roc - real-time audio streaming over the network
- Overview — Roc Toolkit
- https://github.com/roc-project/roc

HBRMT

WP: High_bit_rate_media_transport - (HBRMT) formerly known as High bit rate audio video over IP (HBRAV-IP), is a proposed standard for data encapsulation and forward error correction (FEC) of high bit rate contribution oriented video/audio feed services, up to 3 Gbit/s over Ethernet networks. HBRMT is being developed by the SMPTE 32NF networking technology committee. HBRMT is designed to incorporate both SDI uncompressed and JPEG 2000 compressed video and audio formats.

waveOverUDP

https://github.com/amurzeau/waveOverUDP - Stream audio over UDP with low latency (can be used for remote speakers)

Plugins

An introduction to Linux audio plugin APIs -LWN.net-

https://www.kvraudio.com/plugins/linux/free/newest

Formats

https://www.youtube.com/user/UPROAR24

VST2 / VST3

WP: Virtual_Studio_Technology

paths:

~.vst
/usr/lib/vst
/usr/local/lib/vst
~/.wine/drive_c/Program Files (x86)/VstPlugins
~/.wine/drive_c/Program Files/VstPlugins

PDF: An Investigation into Music-Oriented Software-Based Audio Signal Processing, Including Development of a Real-time Audio Application Using C++ - Toby Newman

https://github.com/Xaymar/vst2sdk - a completely "clean room" untainted reverse engineered "SDK" for the VST 2.x interface. It was reverse engineered from binaries where no license restricting the reverse engineering was attached, or where the legal system explicitly allowed reverse engineering for the purpose of interoperability.

VST Preset Generator - writes preset files (fxp for program patch or fxb for bank patch) with randomized values.This is a tool for lazy or curious sound designers, who want to experiment random theory with their VST plugins.
- https://svn.tuxfamily.org/viewvc.cgi/vpg_vst-preset-gen

https://github.com/x42/lv2vst - LV2 - VST wrapper. Expose LV2 plugins as VST2 plugins to a VST plugin-host on Windows, OSX and Linux.

https://github.com/falkTX/JackAss - a VST plugin that provides JACK-MIDI support for VST hosts. Simply load the plugin in your favourite host to get a JACK-MIDI port. Each new plugin instance creates a new MIDI port.

https://github.com/webprofusion/OpenAudio - A list of open source VST/audio plugin projects. Please contribute more links or open source your own plugins.

https://github.com/DropSnorz/OwlPlug - Audio plugin manager. Small tool to manage VST plugin folders on Windows and MacOS

Creating

https://github.com/AuburnSounds/dplug
- Dplug For Developing VST Plugins on Linux [26]

HISE - a cross-platform open source audio application for building virtual instruments. It emphasizes on sampling, but includes some basic synthesis features for making hybrid instruments as well as audio effects. You can export the instruments as VST / AU / AAX plugins or as standalone application for Windows / macOS or iOS.
- https://github.com/christophhart/HISE

https://github.com/davidhealey/librewave_woodwinds - contains the HISE project, scripts, and image files for the Libre Wave Sofia Woodwinds virtual instrument.

https://github.com/Tracktion/pluginval - a cross-platform plugin validator and tester application. It is designed to be used by both plugin and host developers to ensure stability and compatibility between plugins and hosts.

Jamba - a set of helpers (classes, concepts, build files, etc…) built on top of the VST SDK to provide a lightweight framework to build a VST2/3 plugin. Jamba has been designed to help in building VST2/3 plugin, not to replace it: you are still writing a VST2/3 plugin, not a Jamba plugin.
- https://github.com/pongasoft/jamba

https://github.com/RustAudio/vst-rs - VST 2.4 API implementation in rust. Create plugins or hosts. Previously rust-vst on the RustDSP group.
- https://github.com/crsaracco/vst2-gui-research

VST3

Steinberg Plug-in Interfaces Documentation - VST3
- VST 3 Interfaces: Parameters and Automation

https://github.com/steinbergmedia/vst3sdk/releases - versioning

VST 3 SDK: Introduction
- https://github.com/steinbergmedia/vst3_public_sdk - VST 3 Implementation Helper Classes And Examples

https://github.com/steinbergmedia/vst3_doc

https://github.com/skei/vst3_plugin.h - header-only vst3 plugin wrapper [27]

VST3 threads...

LV2

LV2 - an open standard for audio plugins, used by hundreds of plugins and other projects. At its core, LV2 is a simple stable interface, accompanied by extensions which add functionality to support the needs of increasingly powerful audio software.
- WP: LV2
- https://github.com/lv2/lv2/wiki

http://lists.lv2plug.in/listinfo.cgi/devel-lv2plug.in

~/.lv2
/usr/local/lib/lv2
/usr/lib/lv2
  # standard lv2 paths

lv2ls
  # list all lv2 plugins available

Specifications

LV2 Specifications - All official LV2 specifications.

LV2 - an interface for writing audio processors, or plugins, in C/C++ which can be dynamically loaded into many applications, or hosts. This core specification is simple and minimal, but is designed so that extensions can be defined to add more advanced features, making it possibly to implement nearly any feature imaginable. API docs)

http://lv2plug.in/ns/ext/port-groups - Multi-channel groups of LV2 ports.

LV2 MIDI - defines a data type for a MIDI message, midi:MidiEvent, which is normalised for fast and convenient real-time processing. MIDI is the Musical Instrument Digital Interface, a ubiquitous binary standard for controlling digital music devices. For plugins that process MIDI (or other situations where MIDI is sent via a generic transport) the main type defined here, midi:MidiEvent, can be mapped to an integer and used as the type of an LV2 Atom or Event.
LV2 + midnam

LV2 Units - This vocabulary defines a number of units for use in audio processing.

LV2 UI - This extension is used to create User Interfaces (UIs) for LV2 plugins.

LV2 1.0 released, what's next? - "LV2 is a successor of both LADSPA (audio effects) and DSSI (instruments) with some backwards compatibility. The scope of the API more or less equals to the sum of LADSPA and DSSI, not in the last place thanks to its modular design."

Creating

Programming LV2 Plugins - a series of well-documented example plugins that demonstrate the various features of LV2. Starting with the most basic plugin possible, each adds new functionality and explains the features used from a high level perspective. API and vocabulary reference documentation explains details, but not the “big picture”. This book is intended to complement the reference documentation by providing good reference implementations of plugins, while also conveying a higher-level understanding of LV2.
Programming LV2 Plugins - new book layout

The LV2 Book - Rust Edition - a translation of the LV2 Book by David Robillard for the lv2rs library. As such, the examples in this book as well as the README's and comments are copied from the original, but the book itself has been altered to adapt for the differences between C and Rust.
- https://github.com/Janonard/lv2rs-book

https://github.com/diovudau/lv2-workshop - Documentation and code for a workshop on creating LV2 plug-ins by OSAMC

LV2 programming for the complete idiot - an LV2 plugin programming guide for the complete idiot using a set of C++ classes. If you are not a complete idiot, you may want to read the LV2 spec and figure it out for yourself.

Lilv - a C library to make the use of LV2 plugins as simple as possible for applications. Lilv is the successor to SLV2, rewritten to be significantly faster and have minimal dependencies. It is stable, well-tested software (the included test suite covers over 90% of the code) in use by several applications.
- https://github.com/brunogola/lilv_python_examples

https://github.com/agraef/pure-lang/tree/master/pure-lilv - provides a Pure module for David Robillard's Lilv, a library for LV2 plugin host writers.

https://github.com/atsushieno/lilv-sharp - It is an experimental Mono binding for Lilv.

Suil is a lightweight C library for loading and wrapping LV2 plugin UIs.

https://github.com/OpenMusicKontrollers/props.lv2 - Utility header for property based LV2 plugins

https://github.com/OpenMusicKontrollers/timely.lv2 - Utility header for time-based LV2 plugins

LVTK - C++ wrappers for LV2 Plugins
- https://github.com/lvtk/lvtk

https://github.com/Janonard/lv2rs - Idiomatic Rust library to create LV2-compatible plugins.

https://github.com/x42/lv2toweb - create xhtml documentation for LV2 plugins

LV2 Create - a GUI utility that lets you easily enter information about a plugin, without needing to know too many details about LV2 (certainly not about those godawful, over-engineered, developer/enduser hostile, inefficient, easily-broken TTL files. Terrible design for audio work). Then you click a button, and the utility creates the TTL files, and C skeleton code for the plugin. You just need to add your DSP code, and compile to create your plugin. It even generates the GNU Makefile for you.

dkbuilder - from circuit to LV2 plugin
- dkbuilder: simulate a Poweramp - follow-up

https://github.com/fps/lv2-ttl2c - A small python script to generate code from a LV2 plugin bundle manifest

https://github.com/polyeffects/lv2_to_dict - dodgy hack script to convert an LV2 TTL to a python dictionary because TTL is harder for me to parse than json or dicts.

https://github.com/lvtk/jlv2 - LV2 Related JUCE Modules

https://github.com/maxmarsc/lv2-cpp-tools-gui-less - This repository contains a dead simple copy of the original lv2-c++-tools but only containing the source code for the lv2plugin library. Everything else has been removed.

Testing

Validating LV2 Data

https://github.com/jpcima/lv2-plugin-checker

lv2lint - Check whether a given LV2 plugin is up to the specification

Torture tester - a program to help with testing of LADSPA and LV2 plugins.

https://github.com/moddevices/lv2bm - a benchmark tool for LV2 plugins, it was inspired in the lv2bench of lilv utils and the Torture tester

https://github.com/ventosus/alluis.lv2 - LV2 plugin to test various LV2 UI toolkits

https://github.com/ventosus/customui.lv2 - Plugin to test custom LV2 UI

Bridge to

NASPRO bridges - a collection of bridges to LV2 that, once installed, allow you to use plugins developed for other plugin standards in LV2 hosts. As of now, it contains two bridges: a LADSPA 1.1 bridge and a DSSI 1.0.0/1.1.0 bridge.

https://github.com/x37v/pdlv2 - turns pure data patches into LV2 plugins

Bridge from

https://github.com/falkTX/Shella - LV2 to VST2 wrapper

Offline processing

lv2file - a simple program which you can use to apply effects to your audio files without much hassle.

lv2proc - generates an output sound file by applying a LV2 effect plugin to an input sound file.

Events

https://github.com/OpenMusicKontrollers/orbit.lv2 - An LV2 time event manipulation plugin bundle

Beatbox - Creates MIDI events based on LV2 time position events (bars and beats), e.g. to drive a drum machine. Bars and beats can be disabled/enabled separately.
Click - Synthesizes click tracks based on LV2 time position events (bars and beats). Bars and beats can be disabled/enabled separately.
Looper - Loops arbitrary LV2 atom events on a ping-pong buffer. E.g. loops MIDI, OSC or anything else that can be packed into LV2 atoms with sample accuracy. Needs to be driven by LV2 time position events.
Pacemaker - Creates LV2 time position events from scratch to drive other plugins.
Quantum - Quantizes incoming events to whole beats.
Subspace - Subdivide or multiply incoming time signals by whole fractions, e.g. to speed up time x2, x3, ... or slow it down to x1/2, x1/3, ...
Timecapsule - Record/Playback of arbitrary LV2 atoms to/from memory. Record all incoming atom messages with sample accuracy and play them back later from memory. Stored atom event data is part of the plugin state and thus preserved across instantiations.

Presets

https://gitlab.com/Jofemodo/preset2lv2 - A converter that takes a set of native presets and generate a LV2 bundle containing one or more banks. It supports several native formats and it's easily extensible.

Hardware

https://wiki.moddevices.com/wiki/Control_Chain - an open standard developed by MOD Devices that defines communication protocol, electrical specification, cables and connectors. It's used to interconnect external controllers (a.k.a peripheral devices) as expression pedals and foot-switches extension to MOD devices, for example the MOD Duo.

https://wiki.moddevices.com/wiki/Control_Chain_Protocol

Distribution

Distro packages.

https://github.com/patchstorage/patchstorage-lv2-uploader - Proof of concept utility for uploading LV2 plugins to patchstorage.com

SPA

SPA is designed to also support video, multiple buffers, dmabuf backed memory, buffers managed by the hardware. It avoids the RDF descriptions of LV2, keeping the .so self contained, using simple key/value properties to describe ports and nodes. It also favours using shared memory to exchange info between host and plugins, like clock information etc.

https://gitlab.freedesktop.org/pipewire/pipewire/-/blob/master/doc/spa/design.md

https://gitlab.freedesktop.org/pipewire/pipewire/-/tree/master/spa

LADSPA

~/.ladspa
/usr/local/lib/ladspa
/usr/lib/ladspa

Linux Audio Developer's Simple Plugin API (LADSPA)

http://torhelgeskei.blogspot.co.uk/2017/01/ladspawrapper-v001.html

https://github.com/kmatheussen/ladspavst - Make VST plugins appear as LADSPA Plugins.

https://github.com/swh/LRDF - a library to make it easy to manipulate RDF files describing LADSPA plugins. It can also be used for general RDF manipulation. It can read RDF/XLM and N3 files and export N3 files, it also has a light taxonomic inference capability. N.B. this is the descendent project of sourceforge.net/projects/lrdf

DSSI

http://dssi.sourceforge.net

WP: Disposable_Soft_Synth_Interface

http://smbolton.com/linux.html

http://libmodsynth.sourceforge.net/

AU

https://github.com/sfztools/sfzt_auwrapper - Custom edit of the Steinberg VST3→AU wrapper, with preference for static linking

CLAP

https://www.kvraudio.com/forum/viewtopic.php?p=8240683#p8240683

https://github.com/free-audio/clap - Audio Plugin API

https://github.com/free-audio/clap-host - an example to demonstrate how to create a CLAP host.

https://github.com/jpcima/claptrap - Wrapper of CLAP plugins to other plugin standards

https://github.com/free-audio/clap-info - A tool to show information about a CLAP plugin on the command line

https://github.com/baconpaul/clap-c99-distortion - A simple C99-only example of a CLAP audio effect. Really, you don't want to use this musically, The DSP is painfully naive. But this shows a simple set of wave folder / distortions in a 3 param state saving pure C99 CLAP Audio Effect.

Hosts

Multiple

Carla

Carla - an audio plugin host, with support for many audio drivers and plugin formats. It has some nice features like automation of parameters via MIDI CC (and send output back as MIDI too) and full OSC control. Carla currently supports LADSPA (including LRDF), DSSI, LV2, VST2/3 and AU plugin formats, plus GIG, SF2 and SFZ file support. It uses JACK as the default and preferred audio driver but also supports native drivers like ALSA, DirectSound or CoreAudio.
- https://github.com/falkTX/Carla
- https://github.com/falkTX/Carla-Manual

There are 4 types of engine processing:

Single-client: (JACK driver only) - carla-jack-single
- Same as Multi-client, except that all JACK ports belong to a single master client.
- This is needed when a setup doesn't support multi-client JACK apps, such as LADISH.
Multi-client: (JACK driver only) - carla-jack-multi
- Every single plugin is exposed as a new JACK client. Audio and MIDI ports are registered as needed.
Rack: - carla-rack
- Plugins are processed in order, from top to bottom.
- Plugins with non-stereo audio channels are not supported, but a forced-stereo option is available for Mono ones.
Patchbay: - carla-patchbay
- Modular patchbay mode, just like in JACK Multi-client and many other modular applications.
- Every plugin gets its own canvas group and ports allowing you to interconnect plugin audio and MIDI.

carla-single 
  # usage: /usr/bin/carla-single [arch (optional)] [format] [filename/uri] [label (optional)] [uniqueId (optional)]

Possible archs:

 - native (default)
 - linux32
 - linux64
 - win32
 - win64

Possible formats:

 - internal
 - ladspa
 - dssi
 - lv2
 - vst|vst2
 - gig
 - sf2
 - sfz

Command-line launch examples:

/usr/bin/carla-single internal midisplit
/usr/bin/carla-single dssi /usr/lib/dssi/whysynth.so
/usr/bin/carla-single lv2 http://calf.sourceforge.net/plugins/Compressor
/usr/bin/carla-single native vst /usr/lib/vst/TAL-NoiseMaker.so
/usr/bin/carla-single win32 vst "~/.wine/drive_c/Program Files (x86)/VstPlugins/Kontakt 5.dll"

Carla Backend: Modules - API

https://github.com/progwolff/performer - Performer lets you manage all the songs in your setlist as individual Carla patches and loads each of them when you need it. Additionally Performer uses Okular or QWebEngine to display notes and chords of your songs.

Chibi

https://github.com/falkTX/Chibi - a mini-host audio plugin loader, meant to load one plugin at a time as if it was a standalone application.These are the goals for the project: Provide a quick way to start audio plugins; Integrate with relevant Session Managers; Allow to be reused for other projects, so plugins can literally become standalones; Provide the most useful tools from host-side without any extra plugin work; Leverage and test Carla as much as possible (avoiding duplicate work and having head-start on features). Chibi is basically Carla's little sister. It runs Carla's backend behind the scenes and shares quite a few visual traits. Building on top of what Carla has already achieved, it adds only the missing pieces for a "mini-host" setup.

Ingen

Ingen (formerly Om) - a modular audio processing system for GNU/Linux audio systems using the Jack audio server and LV2 or LADSPA plugins.

http://lac.linuxaudio.org/2015/papers/24.pdf

https://github.com/rhetr/ingen-scripts

Jost

Jost (dead) is the first open source multi-technology (native vst, ladspa, dssi) host in linux. It will mainly host a chain of plugins per instance, publishing jack, alsa and alsa_seq ports in order to be connected in your main stream flow. it still have some very good features that makes it a first class host.
- https://www.kvraudio.com/product/jost_by_jucetice

VST

MrsWatson

MrsWatson - a command-line audio plugin host. It takes an audio and/or MIDI file as input, and processes it through one or more audio plugins. Currently MrsWatson only supports VST 2.x plugins, but more formats are planned in the future. MrsWatson was designed for primarily three purposes: Audio plugin development and testing, Automated audio processing for servers or other applications, Unit testing audio plugins
- https://github.com/teragonaudio/MrsWatson

dssi-vst

dssi-vst - Run Windows VST plugins on Linux. DSSI doesn't support host tempo to plugin features.

FST

FST - a program by which uses Wine, Jack and Steinberg's VST Audio Plug-Ins SDK to enable the use of many VST audio plugins under Gnu/Linux.
- https://git.iem.at/zmoelnig/FST
- https://github.com/pierreguillot/FTS

FeSTige

FeSTige - a GUI for fst and dssi-vst, allowing you to run Windows VST plugins on Linux.

fsthost

fsthost - FreeST standalone fork. linux VST host - hybrid using winelib. Runs as a Jack client for Audio/MIDI, and with GTK GUI. Supports 32 and 64 bit plugins. doesn't see JACK server..?
- https://sourceforge.net/p/fsthost/code/HEAD/tree/trunk/README

fsthost -g ~/.vst
  # build plugin db

export VST_PATH=~/VST:/usr/share/vst:/otherlocation

fsthost -g

fsthost_menu
  # Perl GTK menu to startup plugins
fsthost_ctrl
  # Perl GTK app for control via TCP socket
fsthost_list
  # simple application to show known plugins ( read about XML DB )

export FSTMENU_GTK=2 # or 3

Airwave

Airwave - a WINE-based VST bridge, that allows for the use of Windows 32- and 64-bit VST 2.4 audio plugins with Linux VST hosts

https://pastebin.com/aDNcdRjp - "Airwave is very nice, but adding more than a few plugins to it is awfully tedious. So I've taken the matter into my own hands and written a script to add a large number of plugins to Airwave (plus the ability to edit their names) as a batch process." [28]

vstserver

https://github.com/kmatheussen/vstserver - an old program vstlib server

vst-bridge

https://github.com/abique/vst-bridge - a bridge to run Windows VST plugins (both 32 and 64 bits) with Linux VST hosts.

LinVst

https://github.com/osxmidi/LinVst - a Linux vst plugin that runs Windows 64 bit vst's. To use LinVst, the linvst.so file simply needs to be renamed to match the windows vst dll's filename.

https://github.com/osxmidi/LinVst-X - runs vst plugins in a single Wine process so plugins that communicate with each other or plugins that can use shared samples between instances will be able to communicate with their other instances, usage is basically the same as LinVst except that the file to be renamed to the vst dll name is linvstx.so (rather than linvst.so for LinVst).

Linvstblast [29]

LinVST - working plugs / libraries you use for compatibility (March 2019) - LinuxMusicians

https://github.com/osxmidi/LinVst3 - Linux Windows vst3 wrapper/bridge

https://github.com/osxmidi/LinVst3-X - Windows vst3 Linux Wrapper - Extra

yabridge

https://github.com/robbert-vdh/yabridge - Yet Another VST bridge, run Windows VST2 plugins under Linux

VSTForx

VSTForx - a full-modular effect network creation tool which comes as a VST-plugin. With VSTForx you are able to load any number of VST-plugins and connect them anyway you want. Additional modules allow you to manipulate such signal chains and offer a whole new way in mixing and producing. Windows/Mac. $.q

L_Pa

L_Pa Project - collection of tools aimed at better integration and *performance* of Linux + Wine + jackd + Proaudio applications...L_pa accomplishes this by setting up a proper mix of kernel/software with low-latency/proaudio users _specifically in mind. So far the main components are;Custom version of the (rt-)linux kernel with a delta of patches, for linux proaudio usage.Custom version of Wine with a delta of patches to ensure Wine is ready for proaudio on the linux platform. It's also has various other improvements and bug fixes.You'll want both WineASIO and FSThost;WineASIO - WineASIO provides an ASIO to JACK driver for WINE.
- https://sourceforge.net/p/l-proaudio/wiki/Home

vsthost

https://github.com/wtrsltnk/vsthost - Small code base containing a minimal vsthost

NetVST

NetVST - Windows only
- http://netvst.org/blog

PluginRunner

https://github.com/jatinchowdhury18/PluginRunner - A minimal command-line application for running audio through an audio plugin. Made with JUCE. Mostly only tested on Windows.

vstplugin

https://github.com/Spacechild1/vstplugin - VST plugin support for Pd and SuperCollider (mirror of https://git.iem.at/pd/vstplugin)

UnityVSTHost

https://github.com/Chris-TopherW/UnityVSTHost - VST plugin host for Unity engine. Supports 64bit VST2 plugins on Windows only. Does not support Midi input to plugins at this stage.

https://github.com/Chris-TopherW/UnityVSTDll - the Dll implementation for the Unity VST host

Terra

https://github.com/hotwatermorning/Terra - The yet another audio plugin hosting application. (alpha version)

DSSI

ghostess - a rough start at a graphical DSSI host, based on jack-dssi-host, but capable of saving and restoring plugin configuration, as well as specifying MIDI channels and layering synths. ghostess includes three MIDI drivers: an ALSA sequencer MIDI driver, a (clumsy but functional) CoreMIDI driver (which allows ghostess to be used on Mac OS X), and a JACK MIDI driver for use with the MIDI transport in recent versions (>=0.105.0) of JACK. ghostess also comes with a universal DSSI GUI, which attempts to provide GUI services for any DSSI or LADSPA plugin, and may be used with any DSSI host.
- https://github.com/smbolton/ghostess

LADSPA

JACK Rack is an effects "rack" for the JACK low latency audio API. The rack can be filled with LADSPA effects plugins and can be controlled using the ALSA sequencer. It's phat; it turns your computer into an effects box.
- https://github.com/jwm-art-net/jack-rack

jackspa - A small utility which will host a LADSPA plugin, providing JACK ports for its audio inputs and outputs, and sliders in a gtkmm GUI for its control inputs. I find it useful for hosting plugins with odd port configurations (such as a vocoder or a ring modulator), and for testing plugins. This project is pretty hacky. I threw it together quickly because I needed it in a hurry, and as a result, it's fairly buggy, and the code is a mess. But, it does the job.

ng-jackspa is a set of simple user interfaces that host a LADSPA plugin, providing JACK ports for its audio inputs and outputs, and dynamic setting of its control inputs. Additionally, the plugin controls can be exported to or controlled by control voltages on standard JACK audio ports.

Soundtank hosts LADSPA plugins in "realtime objects" which embody the structure of the audio signal flow. RTObjects can be controlled in a completely customizeable fashion using MIDI events sent through the ALSA sequencer interface.

Stomper - a virtual pedalboard for guitar, using commonly-available audio plugins in a user-defined arrangement and MIDI for switching. It is intended for on-stage use and will be optimized as such.

LV2

Jalv

Jalv - a simple but fully featured LV2 host for Jack. It runs LV2 plugins and exposes their ports as Jack ports, essentially making any LV2 plugin function as a Jack application.
- https://github.com/brummer10/jalv_select - little app to select lv2 plugs for run with jalv

jalv.qt5 http://drumkv1.sourceforge.net/lv2

LV2_PATH=/path/to/plugin.lv2 jalv.gtk URI

LV2_PATH=/path/to/plugin.lv2 lv2ls
  # to find the URI(s)

calvo

https://github.com/ajboni/calvo - 🧑🏼‍🦲 A jalv based lv2 plugin rack for your terminal.

calvo-cli-tools
- https://github.com/ajboni/calvo-cli-tools - python cli tools to manipulate JACK and get useful LV2 Information.

mod-host

https://github.com/moddevices/mod-host - an LV2 host for JACK, controllable via socket or command line

https://github.com/moddevices/mod-ui - the UI for the MOD software. It's a webserver that delivers an HTML5 interface and communicates with mod-host. It also communicates with the MOD hardware, but does not depend on it to run.

PedalPi - PluginsManager - Pythonic management of LV2 audio plugins with mod-host.
- https://github.com/PedalPi/PluginsManager

Synthpod

Synthpod - both LV2 host and plugin. It can be run as a standalone app and be used as a tool for live performances or general audio and event filtering. Or it can be run as a plugin itself inside another host (or inside itself) to add support for non-linear patching where only strictly linear connections are supported (e.g. as in most DAWs). Patching of audio channels is clickless.
- https://github.com/OpenMusicKontrollers/synthpod

Elven

Elven - written for revision 2 of the LV2 specification and is NOT compatible with revisions 3 and later. It may work, it may break subtly or it may give your computer the swine flu.

zynjacku

zynjacku - JACK based, GTK (2.x) host for LV2 synths. It has one JACK MIDI input port (routed to all hosted synths) and one (two for stereo synths) JACK audio output port per plugin. Such design provides multi-timbral sound by running several synth plugins.

MODEP

MODEP - an open-source, community-based MOD DUO emulator that lets you play around with hundreds of LV2 audio plugins ranging from a simple reverb to a complex FM synth using your Raspberry Pi and Pisound or any other Raspberry Pi supported sound card!
- https://github.com/BlokasLabs/modep - fork of pi-gen

PedalPi

https://github.com/auto3000/pedalpii - an affordable but complete computer-based pedalboard for guitar/bass.

https://github.com/auto3000/meta-pedalpi - Yocto meta-layer for pedalpi

https://github.com/auto3000/pedalpi-dev-platform - pedalpi development platform

https://github.com/Rezzonics/pedalC2-dev-platform - PedalPi Development Platform for Hardkernel Odroid-C2

lv2host

https://github.com/giuliomoro/lv2host - A lv2 host with Bela example.

jackwrap.c

https://github.com/x42/robtk/blob/master/jackwrap.c - x42 jack wrapper / minimal LV2 host

lv2h

https://github.com/adsr/lv2h

Audio formats

http://www.fmtz.com/misc/raw-audio-file-formats

https://wiki.multimedia.cx/index.php/Category:Audio_Codecs

Xiph.Org's Digital Show & Tell- video on digital media explores multiple facets of digital audio signals and how they really behave in the real world.

Video Game Music Preservation Foundation - the Wikipedia of video game music!

https://github.com/sandreas/tone - a cross platform utility to dump and modify audio metadata for a wide variety of formats. [30]

PCM

WP: Pulse-code_modulation - a method used to digitally represent sampled analog signals. It is the standard form of digital audio in computers, Compact Discs, digital telephony and other digital audio applications. In a PCM stream, the amplitude of the analog signal is sampled regularly at uniform intervals, and each sample is quantized to the nearest value within a range of digital steps.

Linear pulse-code modulation (LPCM) is a specific type of PCM where the quantization levels are linearly uniform. This is in contrast to PCM encodings where quantization levels vary as a function of amplitude (as with the A-law algorithm or the μ-law algorithm). Though PCM is a more general term, it is often used to describe data encoded as LPCM.

A PCM stream has two basic properties that determine the stream's fidelity to the original analog signal: the sampling rate, which is the number of times per second that samples are taken; and the bit depth, which determines the number of possible digital values that can be used to represent each sample.

https://jannewmarch.gitbooks.io/programming-and-using-linux-sound-systems/content/Sampled/Codecs/PCM.html

WP: Differential_pulse-code_modulation - DPCM, a signal encoder that uses the baseline of pulse-code modulation (PCM) but adds some functionalities based on the prediction of the samples of the signal. The input can be an analog signal or a digital signal. If the input is a continuous-time analog signal, it needs to be sampled first so that a discrete-time signal is the input to the DPCM encoder. DPCM was invented by C. Chapin Cutler at Bell Labs in 1950; his patent includes both methods.

Option 1: take the values of two consecutive samples; if they are analog samples, quantize them; calculate the difference between the first one and the next; the output is the difference, and it can be further entropy coded. Option 2: instead of taking a difference relative to a previous input sample, take the difference relative to the output of a local model of the decoder process; in this option, the difference can be quantized, which allows a good way to incorporate a controlled loss in the encoding. Applying one of these two processes, short-term redundancy (positive correlation of nearby values) of the signal is eliminated; compression ratios on the order of 2 to 4 can be achieved if differences are subsequently entropy coded, because the entropy of the difference signal is much smaller than that of the original discrete signal treated as independent samples.

WP: Adaptive_differential_pulse-code_modulation - ADPCM, is a variant of differential pulse-code modulation (DPCM) that varies the size of the quantization step, to allow further reduction of the required data bandwidth for a given signal-to-noise ratio. Typically, the adaptation to signal statistics in ADPCM consists simply of an adaptive scale factor before quantizing the difference in the DPCM encoder. ADPCM was developed in the early 1970s at Bell Labs for voice coding, by P. Cummiskey, N. S. Jayant and James L. Flanagan

PDM

WP: Pulse-density_modulation - a form of modulation used to represent an analog signal with a binary signal. In a PDM signal, specific amplitude values are not encoded into codewords of pulses of different weight as they would be in pulse-code modulation (PCM). Instead, it is the relative density of the pulses that corresponds to the analog signal's amplitude. The output of a 1-bit DAC is the same as the PDM encoding of the signal. Pulse-width modulation (PWM) is a special case of PDM where the switching frequency is fixed and all the pulses corresponding to one sample are contiguous in the digital signal. For a 50% voltage with a resolution of 8-bits, a PWM waveform will turn on for 128 clock cycles and then off for the remaining 128 cycles. With PDM and the same clock rate the signal would alternate between on and off every other cycle. The average is 50% for both waveforms, but the PDM signal switches more often. For 100% or 0% level, they are the same.

https://curiouser.cheshireeng.com/2014/11/04/using-a-pdm-microphone/

WAV / WAVE

WP: WAV - a Microsoft and IBM audio file format standard for storing an audio bitstream on PCs. It is an application of the Resource Interchange File Format (RIFF) bitstream format method for storing data in "chunks", and thus is also close to the 8SVX and the AIFF format used on Amiga and Macintosh computers, respectively. It is the main format used on Windows systems for raw and typically uncompressed audio. The usual bitstream encoding is the linear pulse-code modulation (LPCM) format.

The Canonical WAVE File Format

Wave File Format - The Sonic Spot

Intro to Audio Programming, Part 2: Demystifying the WAV Format – Game Theory - A blog by Microsoft Academic Developer Evangelist, Dan Waters

https://github.com/Borewit/music-metadata/wiki/RIFF-WAVE - on metadata

https://github.com/synesthesiam/wav-chunk - Read or write INFO chunks in WAV files

https://github.com/K0F/genwav - generate audio files from txt floats

https://github.com/volkertb/ich2player - AC'97 DOS .wav player for the Intel ICHx (ICH through ICH7) and 440MX chipsets.

https://github.com/iluvcapra/bwavfile

Broadcast WAV

WP: Broadcast_Wave_Format - an extension of the popular Microsoft WAV audio format and is the recording format of most file-based non-linear digital recorders used for motion picture, radio and television production. It was first specified by the European Broadcasting Union in 1997, and updated in 2001 and 2003. The purpose of this file format is the addition of metadata to facilitate the seamless exchange of sound data between different computer platforms and applications. It specifies the format of metadata, allowing audio processing elements to identify themselves, document their activities, and supports timecode to enable synchronization with other recordings. This metadata is stored as extension chunks in a standard digital audio WAV file.

Audio Definition Model Software - BBC R&D

BWF MetaEdit - developed by the Federal Agencies Digitization Guidelines Initiative (FADGI) supported by AudioVisual Preservation Solutions.This tool permits embedding, editing, and exporting of metadata in Broadcast WAVE Format (BWF) files. This tool can also enforce metadata guidelines developed by the Federal Agencies Audio-Visual Working Group, as well as recommendations and specifications from the European Broadcasting Union (EBU), Microsoft, and IBM.

RF64

PDF: RF64: An extended File Format for Audio - EBU tech

PDF: Long-form file format for the international exchange of audio programme materials with metadata

WP: RF64 - a BWF-compatible multichannel audio file format enabling file sizes to exceed 4 GB. It has been specified by the European Broadcasting Union. It has been accepted as the ITU recommendation ITU-R BS.2088.The file format is designed to meet the requirements for multichannel sound in broadcasting and audio archiving. It is based on the Microsoft RIFF/WAVE format and Wave Format Extensible for multichannel parameters. Additions are made to the basic specification to allow for more than 4 GB file sizes when needed (the new maximum filesize is now approximately 16 exabytes). The format is transparent to the BWF and all its supplements and chunks.

https://github.com/IRT-Open-Source/libbw64 - Broadcast Wave 64 (ITU-R BS.2088) library

AU

WP: Au_file_format - a simple audio file format introduced by Sun Microsystems. The format was common on NeXT systems and on early Web pages. Originally it was headerless, being simply 8-bit µ-law-encoded data at an 8000 Hz sample rate. Hardware from other vendors often used sample rates as high as 8192 Hz, often integer multiples of video clock signal frequencies. Newer files have a header that consists of six unsigned 32-bit words, an optional information chunk and then the data (in big endian format). Although the format now supports many audio encoding formats, it remains associated with the µ-law logarithmic encoding. This encoding was native to the SPARCstation 1 hardware, where SunOS exposed the encoding to application programs through the /dev/audio interface. This encoding and interface became a de facto standard for Unix sound.

https://notabug.org/kd/au-utils - simple, easily sandboxed, pipeline components for audio processing, using the au(7) file-format as an intermediary.

MP3

MP3 (MPEG-1 or MPEG-2 Audio Layer III) is a patented encoding format for digital audio which uses a form of lossy data compression. It is a common audio format for consumer audio streaming or storage, as well as a de facto standard of digital audio compression for the transfer and playback of music on most digital audio players.

http://ryanmaguiremusic.com/theghostinthemp3.html [31]

http://www.walterdevos.be/how-to-check-quality-of-mp3-file

https://github.com/lieff/minimp3 - Minimalistic, single-header library for decoding MP3. minimp3 is designed to be small, fast (with SSE and NEON support), and accurate (ISO conformant). You can find a rough benchmark below, measured using perf on an i7-6700K, IO included, no CPU heat to address speedstep:

https://github.com/anars/blank-audio - Set of blank MP3 audio files

Encoding

LAME

LAME is a high quality MPEG Audio Layer III (MP3) encoder licensed under the LGPL.
http://savvyadmin.com/batch-mp3-encoding-with-linux-and-lame/

for f in *.wav ; do lame "$f" ; done

shine

https://github.com/toots/shine - Super fast fixed-point MP3 encoder.

mp3fs

mp3fs - a read-only FUSE filesystem which transcodes between audio formats (currently FLAC to MP3) on the fly when files are opened and read. It can let you use a FLAC collection with software and/or hardware which only understands the MP3 format, or transcode files through simple drag-and-drop in a file browser. [32]
- https://github.com/khenriks/mp3fs

Splitting

mp3splt

mp3splt - a utility to split mp3, ogg vorbis and native FLAC files selecting a begin and an end time position, without decoding. It's very useful to split large mp3/ogg vorbis/FLAC to make smaller files or to split entire albums to obtain original tracks. If you want to split an album, you can select split points and filenames manually or you can get them automatically from CDDB (internet or a local file) or from .cue files. Supports also automatic silence split, that can be used also to adjust cddb/cue splitpoints. Trimming using silence detection is also available. You can extract tracks from Mp3Wrap or AlbumWrap files in few seconds. For mp3 files, both ID3v1 & ID3v2 tags are supported. Mp3splt-project is split in 3 parts : libmp3splt, mp3splt and mp3splt-gtk.

unflac

https://github.com/ftrvxmtrx/unflac - Frame accurate audio image + cue sheet splitting. No ReplayGain.

split2flac

https://github.com/ftrvxmtrx/split2flac - Split flac/ape/wv/wav + cue sheet into separate tracks

Pcutmp3

Pcutmp3 - a Java based program that lets you cut and trim MP3 files losslessly (i.e. without any quality loss as there is no re-encoding). Ideal for removing adverts or unwanted intros/outros from your radio mixes. Originally created by Sebastian Gesemann it is now maintained by Christopher Banes.

https://bitbucket.org/gbouthenot/pcutmp3/src/default/ - gapless mp3 cutter tool This is a fork from http://pcutmp3.googlecode.com/svn/trunk/ Author: Christopher Banes (New BSD Licence).

https://github.com/hdijkema/pcutmp3-gui - Proper Cut MP3 with bug fixes and enhanced with a GUI

lossless-cut

https://github.com/mifi/lossless-cut - Save space by quickly and losslessly trimming video and audio files

quelcom

https://github.com/posixru/quelcom - provides assorted tools to perform simple editing operations on MP3 and WAV audio files. These include fading, check-and-clean, informational extraction and lossless cutting and joining without reencoding.

MP3-Splitter

https://github.com/gitpan/MP3-Splitter - MP3::Splitter - Perl extension for splitting MP3 files

Flacon

Flacon - extracts individual tracks from one big audio file containing the entire album of music and saves them as separate audio files. To do this, it uses information from the appropriate CUE file. Besides, Flacon makes it possible to conveniently revise or specify tags both for all tracks at once or for each tag separately.
- https://github.com/Flacon/flacon

album-splitter

https://github.com/crisbal/album-splitter - Do you have a music album as a single file (locally or on YouTube), with all its tracks joined together? Do you want to split that album in its single tracks? Do you want to tag these tracks so your music player can get all the required info from them?

WavePad

Audio Editing Software. Sound, Music, Voice & Mp3 Editor - This audio editing software is a full-featured professional audio and music editor for Windows and Mac. It lets you record and edit music, voice and other audio recordings. When editing audio files, you can cut, copy and paste parts of recordings, and then add effects like echo, amplification and noise reduction. WavePad works as a wav or mp3 editor, but it also supports a number of other file formats including vox, gsm, wma, real audio, au, aif, flac, ogg, and more.

Non-commercial usage only.

Editing - Lossless MP3 Editing

mp3DirectCut

mp3DirectCut - a fast and extensive audio editor and recorder for encoded MP3. Without re-encoding you can directly cut, crop or split your MP3 and AAC tracks, change the volume on MP3 and much more. Direct editing saves encoding time and preserves the original audio quality of your tracks. The built in recorder creates MP3 on the fly. By using Cue sheets, Pause detection or Auto cue you can easily divide long files.
- WP: mp3DirectCut - Windows

Metadata

WP: ID3

http://wiki.slimdevices.com/index.php/Beginners_Guide_To_Tagging

http://id3v2.sourceforge.net/

http://linux.die.net/man/1/mid3v2

id3reader - a Python module that reads ID3 metadata tags in MP3 files. It can read ID3v1, ID3v2.2, ID3v2.3, or ID3v2.4 tags. It does not write tags at all.

https://wiki.python.org/moin/UsefulModules#ID3_Handling

http://id3-py.sourceforge.net/

https://github.com/quodlibet/mutagen - a Python module to handle audio metadata. It supports ASF, FLAC, MP4, Monkey's Audio, MP3, Musepack, Ogg Opus, Ogg FLAC, Ogg Speex, Ogg Theora, Ogg Vorbis, True Audio, WavPack, OptimFROG, and AIFF audio files. All versions of ID3v2 are supported, and all standard ID3v2.4 frames are parsed. It can read Xing headers to accurately calculate the bitrate and length of MP3s. ID3 and APEv2 tags can be edited regardless of audio format. It can also manipulate Ogg streams on an individual packet/page level.

eyeD3 - a Python tool for working with audio files, specifically mp3 files containing ID3 metadata (i.e. song info). It provides a command-line tool (eyeD3) and a Python library (import eyed3) that can be used to write your own applications or plugins that are callable from the command-line tool.

http://sourceforge.net/projects/bulkid3 - an en-masse ID3 tag editor. It is designed to allow bulk modification of mp3 files, as well as file renaming.

AAC

WP: Advanced_Audio_Coding - AAC, is a standardized, lossy compression and encoding scheme for digital audio. Designed to be the successor of the MP3 format, AAC generally achieves better sound quality than MP3 at similar bit rates.

WP: High-Efficiency_Advanced_Audio_Coding

https://github.com/linnaea/faac - based on the ISO MPEG-4 reference code.

Ogg

Ogg is a multimedia container format, and the native file and stream format for the Xiph.org multimedia codecs.
- WP: Ogg

Container format.

WP: Vorbis_comment - a metadata container used in the Vorbis, FLAC, Theora, Speex and Opus file formats. It allows information such as the title, artist, album, track number or other information about the file to be added to the file itself. However, as the official Ogg Vorbis documentation notes, “[the comment header] is meant for short, text comments, not arbitrary metadata; arbitrary metadata belongs in a separate logical bitstream (usually an XML stream type) that provides greater structure and machine parseability.”

https://github.com/manxorist/rogg - a small library I wrote for manipulating Ogg streams in memory.This makes it convenient to write certain quick scripts for checkingand fixing simple bitstream errors in mmap()'d files.

Opus

Opus is a totally open, royalty-free, highly versatile audio codec. Opus is unmatched for interactive speech and music transmission over the Internet, but is also intended for storage and streaming applications. It is standardized by the Internet Engineering Task Force (IETF) as RFC 6716 which incorporated technology from Skype's SILK codec and Xiph.Org's CELT codec.
- WP: Opus_(codec)

Best lossy audio format, replaces Vorbis and Speex.

https://opus-codec.org/comparison

https://opus-codec.org/docs

https://wiki.xiph.org/Opus_Recommended_Settings

http://wiki.hydrogenaud.io/index.php?title=Opus

ffmpeg -i input -acodec libopus -b:a bitrate -vbr on -compression_level 10 output

https://github.com/vlazzarini/opcode_compiler - experimental opcode builds on the initial work by Michael Goggins, and is based on the llvm/clang interpreter example code. It provides a just-in-time C module compiler, which can be used to add new opcodes to Csound on-the-fly.

Vorbis

http://xiph.org/vorbis
- WP: Vorbis

Speex

https://www.speex.org
- WP: Speex - a lossy audio compression format specifically tuned for the reproduction of human speech and also a free software speech codec that may be used on VoIP applications and podcasts. It is based on the CELP speech coding algorithm. Speex claims to be free of any patent restrictions and is licensed under the revised (3-clause) BSD license. It may be used with the Ogg container format or directly transmitted over UDP/RTP. It may also be used with the FLV container format. The Speex designers see their project as complementary to the Vorbis general-purpose audio compression project.

Codec 2 | Rowetel - Codec 2 is an open source speech codec designed for communications quality speech between 700 and 3200 bit/s. The main application is low bandwidth HF/VHF digital radio. It fills a gap in open source voice codecs beneath 5000 bit/s and is released under the GNU Lesser General Public License (LGPL).

The Codec 2 project also contains several modems (FDMDV, COHPSK and mFSK) carefully designed for digital voice over HF radio; GNU Octave simulation code to support the codec and modem development; and FreeDV – an open source digital voice protocol that integrates the modems, codecs, and FEC. FreeDV is available as a GUI application, an open source library (FreeDV API), and in hardware (the SM1000 FreeDV adaptor).

FLAC

FLAC - stands for Free Lossless Audio Codec, an audio format similar to MP3, but lossless, meaning that audio is compressed in FLAC without any loss in quality. This is similar to how Zip works, except with FLAC you will get much better compression because it is designed specifically for audio, and you can play back compressed FLAC files in your favorite player (or your car or home stereo, see supported devices) just like you would an MP3 file. FLAC stands out as the fastest and most widely supported lossless audio codec, and the only one that at once is non-proprietary, is unencumbered by patents, has an open-source reference implementation, has a well documented format and API, and has several other independent implementations.

FLAC documentation

flac --best --keep-foreign-metadata input.wav -f

https://linux.die.net/man/1/metaflac

metaflac --list file.flac
  # list all metadata of a FLAC file

http://sourceforge.net/projects/flacsquisher

aptX

WP: AptX - a family of proprietary audio codec compression algorithms currently owned by Qualcomm. The original aptX algorithm was developed in the 1980s by Dr. Stephen Smyth as part of his Ph.D. research at Queen's University Belfast School of Electronics, Electrical Engineering and Computer Science; its design is based on time domain ADPCM principles without psychoacoustic auditory masking techniques.

Codec 2

Codec 2 - Codec 2 is an open source speech codec designed for communications quality speech between 700 and 3200 bit/s. The main application is low bandwidth HF/VHF digital radio. It fills a gap in open source voice codecs beneath 5000 bit/s and is released under the GNU Lesser General Public License (LGPL). The Codec 2 project also contains several modems (FDMDV, COHPSK and mFSK) carefully designed for digital voice over HF radio; GNU Octave simulation code to support the codec and modem development; and FreeDV – an open source digital voice protocol that integrates the modems, codecs, and FEC. FreeDV is available as a GUI application, an open source library (FreeDV API), and in hardware (the SM1000 FreeDV adaptor).
- https://github.com/drowe67/codec2

Auphonic Blog: Codec2: a whole Podcast on a Floppy Disk - [33]

Lyra

Google AI Blog: Lyra: A New Very Low-Bitrate Codec for Speech Compression -

Lyra audio codec enables high-quality voice calls at 3 kbps bitrate [34]

STEM

NI: Stems - a completely new way to DJ. Stems is a new format for music that redefines creative live performance. Create spontaneous edits, a cappellas, instrumentals, and more with your tracks’ musical elements available independently.

https://github.com/faroit/stempeg - Python tool to read and write STEM files

WavPack

WavPack - a completely open audio compression format providing lossless, high-quality lossy, and a unique hybrid compression mode. For version 5.0.0, several new file formats and lossless DSD audio compression were added, making WavPack a universal audio archiving solution.
- WP: WavPack

QOA

https://github.com/phoboslab/qoa - QOA, The “Quite OK Audio Format” for fast, lossy audio compressionSingle-file MIT licensed library for C/C++See qoa.h for the documentation and format specification.

More info at: https://phoboslab.org/log/2023/02/qoa-time-domain-audio-compressionAudio samples in WAV & QOA format can be found at: https://phoboslab.org/files/qoa-samples/

Older

https://github.com/temisu/ancient_format_decompressor - Decompression routines for ancient formats

DSD / SACD

WP: Direct_Stream_Digital - DSD is the name of a trademark used by Sony and Philips for their system of digitally recreating audible signals for the Super Audio CD (SACD). DSD uses pulse-density modulation encoding—a technology to store audio signals on digital storage media that are used for the SACD. The signal is stored as delta-sigma modulated digital audio, a sequence of single-bit values at a sampling rate of 2.8224 MHz (64 times the CD audio sampling rate of 44.1 kHz, but only at 1/32768 of its 16-bit resolution). Noise shaping occurs by use of the 64-times oversampled signal to reduce noise and distortion caused by the inaccuracy of quantization of the audio signal to a single bit. Therefore, it is a topic of discussion whether it is possible to eliminate distortion in one-bit delta-sigma conversion.

YouTube: DSD Explained part 1 - At the 1996 AES Convention in Copenhagen the formerly CBS Research Lab introduced Direct Stream Digital, DSD for short, as a archive format that offered compact files while it contains sufficient information for conversion to sample rates as high as 352.8 kHz - unheard of in 1996. From there is went to SACD and now to DSD downloads. What is DSD all about? Hans Beekhuyzen explains.

WP: Super_Audio_CD - DSD on disc

Super Audio CD decoder - a command-line application which takes a Super Audio CD source and extracts a 24-bit high resolution wave file. It handles both DST and DSD streams. The application reads the following input: SACD image files (*.iso), Sony DSF files (*.dsf), Philips DSDIFF files (*.dff). Supported output sample rates: 88.2KHz, 96KHz, 176.4KHz, 192KHz
- https://launchpad.net/sacd

What is DoP (DSD over PCM)? - It involves taking groups of 16 adjacent 1-bit samples from a DSD stream and packing them into the lower 16 bits of a 24/176.4 data stream. Data from the other channel of the stereo pair is packed the same way. A specific marker code in the top 8 bits identifies the data stream as DoP, rather than PCM. The resulting DoP stream can be transmitted through existing 24/192-capable USB, AES, Dual AES or SPDIF interfaces to a DoP-compatible DAC, which reassembles the original stereo DSD data stream COMPLETELY UNCHANGED. If something goes wrong and the data stream is decoded as PCM, the output will be low-level noise with faint music in the back ground, so it fails safely. This can happen if the computer erases the marker code by applying a volume adjustment.

DSD native vs DoP

DoP isn't PCM - Paul McGowan, PS Audio

New HDMI audio output format: native DSD (one bit audio) passthrough for Android set-top boxes (feature request) - Google Issue Tracker

WP: Digital_eXtreme_Definition - or DXD, a digital audio format that originally was developed for editing high-resolution recordings recorded in DSD, the audio standard used on Super Audio CD (SACD). As the 1-bit DSD format used on SACD is not suitable for editing, alternative formats such as DXD or DSD-Wide must be used during the mastering stage. In contrast with DSD-Wide or DSD Pure which offers level, EQ, and crossfade edits at the DSD sample rate (64fs, 2.822 MHz), DXD is a PCM signal with 24-bit resolution (8 bits more than the 16 bits used for Red Book CD) sampled at 352.8 kHz – eight times 44.1 kHz, the sampling frequency of Red Book CD. The data rate is 8.4672 Mbit/s per channel – three times that of DSD64.

https://www.blisshq.com/music-library-management-blog/2016/03/15/dsd-versus-dsf-versus-dff-what-mean-audio-libraries/

https://github.com/torvalds/linux/blob/master/sound/usb/quirks.c#L1459

https://github.com/SqueezeOnArch/dsdplay - DSD to Flac / PCM/DoP conversion and resampling

https://github.com/DocMarty84/sacd - Converts SACD image files, Philips DSDIFF and Sony DSF files to 24-bit high resolution wave files. Handles both DST and DSD streams. THIS IS ONLY A CLONE OF THE OFFICIAL REPO!!!

MQA

WP: Master_Quality_Authenticated

https://www.whathifi.com/advice/mqa-audio-what-it-how-can-you-get-it

AMR

WP: Adaptive_Multi-Rate_audio_codec

http://sourceforge.net/projects/opencore-amr/

Dolby Digital / AC-3

WP: Dolby_Digital - the name for audio compression technologies developed by Dolby Laboratories. Originally named Dolby Stereo Digital until 1994, except for Dolby TrueHD, the audio compression is lossy. The first use of Dolby Digital was to provide digital sound in cinemas from 35mm film prints; today, it is now also used for other applications such as TV broadcast, radio broadcast via satellite, DVDs, Blu-ray discs and game consoles. This format has different names: Dolby Digital, DD (an abbreviation for Dolby Digital, often combined with channel count; for instance, DD 2.0, DD 5.1), AC-3 (Audio Codec 3, Advanced Codec 3, Acoustic Coder 3. [These are backronyms. Adaptive Transform Acoustic Coding 3 is a separate format developed by Sony.]). ATSC A/52 is name of the standard.

WP: Dolby_Digital_Plus - also known as Enhanced AC-3 (and commonly abbreviated as DD+ or E-AC-3, or EC-3) is a digital audio compression scheme developed by Dolby Labs for transport and storage of multi-channel digital audio. It is a successor to Dolby Digital (AC-3), also developed by Dolby, and has a number of improvements including support for a wider range of data rates (32 Kbit/s to 6144 Kbit/s), increased channel count and multi-program support (via substreams), and additional tools (algorithms) for representing compressed data and counteracting artifacts. While Dolby Digital (AC-3) supports up to 5 full-bandwidth audio channels at a maximum bitrate of 640 Kbit/s, E-AC-3 supports up to 15 full-bandwidth audio channels at a maximum bitrate of 6.144 Mbit/s. The full set of technical specifications for E-AC-3 (and AC-3) are standardized and published in Annex E of ATSC A/52:2012, as well as Annex E of ETSI TS 102 366 V1.2.1 (2008–08), published by the Advanced Television Systems Committee.

http://aften.sourceforge.net/

Dolby AC-4

WP: Dolby_AC-4 - Dolby AC-4 is an audio compression standard supporting multiple audio channels and/or audio objects. Support for 5.1 channel audio is mandatory and additional channels up to 7.1.4 are optional. AC-4 provides a 50% reduction in bit rate over AC-3/Dolby Digital Plus.

Dolby TrueHD

WP: Dolby_TrueHD - a lossless multi-channel audio codec developed by Dolby Laboratories which is used in home-entertainment equipment such as Blu-ray Disc players and A/V receivers. It is one of the successors to the Dolby Digital (AC-3) surround sound codec, which is used as the audio standard for the DVD-Video format. In this application, Dolby TrueHD competes with DTS-HD Master Audio, a lossless codec from DTS.

Dolby TrueHD uses Meridian Lossless Packing (MLP) as its mathematical basis for compressing audio samples. MLP is also used in the DVD-Audio format, but details of Dolby TrueHD and the MLP Lossless format as used on DVD-Audio differ substantially. A Dolby TrueHD bitstream can carry up to 16 discrete audio channels. Sample depths up to 24 bits/sample and audio sample rates up to 192 kHz are supported. Like the more common legacy codec Dolby Digital, Dolby TrueHD bitstreams carry program metadata. Metadata is separate from the coding format and compressed audio samples, but stores relevant information about the audio waveform and provides control over the decoding process. For example, dialog normalization and dynamic range compression are controlled by metadata embedded in the Dolby TrueHD bitstream. Similarly, a Dolby Atmos encoded Dolby TrueHD stream contains metadata to extract and place the objects in relevant positions. Dolby TrueHD is a variable bit-rate codec.

Dirac

https://github.com/kode54/dh - Dirac to Headphones, with convolution code

ATRAC / Minidisc

WP: Adaptive_Transform_Acoustic_Coding - a family of proprietary audio compression algorithms developed by Sony. MiniDisc was the first commercial product to incorporate ATRAC in 1992. ATRAC allowed a relatively small disc like MiniDisc to have the same running time as CD while storing audio information with minimal loss in perceptible quality. Improvements to the codec in the form of ATRAC3, ATRAC3plus, and ATRAC Advanced Lossless followed in 1999, 2002, and 2006 respectively. Other MiniDisc manufacturers such as Sharp and Panasonic also implemented their own versions of the ATRAC codec. Sony has all but dropped the ATRAC related codecs in the USA and Europe and in their SonicStage powered 'Connect' Music Service (Sony's equivalent of iTunes) on 31 March 2008. However, it is being continued in Japan and various other countries.

https://lobste.rs/s/3jk1a4/dumping_minidisc_media

https://www.reddit.com/r/minidisc

The Web MiniDisc Application - Use your old NetMD device in the browser [35]

NICAM

WP: NICAM - an early form of lossy compression for digital audio. It was originally developed in the early 1970s for point-to-point links within broadcasting networks. In the 1980s, broadcasters began to use NICAM compression for transmissions of stereo TV sound to the public.

http://www.bbc.co.uk/rd/blog/2016-01-35-million-people-didnt-notice-a-thing-dot-dot-dot [36]

Vinyl

WP: Phonograph - a device, invented in 1877, for the mechanical recording and reproduction of sound. In its later forms, it is also called a gramophone (as a trademark since 1887, as a generic name in the UK since 1910), or, since the 1940s, a record player. The sound vibration waveforms are recorded as corresponding physical deviations of a spiral groove engraved, etched, incised, or impressed into the surface of a rotating cylinder or disc, called a "record". To recreate the sound, the surface is similarly rotated while a playback stylus traces the groove and is therefore vibrated by it, very faintly reproducing the recorded sound. In early acoustic phonographs, the stylus vibrated a diaphragm which produced sound waves which were coupled to the open air through a flaring horn, or directly to the listener's ears through stethoscope-type earphones.

WP: Phonograph_cylinder - the earliest commercial medium for recording and reproducing sound. Commonly known simply as "records" in their era of greatest popularity (c. 1896–1915), these hollow cylindrical objects have an audio recording engraved on the outside surface, which can be reproduced when they are played on a mechanical cylinder phonograph. In the 1910s, the competing disc record system triumphed in the marketplace to become the dominant commercial audio medium.

WP: Phonograph_record - also known as a gramophone record, especially in British English, or record) is an analog sound storage medium in the form of a flat disc with an inscribed, modulated spiral groove. The groove usually starts near the periphery and ends near the center of the disc. At first, the discs were commonly made from shellac; starting in the 1950s polyvinyl chloride became common. In recent decades, records have sometimes been called vinyl records, or simply vinyl, although this would exclude most records made until after World War II.

Sound Experiments at the Volta Laboratory - Hear My Voice | Albert H. Small Documents Gallery | Smithsonian's National Museum of American History - [37]

WP: Magnetic_cartridge - more commonly called a phonograph cartridge or phono cartridge or (colloquially) a pickup, is an electromechanical transducer used in the playback of analog sound recordings called records on a record player, now commonly called a turntable because of its most prominent component but formally known as a phonograph in the US and a gramophone in the UK. The cartridge contains a removable or permanently mounted stylus, the tip - usually a gemstone like diamond or sapphire - of which makes physical contact with the record's groove. In popular usage and in disc jockey jargon, the stylus, and sometimes the entire cartridge, is often called the needle. As the stylus tracks the serrated groove, it vibrates a cantilever on which is mounted a permanent magnet which moves between the magnetic fields of sets of electromagnetic coils in the cartridge (or vice versa: the coils are mounted on the cantilever, and the magnets are in the cartridge). The shifting magnetic fields generate an electrical current in the coils. The electrical signal generated by the cartridge can be amplified and then converted into sound by a loudspeaker.

YouTube: The world's cheapest phono cartridge - ubiquitous black red angular clone of Chuo Denshi 33 1/3 & 45 Cartridge+Stylus cartridge

YouTube: What Phono Cartridge Should I Get?

YouTube: DJ tips: needles

https://www.stylusplus.co.uk

WP: RIAA_equalization

http://wow.heavylistening.com

https://github.com/mitsuhito/CuttingRecordGenerator - To make "Record" with laser cutter. Uses Processing.

https://github.com/kallaballa/sndcut - a program that generates LP records from audio files - it generates an SVG file that you can laser cut.

Playlist formats

WP: M3U

playlist='play.m3u' ; if [ -f $playlist ]; then rm $playlist ; fi ; for f in *.mp3; do echo "$(pwd)/$f" >> "$playlist"; done
  # create m3u playlist with absolute file paths

WP: PLS_(file_format) [38]

WP: Advanced_Stream_Redirector - RTSP & MMS

http://www.xspf.org/ - xiph
- WP: XML_Shareable_Playlist_Format

ADF

EBU ADM Guidelines - standardised metadata model for describing the technical properties of audio. ADM metadata can be attached to audio files to ensure the audio is correctly handled. These pages contain information to help you understand the ADM and provide examples and a reference to use it.

Audio definition model.pdf

How to get started with the Audio Definition Model – IRT Lab

Open-source for open object-based audio workflows – IRT Lab

https://github.com/ebu/ebu_adm_renderer - The EBU ADM Renderer, written in Python, is the reference implementation of EBU Tech 3388

https://github.com/ebu/libadm - The libadm library is a modern C++11 library to parse, modify, create and write ITU-R BS.2076 conformant XML. It works well with the header-only library libbw64 to write ADM related applications with minimal dependencies.

Audio Definition Model Software - BBC R&D - The Audio Definition Model is an ITU specification of metadata that can be used to describe object-based audio, scene-based audio and channel-based audio. It can be included in BWF WAVE files or used as a streaming format in production environments. The BBC Audio Toolbox is a suite of C++ libraries for manipulating object-based audio and ADM based BWF files.

https://github.com/immersive-audio-live/ADM-OSC - An OSC dictionnary that implements the Audio Definition Model (ADM)

SoundStream

https://github.com/wesbz/SoundStream -an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf

Playback

See also Playback, Sampling, Distros#Media

jplay2

https://gareus.org/oss/jplay2/start - jplay2 is a command-line audio player, gluing JACK, libsamplerate and liblo (OSC control), it plays a single file (no playlist), but with ffmpeg & libsndfile it plays every file one throws at it (even DVD-vobs or timemachine-w64 ;-) ). Once started, it's only possible to interact with jplay2 via OSC or jack-transport.

Random Parallel Player

https://github.com/diovudau/random-parallel-player - Takes a bunch of audio files as tracks and plays them back randomly creating new music each playthrough. The core rule of RPP: No human interaction once the playback has started. RPP is based on an idea of Louigi Verona. The included audio samples in example.rpp were created by him. You can read about the original project here: https://louigiverona.com/?page=projects&s=writings&t=linux&a=linux_randomhouse

loopnaut

https://github.com/soenkehahn/loopnaut - tool to play audio files in a loop (via JACK)

mfl-gramophone

https://github.com/eeeeeta/mfl-gramophone - A simple Rust application to play audio, using JACK, when it receives commands via OSC.

SQA: the Stuttery QLab Alternative

https://github.com/eeeeeta/sqa - This project aims to create an audio player & cue system for live shows and staged productions, à la Figure53's QLab. All its code is written in the Rust programming language, a new language that prevents memory unsafety and improves programming ergonomics. This one large repo contains many different crates that all help accomplish that aim.

PEAR

https://github.com/esologic/pear - a tool for sound installations. Take a directory with .wav files named in numeric order and play them over usb sound devices attached to the host computer over and over forever, looping all files once the longest one finishes.

Recording

fmedia

fmedia - a fast asynchronous media player/recorder/converter for Windows, Linux and FreeBSD. It provides smooth playback and recording even if devices are very slow. It's highly customizable and can be easily extended with additional plugins. Its low CPU & memory consumption saves energy when running on a notebook's battery. Play or convert audio files, record new audio tracks from microphone, save songs from Internet radio, and much more! fmedia is free and open-source project, and you can use it as a standalone application or as a library for your own software. fmedia can decode: .mp3, .ogg (Vorbis, Opus), .opus, .m4a/.mp4 (AAC, ALAC, MPEG), .mka/.mkv (AAC, ALAC, MPEG, Vorbis), .avi (AAC, MPEG), .aac, .mpc, .flac, .ape, .wv, .wav. fmedia can encode into: .mp3, .ogg, .opus, .m4a (AAC), .flac, .wav.

arecord

http://alsa.opensrc.org/Arecord

 arecord -D hw:0 -f cd test.wav

audio-recorder

https://launchpad.net/~audio-recorder

Ecasound

Ecasound is a software package designed for multitrack audio processing. It can be used for simple tasks like audio playback, recording and format conversions, as well as for multitrack effect processing, mixing, recording and signal recycling. Ecasound supports a wide range of audio inputs, outputs and effect algorithms. Effects and audio objects can be combined in various ways, and their parameters can be controlled by operator objects like oscillators and MIDI-CCs. A versatile console mode user-interface is included in the package.

ecasound -a:1,2 -i jack -o jack -a:1 -elv2:http://calf.sourceforge.net/plugins/Compressor,0,1,0,0,0,0,0.015625,20,0.01,2000,1,1,0,0,1,1 -a:2 -elv2:http://calf.sourceforge.net/plugins/Limiter,0,1,0.587231,0,0,0,0,0,0,0,0,0.0625,0.1,1000,1,1,0,0.5,4

Nama - manages multitrack recording, mixing and mastering using the Ecasound audio processing engine developed by Kai Vehmanen.
- Juliencoder / Nama - Contributions to and use of Nama

Ecasound Mastering Interface - a Python front end to ecasound. It looks a lot like Rackmount effect and can be used to create an Ecasound Chain Setup while playing with parameters in real time. It supports mixing, recording, filtering, and processing and can export to ECS files. It supports all ecasound options, chain operators, and controllers.

Visecas - a graphical user interface for Ecasound (http://eca.cx/ecasound), a software package written by Kai Vehmanen (k@eca.cx) which is designed for multitrack audio processing. It starts Ecasound as a child process and communicates via a pipe using Ecasound's InterActive Mode (IAM) commands.

ecaplugin.py - a tool to generate the unwieldy ecasound command lines for LADSPA and LV2 plugins from Ardour sessions or JACK Rack configurations.
- https://github.com/dsacre/ecaplugin

Meterec

meterec works as a basic multi track tape recoder. The aim of this software is to minimise the interactions of the users with the computer and allow them to focus on their instrumental performance. For this reason meterec features are minimal. One of the main "limitation" is that meterec can only restart from time 0:00:00.00: if you srew one take, start it over again! rather than learning how to use a specific software to correct what you screw, meterec forces to learn and master your instrument. Good news is previous takes are kept in take history and if in the end, the first one was the best you could play, you can choose it in your final mix.

jack_capture

jack_capture is a program for recording soundfiles with jack. The default operation will record what you hear in your loudspeakers into a stereo wav file.

http://mein-neues-blog.de/2015/02/07/mein-neues-blog-deb-repository/#recjack
- http://mein-neues-blog.de/category/recjack/

https://github.com/danielappelt/caPiture - uses jack_capture to headlessly multitrack record the input of the Behringer XR18 Mixer. For now, it is hardcoded to work with the XR18.

jrec2

jrec2 - simple patched jack_capture, simple patch to the jack_capture example client, that implements silence detection and splitting of output files), can call hooks (invoke 3rd party software) upon detecting silence or audio. It include an optional random-playback control script that was used in an installation to record voice and if it detects silence plays back random snippets of previously recorded material.

jack-record

jack-record is a light-weight JACK capture client to write an arbitrary number of channels to disk.

jack_playrec

https://github.com/HoerTech-gGmbH/jack_playrec - provides an interface for synchronous recording/playback via the JACK Audio Connection Kit.

screcord.lv2

https://github.com/brummer10/screcord.lv2

jamRecord.lv2

https://github.com/tdufret/jamRecord.lv2 - jam session recorder to only keep on file the last x minutes of a jam

Freeze

https://github.com/nickolas360/freeze - an LV2 plugin for freezing tracks in a digital audio workstation—that is, temporarily rendering a track as audio to reduce CPU/DSP load, as tracks with large chains of CPU-heavy effects can make buffer underruns (xruns) quite common. Some DAWs like Ardour support track freezing to a certain extent, but Ardour, for example, cannot freeze MIDI tracks.

QJackRcd

qjackrcd - a simple stereo recorder for Jack with few features as silence processing for automatic pause, file splitting, background file post-processing. It can be used with QJackCtl.
- https://github.com/orouits/qjackrcd

audio coffin

https://github.com/UoC-Radio/audio-coffin - A simple audio recorder/logger on top of Jack, libsndfile and libsoxr

timemachine

JACK Timemachine - I used to always keep a minidisc recorder in my studio running in a mode where when you pressed record it wrote the last 10 seconds of audio to the disk and then caught up to realtime and kept recording. The recorder died and haven't been able to replace it, so this is a simple jack app to do the same job. It has the advantage that it never clips and can be wired to any part of the jack graph.
- http://www.64studio.com/manual/audio/timemachine - A JACK application that can retrospectively record audio.
- https://github.com/swh/timemachine

Rotter

Rotter is a Recording of Transmission / Audio Logger for JACK. It was designed for use by radio stations, who are legally required to keep a recording of all their output. Rotter runs continuously, writing to a new file every hour. Rotter can output files in servaral different strutures, including all files in a single directory or create a directory structure.The advantage of using a folder hierarchy is that you can store related files in the hour's directory.

listener

https://github.com/folkertvanheusden/listener - will listen for sound and as soon as a certain loudness level is reached, it will start recording.

Transcoding

WP: Transcoding - the direct digital-to-digital conversion of one encoding to another,[1] such as for movie data files (e.g., PAL, SECAM, NTSC), audio files (e.g., MP3, WAV), or character encoding (e.g., UTF-8, ISO/IEC 8859). This is usually done in cases where a target device (or workflow) does not support the format or has limited storage capacity that mandates a reduced file size,[2] or to convert incompatible or obsolete data to a better-supported or modern format.

http://wiki.hydrogenaud.io/index.php?title=Transcoding

SRC Comparisons - converting sample frequency

SoundConverter

SoundConverter - the leading audio file converter for the GNOME Desktop. It reads anything GStreamer can read (Ogg Vorbis, AAC, MP3, FLAC, WAV, AVI, MPEG, MOV, M4A, AC3, DTS, ALAC, MPC, Shorten, APE, SID, MOD, XM, S3M, etc...), and writes to Opus, Ogg Vorbis, FLAC, WAV, AAC, and MP3 files, or use any GNOME Audio Profile.

Perl Audio Converter

Perl Audio Converter - A tool for converting multiple audio types from one format to another. It supports the following audio formats: 3G2, 3GP, 8SVX, AAC, AC3, ADTS, AIFF, AL, AMB, AMR, APE, AU, AVR, BONK, CAF, CDR, CVU, DAT, DTS, DVMS, F32, F64, FAP, FLA, FLAC, FSSD, GSRT, HCOM, IMA, IRCAM, LA, MAT, MAUD, MAT4, MAT5, M4A, MP2, MP3, MP4, MPC, MPP, NIST, OFF, OFR, OFS, OPUS, OGA, OGG, PAF, PRC, PVF, RA, RAW, RF64, SD2, SF, SHN, SMP, SND, SOU, SPX, SRN, TAK, TTA, TXW, VOC, VMS, VQF, W64, WAV, WMA, and WV.

Secret Rabbit Code

Secret Rabbit Code - aka libsamplerate, is a Sample Rate Converter for audio. One example of where such a thing would be useful is converting audio from the CD sample rate of 44.1kHz to the 48kHz sample rate used by DAT players. SRC is capable of arbitrary and time varying conversions ; from downsampling by a factor of 256 to upsampling by the same factor. Arbitrary in this case means that the ratio of input and output sample rates can be an irrational number. The conversion ratio can also vary with time for speeding up and slowing down effects.

SRC provides a small set of converters to allow quality to be traded off against computation cost. The current best converter provides a signal-to-noise ratio of 145dB with -3dB passband extending from DC to 96% of the theoretical best bandwidth for a given pair of input and output sample rates. Since the library has few dependencies beyond that provided by the standard C library, it should compile and work on just about any operating system. It is known to work on Linux, MacOSX, Win32 and Solaris. With some relatively minor hacking it should also be relatively easy to port it to embedded systems and digital signal processors.

fre:ac

fre:ac - a free audio converter and CD ripper with support for various popular formats and encoders. It currently converts between MP3, MP4/M4A, WMA, Ogg Vorbis, FLAC, AAC, WAV and Bonk formats. With fre:ac you easily rip your audio CDs to MP3 or WMA files for use with your hardware player or convert files that do not play with other audio software. You can even convert whole music libraries retaining the folder and filename structure.The integrated CD ripper supports the CDDB/freedb online CD database. It will automatically query song information and write it to ID3v2 or other title information tags.
- https://github.com/enzo1982/freac - a free and open source audio converter. It supports audio CD ripping and tag editing and converts between various audio file formats.

audiomap

audiomap - a program which converts from any audio format to any other in a uniform & sane fashion so you don't have to learn all the options for conversion program x. It will preserve all tags in the process. It's goal is bullcrap free conversion. I wrote it because while there are plunty of shell scripts out there to convert things to/from a few formats, they suck at handling weird characters and often even spaces! On top of this, they usually do not properly preserve metadata. Audiomap works with funky chars, spaces, and preserves metadata - all whilst providing encoding/decoding through many formats.

resample

https://ccrma.stanford.edu/~jos/resample/

transfercoder

https://github.com/DarwinAwardWinner/transfercoder - Transfer and transcode your music at the same time

AudioMove

AudioMove - a simple, easy to use GUI-based batch audio file copy-and-conversion program.

flacsync

https://github.com/cmcginty/flacsync - Recursively mirror a directory tree of FLAC audio files to AAC or OGG.

lackey

https://github.com/cassava/lackey - Automatically create and manage a lower-quality mirror of your music library

caudec

caudec - a command-line utility for GNU/Linux and OS Ⅹ that transcodes (converts) audio files from one format (codec) to another. It leverages multi-core CPUs with lots of RAM by using a ramdisk, and running multiple processes concurrently (one per file and per codec). It is Free Software, licensed under the GNU General Public License.

ffcvt

ffcvt - ffmpeg convert wrapper tool
- https://github.com/suntong/ffcvt

garrett

https://github.com/hannesbraun/garrett - converts the given audio files into the "Waveform Audio File Format" (WAVE) with a static bit depth of 16 bits. You can choose between two output sample rates: 44100 Hz and 48000 Hz. The supported input formats are: WAVE MP3 FLAC AIFF

Analysis

Sonic Visualiser

Sonic Visualiser - an application for viewing and analysing the contents of music audio files. The aim of Sonic Visualiser is to be the first program you reach for when want to study a musical recording rather than simply listen to it. We hope Sonic Visualiser will be of particular interest to musicologists, archivists, signal-processing researchers and anyone else looking for a friendly way to take a look at what lies inside the audio file. Sonic Visualiser is Free Software, distributed under the GNU General Public License (v2 or later) and available for Linux, OS/X, and Windows. It was developed at the Centre for Digital Music at Queen Mary, University of London.
- https://code.soundsoftware.ac.uk/projects/sonic-visualiser

Don't forget to install at least the QM vamp plugins.

http://www.sonicvisualiser.org/videos.html

YouTube: 2 7 Sonic Visualiser Tutorial 2011

Baudline

Baudline is a time-frequency browser designed for scientific visualization of the spectral domain. Signal analysis is performed by Fourier, correlation, and raster transforms that create colorful spectrograms with vibrant detail. Conduct test and measurement experiments with the built in function generator, or play back audio files with a multitude of effects and filters. The baudline signal analyzer combines fast digital signal processing, versatile high speed displays, and continuous capture tools for hunting down and studying elusive signal characteristics.

WP: Baudline

http://www.baudline.com/manual/

http://baudline.blogspot.co.uk/

https://ileriseviye.wordpress.com/2012/03/20/baudline-how-to-solve-the-all-input-devices-disabled-error-on-ubuntu-gnulinux/

https://groups.google.com/forum/#!msg/baudline

Friture

Friture - a real-time audio analyzer. It works on Windows, Mac OS X and Linux. It is free and open source.
- https://github.com/tlecomte/friture

SoundRuler

SoundRuler is a tool for measuring and graphing sound and for teaching acoustics. Its visual interactive approach to analysis brings you the best of two worlds: the control of manual analysis and the objectivity and speed of automated analysis.

Binary download needs 32-bit libxp to be installed.

BRP-PACU

BRP-PACU - A cross platform dual channel FFT based Acoustic Analysis Tool to help engineers analyze live professional sound systems using the transfer function. One feature is the ability to capture four sample plots, average them, and invert to aid in final EQ.

DSP

WP: BRP-PACU

https://electronjunkie.wordpress.com/category/brp-pacu/

Open Sound Meter

Open Sound Meter - real time crossplatform dual-FFT measurement analysis tool for sound system tuning.
- https://github.com/psmokotnin/osm

japa

Japa (JACK and ALSA Perceptual Analyser) - a 'perceptual' or 'psychoacoustic' audio spectrum analyser.

Spek

Spek - helps to analyse your audio files by showing their spectrogram. Spek is free software available for Unix, Windows and Mac OS X.

Visual only.

GTKWave

https://github.com/gtkwave/gtkwave - a fully featured GTK+ based wave viewer for Unix and Win32 which reads LXT, LXT2, VZT, FST, and GHW files as well as standard Verilog VCD/EVCD files and allows their viewing.

zrtstr

zrtstr is a small command line application for detecting faux-stereo WAV-files, that is, files with two identical channels that should have been saved as mono. Such files are sometimes generated by some audio-editing software and DAWs (I’m looking at you, old Cubase 5). Gotten tired of receiving such files from clients for mixing, as they are using twice the necessary space and require twice the processing power, I decided to deal with this nuisance once and for all. zrtstr is a cross-platform application which runs very fast, thanks to being written in Rust.

DFasma

DFasma is a free open-source software which is used to compare audio files in time and frequency. The comparison is first visual, using wavforms and spectra. It is also possible to listen to time-frequency segments in order to allow perceptual comparison. It is basically dedicated to analysis. Even though there are basic functionnalities to align the signals in time and amplitude, this software does not aim to be an audio editor.
- https://github.com/gillesdegottex/dfasma

Pydiogment

https://github.com/SuperKogito/pydiogment - aims to simplify audio augmentation. It generates multiple audio files based on a starting mono audio file. The library can generates files with higher speed, slower, and different tones etc.

ASAnnotation

AS Annotation is an application for the analysis and automated or manual annotation of sound files. It features state of the art sound analysis algortithms, specialized sound inspection tools and can import Standard MIDI files. ASAnnotation is based on AudioSculpt, a sound analysis and transformation software developed at IRCAM since 1996. In addition to the analysis and annotation features present in AS Annotation, AudioSculpt comes with state of the art sound processing, mostly based on an enhanced version of the phase vocoder. To store and exchange analysis and annotation data, ASAnnotation can use two formats; MIDI for notes and text, and SDIF for all analyses. The MIDI support facilitates the verification, alignment and correction of Standard MIDI Files to soundfiles. SDIF is a specialized format for sound description data, which combines very high precision with efficiency and interchangeability. Numerous other programs support SDIF, such as Max/MSP, OpenMusic, CLAM and SPEAR. A collection of utility programs can be used to convert SDIF files to text.

harmony-analyser

harmony-analyser is a set of visual tools for music harmony analysis of WAV/MIDI input, powered by JHarmonyAnalyser library
- https://github.com/lacimarsik/harmony-analyser

Toscanalyzer

Toscanalyzer is a powerful audio analysis tool for mixing and mastering. Toscanalyzer helps you to mix and master better. It is not only an analysis tool but a complete guide to understand why your song sounds as it sounds. Toscanalyzer lets you compare audible and visually your project to any reference songs in a very convenient way. Toscanalyzer offers a clear project view including many options to analyze. The analysis gives you a detailed report about possible problems and in addition a clear guidance how to fix it.

Java. Doesn't work for me.

SPAN

http://www.voxengo.com/product/span/ - windows/mac vst

Raven Lite

http://www.birds.cornell.edu/brp/RavenLite/RavenLiteReadMe.htm - an interactive sound visualization tool for novice through advanced users who want to visualize sound in exciting ways. Windows and Mac. Lite version is freware

pyAudioAnalysis

https://github.com/tyiannak/pyAudioAnalysis pyAudioAnalysis - Python library covering a wide range of audio analysis tasks, including: feature extraction, classification, segmentation and visualization.

libroda

librosa - a python package for music and audio analysis. It provides the building blocks necessary to create music information retrieval systems.
- https://github.com/librosa

QLoud

QLoud - tool to measure a loudspeaker frequency response and distortions
- https://github.com/molke-productions/qloud - an attempt of porting QLoud to QT5

freqresp

https://github.com/flok99/freqresp - Calculates a frequency response diagram

Room EQ Wizard

REW - free room acoustics analysis software for measuring and analysing room and loudspeaker responses. The audio analysis features of REW help you optimise the acoustics of your listening room, studio or home theater and find the best locations for your speakers, subwoofers and listening position. It includes tools for generating audio test signals; measuring SPL and impedance; measuring frequency and impulse responses; measuring distortion; generating phase, group delay and spectral decay plots, waterfalls, spectrograms and energy-time curves; generating real time analyser (RTA) plots; calculating reverberation times; calculating Thiele-Small parameters; determining the frequencies and decay times of modal resonances; displaying equaliser responses and automatically adjusting the settings of parametric equalisers to counter the effects of room modes and adjust responses to match a target curve.

https://linuxmusicians.com/viewtopic.php?f=18&t=17971&p=97173

PostQC

PostQC - a tool for measuring, logging and reporting audio levels for professional audio production and post production studios.

Gist

https://github.com/adamstark/Gist - a C++ based audio analysis library

timbreIDLib

https://github.com/wbrent/timbreIDLib - a library of audio analysis externals for Pure Data. The classification external [timbreID] accepts arbitrary lists of audio features and attempts to find the best match between an input feature and previously stored training instances. The library can be used for a variety of real-time and non-real-time applications, including sound classification, sound searching, sound visualization, automatic segmenting, ordering of sounds by timbre, key and tempo estimation, and concatenative synthesis.

musii-kit

https://github.com/otsob/musii-kit - A collection of tools for computational musicology, especially using Jupyter Notebooks for music analysis. This has been greatly inspired by the example notebooks associated with the excellent book Fundamentals of Music Processing.

Mixers

Command-line mixers

amixer

amixer - a command-line program for controlling the mixer in the ALSA soundcard driver. amixer supports multiple soundcards

amixer -c 0 | pcregrep "control"
  # shows all audio channels

JackMiniMix

JackMiniMix - a simple mixer for the Jack Audio Connection Kit with an OSC based control interface. It supports a user configurable number of stereo inputs, which can then be queried and controlled by sending it OSC messages. It is released under the GPL license.
- https://github.com/njh/jackminimix/

kbd2jackmix

https://github.com/dsheeler/kbd2jackmix - Listen on a keyboard event device and respond to key combinations with jack midi volume messages for use with a midi-aware jack mixer, like jackmix or jack_mixer.

jack_switch

https://github.com/MaurizioB/jack_switch - switches outputs from any input client

jack-balancer

https://github.com/OldShatterham/jack-balancer - provides rudimentary volume and balance control via MIDI controls.

ncurses mixers

alsamixer

http://alsa.opensrc.org/alsamixer
- WP: alsamixer

aumix

http://jpj.net/~trevor/aumix.html

Graphical mixers

Kmix

Kmix - an application to allow you to change the volume of your sound card. Though small, it is full-featured, and it supports several platforms and sound drivers. Features: Support for ALSA and OSS sound systems, Plasma Desktop integrated on-screen-display for volume changes

Recommended.

QasMixer

QasMixer - a desktop mixer application for ALSA's "Simple Mixer Interface".

QasHctl

QasHctl - a mixer for ALSA's more complex "High level Control Interface".

alsamixergui

alsamixergui - a FLTK based frontend for alsamixer. It is written directly on top of the alsamixer source, leaving the original source intact, only adding a couple of ifdefs, and some calls to the gui part, so it provides exactly the same functionality, but with a graphical userinterface.

gnome-alsamixer

gnome-alsamixer

volti

https://github.com/gen2brain/volti - an GTK+ application for controlling ALSA volume from system tray/notification area.

Buggy, forgets card sometimes, menu/preferences don't open.

Xmixer

https://www.freshports.org/audio/xmixer audio/xmixer: Audio mixer (gtk and Xlib) for X11R6

XMMIX

XMMIX - Motif Audio Mixer

NewMixer

https://github.com/jatinchowdhury18/NewMixer - An audio mixing tool that allows the user to visualize audio sources by their location in space rather than as channels on a mixing board.

DAMC

https://github.com/amurzeau/damc - Audio mixing console for Jack also supporting external devices and OSC remote control.

Web mixer

alsa-json-gateway

https://github.com/fulup-bzh/AlsaJsonGateway - Provides an HTTP REST interface to ALSA mixer for HTML5 UI support. The main objective of AJG is to decouple ALSA from UI, this especially for Music Oriented Sound boards like Scarlett Focurite and others.

JACK mixers

Non Mixer

Non Mixer
- non-mixer manual
- http://non.tuxfamily.org/wiki/UsingMidiWithNon - doesn't handle MIDI directly, requires CV or OSC conversion

/strip/[STRIP_NAME]/[MODULE_NAME]/[PARAMETER_NAME]

mx2482

https://github.com/cybercatalyst/mx2482 - uses QJackAudio, works with JACK. 24 channels routed to 8 subgroups each with direct out. Three-band parametric EQ for each channel. Aux send/return for each channel, so you can hook in other effects processors. Save and restore complete EQ states. Clean source code and free sofware licensed under GPL. Using latest Qt5, which means it runs on all major platforms.

JackMix

JackMix - Matrix Mixer with dials for Jack. Not quite MIDI controllable?
- https://github.com/kampfschlaefer/jackmix

jack_mixer

https://github.com/jack-mixer/jack_mixer
- https://github.com/dsheeler/jack_mixer - old
- https://github.com/relascope/jack_mixer - old. a GTK+ JACK audio mixer app with a look similar to its hardware counterpart. input to output, and send outputs. only MIDI control of one level and stereo balance! manual send groups, solo and mute.

https://github.com/sen87/jack_mixer_cc - jack_mixer companion that provides a CLI for channel adjustments.Creates a JACK-client that controls jack_mixer via MIDI Control Change Messages.

jackmixdesk

jackmixdesk - an audio mixer for JACK with an OSC control interface and LASH support. It has a configurable number of inputs and pre/post sends/outs which can be controlled by sending it OSC messages. There is a XML config file and a GTK interface.

JackMaster

JackMaster - "Master Console" for the jack-audio-connection-kit. Number of inputs/subs and screen layout can only be changed by recompiling.

qtjmix

https://github.com/ycollet/qtjmix - A Jack mixer GUI

jackmaster

https://github.com/linuxmao-org/jackmaster - - A "Master Console" for the jack-audio-connection-kit

jack-volume

https://github.com/voidseg/jack-volume - JACK client for controlling the volume of multiple audio channels via OSC.

JJack

https://github.com/MrLetsplay2003/JJack - JJack Audio Mixer

injector

https://github.com/dastax/injector - A simple mixer application for jack audio written in Qt4

MU1

MU1 - a simple Jack app used to organise stereo monitoring. It was written originally for use with Ardour2, but still useful with Ardour3 as it provides some extra functions.

audio-multiplexer

https://github.com/knoellle/audio-multiplexer - interlace two streams of speech while maintaining comprehension.

LV2 mixers

Some of these are also standalone.

xfade.lv2

https://github.com/x42/xfade.lv2 - an audio-plugin for stereo cross-fading 2 x 2 input channels to 2 output channels.

balance.lv2

https://github.com/x42/balance.lv2 - for stereo balance control with optional per channel delay. balance.lv2 facilitates adjusting stereo-microphone recordings (X-Y, A-B, ORTF). But it also generally useful as "Input Channel Conditioner". It allows for attenuating the signal on one of the channels as well as delaying the signals (move away from the microphone). To round off the feature-set channels can be swapped or the signal can be downmixed to mono after the delay.

sonejostudios

https://github.com/sonejostudios/LiveFader - LiveFader is a very simple stereo passive volume fader

https://github.com/sonejostudios/StereoKnot - Simple Stereo Through with Volume Slider

https://github.com/sonejostudios/AudioThrough16 - very simple 16-channel audio through

https://github.com/sonejostudios/StereoSwitch - A simple Stereo Switch (send stereo signal to output A or to output B)

https://github.com/sonejostudios/ABswitchStereo - Stereo source comparison tool.

https://github.com/sonejostudios/Mixer4x - A simple 4-channel stereo mixer. The main goal is to use it as a submixer on a 4 channel track, but you can use it everywhere you need a small 4 channel stereo mixer.

https://github.com/sonejostudios/LiveMixer - Stereo Mixer Strip with 2 Aux Sends (post Fader)

https://github.com/sonejostudios/XYMatrix - XY Surround Matrix for one Source (Mono Input) with 4 Outputs (Left, Right, Surround Left, Surround Right) and Position Lock.

BalanceGain / BalanceWidth

https://github.com/johnflynnjohnflynn/BalanceGain - Stepped gain audio plugin for Balance Mastering

https://github.com/johnflynnjohnflynn/BalanceWidth - Stepped stereo width plugin for Balance Mastering

vopa

https://github.com/ycollet/vopa - Volume Panning Midi CC controlable LV2 plugin

x42-mixtrix

mixtri(x) - a matrix mixer and trigger processor intended to be used with the oscilloscope, but also useful for other applications.
- https://github.com/x42/mixtri.lv2

matrixmixer.lv2

https://github.com/x42/matrixmixer.lv2 - matrixmixer.lv2 is a matrix mixer :)It is available as LV2 plugin and standalone JACK-application.

BadAmp

https://github.com/badosu/BadAmp - A simple amplifier LV2 plugin

BAmp

https://github.com/sjaehn/BAmp - Simple amplifier LV2 plugin using BWidgets GUI

Simple Amplifier

https://github.com/ul/simple-amplifier - A very simple example of LV2 plugin built in Zig

Plujain

https://github.com/Houston4444/plujain-plugins - utility LV2 plugins. The fadeswitch is an audio mono switch that progressly fade between the 2 channels outs. The triswitch follow the same principle with 3 outputs. For the Quadriswitch, guess !

Kn0ck0ut

Kn0ck0ut - takes two mono 44.1KHz inputs and spectrally subtracts one from the other. It can be used to help create 'acapellas' - to extract vocals from a track - if an instrumental version (or section) of the track is available.
- YouTube: KnOckOut by St3pan0va

https://github.com/jeremysalwen/kn0ck0ut-LV2 - Port of kn0ck0ut to LV2 plugin

intersect-lv2

https://github.com/sboukortt/intersect-lv2 - an LV2 plugin which, given a stereo audio stream, “expands” it to three channels. Everything that is present in both input channels will be in the center channel of the output, and what is specific to each channel will be in the corresponding output channel. This can be useful, for example, to rediscover some of your favorite music by hearing things that you had never noticed before. (With that said, note that it does not necessarily work equally well on all songs, depending on how they were mixed.)

sm.lv2

https://github.com/nettings/sm.lv2 - A simple speaker management LV2 plugin with global master volume and per-channel trim, delay, and low-shelf.This plugin lets you optimize a stereo or multichannel speaker system to your listening environment.

tinyamp.lv2

https://github.com/x42/tinyamp.lv2 - minimalistic gain control with small MOD GUI

mod-volume-lv2

https://github.com/moddevices/mod-volume-lv2 - LV2 volume plugin

SendMixer

https://github.com/SpotlightKid/sendmixer - A stereo channel strip with one master gain and two pre/post-fader sends

Other mixers

ladspa-xfade

https://github.com/diizy/ladspa-xfade - LADSPA 2-channel crossfader

BackgroundMusic

https://github.com/kyleneideck/BackgroundMusic - Mac [40]

faderratic

faderratic - brings you cross-fading of 2 stereo inputs, but with a mind of its own and a ton of options to change the fade shape, length, limits, frequency and probability. faderratic works by generating a pulse on a tempo-sync frequency, and depending on the probability it may trigger a cross-fade event. You can optionally make the fader auto-return to either side and if you feel like it, trigger a fade manually or control the fader movement totally manually. Windows VST.

Metering

http://www.soundonsound.com/sos/jun00/articles/metring.htm

WP: VU_meter

WP: Peak_programme_meter

https://en.m.wikipedia.org/wiki/K-system - an audio level measuring technique proposed by mastering engineer Bob Katz in the paper "An integrated approach to Metering, Monitoring and Levelling". It proposes a studio monitor calibration system and a set of meter ballistics to help engineers produce consistent sounding music while preserving appropriate dynamic range.[

Level Practices (Part 2) - Digido.com

JackMeter

Jack Meter is a basic console based DPM (Digital Peak Meter) for JACK. I wrote it for quickly checking remote signal levels, without having to run X11 to use a pretty graphical meter such as meterbridge.

JACK Meterbridge

JACK Meterbridge - software meterbridge for the UNIX based JACK audio system. It supports a number of different types of meter, rendered using the SDL library and user-editable pixmaps.

Ebumeter

Ebumeter provides level metering according to the EBU R-128 recommendation. The current release implements all features required by the EBU document except the oversampled peak level monitoring.

mod-peakmeter

https://github.com/moddevices/mod-peakmeter

lv2

meters.lv2 is a collection of audio-level meters with GUI in LV2 plugin format.

traKmeter - Loudness meter for correctly setting up tracking and mixing levels.
- https://www.kvraudio.com/product/trakmeter-by-mzuther
- https://github.com/danielappelt/traKmeter

K-Meter - Implementation of a K-System meter according to Bob Katz’ specifications.

JACK bitmeter

JACK bitmeter - a diagnosis tool for JACK audio software on Linux (and perhaps other systems which have JACK and GTK+ 2.x). As its name might suggest, the bitmeter operates at the bare metal of JACK's I/O layer, looking at the 32 binary digits in each individual sample.

jack-peak-meter

https://github.com/gethiox/jack-peak-meter - terminal-based peak-meter for JACK audio system writen in Go

vu-meter

https://github.com/xkr47/vu-meter - Audio VU meter for JACK with any number of channels.This is heavily inspired by the cadence-jackmeter included in the Cadence tools. I rewrote it in Rust, with freely configurable amount of channels through commandline parameters. It uses XCB i.e. the X11 protocol for graphics. It does NOT currently autoconnect to any source.

jack_led_peak

https://github.com/fps/jack_led_peak - A small jack utility to drive LEDs based on the peak values of the inputs. This was developed primarily for use on a raspberry pi4 (the default line offsets), but it should be usable on any linux system that has some LEDs attached via GPIO lines addressable by libgpiod.

Visualisation

See Lighting#Visualisation

Oscilloscope

https://www.reddit.com/r/oscilloscopemusic

YouTube: Jerobeam Fenderson - channel
- YouTube: How To Draw Mushrooms On An Oscilloscope With Sound
- YouTube: Oscilloscope Music VST Plugins by Soundemote (Mushroom / Nyquist / Spiral Generator)

XXY Oscilloscope

XXY Oscilloscope - WebGL, version 1.0, April 2017, by Neil Thape [41]

woscope

woscope
- https://github.com/m1el/woscope

xoscope

xoscope - a digital oscilloscope for Linux

x42-scope

x42-scope - aka sisco.lv2, audio oscilloscope with variable time scale in LV2 plugin format.
- https://github.com/x42/sisco.lv2

jack_oscrolloscope

jack_oscrolloscope - a simple waveform viewer for JACK. The waveform is displayed in realtime, so you can always see the signal the instant it comes through JACK's input port.

jack-scope

jack-scope - an oscilloscope for JACK under X11. jack-scope draws either a time domain signal trace or a self correlation trace. Multiple input channels are superimposed, each channel is drawn in a different color. jack-scope accepts OSC packets for interactive control of drawing parameters.

QOscC

QOscC - a highly flexible and configurable software Oscilloscope with a large number of features. This includes support for any number of audio devices (ALSA or OSS), each with any number of channels. Each scope display can be configured individually to different display types and variants. e.g. you can chose from standard y-t mode (as on an usual oscilloscope), xy mode (e.g. for measuring the phase shift between two signals) of the FFT mode (to view a spectrum plot of the signal). This software is intended for electronic hobyists, who cannot afford a hardware oscilloscope or need a simple spectrum analyzer as well as for musicans for doing basic signal analysis.
- https://github.com/majorx234/qoscc

jack-analyser

https://github.com/alxdb/jack-analyser - an Osciloscope for the Jack Audio API

DSSI Oscilloscope

DSSI Oscilloscope - old

Oscilloscope

https://github.com/kritzikratzi/Oscilloscope - Oscilloscope for Mac/Windows written in OF

PrettyScope

PrettyScope - by Soundemote - $

OsciStudio

OsciStudio - Convert 3d shapes and animations to sounds (through obj files or blender addons). Modify sounds through a small plugin system. Animate parameters through timelines. Works on Mac OS and Windows (only intel/amd processors). Livecoding in C++ (no STL) $

Spectrum graph

cava - Console-based Audio Visualizer for Alsa
- https://github.com/karlstav/cava [42]

https://github.com/wayou/HTML5_Audio_Visualizer - An audio spectrum visualizer built with HTML5 Audio API

Spectrogram

https://github.com/johnhldavis/xjackfreak - audio analysis/EQ tool for GNU/Linux/X11/Jack Audio Connection Kit. It can display the FFT of any input, modify it and output the result.

spectrojack - A little spectrogram/audiogram/sonogram/whatever for jack. gtk 2 and fftw 3.

Spectrum 3D - a 3D audio spectrogram in real time or not from the microphone or an audio file (including recorded file from the microphone); it is compatible with Jack (jack-audio-connection-kit). Optionally, it supports multitouch gestures from touchscreen and touchpad. It is build with the Gstreamer, SDL (or Gtkglext), OpenGl, GTK+-2.0 and uTouch-Geis free libraries and is under GPL license.
- https://sourceforge.net/projects/spectrum3d/

xspect3d - a bespoke, radical drawing algorithm and buffer scheduling paradigm, to render a 3D sonic landscape in real time, at up to several hundred frames a second.

https://github.com/pdesaulniers/wolf-spectrum - a spectrogram plugin. It can be built as an LV2 or VST plugin and as a standalone Jack application.

Jack Live Spectrum - a small (only one file) C program that runs on Linux that will display a frequency spectrum of the live streaming sound as a floating down animation. Below are some still screenshots.

https://github.com/jcarrano/rtfi - Audio visualization & analysis using the RTFI, mono JACK client

https://musiclab.chromeexperiments.com/Spectrogram - WebGL

https://github.com/james34602/SuperSpectrogram - Super spectrogram Qt cross platform!

Photosounder Spiral - a music analysis plugin. It's a fresh take on spectral analysis focused on allowing you to see and understand music and the notes that make it up instantly. This is achieved mainly by coiling the spectrum into a spiral framed by a chromatic circle, thus allowing you to instantly see what's happening musically and spectrally. - $

Waveform

Peaks.js - a JavaScript component from BBC Research and Development that allows users to view and interact with audio waveforms in the browser.Peaks.js uses the HTML <canvas> element to display the waveform at different zoom levels, and synchronise the display to playback of an associated <audio> or <video> element. The component also allows point and segment markers to be added to the waveform, e.g., for distinguishing music from speech, or identifying different music tracks.
- https://github.com/bbc/audiowaveform

https://github.com/andrewrk/waveform - simultaneously transcode and generate visuals for an audio file

https://github.com/ayyi/libwaveform - Libwaveform attempts to provide versatile easy-to-use interactive display of audio waveforms for Gtk+-2 and X11 applications. Suitable for anything from simple display, up to a full digital audio workstation.

https://github.com/endolith/freesound-thumbnailer - A script to generate thumbnails for audio files, derived from the method used on Freesound.org (amplitude envelope with color representing spectral centroid)

Various

See Lighting#Visualisation

xoscope for Linux - a digital oscilloscope for Linux! ALSA, ESD, and COMEDI data sources; Sweep rates from 2 ns to 2 seconds per division; Eight simultaneous display channels; Scrollable memory buffers; Triggers; Cursors; Both analog and digital inputs; Sweep, accumulate, and strip chart display modes

Signalizer - a all-in-one signal visualizing package with a bunch of unique focus-points; real-time audio visualization with optimized 3D GPU graphics, everything being scalable and zoomable gridlessly as well as being arbitrarily precise in both settings and display. Combined with a rich feature set, Signalizer is suited both for electrical/audio engineers fullscreen-inspecting signals, or for general small windows giving an overview of your audio as you create it.
- https://bitbucket.org/Mayae/signalizer/overview

RepoVizz - a data repository and visualization tool for structured storage and user-friendly browsing of music performance multi-modal recordings. The primary purpose of RepoVizz is to offer means for the scientific community to gain on-line access to a music performance multi-modal database shared among researchers.

sndpeek - real-time 3D animated display/playback, can use mic-input or wav/aiff/snd/raw/mat file (with playback), time-domain waveform, FFT magnitude spectrum, 3D waterfall plot

https://bitbucket.org/asiniscalchi/visualjackm - connect projectM visualisation to Jack

VSXu - VSX Ultra, is an OpenGL-based (hardware-accelerated), modular visual programming environment with its main purpose to visualize music and create graphic effects in real-time. Its intention is to bridge the gap between programmer and artist and enabling acreative and inspiring environment to work in for all parties involved. VSXu is built on a modular plug-in-based architecture so anyone can extend it and or make visualization presets ("visuals" or "states"). The program is free software which means it's free from restrictions, free to share and copy, free to adapt / modify and use it any way you like.
- https://github.com/vovoid/vsxu

Le Biniou - As an artist/creator/DJ/VJ, to create live visuals based on your audio performances. As a user/listener, to watch an everlasting and totally unseen creation reacting to the music.
- https://dl.biniou.net/biniou/tar
- https://github.com/oliv3/lebiniou4/wiki

https://github.com/party/tv - TV display for Party.

https://github.com/jcarrano/rtfi - Resonator Time-Frequency Image

Phase

Windows VST

Voxengo Correlometer free multi-band correlation meter plugin released - a free analog-style stereo multi-band correlation meter AudioUnit, AAX and VST plugin for professional music production applications. It is based on correlation meter found in PHA-979 phase-alignment plugin.Multi-band correlation meter is an advanced way to check for presence of out-of-phase elements in the mix. Broadband correlation metering reports overall phase issues and may misrepresent problems present in select spectral bands, while multi-band correlation meter easily highlights problems present in mid to high frequencies that are not easily heard by ear, but may still reduce clarity of the mix. Another application of multi-band correlation metering is phase- and time-aligning of channels and tracks, especially bass and bass-drum pairs, guitar mic and D.I. source pairs, two-microphone stereo recordings, etc.Correlometer can display 4 to 64 individual spectral bands, with adjustable band quality factor that controls the degree of band’s selectivity. Averaging time of correlation estimator can be adjusted. Correlometer supports side-chain inputs for easy correlation estimation between separate audio tracks.

Transcription

Transcribe!

Transcribe! - an assistant for people who want to work out a piece of music from a recording, in order to write it out, or play it themselves, or both. It doesn't do the transcribing for you, but it is essentially a specialised player program which is optimised for the purpose of transcription. It has many transcription-specific features not found on conventional music players. It is also used by many people for play-along practice. It can change pitch and speed instantly, and you can store and recall any number of named loops. So you can practice in all keys, and you can speed up as well as slow down. There is some advice about play-along practice in Transcribe!'s help, under the heading "Various Topics". And it is also used for speech transcription. With its support for foot pedals and its superior slowed-down sound quality, it is an excellent choice for this purpose. There is some advice about speech transcription in Transcribe!'s help, under the heading "Various Topics".

MT3

https://github.com/magenta/mt3 - a multi-instrument automatic music transcription model that uses the T5X framework.This is not an officially supported Google product.

Pop2Piano

Pop2Piano
- https://github.com/sweetcocoa/pop2piano

Feature detection/extraction

aubio

aubio is a tool designed for the extraction of annotations from audio signals. Its features include segmenting a sound file before each of its attacks, performing pitch detection, tapping the beat and producing midi streams from live audio. Because these tasks are difficult, we thought it was important to gather them in a dedicated library. To increase the fun, we have made these algorithms work in a causal way, so as to be used in real time applications with as low delay as possible. Functions can be used offline in sound editors and software samplers, or online in audio effects and virtual instruments.

Aubio-LV2-Plugins is an unoffial set of LV2 plugins which wrap the functionality of the audio analysis library Aubio. Currently it consists of a transient/steady state separator, and an onset detector.

Vamp

Vamp - an audio processing plugin system for plugins that extract descriptive information from audio data — typically referred to as audio analysis plugins or audio feature extraction plugins.
Find and Download Plugins - listing

QM Vamp Plugins - A set of plugins for feature extraction from audio data, using the Vamp plugin format suitable for use in programs such as Sonic Visualiser and Sonic Annotator.VAMP diagramThis plugin set includes note onset detector, beat and barline tracker, tempo estimator, key estimator, tonal change detector, structural segmenter, timbral and rhythmic similarity, wavelet scaleogram, adaptive spectrogram, note transcription, chromagram, constant-Q spectrogram, and MFCC plugins.
- https://github.com/c4dm/qm-vamp-plugins

Segmentino - a Vamp plugin for automatic music structural segmentation, based on an algorithm first used in Mauch et al.'s paper on Using Musical Structure to Enhance Automatic Chord Transcription.

Silvet Note Transcription - or Shift-Invariant Latent Variable Transcription, is a Vamp plugin for polyphonic music transcription (from audio to note times and pitches).What can I use it for?In conjunction with a Vamp plugin host like Sonic Visualiser, you can use Silvet to help you work out what notes are being played in a piece of music, for example if you want to learn to play it yourself.You might also use it to study performances for musicological features such as timing and dynamics.Silvet also serves as a useful stable baseline for comparative purposes, for researchers working on other methods related to note transcription.Silvet uses a high-quality and quite flexible method, but it has various limitations which you can find described in the README file. Although you can easily get interesting and useful results for many kinds of music, don't expect it to take you straight from the audio to a complete and readable score!
- https://github.com/cannam/silvet

Melodia - automatically estimates the pitch of a song's main melody. More specifically, it implements an algorithm that automatically estimates the fundamental frequency corresponding to the pitch of the predominant melodic line of a piece of polyphonic (or homophonic or monophonic) music.Given a song, the algorithm estimates: When the melody is present and when it is not (a.k.a. voicing detection) The pitch of the melody when it is present

HPCP - a vamp plug-in for audio feature extraction that computes the instantaneous evolution of HPCP (Harmonic Pitch Class Profile) of a signal. The HPCP is an approach for chroma feature estimation which represents the pitch content of polyphonic music signals, mapped to a single octave. HPCP have been extensively used for several final applications such as key and chord estimation, similarity computation (cover version identification) and music classification.

auditok

https://github.com/amsehili/auditok - an Audio Activity Detection tool that can process online data (read from an audio device or from standard input) as well as audio files. It can be used as a command line program and offers an easy to use API.

audioid

https://github.com/sai-soum/audioid - Audio Identification using Audio Fingerprinting technique.

Audioflux

AudioFlux - library for audio and music analysis, feature extraction. Can be used for deep learning, pattern recognition, signal processing, bioinformatics, statistics, finance, etc
- https://github.com/libAudioFlux/audioFlux

Tempo

BPM, or beats per minute

bonk - Pure Data unit [bonk~] is a very useful musical tool for performance and composition. It processes a stream of audio on its input and produces messages when it thinks the signal matches certain patterns. It doesn't have an audio output, just messages. What [bonk~] does is analyse the incoming signal.

MiniBPM is a simple, reliable tempo estimator for use in music audio applications. It quickly gets you a fixed beats-per-minute estimate from a sample of audio, provided the tempo doesn't change too much in it.

https://github.com/mguentner/libbeat libbeat - a lightweight beat detection library for Qt. It currently supports ALSA and PulseAudio. It uses fftw to process the samples.

bpm-tools software is the result of some experiments I did into automatically calculating and tagging the tempo (in beats-per-minute) of music files. Right now the code serves as the best explanation of the algorithm — a relatively simple application of an autocorrelation by statistical sampling. As yet, there is no scientific comparison of the algorithm with others software.

BeatDetektor - uses a very simple statistical model designed from scratch by myself to detect the BPM of music and provides real-time feedback useful for visualization and synchronization.

https://github.com/adamstark/BTrack - a causal beat tracking algorithm intended for real-time use. It is implemented in C++ with wrappers for Python and the Vamp plug-in framework.

https://github.com/metachronica/audio-dsp-midi-trigger MIDI Trigger - LV2 plugin which detects peaks by audio signal and sends MIDI notes.

BeatCounter - a simple plugin designed to facilitate beatmatching software and turntables. It displays the current tempo in beats per minute (BPM), and an accumulated average over the last few seconds. BeatCounter is the perfect tool for DJ’s that want to integrate computer effects with turntables or a live band.
- https://github.com/teragonaudio/BeatCounter

Tapita - (snack in spanish) is a BPM detector trough keyboard, MIDI and jack written in C with GTK2.

Frequency

Tony

Tony - a software program for high quality scientific pitch and note transcription in three steps: automatic visualisation/sonification, easy correction, and export. First, Tony automatically analyses the audio to visualise pitch tracks and notes from monophonic recordings . As a Tony user, you can play back pitches and notes alongside the recording, which makes it easy to spot the inevitable extraction errors. Then you can use several tools that make correction of such errors a breeze, including alternative pitch track selection, octave shift, and easy note split, merge and deletion. Finally, you can export pitch track and note track in .csv (comma-separated values) format, or simply save a session file to continue annotating another time.
- https://code.soundsoftware.ac.uk/projects/tony

libKeyFinder

https://github.com/ibsh/libKeyFinder - Musical key detection for digital audio, GPL v3

pitch-detection

https://github.com/sevagh/pitch-detection - collection of O(NlogN) pitch detection implementations

Pitch-Tracking

https://github.com/orchidas/Pitch-Tracking - Pitch detection algorithmsMethods implemented: YIN ESTIMATOR - YIN, a fundamental frequency estimator for speech and music - Alain de Cheveigné, Hideki Kawahara - Journal of the Acoustical Society of America, 2002. CEPSTRUM - Cepstrum Pitch Determination - A.M.Noll - Journal of the Acoustical Society of America, 1967. MAXIMUM LIKELIHOOD - Maxmium Likelihood Pitch Estimation - James D.Wise, James R.Caprio, Thomas W.Parks - IEEE Transactions on Acoustics, Speech and Signal Processing, 1976. EXTENDED KALMAN FILTER - Real-time Pitch Tracking in Audio Signals with the Extended Complex Kalman Filter - Orchisama Das, Julius O. Smith, Chris Chafe in Proc. of International Conference on Digital Audio Effects, DAFx 2017. Improved Real-time Monophonic Pitch Tracking with the Extended Complex Kalman Filter - Orchisama Das, Julius O. Smith, Chris Chafe - in Journal of the Audio Engineering Society, Vol 68, No. 1/2, 2020.

Silence

silan

https://github.com/x42/silan - audio file -silence- analyzer

SilentJack

SilentJack is a silence/dead air detector for the Jack Audio Connection Kit.
- https://github.com/njh/silentjack
- https://github.com/7890/silentjack_osc - fork

SilenceRemover

https://github.com/sagamusix/SilenceRemover - removes silence from the beginning of a WAV or FLAC file. Getting the job done was more important than beautiful code. For example, using FILE* instead of std::ifstream was simply done so that the file feeds directly into libflac.

Essentia

Essentia - an open-source C++ library for audio analysis and audio-based music information retrieval released under the Affero GPLv3 license (also available under proprietary license upon request). It contains an extensive collection of reusable algorithms which implement audio input/output functionality, standard digital signal processing blocks, statistical characterization of data, and a large set of spectral, temporal, tonal
- https://github.com/MTG/essentia

https://github.com/MTG/gaia - C++ library to apply similarity measures and classiﬁcations on the results of audio analysis, including Python bindings. Together with Essentia it can be used to compute high-level descriptions of music.

openSMILE

openSMILE - feature extration tool enables you to extract large audio feature spaces in realtime. It combines features from Music Information Retrieval and Speech Processing. SMILE is an acronym forSpeech & Music Interpretation by Large-space Extraction. It is written in C++ and is available as both a standalone commandline executable as well as a dynamic library. The main features of openSMILE are its capability of on-line incremental processing and its modularity. Feature extractor components can be freely interconnected to create new and custom features, all via a simple configuration file. New components can be added to openSMILE via an easy binary plugin interface and a comprehensive API.

https://github.com/naxingyu/opensmile

Madmom

https://github.com/CPJKU/madmom - an audio signal processing library written in Python with a strong focus on music information retrieval (MIR) tasks. The library is internally used by the Department of Computational Perception, Johannes Kepler University, Linz, Austria (http://www.cp.jku.at) and the Austrian Research Institute for Artificial Intelligence (OFAI), Vienna, Austria (http://www.ofai.at). Possible acronyms are: Madmom Analyzes Digitized Music Of Musicians, Mostly Audio / Dominantly Music Oriented Modules

jMIR

jMIR - an open-source software suite implemented in Java for use in music information retrieval (MIR) research. It can be used to study music in the form of audio recordings, symbolic encodings and lyrical transcriptions, and can also mine cultural information from the Internet. It also includes tools for managing and profiling large music collections and for checking audio for production errors. jMIR includes software for extracting features, applying machine learning algorithms, applying heuristic error error checkers, mining metadata and analyzing metadata.
- https://sourceforge.net/projects/jmir/

KeyFinder

KeyFinder - an open source key detection tool, for DJs interested in harmonic and tonal mixing. It's intended to be very focused: no library management, no track suggestions, no media player. Just a fast, efficient workflow tool. It supports a huge range of codecs thanks to LibAV, and writes to metadata tags using TagLib.
- https://github.com/ibsh/is_KeyFinder
- https://github.com/ibsh/libKeyFinder

polyscribe

https://github.com/joelrobichaud/polyscribe - Convert polyphonic multi-track audio to sheet music.

kbd-audio

https://github.com/ggerganov/kbd-audio - collection of command-line and GUI tools for capturing and analyzing audio data.

Sufrboard

Surfboard: Audio Feature Extraction for Modern Machine Learning - We introduce Surfboard, an open-source Python library for extracting audio features with application to the medical domain. Surfboard is written with the aim of addressing pain points of existing libraries and facilitating joint use with modern machine learning frameworks. The package can be accessed both programmatically in Python and via its command line interface, allowing it to be easily integrated within machine learning workflows. It builds on state-of-the-art audio analysis packages and offers multiprocessing support for processing large workloads. We review similar frameworks and describe Surfboard's architecture, including the clinical motivation for its features. Using the mPower dataset, we illustrate Surfboard's application to a Parkinson's disease classification task, highlighting common pitfalls in existing research. The source code is opened up to the research community to facilitate future audio research in the clinical domain. [43]
- https://github.com/novoic/surfboard

DrumClassifier CNN-LSTM

https://github.com/faraway1nspace/DrumClassifer-CNN-LSTM - Classifies percussion audio samples with a CNN-LSTM, written in python and pytorch. Also exports to Drumkv1 (lv2 plugin)

FMA

https://github.com/mdeff/fma - We introduce the Free Music Archive (FMA), an open and easily accessible dataset suitable for evaluating several tasks in MIR, a field concerned with browsing, searching, and organizing large music collections. The community's growing interest in feature and end-to-end learning is however restrained by the limited availability of large audio datasets. The FMA aims to overcome this hurdle by providing 917 GiB and 343 days of Creative Commons-licensed audio from 106,574 tracks from 16,341 artists and 14,854 albums, arranged in a hierarchical taxonomy of 161 genres. It provides full-length and high-quality audio, pre-computed features, together with track- and user-level metadata, tags, and free-form text such as biographies. We here describe the dataset and how it was created, propose a train/validation/test split and three subsets, discuss some suitable MIR tasks, and evaluate some baselines for genre recognition. Code, data, and usage examples are available at mdeff/fma.

Wavebeat

https://github.com/csteinmetz1/wavebeat - End-to-end beat and downbeat tracking in the time domain.

Buzz

https://github.com/chidiwilliams/buzz - Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

AudioCLIP

https://github.com/AndreyGuzhov/AudioCLIP - Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Editing

Audacity

Audacity is a free, easy-to-use, multi-track audio editor and recorder for Windows, Mac OS X, GNU/Linux and other operating systems. The interface is translated into many languages. You can use Audacity to record live audio, record computer playback on any Windows Vista or later machine, convert tapes and records into digital recordings or CDs, edit WAV, AIFF, FLAC, MP2, MP3 or Ogg Vorbis sound files, cut, copy, splice or mix sounds together, change the speed or pitch of a recording, etc.
- https://github.com/audacity/audacity
- WP: Audacity_(audio_editor)

https://manual.audacityteam.org/man/track_control_panel_and_vertical_scale.html

https://manual.audacityteam.org/man/splitting_a_recording_into_separate_tracks.html

https://github.com/hugofloresgarcia/torchaudacity - This package contains utilities for prepping PyTorch audio models for use in Audacity. More specifically, it provides abstract classes for you to wrap your waveform-to-waveform and waveform-to-labels models (see the Deep Learning for Audacity website to learn more about deep learning models for audacity).

Tenacity

https://github.com/tenacityteam/tenacity

AudioMass

AudioMass - a free, open source, web-based Audio and Waveform Editor.It runs entirely in the browser with no backend and no plugins required!
- https://github.com/pkalogiros/AudioMass

mhWaveEdit

mhWaveEdit is a graphical program for editing, playing and recording sound files. It is lightweight, portable, user-friendly and handles large files very well. The program itself has only simple editing features such as cut'n'paste and volume adjustment but it can also use Ladspa effect plugins and the effects provided by the SoX application. It can also support additional file formats besides wav through libsndfile and mp3/ogg import and export through lame and oggenc/oggdec.

Sweep

Sweep is an audio editor and live playback tool for GNU/Linux, BSD and compatible systems. It supports many music and voice formats including WAV, AIFF, Ogg Vorbis, Speex and MP3, with multichannel editing and LADSPA effects plugins.

ReZound

ReZound aims to be a stable, open source, and graphical audio file editor primarily for but not limited to the Linux operating system.
- https://github.com/ddurham2/rezound

ocenaudio

ocenaudio is a cross-platform, easy to use, fast and functional audio editor. It is the ideal software for people who need to edit and analyze audio files without complications. ocenaudio also has powerful features that will please more advanced users.

Eisenkraut

Eisenkraut - A multi-channel and hi-res capable audio file editor.
- https://github.com/Sciss/Eisenkraut

WaveSurfer

WaveSurfer is an open source tool for sound visualization and manipulation. Typical applications are speech/sound analysis and sound annotation/transcription. WaveSurfer may be extended by plug-ins as well as embedded in other applications.
- WP: WaveSurfer

Jokosher

https://launchpad.net/jokosher
- WP: Jokosher

soniK

soniK is an open source digital audio editor for Linux, using the KDE platform. soniK allows you to record, edit and process sounds on your computer.

EKO

EKO - a simple sound editor.
- https://github.com/psemiletov/eko

wavbreaker

wavbreaker is a GTK wave file splitter for Linux and Unix-like operating systems licensed under the terms of the GNU General Public License. This application's purpose in life is to take a wave file and break it up into multiple wave files. It makes a clean break at the correct position to burn the files to an audio cd without any dead air between the tracks. It will only read wave files, so use an appropriate tool to convert ogg, mp3, etc. files and then break them up.

AudioAlign

https://github.com/protyposis/AudioAlign - a tool written for research purposes to automatically synchronize audio and video recordings that have either been recorded in parallel at the same event or contain the same aural information. AudioAlign is basically a GUI for the Aurio library with a little bit of glue code in between.

LAoE

LAoE means Layer-based Audio Editor, and it is a rich featured graphical audiosample-editor, based on multi-layers, floating-point samples, volume-masks, variable selection-intensity, and many plugins suitable to manipulate sound, such as filtering, retouching, resampling, graphical spectrogram editing by brushes and rectangles, sample-curve editing by freehand-pen and spline and other interpolation curves, effects like reverb, echo, compress, expand, pitch-shift, time-stretch, and much more... And it is free of charge, under GPL license!

Snd

Snd is a sound editor modelled loosely after Emacs. It can be customized and extended using either s7 (included in the Snd sources), Ruby, or Forth.

San Dysth - a standalone realtime soft-synth written in SND. This softsynth has controls to generate various kinds of sounds inbetween white noise and pure tones. It also provides controllers to disturb the generated sound by using a "period counter" to extend the variety of the generated output. Common usage for the softsynth is organ-like sound, organic-like sound, alien-like sounds, water-like sounds, and various kinds of noise (noise artists could find this softsynth most useful).

GNUsound

GNUsound is a multitrack sound editor for GNOME 1 and 2. The current version is 0.7.5, which was released 6 July 2008.

Marlin

Marlin - A GNOME Sample Editor. last updated 03-08-2004

Gnoise

GNoise - gtk+ or gnome (you can ./configure it either way) wave file editor for Linux. Prime considerations were for it to be speedy and be able to handle big files. So far it can: load and display files, generate a display cache, play the file, cut, copy, paste, (unlimited) undo, mute, fade in/out, reverse, normalize, and more. 2003

WaveShop

WaveShop - a free, open-source audio editor for Windows XP/Vista/7/8 32-bit and 64-bit. WaveShop is fast, lightweight, and bit-perfect, meaning samples aren't altered unless they need to be. Editing a portion of an audio file only affects that portion; the rest of the file is untouched. Blocks of audio can be cut and pasted without changing their contents at all. This is especially useful for patching a finished master without corrupting its dither. Waveshop's features include peak, RMS and spectral analysis, normalizing, fading, sample rate conversion, audio generation, plug-ins, and more, all with unlimited undo and comprehensive help.
- https://github.com/victimofleisure/WaveShop

Source separation

Open Source Tools & Data for Music Source Separation - [44]

ISSE

ISSE - An Interactive Source Separation Editor

Spleeter

Releasing Spleeter: Deezer Research source separation engine | by Manuel Moussallam | Deezer I/O
- https://github.com/deezer/spleeter - Deezer source separation library including pretrained models. [45]

https://github.com/gvne/spleeterpp - A C++ Inference library for the Spleeter project [46]

https://github.com/james34602/SpleeterRT - Real time monaural source separation base on fully convolutional neural network operates on time-frequency domainAI Source separator written in C running a U-Net model trained by Deezer, separate your audio input to Drum, Bass, Accompaniment and Vocal/Speech with Spleeter model.

Deep-Audio-Prior

https://github.com/adobe/Deep-Audio-Prior - Audio Source Separation Without Any Training Data.

sudo_rm_rf

https://github.com/etzinis/sudo_rm_rf - Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

python_source_separation

https://github.com/masahitotogami/python_source_separation - 本リポジトリでは、インプレス社機械学習実践シリーズの「Pythonで学ぶ音源分離」のサンプルコードを管理しています。なお、本ソースコードは、MITライセンスのもとで公開されています。LICENSE.txtを見てください。

ssspy

https://github.com/tky823/ssspy - A Python toolkit for sound source separation.

Adaptive and Focus Layer for Multi-Speaker Separation Problem

https://github.com/Totoketchup/Adaptive-MultiSpeaker-Separation - Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem

Hierarchical Music Source Separation Using Mixing Secret Multi-track Dataset

https://github.com/felixCheungcheung/mixing_secrets_v2 - an accompanying repository of my my master thesis and a submission to Late Breaking Demo of International Society of Music Information Retrieval 2022: MS500: A MULTI-TRACK DATASET FOR HIERARCHICAL MUSIC SOURCE SEPARATION

Deep Learning For Monaural Source Separation

Deep Learning for Monaural Source Separation - Monaural source separation is important for many real world applications. It is challenging in that, given only single channel information is available, there is an infinite number of solutions without proper constraints. In this paper, we explore joint optimization of masking functions and deep recurrent neural networks for monaural source separation tasks, including the monaural speech separation task, monaural singing voice separation task, and speech denoising task. The joint optimization of the deep recurrent neural networks with an extra masking layer enforces a reconstruction constraint. Moreover, we explore a discriminative training criterion for the neural networks to further enhance the separation performance. We evaluate our proposed system on TSP, MIR-1K, and TIMIT dataset for speech separation, singing voice separation, and speech denoising tasks, respectively.
- https://github.com/posenhuang/deeplearningsourceseparation - Deep Recurrent Neural Networks for Source Separation

separateLeadStereo

https://github.com/wslihgt/separateLeadStereo - Separate the lead from the accompaniment, in polyphonic audio music excerpts, in Python/Numpy

Asteroid

Asteroid - a Pytorch-based audio source separation toolkit that enables fast experimentation on common datasets. It comes with a source code that supports a large range of datasets and architectures, and a set of recipes to reproduce some important papers.
- https://github.com/asteroid-team/asteroid

https://github.com/saurjya/asteroid - The PyTorch-based audio source separation toolkit for researchers || Pretrained models available

Music Separation Enhancement With Generative Modeling

https://github.com/interactiveaudiolab/MSG - the official implementation of the Make it Sound Good (MSG) model from our 2022 ISMIR paper "Music Separation Enhancement with Generative Modeling". We introduce Make it Sound Good (MSG), a post-processor that enhances the output quality of source separation systems like Demucs, Wavenet, Spleeter, and OpenUnmix

Vocal separation

Voiceolation

Voiceolation - a music source separator that extracts vocals from songs. It is coded on Python and uses image segmentation methods via artificial intelligence, used a U-Net model implemented by Keras and Tensorflow.

vocal-remover

https://github.com/tsurumeso/vocal-remover - a deep-learning-based tool to extract instrumental track from your songs.

vocal-music-separation

https://github.com/zingmars/vocal-music-separation - Software that performs the separation of vocals from music using neural networks (part of my Bachelor's thesis). This CNN attempts to separate the vocals from the music. It does so by training on the amplitude data of the audio file and tries to estimate where the voiced parts are. Vocal separation is done by generating a binary mask of the time-frequency bins that the network thinks contain the vocals and applying it to the original file.

blind-audio-source-separation-cnn

https://github.com/ivasique/blind-audio-source-separation-cnn - A convolutional neural network for blind audio source separation.

SnnAsp

https://github.com/NeuroSumbaD/SnnAsp - This project explores applications of spiking neural networks for blind source audio separation. Using ARTRS, an eight-channel, multiple-talker dataset has been synthesized to simulate input from an eight-microphone linear array.

voicefilter

https://github.com/mindslab-ai/voicefilter - Unofficial PyTorch implementation of Google AI's VoiceFilter system

VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking | Papers With Code - In this paper, we present a novel system that separates the voice of a target speaker from multi-speaker signals, by making use of a reference signal from the target speaker. We achieve this by training two separate neural networks: (1) A speaker recognition network that produces speaker-discriminative embeddings; (2) A spectrogram masking network that takes both noisy spectrogram and speaker embedding as input, and produces a mask. Our system significantly reduces the speech recognition WER on multi-speaker signals, with minimal WER degradation on single-speaker signals.

2009.04323 VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition - We introduce VoiceFilter-Lite, a single-channel source separation model that runs on the device to preserve only the speech signals from a target user, as part of a streaming speech recognition system. Delivering such a model presents numerous challenges: It should improve the performance when the input signal consists of overlapped speech, and must not hurt the speech recognition performance under all other acoustic conditions. Besides, this model must be tiny, fast, and perform inference in a streaming fashion, in order to have minimal impact on CPU, memory, battery and latency. We propose novel techniques to meet these multi-faceted requirements, including using a new asymmetric loss, and adopting adaptive runtime suppression strength. We also show that such a model can be quantized as a 8-bit integer model and run in realtime.

1810.04826 VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking - In this paper, we present a novel system that separates the voice of a target speaker from multi-speaker signals, by making use of a reference signal from the target speaker. We achieve this by training two separate neural networks: (1) A speaker recognition network that produces speaker-discriminative embeddings; (2) A spectrogram masking network that takes both noisy spectrogram and speaker embedding as input, and produces a mask. Our system significantly reduces the speech recognition WER on multi-speaker signals, with minimal WER degradation on single-speaker signals.

https://github.com/tky823/DNN-based_source_separation - A PyTorch implementation of DNN-based source separation.

DNN-based source separation

https://github.com/haoheliu/torchsubband - Pytorch implementation of subband decomposition

XSpeech

https://github.com/tky823/XSpeech - A PyTorch implementation of target speaker extraction.

Vocal and music seperation using a CNN

https://github.com/zingmars/vocal-music-separation - Software that performs the separation of vocals from music using neural networks (part of my Bachelor's thesis).

TasNet / Conv-TasNet

https://github.com/kaituoxu/TasNet - A PyTorch implementation of "TasNet: Time-domain Audio Separation Network for Real-time, single-channel speech separation", published in ICASSP2018, by Yi Luo and Nima Mesgarani.

https://github.com/kaituoxu/Conv-TasNet - A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation".

LSTM/BLSTM based PIT for Two Speakers

https://github.com/aishoot/LSTM_PIT_Speech_Separation - Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

Ultimate Vocal Remover GUI

https://github.com/Anjok07/ultimatevocalremovergui - uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models). These bundles contain the UVR interface, Python, PyTorch, and other dependencies needed to run the application effectively. No prerequisites are required.

Video separation

Move2Hear

Move2Hear Active Audio-Visual Source Separation. We introduce the active audio-visual source separation problem, where an agent must move intelligently in order to better isolate the sounds coming from an object of interest in its environment. The agent hears multiple audio sources simultaneously (e.g., a person speaking down the hall in a noisy household) and it must use its eyes and ears to automatically separate out the sounds originating from a target object within a limited time budget. Towards this goal, we introduce a reinforcement learning approach that trains movement policies controlling the agent's camera and microphone placement over time, guided by the improvement in predicted audio separation quality. We demonstrate our approach in scenarios motivated by both augmented reality (system is already co-located with the target object) and mobile robotics (agent begins arbitrarily far from the target object). Using state-of-the-art realistic audio-visual simulations in 3D environments, we demonstrate our model's ability to find minimal movement sequences with maximal payoff for audio source separation.
- https://github.com/SAGNIKMJR/move2hear-active-AV-separation - Active Audio-Visual Source SeparationThis repository contains the PyTorch implementation of our ICCV-21 paper and the associated datasets:Move2Hear: Active Audio-Visual Source SeparationSagnik Majumder, Ziad Al-Halah, Kristen GraumanThe University of Texas at Austin, Facebook AI Research

VoiceMe

https://github.com/polvanrijn/VoiceMe - Novel text-to-speech systems can generate entirely new voices that were not seen during training. However, it remains a difficult task to efficiently create personalized voices from a high dimensional speaker space. In this work, we use speaker embeddings from a state-of-the-art speaker verification model (SpeakerNet) trained on thousands of speakers to condition a TTS model. We employ a human sampling paradigm to explore this speaker latent space. We show that users can create voices that fit well to photos of faces, art portraits, and cartoons. We recruit online participants to collectively manipulate the voice of a speaking face. We show that (1) a separate group of human raters confirms that the created voices match the faces, (2) speaker gender apparent from the face is well-recovered in the voice, and (3) people are consistently moving towards the real voice prototype for the given face. Our results demonstrate that this technology can be applied in a wide number of applications including character voice development in audiobooks and games, personalized speech assistants, and individual voices for people with speech impairment.

Diarisation

WP: Speaker_diarisation - the process of partitioning an input audio stream into homogeneous segments according to the speaker identity. It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns and, when used together with speaker recognition systems, by providing the speaker’s true identity. It is used to answer the question "who spoke when?" Speaker diarisation is a combination of speaker segmentation and speaker clustering. The first aims at finding speaker change points in an audio stream. The second aims at grouping together speech segments on the basis of speaker characteristics.

https://github.com/wq2012/awesome-diarization - A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

https://github.com/hitachi-speech/EEND - End-to-End Neural Diarization is a neural-network-based speaker diarization method.

Processing tools

SoX

SoX - Sound eXchange, the Swiss Army knife of sound processing programs. SoX is a cross-platform (Windows, Linux, MacOS X, etc.) command line utility that can convert various formats of computer audio files in to other formats. It can also apply various effects to these sound files, and, as an added bonus, SoX can play and record audio files on most platforms.
- WP: SoX

play --show-progress -c 2 --null synth brownnoise reverb bass 6 treble -3 echos 0.8 0.9 1000 0.3 1800 0.25 [47]

play -n -c1 synth whitenoise band -n 100 20 band -n 50 20 gain +25  fade h 1 864000 1

play -c2 -n synth pinknoise band -n 280 80 band -n 60 25 gain +20 treble +40 500 bass -3 20 flanger 4 2 95 50 .3 sine 50 lin [48]

http://www.reddit.com/r/scifi/comments/n7q5x/want_to_pretend_you_are_aboard_the_enterprise_for/c36xkjx

sonfilade - allows the user to rapidly strip junk audio from the beginning and end of audio files. It can be used, for example, to clean up files recorded with Streamripper (e.g., streamripper --xs_padding=5000:5000). Sonfilade is designed to be as effortless and fun as possible to use. An entire edit session can be carried out using only three keys and sound feedback as the entire user interface. (There is also text output, but it is non-essential.) Uses sox.

https://github.com/tartina/sox-plugins - Some additional plugins for SoX

https://github.com/prof-spock/SoX-Plugins

pyDub

pyDub - Manipulate audio with an simple and easy high level interface [49]
- https://github.com/jiaaro/pydub/

Uos

http://wiki.lazarus.freepascal.org/uos - a multi-platform package of audio handling routines that unifies the best open-source audio libraries. play .mp3, .ogg, .wav, .flac, .m4a, .opus and cdrom audio files. 16, 32 or float 32 bit resolution. record all types of input into file, in 16 or 32 bit resolution, mono or stereo. add DSP effects and filters, however many you want and record it. play multiple inputs and outputs simultaneously. internet audio streaming of mp3 and opus files. produce sound by the build-in synthesizer. Uos can use the SoundTouch, PortAudio, SndFile, Mpg123, Faad, OpusFile and Mp4ff audio libraries. Included in the package: Examples and binary libraries for Linux 32/64, arm-Raspberry Pi, Windows 32/64, Mac OSX 32 and FreeBSD 32/64.

Signet

https://github.com/SamWindell/Signet - Command-line program for editing audio files, and assisting sample library development

jackdiff

https://github.com/resinbeard/jackdiff - takes your dsp algorithm and with it processes a signal as a JACK client vomiting plots of input and output

python-mix

https://github.com/j3ffhubb/python-mix - CLI audio file mixer using python-wavefile and numpy. Warning: Alpha-grade, under-tested code, use at your own risk

Composers Desktop Project

CDP Home Page - The CDP software, first released in 1987, now contains hundreds of ways to transform digitally sampled sound. Its software belongs to the musique concrète category, as realised on computer. Processing is off-line. Although the processing is often faster than real-time, several processes could not run in real-time for technical reasons.The CDP software can be run via one of the two available GUIs or via command line / batch file.Based in the UK, CDP is an international network of composers and programmers guided by a vision of amazing sonic possibilities and how they can be woven into the fabric of music. We have been working together since 1986.
- https://github.com/ComposersDesktop/CDP7

CDP Technical Info

Unstablesound

WP: Composers_Desktop_Project

CDP-Soundfiles - The CDP system is one of the most comprehensive and innovative sound design suites available. Written largely by English electro-acoustic composer Trevor Wishart and reflecting his musical aesthetics in many ways, its processes cover almost every aspect of sound manipulation you've ever heard of, plus many that will be unfamiliar, and usually from an original or compositional viewpoint.CDP has over 430 processes covering EDIT-MIX functions, SOUNDFILE processes (time domain), SPECTRAL and PITCH processes, a small but significant SYNTH group, DATA GENERATING functions and a large INFO section. In addition there are over 100 DATA massaging functions and an extensive HELP.

Soundshaper - a free control interface for the CDP sound transformation software, with an emphasis on speed and ease of use. CDP is a suite of over 430 command-line processes for creating, altering and manipulating sounds to make music. Soundshaper (PC only) fully supports the latest CDP Release 7.Soundshaper provides quick and easy access to CDP processes and parameters and assembles scripts which run CDP in the background. Soundshaper saves CDP output to temporary files, which you can save at any stage. Parameter values can be adjusted at any point, even after further processes have been run. Soundshaper's auto-conversion makes it possible to move seamlessly from one process to another while the program handles the different CDP file types.When run, processes are displayed in a table called the Patch Grid. Soundshaper patches are an easy way to store and recall whole sequences of CDP processes in a fully editable form. All values are retained and the patch can be re-run with any source. Soundshaper patches support up to 16 separate process chains, which can come from different sources. Soundshaper also supports bulk processing, presets and multiple parameter sets.

The Sound Loom - an integrated graphic interface to the CDP sound-processing software, a comprehensive collection of over 500 instruments for sound transformation developed as practical working tools by composers over many years, available from the Composers' Desktop Project. The Sound Loom + CDP software is a powerful toolbox for composers, not a performance instrument. Using it, you can specify the parameters of any process to any degree of time-varying detail, detail you may have composed or have extracted from some other complex sound-event. You cannot, however, alter these parameters while the process is running. In compensation, the system offers almost any conceivable process for transforming sounds and sound-data (the data might be loudness envelopes, pitch-tracking information, spectral analysis data, filter specifications etc.) all running in a unified, intelligent environment.

YouTube: Creating a Processing Chain using CDP through Trevor Wishart's "Sound Loom"

YouTube: Trevor Wishart - Imago

YouTube: Trevor Wishart - Tongues of Fire

Renoise CDP Interface
- http://createdigitalmusic.com/2014/05/renoise-community-already-tool-cdp-free-easy-sound-mangling-results/

http://createdigitalmusic.com/2014/05/watch-bt-reveal-sound-design-tricks-free-geeky-cdp-learn/

Xenakios's Blog: CDP frontend Reaper extension plugin

Mammut

Mammut does an FFT of the whole sound (no windows). Various operations can subsequently be done in the frequency domain, such as unlinear stretching of the spectrum, sprectrum shifting, etc. How is the program useful? Doing a giant FFT of the entire sound, as opposed to splitting the sound up into short windows, is unusual. Such a method implies that time-related parameters are included in the spectral coefficients in a non-intuitive manner, and changes in the frequency domain may radically change developments in the time domain. Mammut is a fairly unpredictable program, and the user will need to get used to letting go of controlling the time axis. The sounding results are often surprising and exciting.
- https://github.com/kmatheussen/mammut

YouTube: After Frequency Manipulation? Try Mammut!

Fscape

http://www.sciss.de/fscape/
- https://github.com/Sciss/FScape

https://vimeo.com/26509124

FreqTweak

FreqTweak is a tool for FFT-based realtime audio spectral manipulation and display. It provides several algorithms for processing audio data in the frequency domain and a highly interactive GUI to manipulate the associated filters for each. It also provides high-resolution spectral displays in the form of scrolling-raster spectrograms and energy vs frequency plots displaying both pre- and post-processed spectra.

https://github.com/ycollet/freqtweak - Mirror of the original freqtweak repository

https://github.com/nettings/freqtweak - additional compile fixes on top of ycollet's fork of Jesse Chappell's freqtweak

TAPESTREA

TAPESTREA (Techniques And Paradigms for Expressive Synthesis, Transformation, and Rendering of Environmental Audio, or taps, is a unified framework for interactively analyzing, transforming and synthesizing complex sounds. Given one or more recordings, it provides well-defined means to: identify points of interest in the sound and extract them into reusable templates; transform sound components independently of the background and/or other events; continually resynthesize the background texture in a perceptually convincing manner; controllably place event templates over backgrounds, using a novel graphical user interface and/or scripts written in the ChucK audio programming language

http://wiki.cs.princeton.edu/index.php/Taps

YouTube: Tapestrea demo

Build fails on Linux. Fixing two or three indirect includes gets further, but fails on building it's included [old] chuck.

SPEAR

SPEAR (Sinusoidal Partial Editing Analysis and Resynthesis) is an application for audio analysis, editing and synthesis. The analysis procedure (which is based on the traditional McAulay-Quatieri technique) attempts to represent a sound with many individual sinusoidal tracks (partials), each corresponding to a single sinusoidal wave with time varying frequency and amplitude. Something which closely resembles the original input sound (a resynthesis) can be generated by computing and adding all of the individual time varying sinusoidal waves together. In almost all cases the resynthesis will not be exactly identical to the original sound (although it is possible to get very close).

Aside from offering a very detailed analysis of the time varying frequency content of a sound, a sinusoidal model offers a great deal of flexibility for editing and manipulation. SPEAR supports flexible selection and immediate manipulation of analysis data, cut and paste, and unlimited undo/redo. Hundreds of simultaneous partials can be synthesized in real-time and documents may contain thousands of individual partials dispersed in time. SPEAR also supports a variety of standard file formats for the import and export of analysis data.

Windows/Mac only :(

Ceres3

Ceres3 is a cut-and-paste spectral editor with musically enhanced graphic control over spectral activity of a sound file. It is a free educational program with no other aims, and it owes most of its framework to Oyvind Hammer's Ceres and Jonathan Lee's Ceres2. It has an X-window Motif/OpenMotif based GUI, organized around four principal menus with simple keyboard shortcuts.

http://www.music.columbia.edu/~stanko/Ceres3_help-old.html

https://github.com/jeremysalwen/Ceres4 - build fails with linked use of depreciated OSS code

https://github.com/jeremysalwen/Ceres4

ATS

ATS is a LISP environment for spectral modeling system based on a sinusoidal plus critical-band noise decomposition. The system can be used to analyze recorded sounds, transform their spectrum using a wide variety of algorithms and resynthesize them both out of time and in real time.

ATS is a software library of functions for spectral Analysis, Transformation, and Synthesis of sound based on a sinusoidal plus critical-band noise model. A sound in ATS is a symbolic object representing a spectral model that can be sculpted using a variety of transformation functions. Spectral data can be accessed trough an API, and saved to/loaded from disk. ATS is written in LISP, its analysis and synthesis algorithms are implemented using the CLM (Common Lisp Music) synthesis and sound processing language.

https://github.com/jamezilla/ats

Only takes mono .wav files

Cecilia

Cecilia is an audio signal processing environment aimed at sound designers. Cecilia mangles sound in ways unheard of. Cecilia lets you create your own GUI using a simple syntax. Cecilia comes with many original built-in modules and presets for sound effects and synthesis.
- https://github.com/belangeo/cecilia5

YouTube: Tutorial: Working with Cecilia and Audio

Loris

Loris is an Open Source sound modeling and processing software package based on the Reassigned Bandwidth-Enhanced Additive Sound Model. Loris supports modified resynthesis and manipulations of the model data, such as time- and frequency-scale modification and sound morphing. The Loris programmers' interface supports the C, C++, and Python programming languages, and SWIG interface files are provided so that the API can be easily extended to a variety of other languages. The package includes a handful of utility programs for basic sound modeling and resynthesis, and standard UNIX/Linux tools that build and install the libraries, headers, and utilties.

SMS Tools

Spectral Modeling Synthesis Tools - SMS Tools is a set of techniques and software implementations for the analysis, transformation, and synthesis of musical sounds based on various spectral modeling approaches. These techniques can be used for synthesis, processing and coding applications, while some of the intermediate results might also be applied to other music related problems, such as sound source separation, musical acoustics, music perception, or performance analysis. The basic model and implementation were developed by Xavier Serra as part of his PhD thesis published 1989. Since then many extensions have been proposed at MTG-UPF and by other researchers.
- https://github.com/MTG/sms-tools

FxEngine

FxEngine - an Open C++ Framework under LGPL license. The FxEngine Framework simplifies the plugin architecture for the data flow processing. It provides a full control to the plugin architecture for applications that require custom solutions.

FxJackPack - contains two plugins for the FxEngine framework which enables the recording and playback sound through JACK (Jack Audio Connection Kit).

NASPRO

NASPRO - acronym for "NASPRO Architecture for Sound PROcessing" is a collection of free and open source sound processing software built around the LV2 plugin standard.
- https://sourceforge.net/projects/naspro

Aglaophone

http://pe2bz.philpem.me.uk/ElectronicPrograms/-%20ELF-Programs/AglaPhone-Linux/Index

SpectMorph

SpectMorph is a free software project which allows to analyze samples of musical instruments, and to combine them (morphing). It can be used to construct hybrid sounds, for instance a sound between a trumpet and a flute; or smooth transitions, for instance a sound that starts as a trumpet and then gradually changes to a flute.

YouTube: SpectMorph Overview

Spectral Toolbox

The Spectral Toolbox - a suite of analysis-resynthesis programs that locate relevant partials of a sound and allow them to be resynthesized at any specified frequencies. This enables a variety of techniques including spectral mappings (sending all partials of a sound to fixed destinations), spectral morphing continuously interpolating between the partials of a source sound and a destination) and dynamic tonality (a way of organizing the relationship between a family of tunings and a set of related timbres). A complete application called the TransFormSynth concretely demonstrates the methods using either a one-dimensional controller such as a midi keyboard or a two-dimensional control surface (such as a MIDI guitar, a computer keyboard, or the forthcoming Thummer controller). Requires installing either Max Runtime (free from cycling74) or Max/MSP (not free) and some java routines.

Wav2Spectrum

https://github.com/paulnasca/wav2spectrum - a simple application which takes a small chunk (window) from input wav and outputs the frequencies one by one (a sweep) into another wav file. It is very useful to hear the harmonics (one by one) from a sound. It can be used as a spectrum tool for the blind people who are interested in sound analysis.

wave-tools

https://github.com/ZoeB/wave-tools - Command line tools for wave files

swingify

Swingify - Upload any audio file and make it swing.
- https://github.com/Curly-Mo/swingify

Melodyne

$

YouTube: Melodyne - Direct Note Access

Denoise

Gnome Wave Cleaner

Gnome Wave Cleaner - mainly to remove clicks from recorded vinyl

Postfish

http://linuxmao.org/Postfish

https://svn.xiph.org/trunk/postfish/

audio-declipper

https://github.com/kripton/audio-declipper - Proof-of-concept (or more) to declip sample-based audio files

Neural network

https://github.com/iver56/cross-adaptive-audio

http://crossadaptive.hf.ntnu.no/index.php/2016/06/27/evolving-neural-networks-for-cross-adaptive-audio-effects/

Looking to Listen at the Cocktail Party:
Audio-Visual Speech Separation - deep

https://github.com/GuitarML/NeuralPi - Raspberry Pi guitar pedal using neural networks to emulate real amps and pedals.
- Neural Networks Emulate Any Guitar Pedal For $120* Neural Networks Emulate Any Guitar Pedal For $120

RAVE: A variational autoencoder for fast and high-quality neural audio synthesis - Deep generative models applied to audio have improved by a large margin the state-of-the-art in many speech and music related tasks. However, as raw waveform modelling remains an inherently difficult task, audio generative models are either computationally intensive, rely on low sampling rates, are complicated to control or restrict the nature of possible signals. Among those models, Variational AutoEncoders (VAE) give control over the generation by exposing latent variables, although they usually suffer from low synthesis quality. In this paper, we introduce a Realtime Audio Variational autoEncoder (RAVE) allowing both fast and high-quality audio waveform synthesis. We introduce a novel two-stage training procedure, namely representation learning and adversarial fine-tuning. We show that using a post-training analysis of the latent space allows a direct control between the reconstruction fidelity and the representation compactness. By leveraging a multi-band decomposition of the raw waveform, we show that our model is the first able to generate 48kHz audio signals, while simultaneously running 20 times faster than real-time on a standard laptop CPU. We evaluate synthesis quality using both quantitative and qualitative subjective experiments and show the superiority of our approach compared to existing models. Finally, we present applications of our model for timbre transfer and signal compression. All of our source code and audio examples are publicly available.
- https://github.com/caillonantoine/RAVE - variational autoencoder for fast and high-quality neural audio synthesis

Efficient neural networks for real-time analog audio effect modeling - Deep learning approaches have demonstrated success in the task of modeling analog audio effects such as distortion and overdrive. Nevertheless, challenges remain in modeling more complex effects along with their variable parameters, such as dynamic range compressors. Previous methods not only exhibit a high level of noise and artifacts, they also require large training datasets, are computationally complex, and noncausal, prohibiting real-time operation. In this work, we demonstrate that more efficient temporal convolution networks (TCNs), shallow networks that exploit very large dilation factors to attain significant receptive field, can achieve state-of-the-art performance. We demonstrate that not only do these models produce results perceptually indistinguishable from the original effect, but unlike previous methods, are also capable of running in real-time on GPU and CPU, and can be trained using only 1% of the training data (~10 min) from previous methods.
- https://github.com/csteinmetz1/micro-tcn

https://github.com/erl-j/neural-instrument-cloning - In this project we combine techniques from neural voice cloning and musical instrument synthesis to achieve good results from as little as 16 seconds of target data.

Audio and MIDI looping

Giada

Giada - a free, minimal, hardcore audio tool for DJs, live performers and electronic musicians. How does it work? Just pick up your channel, fill it with samples or MIDI events and start the show by using this tiny piece of software as a loop machine, drum machine, sequencer, live sampler or yet as a plugin/effect host. Giada aims to be a compact and portable virtual device for Linux, Mac OS X and Windows for production use and live sets.
- https://github.com/monocasual/giada

https://www.giadamusic.com/documentation

YouTube: Giada LoopMachine
- Giada tutorial #8 - Loop modes overview
- Giada tutorial #3 - The action editor

What can you control with MIDI:

Global elements — sequencer, metronome, main volumes and so on, stored inside the configuration file and you set them once;
Per-channel elements — channel on/off, mute, volume, solo and so on, stored inside the patch and you set them whenever you create a new song.

No MIDI mod system, each binding is 'channel' specific ('channel' being the Giada term for a sample or sequence), which doesn't seem like it would scale well.

https://github.com/monocasual/giada-midimaps

Soundscape

Boodler

Boodler is an open-source soundscape tool -- continuous, infinitely varying streams of sound. Boodler is designed to run in the background on a computer, maintaining whatever sound environment you desire. Boodler is extensible, customizable, and modular. Each soundscape is a small piece of Python code -- typically less than a page. A soundscape can incorporate other soundscapes; it can combine other soundscapes, switch between them, fade them in and out. This package comes with many example soundscapes. You can use these, modify them, combine them to arbitrary levels of complexity, or write your own.
- https://github.com/ziz/boodler

YouTube: Owen Williams - Advanced Soundscapes with Boodler

Klangwunder3000

https://github.com/bk138/klangwunder3000/ - a cross-platform soundscape generator. It loads a set of sound files and associated control data and generates a constantly changing aural ambient.

Random Parallel Player

https://github.com/hilbrichtsoftware/random-parallel-player - Takes a bunch of audio files as tracks and plays them back randomly creating new music each playthrough. The core rule of RPP: No human interaction once the playback has started. RPP is based on an idea of Louigi Verona. The included audio samples in example.rpp were created by him. You can read about the original project here

Atmosfear

https://github.com/teragonaudio/Atmosfear - a VSTi plugin which generates random atmospheric soundscapes with samples scraped from FreeSound. We had originally imagined that the plugin colud generate soundscapes resembling parks, public places, nature, etc. However, the resulting sounds that it makes are generally quite surreal and creepy, hence the name. :)

Foco

https://github.com/akashnimare/foco - a cross-platform desktop app which runs in menubar. Foco boosts your productivity by creating perfect productive environment. It has the best sounds for getting work done .

Blanket

https://github.com/rafaelmardojai/blanket - Improve focus and increase your productivity by listening to different sounds. Or allows you to fall asleep in a noisy environment.

jungle

jungle - an audio-system. It allows you to create an ambiance using random audio-samples. It works with systems that have one or more audio-devices (with one or more channels) and can also use multiple systems (e.g. a couple of raspberry pis) in a cluster.This software requires a Linux system. It uses the ALSA sub system.I run it with a server-pc and 3 raspberry pies.

Web

https://ambient-mixer.com

myNoise - background noises and relaxing soundscape generator, web/app

Sonification

Retargeting

Scalable Music: Automatic Music Retargeting and Synthesis - S. Wenner, J.C. Bazin, A. Sorkine-Hornung, C. Kim, M. Gross. In this paper we propose a method for dynamic rescaling of music, inspired by recent works on image retargeting, video reshuffling and character animation in the computer graphics community. Given the desired target length of a piece of music and optional additional constraints such as position and importance of certain parts, we build on concepts from seam carving, video textures and motion graphs and extend them to allow for a global optimization of jumps in an audio signal. Based on an automatic feature extraction and spectral clustering for segmentation, we employ length-constrained least-costly path search via dynamic programming to synthesize a novel piece of music that best fulfills all desired constraints, with imperceptible transitions between reshuffled parts. We show various applications of music retargeting such as part removal, decreasing or increasing music duration, and in particular consistent joint video and audio editing.
- YouTube: Scalable Music: Automatic Music Retargeting and Synthesis (Eurographics 2013)

https://github.com/ucbvislab/radiotool - a python library that aims to make it easy to create audio by piecing together bits of other audio files. This library was originally written to enable my research in audio editing user interfaces, but perhaps someone else might find it useful.

Web

Background

Rainy Mood - #1 Rain Sounds • Sleep & Study

myNoise - Focus at Work • Relax at Home • Sleep at Night

Coffitivity

Rainy Cafe - Ambient White Noise Generator. 雨のカフェ

Time for Zen - The best meditation, nature music collection on the web.

raining.fm - Relaxing rain audio for work, play and sleep

A Soft Murmur - [51]

Tuning

Dart-Mic

Dart-Mic - a Javascript library which listens to microphone input and performs pitch/note detection, volume detection, recording, and general purpose data processing. It makes use of the Web Audio APi (which is only supported by Chrome currently) and DSP.js.

AudioNotch

AudioNotch - Tinnitus Treatment Sound Therapy - Tuner and Tone Generator

x42-tuner

x42-tuner - aka Tuna.LV2, an musical instrument tuner with strobe characteristics in LV2 plugin format.
- https://github.com/x42/tuna.lv2

XTuner

https://github.com/brummer10/XTuner - Virtual Tuner for Jack Audio Connection Kit, including NSM support

tunescope

https://github.com/dack/tunescope - an oscilloscope style guitar tuner. It uses jack for audio input and opengl for rendering. The signal is displayed in both normal and XY mode, using an automatically selected not as the reference.

FMIT

FMIT - Free Music Instrument Tuner, is a graphical utility for tuning your musical instruments, with error and volume history and advanced features.

LINGOT

LINGOT - a musical instrument tuner. It's accurate, easy to use, and highly configurable. Originally conceived to tune electric guitars, it can now be used to tune other instruments. It looks like an analogue tuner, with a gauge indicating the relative shift to a certain note, found automatically as the closest note to the estimated frequency.
- https://github.com/ibancg/lingot

jackstrobe

https://github.com/jessecrossen/jackstrobe - A simple strobe tuner using JACK and Qt 5.

Guitar Tuning Database

Guitar Tuning Database

alt-tuner

alt-tuner - a DAW microtonal tuning plug-in that retunes almost every midi keyboard or softsynth. It runs on PCs, macs and Linux/Wine machines. Click here to buy it.

MTuner

https://www.meldaproduction.com/MTuner - a simple Windows VST audio frequency analyzer designed mostly for tuning guitars and other instruments. It detects frequency, note and deviation from correct pitch in cents, resolving frequencies in the range 50Hz to 2kHz, which is enough for most instruments and vocals.

tuner

https://github.com/logsol/tuner - A simple standalone app for macOS based on JUCE that detects the frequency of an instrument and shows its note based on autocorrelation.

Machine learning

Video

Gaze

http://theeyeharp.org

AR / VR

Performance

The Box of No Return - a Linux-based musical synthesizer platform, suitable for live musicianship, designed to handle multiple patches with enormous demands, and switch between them with zero delay and zero cutout. If you sit in your home studio and use single SoundFonts with a laptop and simple GUI, you don't need this. If you play live, and pile on the tone generators and filters in patch development in order to feel and deliver the unyielding power of the musical harmonic roar, a full implementation of the BNR may suit you well. There are obviously middle grounds too ☺, and there are articles here to help in general.
- https://github.com/ponderworthy/the-box-of-no-return

https://github.com/EliasKesh/LiveMusicApp - GTK Application to configure and control music software for live performances

Games

FRACT - a musical exploration game. You arrive in a forgotten place and explore the unfamiliar landscape to discover the secrets of an abandoned world that was once built on sound. As you start to make sense of this strange new environment, you work to rebuild its machinery by solving puzzles and bring the world back to life by shaping sound and creating music in the game.

https://sunebear.github.io/Piano-Flow - A music game follows the piano flows.
- https://github.com/SuneBear/Piano-Flow

DOS

http://www.vgmpf.com/Wiki/index.php?title=Category:Editors

macOS

http://www.macmusic.org/home/?lang=en

Arvid Tomayko's Guide To Free Mac Music Software

Other

https://github.com/ggerganov/kbd-audio - Tools for capturing and analysing keyboard input paired with microphone capture [52]

Volume "control". - Album on Imgur - [53]

WP: ReWire_(software_protocol) - a software protocol, jointly developed by Propellerhead and Steinberg, allowing remote control and data transfer among digital audio editing and related software. Originally appearing in the ReBirth software synthesizer in 1998, the protocol has since evolved into an industry standard. Currently used in macOS and Microsoft Windows 32-bit or 64-bit audio applications, ReWire enables the simultaneous transfer of up to 256 audio tracks of arbitrary resolution and 4080 channels of MIDI data. This allows, for example, the output from synthesizer software to be fed directly into a linear editor without the use of intermediate files or analog transfers. There are also provisions to remotely trigger actions, such as starting and stopping recording. The protocol is licensed free of charge to companies only, but comes with a "non-disclosure of source code" license that is incompatible with most free-software licenses. The ReWire system consists of "Hosts", "Panels", and "Devices". Hosts are the host applications which typically do the sequencing at one end and the final mixdown at the other end. A Device is a dynamic link library that only generates sound; it has no user interface. A Panel is a graphical interface for setting the parameters of one Device. A typical setup would be to use Ableton Live in "Host" mode, and use Propellerhead Reason as a synthesizer. In this case Reason would provide Device/Panel pairs to Ableton, which could then send midi commands, sync timing and mix Reason's output into its own effects chains. Many applications support either mode. In fact, an application could (at the discretion of a developer) act as both a Host and a Panel at the same time.

Using ASCII waveforms to test real-time audio code - Q2Q - [54]

Audio

General

Training

Electrical

Digital

Hardware

Studio

Wiring

Patch bay

Microphones

Preamplifier

Direct injection

Mixer

Amplifier

Crossover

Speaker

Headphones

Calibration

Synthesizer

Vocoder

Drum machine

Sampler

Sound module

DAC / ADC

Sound chip/card

ISA

PCI

Wavetable

USB

DSP

Computer

Music workstaion

FireWire

AES3

ADAT Lightpipe

MADI

USB

Bluetooth

Pedal

Jesusonic

The OWL

to sort

Controller

Game controller

Eye tracking

Instruments

Wire

Music roll

Multitrack recording

Wifi recording

MP3

Rockbox

gtkpod

PA system

Sound system

Noise meter

Linux

News and communities

Mailing lists

IRC

Freenode

OFTC

Cons

Distros

Software lists

Setup

Real time

Audio systems

OSS

ALSA

Information

Configuration

Libraries

loopback

Tools

PulseAudio

Configuration

Mixer

GUI

CLI