ML / AI

Things and Stuff Wiki - An organically evolving personal wiki knowledge base. An on-the-fly taxonomy containing a patchwork trail of topic outlines, descriptions, notes, stubs and breadcrumbs, with links to sites, systems, software, manuals, organisations, people, articles, guides, slides, papers, books, comments, videos, screencasts, webcasts, scratchpads and more. Content is orientated towards mostly free/libre/open, mostly Linux. Quality and age varies drastically. Sometimes old things are first, sometimes last. Use the Table of Contents menu to navigate long pages. Zoom in if text is too small. Dead link? Wayback Machine. I probably need to fix the theme CSS after an update. See also libreav.org. Chat to msg me (not checking tho atm). e

Machine learning

https://en.wikipedia.org/wiki/Machine_learning

https://news.ycombinator.com/item?id=12713056

YouTube: Neural Network Architectures

A Course in Machine Learning - a set of introductory materials that covers most major aspects of modern machine learning (supervised learning, unsupervised learning, large margin methods, probabilistic modeling, learning theory, etc.). It's focus is on broad applications with a rigorous backbone. A subset can be used for an undergraduate course; a graduate course could probably cover the entire material and then some.

https://news.ycombinator.com/item?id=12936891

Tom Mitchell: Never Ending Language Learning

Neural Networks, Manifolds, and Topology -- colah's blog - [1]

OCDevel

YouTube: From Deep Learning of Disentangled Representations to Higher-level Cognition

https://github.com/iamtrask/Grokking-Deep-Learning - this repository accompanies the book "Grokking Deep Learning"

https://en.wikipedia.org/wiki/Artificial_neuron

https://en.wikipedia.org/wiki/Perceptron

https://news.ycombinator.com/item?id=12751585

https://github.com/rasbt/python-machine-learning-book/blob/master/faq/difference-deep-and-normal-learning.md [2]

"In applications of "usual" machine learning, there is typically a strong focus on the feature engineering part; the model learned by an algorithm can only be so good as its input data. Of course, there must be sufficient discriminatory information in our dataset, however, the performance of machine learning algorithms can suffer substantially when the information is buried in meaningless features. The goal behind deep learning is to automatically learn the features from (somewhat) noisy data; it's about algorithms that do the feature engineering for us to provide deep neural network structures with meaningful information so that it can learn more effectively. We can think of deep learning as algorithms for automatic "feature engineering," or we could simply call them "feature detectors," which help us to overcome the vanishing gradient challenge and facilitate the learning in neural networks with many layers."

Backpropagation 101 · Thinc · A refreshing functional take on deep learning - [3]

http://deeplearning4j.org/

http://www.damninteresting.com/on-the-origin-of-circuits/ [4]

NNdef - Java and XML based Neural Networks and Knowledge Modeling toolkit and library

https://probmods.org/

http://karpathy.github.io/2015/05/21/rnn-effectiveness/

https://en.wikipedia.org/wiki/T-distributed_stochastic_neighbor_embedding
- https://news.ycombinator.com/item?id=12713388

https://news.ycombinator.com/item?id=12942169

http://www.cs.cmu.edu/~tom7/mario/
- https://www.youtube.com/watch?v=xOCurBYI_gY

http://neuralnetworksanddeeplearning.com/ [5]

https://news.ycombinator.com/item?id=8258652

http://www.technologyreview.com/view/530561/the-revolutionary-technique-that-quietly-changed-machine-vision-forever/

https://news.ycombinator.com/item?id=8553307

http://openssi.net/
- http://records.sigmm.ndlab.net/2014/10/ssi-an-open-source-platform-for-social-signal-interpretation/ [6]

YouTube: Artificial Intelligence: Neural Networks in Machine Learning

https://en.wikipedia.org/wiki/Convolutional_neural_network

https://news.ycombinator.com/item?id=9109157

https://news.ycombinator.com/item?id=11530304

https://news.ycombinator.com/item?id=19144280

http://quantombone.blogspot.co.uk/2015/04/deep-learning-vs-probabilistic.html [7]

https://www.gnu.org/software/gneuralnetwork/ [10]

https://open_nsfw.gitlab.io/ [11]

Caffe - a deep learning framework made with expression, speed, and modularity in mind. It is developed by Berkeley AI Research (BAIR) and by community contributors. Yangqing Jia created the project during his PhD at UC Berkeley. Caffe is released under the BSD 2-Clause license.
- Caffe Model Zoo

Torch - a scientific computing framework with wide support for machine learning algorithms that puts GPUs first. It is easy to use and efficient, thanks to an easy and fast scripting language, LuaJIT, and an underlying C/CUDA implementation.
- https://github.com/torch/torch7

https://github.com/karpathy/char-rnn - Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch

YouTube: Connections between physics and deep learning [https://news.ycombinator.com/item?id=13062139

DeepRhyme (D-Prime) – generating dope rhymes with deep learning [12]

Composing Music With Recurrent Neural Networks [13]

Data Science Machine - an end-to-end software system that is able to automatically develop predictive models from relational data. The Machine was created by Max Kanter and Kalyan Verramachaneni at the Computer Science and Artificial Intelligence Laboratory (CSAIL) at MIT. The system automates two of the most human-intensive components of a data science endeavor: feature engineering, and selection and tuning of the machine learning methods that build predictive models from those features. First, an algorithm called Deep Feature Synthesis automatically engineers features. Next, through an approach called Deep Mining, the Machine composes a generalized machine learning pipeline that includes dimensionality reduction methods, feature selection methods, clustering, and classifier design. Finally, it tunes the parameters through a Gaussian Copula Process.
- System that replaces human intuition with algorithms outperforms human teams

What's Next in Deep Learning [14]

Deep Residual Learning for Image Recognition https://news.ycombinator.com/item?id=10715628

http://www.wildml.com/2016/01/attention-and-memory-in-deep-learning-and-nlp/ [15]

TensorFlow - an open source software library for high performance numerical computation. Its flexible architecture allows easy deployment of computation across a variety of platforms (CPUs, GPUs, TPUs), and from desktops to clusters of servers to mobile and edge devices. Originally developed by researchers and engineers from the Google Brain team within Google’s AI organization, it comes with strong support for machine learning and deep learning and the flexible numerical computation core is used across many other scientific domains.

http://tflearn.org
- https://github.com/tflearn/tflearn [16]

http://playground.tensorflow.org/ [17]

https://www.oreilly.com/learning/hello-tensorflow [18]

https://github.com/sherjilozair/char-rnn-tensorflow

https://nucl.ai/blog/enhance-pixel-art/

http://www-personal.umich.edu/~reedscot/iclr_project.html [19]

https://deepmind.com/blog/wavenet-generative-model-raw-audio/ [20]

http://www.wildml.com/2016/10/learning-reinforcement-learning [21]

https://blog.acolyer.org/2016/10/12/towards-deep-symbolic-reinforcement-learning/ [22]

http://www.wildml.com/2016/10/learning-reinforcement-learning/ [23]

https://research.googleblog.com/2016/12/open-sourcing-embedding-projector-tool.html [24]

https://tryolabs.com/blog/2016/12/06/major-advancements-deep-learning-2016/ [25]

https://nips.cc/Conferences/2016/SpotlightVideos

https://arxiv.org/abs/1612.03770 [26]

https://jalammar.github.io/visual-interactive-guide-basics-neural-networks/ [27]

Operational calculus on programming spaces - https://news.ycombinator.com/item?id=13280818

https://www.darpa.mil/news-events/2016-06-17 [28]

http://distill.pub/about/ [29]

https://vectordash.com

https://github.com/neo-ai/neo-ai-dlr - a compiler and runtime for machine learning models. The compiler optimizes machine learning models for various target hardware. The runtime executes the model on the target hardware. A stand-alone, light-weight and portable runtime for CNN and decicion-tree models. Built on top of TVM and Treelite runtime, DLR provides simple and unified Python/C++ APIs for loading and running TVM/Treelite compiled models on a wide range of devices, including X86, TRT-enabled GPU and Arm devices.

https://github.com/nihalpasham/fingerprinting_radios_w_ML - The key idea behind radio ﬁngerprinting is to extract unique patterns (or features) and use them as signatures to identify devices (or more precisely ID a radio embedded within a device).

NuPIC - the Numenta Platform for Intelligent Computing, comprises a set of learning algorithms that were first described in a white paper published by Numenta in 2009. The learning algorithms faithfully capture how layers of neurons in the neocortex learn.

https://github.com/jadore801120/attention-is-all-you-need-pytorch - PyTorch implementation of the Transformer model in "Attention is All You Need" (Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin, arxiv, 2017).A novel sequence to sequence framework utilizes the self-attention mechanism, instead of Convolution operation or Recurrent structure, and achieve the state-of-the-art performance on WMT 2014 English-to-German translation task. (2017/06/12)

https://github.com/weihaox/awesome-neural-rendering - A collection of resources on neural rendering.

YouTube: The Surreal Dreams of AI-Generated Art

High-level framework

https://github.com/apache/incubator-mxnet - a deep learning framework designed for both efficiency and flexibility. It allows you to mix symbolic and imperative programming to maximize efficiency and productivity. At its core, MXNet contains a dynamic dependency scheduler that automatically parallelizes both symbolic and imperative operations on the fly. A graph optimization layer on top of that makes symbolic execution fast and memory efficient. MXNet is portable and lightweight, scaling effectively to multiple GPUs and multiple machines.

https://github.com/keras-team/keras - a high-level neural networks API, written in Python and capable of running on top of TensorFlow, CNTK, or Theano. It was developed with a focus on enabling fast experimentation. Being able to go from idea to result with the least possible delay is key to doing good research.

Stable Diffusion

https://github.com/Lightning-AI/stable-diffusion-deploy - Learn to serve Stable Diffusion models on cloud infrastructure at scale. This Lightning App shows load-balancing, orchestrating, pre-provisioning, dynamic batching, GPU-inference, micro-services working together via the Lightning Apps framework.

https://github.com/mlc-ai/web-stable-diffusion - Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.

News

Distill — Latest articles about machine learning

GPT-3 vs Water Cooler Trivia participants: A Human vs Robot Showdown -

How I Used DALL·E 2 to Generate The Logo for OctoSQL | Jacob Martin - [30]

DALL·E 2 prompt book -pdf- | Hacker News

https://github.com/plaidml/plaidml - PlaidML is a framework for making deep learning work everywhere.

AI

How Elon Musk and Y Combinator Plan to Stop Computers From Taking Over

There is a blind spot in AI research : Nature News & Comment - [31]

OpenAI - a non-profit artificial intelligence research company. Our goal is to advance digital intelligence in the way that is most likely to benefit humanity as a whole, unconstrained by a need to generate financial return. Since our research is free from financial obligations, we can better focus on a positive human impact. We believe AI should be an extension of individual human wills and, in the spirit of liberty, as broadly and evenly distributed as is possible safely. The outcome of this venture is uncertain and the work is difficult, but we believe the goal and the structure are right. We hope this is what matters most to the best in the field. [32]

http://www.humanbrainproject.eu/

http://www.nih.gov/science/brain/

http://wiki.opencog.org/w/The_Open_Cognition_Project

https://github.com/hades217/awesome-ai

https://github.com/lucidrains/PaLM-rlhf-pytorch -Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Maybe I'll add retrieval functionality too, à la RETRO

The Expanding Dark Forest and Generative AI - [33]

https://github.com/peterw/Chat-with-Github-Repo - contains two Python scripts that demonstrate how to create a chatbot using Streamlit, OpenAI GPT-3.5-turbo, and Activeloop's Deep Lake.

https://github.com/kyegomez/Sophia - Sophia is an second order clipped stochastic optimization algorithm that uses an inexpensive stochastic estimate of the diagonal of the Hessian as an pre-conditioner and a clipping mechanism to control the worst case update size. It achieves better performance than adam in terms of validation pre-traing loss, total compute, and wall-clock time. By cutting model training cost in half, Sophia can help save millions if not billions of dollars in computational resources.

Scaling Transformer to 1M tokens and beyond with RMT | Hacker News

ChatGPT

https://github.com/lencx/ChatGPT - crystal_ball ChatGPT Desktop Application (Mac, Windows and Linux)

https://github.com/asrul10/linux-command-gpt - Get Linux commands in natural language with the power of ChatGPT.

ChatGPT passes the 2022 APCSA free response section - [34]

https://github.com/TheAppleTucker/backend-GPT - We've built a entire Backend+Database powered by an LLM. It infers business logic based on the name of the API call and can persist a kilobyte of state!

Tell HN: ChatGPT can reply like a specific Reddit or HN user, including you | Hacker News -

The Mechanical Professor - by Ethan Mollick

Where Does ChatGPT Fall on the Political Compass?

Alarmed by A.I. Chatbots, Universities Start Revamping How They Teach - The New York Times

GPT-3 Is the Best Journal I've Ever Used - Superorganizers - Every

Capturing the Flag with GPT-4 [35]

https://github.com/TheR1D/shell_gpt - A command-line interface (CLI) productivity tool powered by OpenAI's text-davinci-003 model, will help you accomplish your tasks faster and more efficiently.

https://github.com/Nutlope/aicommits - A CLI that writes your git commit messages for you with AI

https://github.com/fedenunez/tulp - Tulp is a command-line tool that can help you create and process piped content using the power of ChatGPT directly from the terminal.

https://github.com/Nutlope/roomGPT - Upload a photo of your room to generate your dream room with AI.

https://github.com/xtekky/gpt4free - decentralising the Ai Industry, free gpt-4/3.5 scripts through several reverse engineered api's ( poe.com, phind.com, chat.openai.com, writesonic.com, sqlchat.ai, t3nsor.com, you.com etc...)

GPT-4 System Card [pdf | Hacker News]

LLaMA

A brief history of LLaMA models - AGI Sphere [36]

https://old.reddit.com/r/LocalLLaMA/top/?t=month

https://github.com/ggerganov/llama.cpp - Port of Facebook's LLaMA model in C/C++

Audio

https://github.com/haoheliu/audioldm_eval - This toolbox aims to unify audio generation model evaluation for easier future comparison.

https://github.com/d3n7/riffusionDJ - Multichannel Looper/Feedback System for Riffusion

https://github.com/d3n7/GPT-4-To-MIDI - Text prompt to MIDI File using OpenAI's GPT-4. Now with polyphony and MIDI input!

https://github.com/suno-ai/bark - a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. The model can also produce nonverbal communications like laughing, sighing and crying. To support the research community, we are providing access to pretrained model checkpoints ready for inference.

https://github.com/samim23/polymath - uses machine learning to convert any music library (e.g from Hard-Drive or YouTube) into a music production sample-library. The tool automatically separates songs into stems (beats, bass, etc.), quantizes them to the same tempo and beat-grid (e.g. 120bpm), analyzes musical structure (e.g. verse, chorus, etc.), key (e.g C4, E3, etc.) and other infos (timbre, loudness, etc.), and converts audio to midi. The result is a searchable sample library that streamlines the workflow for music producers, DJs, and ML audio developers.

https://github.com/AIGC-Audio/AudioGPT - Understanding and Generating Speech, Music, Sound, and Talking Head
AudioGPT - a Hugging Face Space by AIGC-Audio

https://github.com/LAION-AI/CLAP - Contrastive Language-Audio Pretraining

https://github.com/declare-lab/tango - Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models - Speech Research -

Text-to-audio (TTA) system has recently gained attention for its ability to synthesize general audio based on text descriptions. However, previous studies in TTA have limited generation quality with high computational costs. In this study, we propose AudioLDM, a TTA system that is built on a latent space to learn the continuous audio representations from contrastive language-audio pretraining (CLAP) latents. The pretrained CLAP models enable us to train latent diffusion models (LDMs) with audio embedding while providing text embedding as a condition during sampling. By learning the latent representations of audio signals and their compositions without modeling the cross-modal relationship, AudioLDM is advantageous in both generation quality and computational efficiency. Trained on AudioCaps with a single GPU, AudioLDM achieves state-of-the-art TTA performance measured by both objective and subjective metrics (e.g., frechet distance). Moreover, AudioLDM is the first TTA system that enables various text-guided audio manipulations (e.g., style transfer) in a zero-shot fashion.

- https://github.com/haoheliu/AudioLDM - AudioLDM: Generate speech, sound effects, music and beyond, with text.

https://github.com/tuneflow/AudioLDM - Fork of https://github.com/haoheliu/AudioLDM as a TuneFlow Plugin

Groove2Groove – One-shot music style transferGrv2Grv) is an AI system for music accompaniment style transfer. Given two MIDI files – a content input and a style input – it generates a new accompaniment for the first file in the style of the second one
- https://github.com/cifkao/groove2groove

Music AI

Medicine

Algorithm for Optimized mRNA Design Improves Stability and Immunogenicity | Nature [37]