ML / AI

Things and Stuff Wiki - An organically evolving personal wiki knowledge base. An on-the-fly taxonomy containing a patchwork trail of topic outlines, descriptions, notes, stubs and breadcrumbs, with links to sites, systems, software, manuals, organisations, people, articles, guides, slides, papers, books, comments, videos, screencasts, webcasts, scratchpads and more. Content is orientated towards mostly free/libre/open, mostly Linux. Quality and age varies drastically. Sometimes old things are first, sometimes last. Use the Table of Contents menu to navigate long pages. Zoom in if text is too small. Dead link? Wayback Machine. I probably need to fix the theme CSS after an update. See also libreav.org. Chat to msg me (not checking tho atm). e

Machine learning

https://en.wikipedia.org/wiki/Machine_learning

https://news.ycombinator.com/item?id=12713056

YouTube: Neural Network Architectures

A Course in Machine Learning - a set of introductory materials that covers most major aspects of modern machine learning (supervised learning, unsupervised learning, large margin methods, probabilistic modeling, learning theory, etc.). It's focus is on broad applications with a rigorous backbone. A subset can be used for an undergraduate course; a graduate course could probably cover the entire material and then some.

https://news.ycombinator.com/item?id=12936891

Tom Mitchell: Never Ending Language Learning

Neural Networks, Manifolds, and Topology -- colah's blog - [1]

OCDevel

YouTube: From Deep Learning of Disentangled Representations to Higher-level Cognition

https://en.wikipedia.org/wiki/Artificial_neuron

https://en.wikipedia.org/wiki/Perceptron

https://news.ycombinator.com/item?id=12751585

https://github.com/rasbt/python-machine-learning-book/blob/master/faq/difference-deep-and-normal-learning.md [2]

"In applications of "usual" machine learning, there is typically a strong focus on the feature engineering part; the model learned by an algorithm can only be so good as its input data. Of course, there must be sufficient discriminatory information in our dataset, however, the performance of machine learning algorithms can suffer substantially when the information is buried in meaningless features. The goal behind deep learning is to automatically learn the features from (somewhat) noisy data; it's about algorithms that do the feature engineering for us to provide deep neural network structures with meaningful information so that it can learn more effectively. We can think of deep learning as algorithms for automatic "feature engineering," or we could simply call them "feature detectors," which help us to overcome the vanishing gradient challenge and facilitate the learning in neural networks with many layers."

Backpropagation 101 · Thinc · A refreshing functional take on deep learning - [3]

http://deeplearning4j.org/

http://www.damninteresting.com/on-the-origin-of-circuits/ [4]

NNdef - Java and XML based Neural Networks and Knowledge Modeling toolkit and library

https://probmods.org/

http://karpathy.github.io/2015/05/21/rnn-effectiveness/

https://en.wikipedia.org/wiki/T-distributed_stochastic_neighbor_embedding
- https://news.ycombinator.com/item?id=12713388

https://news.ycombinator.com/item?id=12942169

http://www.cs.cmu.edu/~tom7/mario/
- https://www.youtube.com/watch?v=xOCurBYI_gY

http://neuralnetworksanddeeplearning.com/ [5]

https://news.ycombinator.com/item?id=8258652

http://www.technologyreview.com/view/530561/the-revolutionary-technique-that-quietly-changed-machine-vision-forever/

https://news.ycombinator.com/item?id=8553307

http://openssi.net/
- http://records.sigmm.ndlab.net/2014/10/ssi-an-open-source-platform-for-social-signal-interpretation/ [6]

YouTube: Artificial Intelligence: Neural Networks in Machine Learning

https://en.wikipedia.org/wiki/Convolutional_neural_network

https://news.ycombinator.com/item?id=9109157

https://news.ycombinator.com/item?id=11530304

https://news.ycombinator.com/item?id=19144280

http://quantombone.blogspot.co.uk/2015/04/deep-learning-vs-probabilistic.html [7]

https://www.gnu.org/software/gneuralnetwork/ [10]

https://open_nsfw.gitlab.io/ [11]

Caffe - a deep learning framework made with expression, speed, and modularity in mind. It is developed by Berkeley AI Research (BAIR) and by community contributors. Yangqing Jia created the project during his PhD at UC Berkeley. Caffe is released under the BSD 2-Clause license.
- Caffe Model Zoo

Torch - a scientific computing framework with wide support for machine learning algorithms that puts GPUs first. It is easy to use and efficient, thanks to an easy and fast scripting language, LuaJIT, and an underlying C/CUDA implementation.
- https://github.com/torch/torch7

https://github.com/karpathy/char-rnn - Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch

YouTube: Connections between physics and deep learning [https://news.ycombinator.com/item?id=13062139

DeepRhyme (D-Prime) – generating dope rhymes with deep learning [12]

Composing Music With Recurrent Neural Networks [13]

Data Science Machine - an end-to-end software system that is able to automatically develop predictive models from relational data. The Machine was created by Max Kanter and Kalyan Verramachaneni at the Computer Science and Artificial Intelligence Laboratory (CSAIL) at MIT. The system automates two of the most human-intensive components of a data science endeavor: feature engineering, and selection and tuning of the machine learning methods that build predictive models from those features. First, an algorithm called Deep Feature Synthesis automatically engineers features. Next, through an approach called Deep Mining, the Machine composes a generalized machine learning pipeline that includes dimensionality reduction methods, feature selection methods, clustering, and classifier design. Finally, it tunes the parameters through a Gaussian Copula Process.
- System that replaces human intuition with algorithms outperforms human teams

What's Next in Deep Learning [14]

Deep Residual Learning for Image Recognition https://news.ycombinator.com/item?id=10715628

http://www.wildml.com/2016/01/attention-and-memory-in-deep-learning-and-nlp/ [15]

TensorFlow - an open source software library for high performance numerical computation. Its flexible architecture allows easy deployment of computation across a variety of platforms (CPUs, GPUs, TPUs), and from desktops to clusters of servers to mobile and edge devices. Originally developed by researchers and engineers from the Google Brain team within Google’s AI organization, it comes with strong support for machine learning and deep learning and the flexible numerical computation core is used across many other scientific domains.

http://tflearn.org
- https://github.com/tflearn/tflearn [16]

http://playground.tensorflow.org/ [17]

https://www.oreilly.com/learning/hello-tensorflow [18]

https://github.com/sherjilozair/char-rnn-tensorflow

https://nucl.ai/blog/enhance-pixel-art/

http://www-personal.umich.edu/~reedscot/iclr_project.html [19]

https://deepmind.com/blog/wavenet-generative-model-raw-audio/ [20]

http://www.wildml.com/2016/10/learning-reinforcement-learning [21]

https://blog.acolyer.org/2016/10/12/towards-deep-symbolic-reinforcement-learning/ [22]

http://www.wildml.com/2016/10/learning-reinforcement-learning/ [23]

https://research.googleblog.com/2016/12/open-sourcing-embedding-projector-tool.html [24]

https://tryolabs.com/blog/2016/12/06/major-advancements-deep-learning-2016/ [25]

https://nips.cc/Conferences/2016/SpotlightVideos

https://arxiv.org/abs/1612.03770 [26]

https://jalammar.github.io/visual-interactive-guide-basics-neural-networks/ [27]

Operational calculus on programming spaces - https://news.ycombinator.com/item?id=13280818

https://www.darpa.mil/news-events/2016-06-17 [28]

http://distill.pub/about/ [29]

https://vectordash.com

https://github.com/neo-ai/neo-ai-dlr - a compiler and runtime for machine learning models. The compiler optimizes machine learning models for various target hardware. The runtime executes the model on the target hardware. A stand-alone, light-weight and portable runtime for CNN and decicion-tree models. Built on top of TVM and Treelite runtime, DLR provides simple and unified Python/C++ APIs for loading and running TVM/Treelite compiled models on a wide range of devices, including X86, TRT-enabled GPU and Arm devices.

https://github.com/nihalpasham/fingerprinting_radios_w_ML - The key idea behind radio ﬁngerprinting is to extract unique patterns (or features) and use them as signatures to identify devices (or more precisely ID a radio embedded within a device).

NuPIC - the Numenta Platform for Intelligent Computing, comprises a set of learning algorithms that were first described in a white paper published by Numenta in 2009. The learning algorithms faithfully capture how layers of neurons in the neocortex learn.

https://github.com/jadore801120/attention-is-all-you-need-pytorch - PyTorch implementation of the Transformer model in "Attention is All You Need" (Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin, arxiv, 2017).A novel sequence to sequence framework utilizes the self-attention mechanism, instead of Convolution operation or Recurrent structure, and achieve the state-of-the-art performance on WMT 2014 English-to-German translation task. (2017/06/12)

https://github.com/weihaox/awesome-neural-rendering - A collection of resources on neural rendering.

High-level framework

https://github.com/apache/incubator-mxnet - a deep learning framework designed for both efficiency and flexibility. It allows you to mix symbolic and imperative programming to maximize efficiency and productivity. At its core, MXNet contains a dynamic dependency scheduler that automatically parallelizes both symbolic and imperative operations on the fly. A graph optimization layer on top of that makes symbolic execution fast and memory efficient. MXNet is portable and lightweight, scaling effectively to multiple GPUs and multiple machines.

https://github.com/keras-team/keras - a high-level neural networks API, written in Python and capable of running on top of TensorFlow, CNTK, or Theano. It was developed with a focus on enabling fast experimentation. Being able to go from idea to result with the least possible delay is key to doing good research.

News

Distill — Latest articles about machine learning

GPT-3 vs Water Cooler Trivia participants: A Human vs Robot Showdown -

AI

How Elon Musk and Y Combinator Plan to Stop Computers From Taking Over

There is a blind spot in AI research : Nature News & Comment - [30]

OpenAI - a non-profit artificial intelligence research company. Our goal is to advance digital intelligence in the way that is most likely to benefit humanity as a whole, unconstrained by a need to generate financial return. Since our research is free from financial obligations, we can better focus on a positive human impact. We believe AI should be an extension of individual human wills and, in the spirit of liberty, as broadly and evenly distributed as is possible safely. The outcome of this venture is uncertain and the work is difficult, but we believe the goal and the structure are right. We hope this is what matters most to the best in the field. [31]