ML / AI

Things and Stuff Wiki - An organically evolving personal wiki knowledge base. An on-the-fly taxonomy containing a patchwork trail of topic outlines, descriptions, notes, stubs and breadcrumbs, with links to sites, systems, software, manuals, organisations, people, articles, guides, slides, papers, books, comments, videos, screencasts, webcasts, scratchpads and more. Content is orientated towards mostly free/libre/open, mostly Linux. Quality and age varies drastically. Sometimes old things are first, sometimes last. Use the Table of Contents menu to navigate long pages. Zoom in if text is too small. Dead link? Wayback Machine. I probably need to fix the theme CSS after an update. See also libreav.org. Chat to msg me (not checking tho atm). e

to sort

Automation

https://en.wikipedia.org/wiki/Artificial_intelligence

http://theai.wiki/Home [1]

https://medium.com/backchannel/how-elon-musk-and-y-combinator-plan-to-stop-computers-from-taking-over-17e0e27dd02a#.m7ju9koio

https://www.nature.com/news/there-is-a-blind-spot-in-ai-research-1.20805 [2]

OpenAI is a non-profit artificial intelligence research company. Our goal is to advance digital intelligence in the way that is most likely to benefit humanity as a whole, unconstrained by a need to generate financial return. Since our research is free from financial obligations, we can better focus on a positive human impact. We believe AI should be an extension of individual human wills and, in the spirit of liberty, as broadly and evenly distributed as is possible safely. The outcome of this venture is uncertain and the work is difficult, but we believe the goal and the structure are right. We hope this is what matters most to the best in the field. [3]

BotLibre.org - an open source platform based on an advanced artificial intelligence engine developed in Java. The Bot Libre AI engine can be used in any Java platform, such as a Java webserver, Java client, or on Android. The Bot Libre SDK supports access to Bot Libre's web API from JavaScript, Android, Java, iOS, and objective C. [4]

Mycroft Mycroft - the world’s first open source assistant. Mycroft runs anywhere – on a desktop computer, inside an automobile, or on a Raspberry Pi. This is open source software which can be freely remixed, extended, and improved. Mycroft may be used in anything from a science project to an enterprise software application.
- https://github.com/MycroftAI/mycroft-core
- https://github.com/MycroftAI/mycroft-skills

http://www.humanbrainproject.eu/

http://www.nih.gov/science/brain/

http://wiki.opencog.org/w/The_Open_Cognition_Project

Wit.ai - makes it easy for developers to build applications and devices that you can talk or text to. Our vision is to empower developers with an open and extensible natural language platform. Wit.ai learns human language from every interaction, and leverages the community: what's learned is shared across developers.

SUSI.AI - Open Source Artificial Intelligence for Personal Assistants, Robots, Help Desks and Chatbots.
- https://github.com/fossasia/susi_server

Leon - an open-source personal assistant who can live on your server. He does stuff when you ask for it. You can talk to him and he can talk to you. You can also text him and he can also text you. If you want to, Leon can communicate with you by being offline to protect your privacy.
- https://github.com/leon-ai/leon [5]

NLP

https://en.wikipedia.org/wiki/Natural_language_processing

http://thetokenizer.com/2013/04/28/build-your-own-summary-tool/

https://textblob.readthedocs.org/en/latest/

https://github.com/Tabule/Sherlock

http://eacl2014.org/tutorial-metaphor

http://languagengine.co/blog/what-happened-to-old-school-nlp/ [6]

https://news.ycombinator.com/item?id=16713964

https://github.com/zalandoresearch/flair [7]

Metacat - a computer model of analogy-making and perception that builds on the foundations of an earlier model called Copycat. Copycat was originally developed by Douglas Hofstadter and Melanie Mitchell as part of a research program aimed at computationally modeling the fundamental mechanisms underlying human thought processes. Central to the philosophy of this research is the belief that the mind's ability to perceive connections between apparently dissimilar things, and to make analogies based on these connections, lies at the heart of intelligence. According to this view, to understand the analogical mechanisms of thinking and perception is to understand the source of the remarkable fluidity of the human mind, including its hidden wellsprings of creativity.

Like Copycat, Metacat operates in an idealized world of analogy problems involving short strings of letters. Although the program understands only a limited set of concepts about its letter-string world, its emergent processing mechanisms give it considerable flexibility in recognizing and applying these concepts in a wide variety of situations. The program's high-level behavior emerges in a bottom-up manner from the collective actions of many small nondeterministic processing agents (called codelets) working in parallel, in much the same way that an ant colony's high-level behavior emerges from the individual behaviors of the underlying ants, without any central executive directing the course of events.

Metacat focuses on the issue of self-watching: the ability of a system to perceive and respond to patterns that arise not only in its immediate perceptions of the world, but also in its own processing of those perceptions. Copycat lacked such an "introspective" capacity, and consequently lacked insight into how it arrived at its answers. It was unable to notice similarities between analogies, or to explain the differences between them or why one might be considered to be better or worse than another. In contrast, Metacat's self-watching mechanisms enable it to create much richer representations of analogies, allowing it to compare and contrast answers in an insightful way. Furthermore, it is able to recognize, remember, and recall patterns that occur in its own "train of thought" as it makes analogies. For instance, by monitoring its own processing, Metacat can often recognize when it has fallen into a repetitive cycle of behavior, enabling it to break out of its "rut" and try something else. [8]

Bots

https://dev.botframework.com/ [9]

https://github.com/itsabot/abot

https://news.ycombinator.com/item?id=11503998

https://home-assistant.io/

https://pagenodes.com/
- https://github.com/monteslu/pagenodes

Rasa Core - robot open source chatbot framework with machine learning-based dialogue management - Build contextual AI assistants
- https://github.com/RasaHQ/rasa_core

Machine learning

https://en.wikipedia.org/wiki/Machine_learning

https://news.ycombinator.com/item?id=12713056

YouTube: Neural Network Architectures

https://news.ycombinator.com/item?id=12936891

Tom Mitchell: Never Ending Language Learning

Neural Networks, Manifolds, and Topology -- colah's blog - [10]

OCDevel

YouTube: From Deep Learning of Disentangled Representations to Higher-level Cognition

https://en.wikipedia.org/wiki/Artificial_neuron

https://en.wikipedia.org/wiki/Perceptron

https://news.ycombinator.com/item?id=12751585

https://github.com/rasbt/python-machine-learning-book/blob/master/faq/difference-deep-and-normal-learning.md [11]

"In applications of "usual" machine learning, there is typically a strong focus on the feature engineering part; the model learned by an algorithm can only be so good as its input data. Of course, there must be sufficient discriminatory information in our dataset, however, the performance of machine learning algorithms can suffer substantially when the information is buried in meaningless features. The goal behind deep learning is to automatically learn the features from (somewhat) noisy data; it's about algorithms that do the feature engineering for us to provide deep neural network structures with meaningful information so that it can learn more effectively. We can think of deep learning as algorithms for automatic "feature engineering," or we could simply call them "feature detectors," which help us to overcome the vanishing gradient challenge and facilitate the learning in neural networks with many layers."

http://deeplearning4j.org/

http://www.damninteresting.com/on-the-origin-of-circuits/ [12]

https://probmods.org/

http://karpathy.github.io/2015/05/21/rnn-effectiveness/

https://en.wikipedia.org/wiki/T-distributed_stochastic_neighbor_embedding
- https://news.ycombinator.com/item?id=12713388

https://news.ycombinator.com/item?id=12942169

http://www.cs.cmu.edu/~tom7/mario/
- https://www.youtube.com/watch?v=xOCurBYI_gY

http://neuralnetworksanddeeplearning.com/ [13]

https://news.ycombinator.com/item?id=8258652

http://www.technologyreview.com/view/530561/the-revolutionary-technique-that-quietly-changed-machine-vision-forever/

https://news.ycombinator.com/item?id=8553307

http://openssi.net/
- http://records.sigmm.ndlab.net/2014/10/ssi-an-open-source-platform-for-social-signal-interpretation/ [14]

YouTube: Artificial Intelligence: Neural Networks in Machine Learning

https://en.wikipedia.org/wiki/Convolutional_neural_network

https://news.ycombinator.com/item?id=9109157

https://news.ycombinator.com/item?id=11530304

https://news.ycombinator.com/item?id=19144280

http://quantombone.blogspot.co.uk/2015/04/deep-learning-vs-probabilistic.html [15]

https://www.gnu.org/software/gneuralnetwork/ [18]

https://open_nsfw.gitlab.io/ [19]

Caffe - a deep learning framework made with expression, speed, and modularity in mind. It is developed by Berkeley AI Research (BAIR) and by community contributors. Yangqing Jia created the project during his PhD at UC Berkeley. Caffe is released under the BSD 2-Clause license.
- Caffe Model Zoo

Torch - a scientific computing framework with wide support for machine learning algorithms that puts GPUs first. It is easy to use and efficient, thanks to an easy and fast scripting language, LuaJIT, and an underlying C/CUDA implementation.
- https://github.com/torch/torch7

https://github.com/karpathy/char-rnn - Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch

YouTube: Connections between physics and deep learning [https://news.ycombinator.com/item?id=13062139

DeepRhyme (D-Prime) – generating dope rhymes with deep learning [20]

Composing Music With Recurrent Neural Networks [21]

Data Science Machine - an end-to-end software system that is able to automatically develop predictive models from relational data. The Machine was created by Max Kanter and Kalyan Verramachaneni at the Computer Science and Artificial Intelligence Laboratory (CSAIL) at MIT. The system automates two of the most human-intensive components of a data science endeavor: feature engineering, and selection and tuning of the machine learning methods that build predictive models from those features. First, an algorithm called Deep Feature Synthesis automatically engineers features. Next, through an approach called Deep Mining, the Machine composes a generalized machine learning pipeline that includes dimensionality reduction methods, feature selection methods, clustering, and classifier design. Finally, it tunes the parameters through a Gaussian Copula Process.
- System that replaces human intuition with algorithms outperforms human teams

What's Next in Deep Learning [22]

Deep Residual Learning for Image Recognition https://news.ycombinator.com/item?id=10715628

http://www.wildml.com/2016/01/attention-and-memory-in-deep-learning-and-nlp/ [23]

TensorFlow - an open source software library for high performance numerical computation. Its flexible architecture allows easy deployment of computation across a variety of platforms (CPUs, GPUs, TPUs), and from desktops to clusters of servers to mobile and edge devices. Originally developed by researchers and engineers from the Google Brain team within Google’s AI organization, it comes with strong support for machine learning and deep learning and the flexible numerical computation core is used across many other scientific domains.

http://tflearn.org
- https://github.com/tflearn/tflearn [24]

http://playground.tensorflow.org/ [25]

https://www.oreilly.com/learning/hello-tensorflow [26]

https://github.com/sherjilozair/char-rnn-tensorflow

https://nucl.ai/blog/enhance-pixel-art/

http://www-personal.umich.edu/~reedscot/iclr_project.html [27]

https://deepmind.com/blog/wavenet-generative-model-raw-audio/ [28]

http://www.wildml.com/2016/10/learning-reinforcement-learning [29]

https://blog.acolyer.org/2016/10/12/towards-deep-symbolic-reinforcement-learning/ [30]

http://www.wildml.com/2016/10/learning-reinforcement-learning/ [31]

https://research.googleblog.com/2016/12/open-sourcing-embedding-projector-tool.html [32]

https://tryolabs.com/blog/2016/12/06/major-advancements-deep-learning-2016/ [33]

https://nips.cc/Conferences/2016/SpotlightVideos

https://arxiv.org/abs/1612.03770 [34]

https://jalammar.github.io/visual-interactive-guide-basics-neural-networks/ [35]

Operational calculus on programming spaces - https://news.ycombinator.com/item?id=13280818

https://www.darpa.mil/news-events/2016-06-17 [36]

http://distill.pub/about/ [37]

https://vectordash.com

https://github.com/neo-ai/neo-ai-dlr - a compiler and runtime for machine learning models. The compiler optimizes machine learning models for various target hardware. The runtime executes the model on the target hardware. A stand-alone, light-weight and portable runtime for CNN and decicion-tree models. Built on top of TVM and Treelite runtime, DLR provides simple and unified Python/C++ APIs for loading and running TVM/Treelite compiled models on a wide range of devices, including X86, TRT-enabled GPU and Arm devices.

https://github.com/nihalpasham/fingerprinting_radios_w_ML - The key idea behind radio ﬁngerprinting is to extract unique patterns (or features) and use them as signatures to identify devices (or more precisely ID a radio embedded within a device).

ML / AI

Contents

to sort

NLP

Bots

Machine learning

Navigation menu

Search