Programming

Things and Stuff Wiki - An organically evolving personal wiki knowledge base. An on-the-fly taxonomy containing a patchwork trail of topic outlines, descriptions, notes, stubs and breadcrumbs, with links to sites, systems, software, manuals, organisations, people, articles, guides, slides, papers, books, comments, videos, screencasts, webcasts, scratchpads and more. Content is orientated towards mostly free/libre/open, mostly Linux. Quality and age varies drastically. Sometimes old things are first, sometimes last. Use the Table of Contents menu to navigate long pages. Zoom in if text is too small. Dead link? Wayback Machine. I probably need to fix the theme CSS after an update. See also libreav.org. Chat to msg me (not checking tho atm). e

General

Syntax

https://en.wikipedia.org/wiki/Syntax_(programming_languages)

https://en.wikipedia.org/wiki/Lexical_grammar - the process of converting a sequence of characters (such as in a computer program or web page) into a sequence of tokens (strings with an identified "meaning"). A program that performs lexical analysis may be called a lexer, tokenizer, or scanner (though "scanner" is also used to refer to the first stage of a lexer). Such a lexer is generally combined with a parser, which together analyze the syntax of programming languages, web pages, and so forth.

For instance, the lexical grammar for many programming languages specifies that a string literal starts with a " character and continues until a matching " is found (escaping makes this more complicated), that an identifier is an alphanumeric sequence (letters and digits, usually also allowing underscores, and disallowing initial digits), and that an integer literal is a sequence of digits. So in the following character sequence "abc" xyz1 23 the tokens are string, identifier and number (plus whitespace tokens) because the space character terminates the sequence of characters forming the identifier. Further, certain sequences are categorized as keywords – these generally have the same form as identifiers (usually alphabetical words), but are categorized separately; formally they have a different token type.

https://en.wikipedia.org/wiki/Lexical_analysis - the process of converting a sequence of characters (such as in a computer program or web page) into a sequence of tokens (strings with an identified "meaning"). A program that performs lexical analysis may be called a lexer, tokenizer, or scanner (though "scanner" is also used to refer to the first stage of a lexer). Such a lexer is generally combined with a parser, which together analyze the syntax of programming languages, web pages, and so forth.

https://en.wikipedia.org/wiki/Language_construct - a syntactically allowable part of a program that may be formed from one or more lexical tokens in accordance with the rules of a programming language. The term "language construct" is often used as a synonym for control structure, and should not be confused with a function.

https://en.wikipedia.org/wiki/Category:Programming_constructs

https://en.wikipedia.org/wiki/Catalan_number - a sequence of natural numbers that occur in various counting problems, often involving recursively-defined objects. They are named after the Belgian mathematician Eugène Charles Catalan (1814–1894).

Using zero-based numbering, the nth Catalan number is given directly in terms of binomial coefficients. The first Catalan numbers for n = 0, 1, 2, 3, … are: 1, 1, 2, 5, 14, 42, 132, 429, 1430, 4862, 16796, 58786, 208012, 742900, 2674440, 9694845, 35357670, 129644790, 477638700, 1767263190, 6564120420, 24466267020, 91482563640, 343059613650, 1289904147324, 4861946401452, … (sequence A000108 in OEIS).

https://en.wikipedia.org/wiki/Dyck_language - the language consisting of balanced strings of square brackets [ and ]. It is important in the parsing of expressions that must have a correctly nested sequence of brackets, such as arithmetic or algebraic expressions.

http://interactivepython.org/runestone/static/pythonds/BasicDS/SimpleBalancedParentheses.html

https://en.wikipedia.org/wiki/Reserved_word - also known as a reserved identifier is a word that cannot be used as an identifier, such as the name of a variable, function, or label – it is "reserved from use". This is a syntactic definition, and a reserved word may have no meaning.

A closely related and often conflated notion is a keyword which is a word with special meaning in a particular context. This is a semantic definition. By contrast, names in a standard library but not built into the language are not considered reserved words or keywords. The terms "reserved word" and "keyword" are often used interchangeably – one may say that a reserved word is "reserved for use as a keyword" – and formal use varies from language to language; for this article we distinguish as above.

In general reserved words and keywords need not coincide, but in most modern languages keywords are a subset of reserved words, as this makes parsing easier, since keywords cannot be confused with identifiers. In some languages, like C or Python, reserved words and keywords coincide, while in other languages, like Java, all keywords are reserved words, but some reserved words are not keywords – these are "reserved for future use". In yet other languages, such as ALGOL and PL/I there are keywords but no reserved words, with keywords being distinguished from identifiers by other means.

http://crasseux.com/books/ctutorial/Reserved-words-in-C.html

Why Does "=" Mean Assignment? - [6]

https://en.wikipedia.org/wiki/Identifier#In_computer_languages - are tokens (also called symbols) which name language entities. Some of the kinds of entities an identifier might denote include variables, types, labels, subroutines, and packages.

https://en.wikipedia.org/wiki/Declaration_(computer_programming) - specifies properties of an identifier: it declares what a word (identifier) means. Declarations are most commonly used for functions, variables, constants, and classes, but can also be used for other entities such as enumerations and type definitions. Beyond the name (the identifier itself) and the kind of entity (function, variable, etc.), declarations typically specify the data type (for variables and constants), or the type signature (for functions); types may also include dimensions, such as for arrays. A declaration is used to announce the existence of the entity to the compiler; this is important in those strongly typed languages that require functions, variables, and constants, and their types to be specified with a declaration before use, and is used in forward declaration. The term "declaration" is frequently contrasted with the term "definition", but meaning and usage varies significantly between languages

https://en.wikipedia.org/wiki/Initialization_(programming) - the assignment of an initial value for a data object or variable. The manner in which initialization is performed depends on programming language, as well as type, storage class, etc., of an object to be initialized. Programming constructs which perform initialization are typically called initializers and initializer lists. Initialization is distinct from (and preceded by) declaration, although the two can sometimes be conflated in practice. The complement of initialization is finalization, which is primarily used for objects, but not variables. Initialization is done either by statically embedding the value at compile time, or else by assignment at run time.

https://en.wikipedia.org/wiki/Finalization - the process of preparing an object for deallocation; strictly speaking, finalization is all parts of object destruction until memory deallocation itself. Finalization is formally complementary to initialization, which is the part of object creation that follows allocation, but differs significantly in practice

https://en.wikipedia.org/wiki/Directive_(programming) - a language construct that specifies how a compiler (or assembler or interpreter) should process its input. Directives are not part of the language proper – they are not part of the grammar, and may vary from compiler to compiler – but instead function either as an in-band form of a command-line option, specifying compiler behavior, or are processed by a preprocessor. In some cases directives specify global behavior, while in other cases they only affect a local section, such as a block of programming code. In some cases, such as some C pragmas, directives are optional compiler hints, and may be ignored, but normally they are prescriptive, and must be followed. However, a directive does not perform any action in the language itself, but rather only a change in the behavior of the compiler.

This term could be used to refer to proprietary third party tags and commands (or markup) embedded in code that result in additional executable processing that extend the existing compiler, assembler and language constructs present in the development environment. The term "directive" is also applied in a variety of ways that are similar to the term command.

https://en.wikipedia.org/wiki/First-class_citizen - In programming language design, a first-class citizen (also type, object, entity, or value) in a given programming language is an entity which supports all the operations generally available to other entities. These operations typically include being passed as an argument, returned from a function, and assigned to a variable.

https://en.wikipedia.org/wiki/Free-form_language

https://en.wikipedia.org/wiki/Indentation_style - a convention governing the indentation of blocks of code to convey the program's structure. This article largely addresses the free-form languages, such as C programming language and its descendants, but can be (and frequently is) applied to most other programming languages (especially those in the curly bracket family), where whitespace is otherwise insignificant. Indent style is just one aspect of programming style.

https://en.wikipedia.org/wiki/Syntactic_sugar - syntax within a programming language that is designed to make things easier to read or to express. It makes the language "sweeter" for human use: things can be expressed more clearly, more concisely, or in an alternative style that some may prefer.

For example, many programming languages provide special syntax for referencing and updating array elements. Abstractly, an array reference is a procedure of two arguments: an array and a subscript vector, which could be expressed as get_array(Array, vector(i,j)). Instead, many languages provide syntax like Array[i,j]. Similarly an array element update is a procedure of three arguments, something like set_array(Array, vector(i,j), value), but many languages provide syntax like Array[i,j] = value.

Specifically, a construct in a language is called syntactic sugar if it can be removed from the language without any effect on what the language can do: functionality and expressive power will remain the same. Language processors, including compilers, static analyzers, and the like, often expand sugared constructs into more fundamental constructs before processing, a process sometimes called "desugaring".

https://en.wikipedia.org/wiki/Lex_(software)

https://en.wikipedia.org/wiki/Flex_(lexical_analyser_generator)

https://en.wikipedia.org/wiki/Syntax_analysis#Programming_languages

https://en.wikipedia.org/wiki/Backus–Naur_Form - BNF (Backus Normal Form or Backus–Naur Form) is one of the two main notation techniques for context-free grammars, often used to describe the syntax of languages used in computing, such as computer programming languages, document formats, instruction sets and communication protocols; the other main technique for writing context-free grammars is the van Wijngaarden form. They are applied wherever exact descriptions of languages are needed: for instance, in official language specifications, in manuals, and in textbooks on programming language theory. Many extensions and variants of the original Backus–Naur notation are used; some are exactly defined, including Extended Backus–Naur Form (EBNF) and Augmented Backus–Naur Form (ABNF).

https://en.wikipedia.org/wiki/Extended_Backus%E2%80%93Naur_Form

https://en.wikipedia.org/wiki/Van_Wijngaarden_grammar

https://en.wikipedia.org/wiki/Parsing_expression_grammar - or PEG, is a type of analytic formal grammar, i.e. it describes a formal language in terms of a set of rules for recognizing strings in the language. The formalism was introduced by Bryan Ford in 2004 and is closely related to the family of top-down parsing languages introduced in the early 1970s. Syntactically, PEGs also look similar to context-free grammars (CFGs), but they have a different interpretation: the choice operator selects the first match in PEG, while it is ambiguous in CFG. This is closer to how string recognition tends to be done in practice, e.g. by a recursive descent parser. Unlike CFGs, PEGs cannot be ambiguous; if a string parses, it has exactly one valid parse tree. It is conjectured that there exist context-free languages that cannot be parsed by a PEG, but this is not yet proven. PEGs are well-suited to parsing computer languages, but not natural languages where their performance is comparable to general CFG algorithms such as the Earley algorithm

Compared to pure regular expressions (i.e. without back-references), PEGs are strictly more powerful, but require significantly more memory. For example, a regular expression inherently cannot find an arbitrary number of matched pairs of parentheses, because it is not recursive, but a PEG can. However, a PEG will require an amount of memory proportional to the length of the input, while a regular expression matcher will require only a constant amount of memory.

The Packrat Parsing and Parsing Expression Grammars Page

https://en.wikipedia.org/wiki/Recursive_descent_parser

https://en.wikipedia.org/wiki/Pratt_parser

Paradigms

http://c2.com/cgi/wiki?BigIdea

http://en.wikipedia.org/wiki/Programming_paradigm
- http://upload.wikimedia.org/wikipedia/commons/f/f7/Programming_paradigms.svg

http://en.wikipedia.org/wiki/Concatenative_programming_language

http://concatenative.org/

We Really Don't Know How To Compute!

Imperative

http://en.wikipedia.org/wiki/Imperative_programming - a programming paradigm that uses statements that change a program's state. In much the same way that the imperative mood in natural languages expresses commands, an imperative program consists of commands for the computer to perform. Imperative programming focuses on describing how a program operates.

http://en.wikipedia.org/wiki/Procedural_programming - a programming paradigm, derived from structured programming, based upon the concept of the procedure call. Procedures, also known as routines, subroutines, or functions (not to be confused with mathematical functions, but similar to those used in functional programming), simply contain a series of computational steps to be carried out. Any given procedure might be called at any point during a program's execution, including by other procedures or itself. Procedural programming languages include C, Go, Fortran, Pascal, and BASIC. Computer processors provide hardware support for procedural programming through a stack register and instructions for calling procedures and returning from them. Hardware support for other types of programming is possible, but no attempt was commercially successful (for example Lisp machines or Java processors).

http://en.wikipedia.org/wiki/Structured_programming - a programming paradigm aimed at improving the clarity, quality, and development time of a computer program by making extensive use of subroutines, block structures, for and while loops—in contrast to using simple tests and jumps such as the goto statement which could lead to "spaghetti code" which is difficult both to follow and to maintain. It emerged in the late 1950s with the appearance of the ALGOL 58 and ALGOL 60 programming languages, with the latter including support for block structures.

Contributing factors to its popularity and widespread acceptance, at first in academia and later among practitioners, include the discovery of what is now known as the structured program theorem in 1966, and the publication of the influential "Go To Statement Considered Harmful" open letter in 1968 by Dutch computer scientist Edsger W. Dijkstra, who coined the term "structured programming". Structured programming is most frequently used with deviations that allow for clearer programs in some particular cases, such as when exception handling has to be performed.

Recursive

Recursive functions of symbolic expressions and their computation by machine, Part I

Object orientated

http://en.wikipedia.org/wiki/Object-oriented_programming - OOP

https://news.ycombinator.com/item?id=11808551 - Alan Kay on objects et al

http://lists.squeakfoundation.org/pipermail/squeak-dev/1998-October/017019.html [7]

http://c2.com/cgi/wiki?ClosuresAndObjectsAreEquivalent

https://www.cs.utexas.edu/users/wcook/papers/OOPvsADT/CookOOPvsADT90.pdf [8]

YouTube: Pong & Object Oriented Programming - Computerphile

YouTube: CS\M04\T01 - Classes, Objects, Properties and Method Concepts

https://en.wikipedia.org/wiki/Object_(computer_science)

https://en.wikipedia.org/wiki/Class_(computer_science)

https://en.wikipedia.org/wiki/Instance_(computer_science)

https://en.wikipedia.org/wiki/Method_(computer_programming) - a procedure associated with an object. An object is made up of data and behavior, which form the interface that an object presents to the outside world. Data is represented as properties of the object and behavior as methods. For example, a Window object would have methods such as open and close, while its state (whether it is opened or closed) would be a property.

http://stackoverflow.com/questions/4203163/how-do-i-design-a-class-in-python/4203836#4203836

https://medium.com/@simplyianm/why-gos-structs-are-superior-to-class-based-inheritance-b661ba897c67 [9]

https://en.wikipedia.org/wiki/Resource_Acquisition_Is_Initialization

https://en.wikipedia.org/wiki/Inheritance_(object-oriented_programming)

https://en.wikipedia.org/wiki/Multiple_inheritance

https://en.wikipedia.org/wiki/Prototype-based_programming

https://en.wikipedia.org/wiki/Differential_inheritance

https://en.wikipedia.org/wiki/Object_composition

https://en.wikipedia.org/wiki/Composition_over_inheritance

https://en.wikipedia.org/wiki/Encapsulation_(object-oriented_programming)

https://en.wikipedia.org/wiki/Resource_acquisition_is_initialization - a programming idiom used in several object-oriented languages to describe a particular language behavior. In RAII, holding a resource is a class invariant, and is tied to object lifetime: resource allocation (or acquisition) is done during object creation (specifically initialization), by the constructor, while resource deallocation (release) is done during object destruction (specifically finalization), by the destructor. Thus the resource is guaranteed to be held between when initialization finishes and finalization starts (holding the resources is a class invariant), and to be held only when the object is alive. Thus if there are no object leaks, there are no resource leaks.

https://en.wikipedia.org/wiki/Mixin

https://en.wikipedia.org/wiki/Coupling_(computer_programming)

https://en.wikipedia.org/wiki/Member_variable

https://en.wikipedia.org/wiki/Command_pattern

https://en.wikipedia.org/wiki/Resource_Acquisition_Is_Initialization

https://en.wikipedia.org/wiki/SOLID_%28object-oriented_design%29 - Single responsibility, Open-closed, Liskov substitution, Interface segregation and Dependency inversion

https://news.ycombinator.com/item?id=14660895

https://en.wikipedia.org/wiki/Jeroo

http://blog.ircmaxell.com/2012/07/oop-vs-procedural-code.html

YouTube: Object-Oriented Programming is Bad - Brian Will

http://raganwald.com/2014/03/10/writing-oop-using-oop.html

http://lwn.net/SubscriberLink/548560/16388a4dbede3884/

http://en.wikipedia.org/wiki/Aspect-oriented_software_development

http://web.media.mit.edu/~lieber/Lieberary/OOP/Delegation/Delegation.html [11]

http://steve-yegge.blogspot.co.uk/2006/03/execution-in-kingdom-of-nouns.html

https://news.ycombinator.com/item?id=6932089

http://raganwald.com/2014/03/31/class-hierarchies-dont-do-that.html [12]

https://news.ycombinator.com/item?id=8830794

http://wcook.blogspot.co.uk/2012/07/proposal-for-simplified-modern.html [13]

https://blog.inf.ed.ac.uk/sapm/2014/02/04/the-anaemic-domain-model-is-no-anti-pattern-its-a-solid-design/

https://en.wikipedia.org/wiki/Literate_programming

http://akkartik.name/post/literate-programming [14]

Declarative

http://en.wikipedia.org/wiki/Declarative_programming - a programming paradigm—a style of building the structure and elements of computer programs—that expresses the logic of a computation without describing its control flow. Declarative programming often considers programs as theories of a formal logic, and computations as deductions in that logic space. Declarative programming may greatly simplify writing parallel programs. Common declarative languages include those of database query languages (e.g., SQL, XQuery), regular expressions, logic programming, functional programming, and configuration management systems.

http://latentflip.com/imperative-vs-declarative

We could do this in an imperative style like so:

var numbers = [1,2,3,4,5]
var doubled = []

for(var i = 0; i < numbers.length; i++) {
  var newNumber = numbers[i] * 2
  doubled.push(newNumber)
}
console.log(doubled) //=> [2,4,6,8,10]

We explicitly iterate over the length of the array, pull each element out of the array, double it, and add the doubled value to the new array, mutating the doubled array at each step until we are done. A more declarative approach might use the Array.map function and look like:

var numbers = [1,2,3,4,5]
 
var doubled = numbers.map(function(n) {
  return n * 2
})
console.log(doubled) //=> [2,4,6,8,10]

map creates a new array from an existing array, where each element in the new array is created by passing the elements of the original array into the function passed to map (function(n) { return n*2 } in this case). What the map function does is abstract away the process of explicitly iterating over the array, and lets us focus on what we want to happen. Note that the function we pass to map is pure; it doesn't have any side effects (change any external state), it just takes in a number and returns the number doubled.

Design, Composition and Performance - Rich Hickey explores the nature of design and composition and how it impacts the software development practice and tools.

Crash-only software: More than meets the eye [15]

Dataflow

See Dataflow

http://en.wikipedia.org/wiki/Dataflow_programming - a programming paradigm that models a program as a directed graph of the data flowing between operations, thus implementing dataflow principles and architecture. Dataflow programming languages share some features of functional languages, and were generally developed in order to bring some functional concepts to a language more suitable for numeric processing.

http://en.wikipedia.org/wiki/Reactive_programming - a programming paradigm oriented around data flows and the propagation of change. This means that it should be possible to express static or dynamic data flows with ease in the programming languages used, and that the underlying execution model will automatically propagate changes through the data flow.

http://staltz.com/dont-react/ [16]

https://en.wikipedia.org/wiki/Flow-based_programming - FBP, a programming paradigm that defines applications as networks of "black box" processes, which exchange data across predefined connections by message passing, where the connections are specified externally to the processes. These black box processes can be reconnected endlessly to form different applications without having to be changed internally. FBP is thus naturally component-oriented. FBP is a particular form of dataflow programming based on bounded buffers, information packets with defined lifetimes, named ports, and separate definition of connections.

Apache NiFi is a dataflow system based on the concepts of flow-based programming. It supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic. NiFi has a web-based user interface for design, control, feedback, and monitoring of dataflows. It is highly configurable along several dimensions of quality of service, such as loss-tolerant versus guaranteed delivery, low latency versus high throughput, and priority-based queuing. NiFi provides fine-grained data provenance for all data received, forked, joined cloned, modified, sent, and ultimately dropped upon reaching its configured end-state. [17]

Constraint

https://en.wikipedia.org/wiki/Constraint_programming

Logic

https://en.wikipedia.org/wiki/Logic_programming - a programming paradigm based on formal logic. A program written in a logic programming language is a set of sentences in logical form, expressing facts and rules about some problem domain. Major logic programming language families include Prolog, Answer set programming (ASP) and Datalog. In all of these languages, rules are written in the form of clauses and are read declaratively as logical implications.

Functional

https://en.wikipedia.org/wiki/Functional_programming

https://en.wikipedia.org/wiki/Purely_functional

http://maryrosecook.com/blog/post/a-practical-introduction-to-functional-programming [18]

Functional Programming 101 - With Haskell [19]

λ Lessons - Pattern matching, first-class functions, and abstracting over recursion in Haskell. This is a short, interactive lesson that teaches core functional programming concepts. It was designed to transform the way you think about performing operations on lists of things, by showing you how functions are executed. [20]

http://www.infoq.com/presentations/functional-pros-cons

http://peteratt.com/programming-thinking-functional-way/ [21]

https://fsharpforfunandprofit.com/posts/recipe-part2/ [22]

http://www.harukizaemon.com/blog/2010/03/01/functional-programming-in-object-oriented-languages/
DrupalCon Munich 2012: Functional PHP

"Controlling Time and Spaaaaaace: understanding the many formulations of FRP" by Evan Czaplicki

http://en.wikipedia.org/wiki/Applicative_programming_language

http://en.wikipedia.org/wiki/Typed_lambda_calculus

http://en.wikipedia.org/wiki/First-class_function

Monads in Pictures - Aug 20th, 2012

Google Video: Type-driven testing in Haskell - Simon Peyton Jones

http://www.ibm.com/developerworks/library/j-ft20/

http://cstheory.stackexchange.com/questions/1539/whats-new-in-purely-functional-data-structures-since-okasaki

https://en.wikipedia.org/wiki/Hindley%E2%80%93Milner_type_system

http://www.infoq.com/presentations/functional-category-theory [24]

http://www.chrisstucchio.com/blog/2014/computers_are_made_of_metal.html [25]

http://bartoszmilewski.com/2014/12/23/kleisli-categories/ [26]

http://www.jpaulmorrison.com/fbp/introduction.html

https://en.wikipedia.org/wiki/Functional_logic_programming

https://news.ycombinator.com/item?id=11729270

http://philipnilsson.github.io/Badness10k/posts/2016-08-10-functional-patterns-monoid-morphism.html [27]

Values and types

https://en.wikipedia.org/wiki/Value_(computer_science) - an expression which cannot be evaluated any further (a normal form). The members of a type are the values of that type. For example, the expression 1 + 2 is not a value as it can be reduced to the expression 3. This expression cannot be reduced any further (and is a member of the type Nat) and therefore is a value. The "value of a variable" is given by the corresponding mapping in the environment. In languages with assignable variables it becomes necessary to distinguish between the r-value (or contents) and the l-value (or location) of a variable. In declarative (high-level) languages, values have to be referentially transparent. This means that the resulting value is independent of the location in which a (sub-)expression needed to compute the value is stored. Only the contents of the location (the bits, whether they are 1 or 0) and their interpretation are significant.

https://en.wikipedia.org/wiki/Data_type - simply type is a classification identifying one of various types of data, such as real, integer or Boolean, that determines the possible values for that type; the operations that can be done on values of that type; the meaning of the data; and the way values of that type can be stored.

https://en.wikipedia.org/wiki/Type_theory - any of a class of formal systems, some of which can serve as alternatives to set theory as a foundation for all mathematics. In type theory, every "term" has a "type" and operations are restricted to terms of a certain type.

YouTube: What type of thing is a type? by Ron Garcia

Type systems

http://c2.com/cgi/wiki?TypingQuadrant

https://en.wikipedia.org/wiki/Type_system - a collection of rules that assign a property called a type to constructs such as variables, expressions, functions or modules—a computer program is composed of. reduces bugs by defining interfaces between different parts of a computer program, and then checking that the parts have been connected in a consistent way. This checking can happen statically (at compile time), dynamically (at run time), or as a combination thereof.

Type systems have other purposes as well, such as enabling certain compiler optimizations, allowing for multiple dispatch, providing a form of documentation, etc. A type system associates a type with each computed value and, by examining the flow of these values, attempts to ensure or prove that no type errors can occur. The particular type system in question determines exactly what constitutes a type error, but in general the aim is to prevent operations expecting a certain kind of value from being used with values for which that operation does not make sense (logic errors); memory errors will also be prevented. Type systems are often specified as part of programming languages, and built into the interpreters and compilers for them; although the type system of a language can be extended by optional tools that perform additional kinds of checks using the language's original type syntax and grammar.

http://c2.com/cgi/wiki?TypeSystem

What to know before debating type systems

Seven deadly sins of talking about “types” [28]

https://en.wikipedia.org/wiki/Type_safety - the extent to which a programming language discourages or prevents type errors.

http://c2.com/cgi/wiki?TypeSafe

https://en.wikipedia.org/wiki/Type_signature - type annotation defines the inputs and outputs for a function, subroutine or method.
http://www.cs.unm.edu/~storm/C++/ProgrammingTerms/FunctionSignatures.html

https://en.wikipedia.org/wiki/Type_conversion - typecasting, and coercion are different ways of, implicitly or explicitly, changing an entity of one data type into another. This is done to take advantage of certain features of type hierarchies or type representations. One example would be small integers, which can be stored in a compact format and converted to a larger representation when used in arithmetic computations. In object-oriented programming, type conversion allows programs to treat objects of one type as one of their ancestor types to simplify interacting with them.

Type aliasing provides a way to redefine existing types as new type names. For example, type aliases may be used to define names for object types, effectively modeling interface types.

The Strict Aliasing Situation is Pretty Bad [29]

https://en.wikipedia.org/wiki/Type_inference - automatic deduction of the type of an expression in a programming language. If some, but not all, type annotations are already present it is referred to as type reconstruction. The opposite operation of type inference is called type erasure.

http://c2.com/cgi/wiki?TypeInference

https://en.wikipedia.org/wiki/Strong_and_weak_typing - in general, a strongly typed language is more likely to generate an error or refuse to compile if the argument passed to a function does not closely match the expected type. On the other hand, a very weakly typed language may produce unpredictable results or may perform implicit type conversion. A different but related concept is latent typing.

https://en.wikipedia.org/wiki/Latent_typing - type system where types are associated with values and not variables, requires run-time type checking

https://www.emacswiki.org/emacs/DynamicBindingVsLexicalBinding

https://en.wikipedia.org/wiki/Structural_type_system - a major class of type system, in which type compatibility and equivalence are determined by the type's actual structure or definition, and not by other characteristics such as its name or place of declaration. Structural systems are used to determine if types are equivalent and whether a type is a subtype of another. It contrasts with nominative systems, where comparisons are based on the names of the types or explicit declarations, and duck typing, in which only the part of the structure accessed at runtime is checked for compatibility.

https://en.wikipedia.org/wiki/Nominative_type_system - or nominative type system (or name-based type system) is a major class of type system, in which compatibility and equivalence of data types is determined by explicit declarations and/or the name of the types. Nominal systems are used to determine if types are equivalent, as well as if a type is a subtype of another. It contrasts with structural systems, where comparisons are based on the structure of the types in question and do not require explicit declarations.

https://en.wikipedia.org/wiki/Automatic_variable - a local variable which is allocated and deallocated automatically when program flow enters and leaves the variable's scope. The scope is the lexical context, particularly the function or block in which a variable is defined. Local data is typically (in most languages) invisible outside the function or lexical context where it is defined. Local data is also invisible and inaccessible to a called function,[note 1] but is not deallocated, coming back in scope as the execution thread returns to the caller.

https://en.wikipedia.org/wiki/Principal_type - In type theory, a type system is said to have the principal type property if, given a term and an environment, there exists a principal type for this term in this environment, i.e. a type such that all other types for this term in this environment are an instance of the principal type.

https://en.wikipedia.org/wiki/Duck_typing - style of typing in which an object's methods and properties determine the valid semantics, rather than its inheritance from a particular class or implementation of a specific interface

https://en.wikipedia.org/wiki/Substructural_type_system - family of type systems analogous to substructural logics where one or more of the structural rules are absent or allowed under controlled circumstances. Such systems are useful for constraining access to system resources such as files, locks and memory by keeping track of changes of state that occur and preventing invalid states

http://wphomes.soic.indiana.edu/jsiek/what-is-gradual-typing/ [30]

https://en.wikipedia.org/wiki/Type_constructor - a feature of a typed formal language that builds new types from old

https://en.wikipedia.org/wiki/Kind_%28type_theory%29 - the type of a type constructor or, less commonly, the type of a higher-order type operator. A kind system is essentially a simply typed lambda calculus "one level up", endowed with a primitive type, denoted * and called "type", which is the kind of any data type which does not need any type parameters.

https://en.wikipedia.org/wiki/Hindley%E2%80%93Milner_type_system - a classical type system for the lambda calculus with parametric polymorphism, first described by J. Roger Hindley and later rediscovered by Robin Milner. Luis Damas contributed a close formal analysis and proof of the method in his PhD thesis.mong HM's more notable properties is completeness and its ability to deduce the most general type of a given program without the need of any type annotations or other hints supplied by the programmer. Algorithm W is a fast algorithm, performing type inference in almost linear time with respect to the size of the source, making it practically usable to type large programs.[note 1] HM is preferably used for functional languages. It was first implemented as part of the type system of the programming language ML. Since then, HM has been extended in various ways, most notably by constrained types as used in Haskell.

http://c2.com/cgi/wiki?HindleyMilnerTypeInference

http://www.lispcast.com/Hindley-Milner-in-Clojure [31]

https://en.wikipedia.org/wiki/Substructural_type_system

http://blog.tweag.io/posts/2017-03-13-linear-types.html

Ideology - Some people claim that unit tests make type systems unnecessary: "types are just simple unit tests written for you, and simple unit tests aren't the important ones". Other people claim that type systems make unit tests unnecessary: "dynamic languages only need unit tests because they don't have type systems." What's going on here? These can't both be right. We'll use this example and a couple others to explore the unknown beliefs that structure our understanding of the world.

Machine data

Machine data types - All data in computers based on digital electronics is represented as bits (alternatives 0 and 1) on the lowest level. The smallest addressable unit of data is usually a group of bits called a byte (usually an octet, which is 8 bits). The unit processed by machine code instructions is called a word (as of 2011, typically 32 or 64 bits). Most instructions interpret the word as a binary number, such that a 32-bit word can represent unsigned integer values from 0 to 2^{32}-1 or signed integer values from -2^{31} to 2^{31}-1. Because of two's complement, the machine language and machine doesn't need to distinguish between these unsigned and signed data types for the most part.

There is a specific set of arithmetic instructions that use a different interpretation of the bits in word as a floating-point number. Machine data types need to be exposed or made available in systems or low-level programming languages, allowing fine-grained control over hardware. The C programming language, for instance, supplies integer types of various widths, such as short and long. If a corresponding native type does not exist on the target platform, the compiler will break them down into code using types that do exist. For instance, if a 32-bit integer is requested on a 16 bit platform, the compiler will tacitly treat it as an array of two 16 bit integers.

http://www.math.grin.edu/~rebelsky/Courses/152/97F/Readings/machine-data.html

Boolean

https://en.wikipedia.org/wiki/Boolean_data_type - a data type, having two values (usually denoted true and false), intended to represent the truth values of logic and Boolean algebra. It is named after George Boole, who first defined an algebraic system of logic in the mid 19th century. The Boolean data type is primarily associated with conditional statements, which allow different actions and change control flow depending on whether a programmer-specified Boolean condition evaluates to true or false. It is a special case of a more general logical data type; logic does not always have to be Boolean.

1,0

Numbers

https://en.wikipedia.org/wiki/Arbitrary-precision_arithmetic - bignum

https://en.wikipedia.org/wiki/Fixed-point_arithmetic

https://en.wikipedia.org/wiki/Integer_(computer_science)

https://en.wikipedia.org/wiki/Decimal_data_type

https://en.wikipedia.org/wiki/Real_data_type

https://en.wikipedia.org/wiki/Rational_data_type

https://en.wikipedia.org/wiki/Complex_data_type

https://en.wikipedia.org/wiki/Signedness - a property of data types representing numbers in computer programs. A numeric variable is signed if it can represent both positive and negative numbers, and unsigned if it can only represent non-negative numbers (zero or positive numbers).

As signed numbers can represent negative numbers, they lose a range of positive numbers that can only be represented with unsigned numbers of the same size (in bits) because half the possible values are non-positive values (so if an 8-bit is signed, positive unsigned values 128 to 255 are gone while -128 to 127 are present). Unsigned variables can dedicate all the possible values to the positive number range. For example, a two's complement signed 16-bit integer can hold the values −32768 to 32767 inclusively, while an unsigned 16 bit integer can hold the values 0 to 65535. For this sign representation method, the leftmost bit (most significant bit) denotes whether the value is positive or negative (0 for positive, 1 for negative).

Floating point

https://en.wikipedia.org/wiki/Floating_point

https://en.wikipedia.org/wiki/IEEE_754 - a technical standard for floating-point computation established in 1985 by the Institute of Electrical and Electronics Engineers (IEEE). The standard addressed many problems found in the diverse floating point implementations that made them difficult to use reliably and portably. Many hardware floating point units now use the IEEE 754 standard.

https://news.ycombinator.com/item?id=15359574

https://lobste.rs/s/eiwnnj/eight_bit_floating_point

p0907r0: Signed Integers are Two’s Complement - [34]

to sort

https://en.wikipedia.org/wiki/Primitive_data_type - a basic type, often built-in, used as a building block for composite types

https://en.wikipedia.org/wiki/Value_type - commonly used to refer to one of two kinds of data types: Types of values or Types of objects with deep copy semantics.

https://en.wikipedia.org/wiki/Reference_type - a data type that refers to an object in memory. A pointer type on the other hand refers to a memory address. Reference types can be thought of as pointers that are implicitly dereferenced. The objects being referred to are dynamically allocated on the heap whereas value types are allocated automatically on the stack. In languages supporting garbage collection the objects being referred to are destroyed automatically after they become unreachable.

https://en.wikipedia.org/wiki/Passive_data_structure - also termed a plain old data structure, or plain old data (POD)), is a term for a record, to contrast with objects. It is a data structure that is represented only as passive collections of field values (instance variables), without using object-oriented features. Passive data structures are appropriate when there is a part of a system where it should be clearly indicated that the detailed logic for data manipulation and integrity are elsewhere. PDSs are often found at the boundaries of a system, where information is being moved to and from other systems or persistent storage and the problem domain logic that is found in other parts of the system is irrelevant. For example, PDS would be convenient for representing the field values of objects that are being constructed from external data, in a part of the system where the semantic checks and interpretations needed for valid objects are not applied yet.

https://en.wikipedia.org/wiki/Reference_(computer_science) - a value that enables a program to indirectly access a particular datum, such as a variable or a record, in the computer's memory or in some other storage device. The reference is said to refer to the datum, and accessing the datum is called dereferencing the reference. A reference is distinct from the data itself. Typically, for references to data stored in memory on a given system, a reference is implemented as the physical address of where the data is stored in memory or in the storage device. For this reason, a reference is often erroneously confused with a pointer or address, and is said to "point to" the data. However a reference may also be implemented in other ways, such as the offset (difference) between the datum's address and some fixed "base" address, as an index into an array, or more abstractly as a handle. More broadly, in networking, references may be network addresses, such as URLs.

The concept of reference must not be confused with other values (keys or identifiers) that uniquely identify the data item, but give access to it only through a non-trivial lookup operation in some table data structure. References are widely used in programming, especially to efficiently pass large or mutable data as arguments to procedures, or to share such data among various uses. In particular, a reference may point to a variable or record that contains references to other data. This idea is the basis of indirect addressing and of many linked data structures, such as linked lists. References can cause significant complexity in a program, partially due to the possibility of dangling and wild references and partially because the topology of data with references is a directed graph, whose analysis can be quite complicated.

https://en.wikipedia.org/wiki/Symbol_(programming) - a primitive datatype whose instances have a unique human-readable form. Symbols can be used as identifiers. In some programming languages, they are called atoms.In the most trivial implementation, they are essentially named integers (e.g. the enumerated type in C).

https://en.wikipedia.org/wiki/Composite_data_type - any data type constructed using primitive data types and other composite types

https://en.wikipedia.org/wiki/Literal_(computer_programming) - a notation for representing a fixed value in source code. Almost all programming languages have notations for atomic values such as integers, floating-point numbers, and strings, and usually for booleans and characters; some also have notations for elements of enumerated types and compound values such as arrays, records, and objects. An anonymous function is a literal for the function type.

In contrast to literals, variables or constants are symbols that can take on one of a class of fixed values, the constant being constrained not to change. Literals are often used to initialize variables, for example, in the following, 1 is an integer literal and the three letter string in "cat" is a string literal:

int a = 1;
String s = "cat";

In lexical analysis, literals of a given type are generally a token type, with a grammar rule, like "a string of digits" for an integer literal. Some literals are specific keywords, like true for the boolean literal "true". In some object-oriented languages (like ECMAScript), objects can also be represented by literals. Methods of this object can be specified in the object literal using function literals.

https://en.wikipedia.org/wiki/String_literal

https://en.wikipedia.org/wiki/Variable_(programming)

https://en.wikipedia.org/wiki/Constant_(computer_science)

https://en.wikipedia.org/wiki/Record_(computer_science) - also called struct or compound data, is a basic data structure. A record is a collection of fields, possibly of different data types, typically in fixed number and sequence. The fields of records may also be called elements, though these risk confusion with elements of a collection, or members, particularly in object-oriented programming. A tuple may or may not be considered a record, and vice versa, depending on conventions and the specific programming language.

https://en.wikipedia.org/wiki/Tuple
https://en.wikipedia.org/wiki/Dependent_type - a type that depends on a value

https://en.wikipedia.org/wiki/Algebraic_data_type
https://en.wikipedia.org/wiki/Product_type - a product of types is another, compounded, type in a structure. associated with logical conjunction (AND) in logic.
https://en.wikipedia.org/wiki/Tagged_union - sum type. data structure used to hold a value that could take on several different, but fixed types. Only one of the types can be in use at any one time, and a tag field explicitly indicates which one is in use.

https://en.wikipedia.org/wiki/Recursive_data_type - inductive data type, a data type for values that may contain other values of the same type. Data of recursive types are usually viewed as directed graphs.

http://stackoverflow.com/questions/47882/what-is-a-magic-number-and-why-is-it-bad

https://news.ycombinator.com/item?id=7188165

https://en.wikipedia.org/wiki/Queue_(abstract_data_type)

https://en.wikipedia.org/wiki/Object_composition - a way to combine simple objects or data types into more complex ones

http://cosmic.mearie.org/2014/01/periodic-table-of-rust-types/ [35]

https://en.wikipedia.org/wiki/Type_constructor - feature of a typed formal language that builds new types from old, such as product types, function types, power types and list types

https://en.wikipedia.org/wiki/Abstract_data_type - a mathematical model for data types where a data type is defined by its behavior (semantics) from the point of view of a user of the data, specifically in terms of possible values, possible operations on data of this type, and the behavior of these operations. This contrasts with data structures, which are concrete representations of data, and are the point of view of an implementer, not a user.

https://en.wikipedia.org/wiki/Semaphore_(programming) - a variable or abstract data type that is used for controlling access, by multiple processes, to a common resource in a concurrent system such as a multiprogramming operating system. A trivial semaphore is a plain variable that is changed (for example, incremented or decremented, or toggled) depending on programmer-defined conditions. The variable is then used as a condition to control access to some system resource.

The Little Book of Semaphores - a free (in both senses of the word) textbook that introduces the principles of synchronization for concurrent programming. [36] [37]

Data structures

http://en.wikipedia.org/wiki/Data_structure

https://en.wikipedia.org/wiki/Plain_old_data_structure

http://www.cs.usfca.edu/~galles/visualization/Algorithms.html [39]

http://en.wikipedia.org/wiki/Hash_table

http://en.wikipedia.org/wiki/Array_data_structure

http://en.wikipedia.org/wiki/Array_data_type

http://en.wikipedia.org/wiki/Associative_array

http://en.wikipedia.org/wiki/Set_data_structure

http://en.wikipedia.org/wiki/Abstract_semantic_graph

http://en.wikipedia.org/wiki/Bit_string

http://en.wikipedia.org/wiki/Tree_(data_structure)

https://en.wikipedia.org/wiki/Tree_traversal

https://eugene-eeo.github.io/blog/tree-traversal-storage.html [40]

http://en.wikipedia.org/wiki/Binary_tree

http://en.wikipedia.org/wiki/Binary_search_tree

http://en.wikipedia.org/wiki/B-tree

http://en.wikipedia.org/wiki/Rope_%28data_structure%29

http://randomcomputation.blogspot.co.uk/2013/08/blog-post.html

https://github.com/antirez/rax
- https://news.ycombinator.com/item?id=101987

http://courses.csail.mit.edu/6.851/spring12/

http://www.careerride.com/c-arrays-and-linked-list.aspx

The difference between arrays and linked lists are:

- Arrays are linear data structures. Linked lists are linear and non-linear data structures. - Linked lists are linear for accessing, and non-linear for storing in memory - Array has homogenous values. And each element is independent of each other positions. Each node in the linked list is connected with its previous node which is a pointer to the node. - Array elements can be modified easily by identifying the index value. It is a complex process for modifying the node in a linked list. - Array elements can not be added, deleted once it is declared. The nodes in the linked list can be added and deleted from the list.

http://www.infoq.com/presentations/Data-Structures

Data Structures for Text Sequences [41]

https://www.snellman.net/blog/archive/2016-12-13-ring-buffers/ [42]

http://algoviz.org/OpenDSA/

http://www.brpreiss.com/books/opus5/html/book.html

http://okasaki.blogspot.ru/2008/02/ten-years-of-purely-functional-data.html

http://www.xanadu.com/zigzag/
YouTube: Ted Nelson on Zigzag data structures - relationship vector, hyperthoginal

https://news.ycombinator.com/item?id=13263038

https://en.wikipedia.org/wiki/Semaphore_%28programming%29

https://news.ycombinator.com/item?id=8761539

http://beautifulracket.com/explainer/continuations.html

https://en.wikipedia.org/wiki/Stack_(abstract_data_type)

https://en.wikipedia.org/wiki/Stack_register

https://en.wikipedia.org/wiki/Symbol_table

http://kukuruku.co/hub/cpp/lock-free-data-structures-the-evolution-of-a-stack [43]

"Visualizing Persistent Data Structures" by Dann Toliver

Mutability

http://en.wikipedia.org/wiki/Persistent_data_structure

http://davesquared.net/2013/03/reasoning-and-mutability.html

Evaluation

http://en.wikipedia.org/wiki/Evaluation

http://en.wikipedia.org/wiki/Expression_(mathematics)

http://en.wikipedia.org/wiki/Expression_(computer_science)

An expression evaluates to a value. A statement does something.

x = 1
y = x + 1     # an expression
print y       # a statement, prints 2

http://en.wikipedia.org/wiki/Statement_(programming)

Heuristic definition of an expression [44]

https://en.wikipedia.org/wiki/Evaluation_strategy

https://en.wikipedia.org/wiki/Expression-oriented_programming_language

https://en.wikipedia.org/wiki/Referential_transparency_(computer_science)

https://en.wikipedia.org/wiki/Side_effect_(computer_science)

https://en.wikipedia.org/wiki/Information_hiding

https://en.wikipedia.org/wiki/Conditional_(computer_programming)

https://en.wikipedia.org/wiki/Strict_function - in the denotational semantics of programming languages is a function f where f\left(\perp\right) = \perp. The entity \perp, called bottom, denotes an expression which does not return a normal value, either because it loops endlessly or because it aborts due to an error such as division by zero. A function which is not strict is called non-strict. A strict programming language is one in which user-defined functions are always strict.

Intuitively, non-strict functions correspond to control structures. Operationally, a strict function is one which always evaluates its argument; a non-strict function is one which may not evaluate some of its arguments. Functions having more than one parameter may be strict or non-strict in each parameter independently, as well as jointly strict in several parameters simultaneously.

https://en.wikipedia.org/wiki/Non-strict_programming_language - A strict programming language is one in which only strict functions (functions whose parameters must be evaluated completely before they may be called) may be defined by the user. A non-strict programming language allows the user to define non-strict functions, and hence may allow lazy evaluation.

https://en.wikipedia.org/wiki/Control_flow

http://en.wikipedia.org/wiki/Rewriting

Vimeo: Impossible Programs - Scottish Ruby Conf

Operators

https://en.wikipedia.org/wiki/Operator_(programming) - constructs which behave generally like functions, but which differ syntactically or semantically from usual functions. Common simple examples include arithmetic (addition with +, comparison with >) and logical operations (such as AND or &&). More involved examples include assignment (usually = or :=), field access in a record or object (usually .), and the scope resolution operator (often ::). Languages usually define a set of built-in operators, and in some cases allow user-defined operators.

https://en.wikipedia.org/wiki/Operators_in_C_and_C%2B%2B

Functions

http://en.wikipedia.org/wiki/Subroutine or function is a sequence of program instructions that perform a specific task, packaged as a unit. This unit can then be used in programs wherever that particular task should be performed. Subprograms may be defined within programs, or separately in libraries that can be used by multiple programs. In different programming languages a subroutine may be called a procedure, a function, a routine, a method, or a subprogram. The generic term callable unit is sometimes used.

http://en.wikipedia.org/wiki/Closure_(computer_science) - also lexical closures or function closures are a technique for implementing lexically scoped name binding in languages with first-class functions. Operationally, a closure is a record storing a function[a] together with an environment:[1] a mapping associating each free variable of the function (variables that are used locally, but defined in an enclosing scope) with the value or storage location to which the name was bound when the closure was created.[b] A closure—unlike a plain function—allows the function to access those captured variables through the closure's reference to them, even when the function is invoked outside their scope.

What's a Closure?

http://en.wikipedia.org/wiki/Call_site - a call site of a function or subroutine is the location (line of code) where the function is called (or may be called, through dynamic dispatch). A call site is where zero or more arguments are passed to the function, and zero or more return values are received.

http://en.wikipedia.org/wiki/Anonymous_function - also function literal or lambda abstraction is a function definition that is not bound to an identifier. Anonymous functions are often: arguments being passed to higher-order functions, or used for constructing the result of a higher-order function that needs to return a function.

If the function is only used once, or a limited number of times, an anonymous function may be syntactically lighter than using a named function. Anonymous functions are ubiquitous in functional programming languages and other languages with first-class functions, where they fulfill the same role for the function type as literals do for other data types.

http://en.wikipedia.org/wiki/Function_type - the type of a variable or parameter to which a function has or can be assigned, or an argument or result type of a higher-order function taking or returning a function.

http://en.wikipedia.org/wiki/Function_prototype - or function interface is a declaration of a function that specifies the function's name and type signature (arity, parameter types, and return type), but omits the function body. The term is particularly used in C and C++. While a function definition specifies how the function does what it does (the "implementation"), a function prototype merely specifies its interface, i.e. what data types go in and come out of it.

In a prototype, parameter names are optional (and in C/C++ have function prototype scope, meaning their scope ends at the end of the prototype), however, the type is necessary along with all modifiers (e.g. if it is a pointer or a const parameter). In object-oriented programming, interfaces and abstract methods serve much the same purpose.

http://en.wikipedia.org/wiki/Function_objects - allows an object to be invoked or called as if it were an ordinary function, usually with the same syntax (a function parameter that can also be a function).

http://journal.stuffwithstuff.com/2013/08/26/what-is-open-recursion/ [46]

http://en.wikipedia.org/wiki/Callback_(computer_science) - a piece of executable code that is passed as an argument to other code, which is expected to call back (execute) the argument at some convenient time. The invocation may be immediate as in a synchronous callback, or it might happen at a later time as in an asynchronous callback. In all cases, the intention is to specify a function or subroutine as an entity that is, depending on the language, more or less similar to a variable. Programming languages support callbacks in different ways, often implementing them with subroutines, lambda expressions, blocks, or function pointers.

http://en.wikipedia.org/wiki/Tail_call - a subroutine call performed as the final action of a procedure. If a tail call might lead to the same subroutine being called again later in the call chain, the subroutine is said to be tail-recursive, which is a special case of recursion. Tail recursion (or tail-end recursion) is particularly useful, and often easy to handle in implementations.

https://medium.com/binary-dreams/from-java-to-scala-tail-recursion-a6acdd71a94d

http://en.wikipedia.org/wiki/Function_pointer - Instead of referring to data values, a function pointer points to executable code within memory. When dereferenced, a function pointer can be used to invoke the function it points to and pass it arguments just like a normal function call. Such an invocation is also known as an "indirect" call, because the function is being invoked indirectly through a variable instead of directly through a fixed name or address. Function pointers can be used to simplify code by providing a simple way to select a function to execute based on run-time values.

http://en.wikipedia.org/wiki/Funarg_problem - refers to the difficulty in implementing first-class functions (functions as first-class objects) in programming language implementations so as to use stack-based memory allocation of the functions. The difficulty only arises if the body of a nested function refers directly (i.e., not via argument passing) to identifiers defined in the environment in which the function is defined, but not in the environment of the function call. To summarize the discussion below, two standard resolutions are to either forbid such references or to create closures.

http://en.wikipedia.org/wiki/Hash_function - any function that can be used to map data of arbitrary size to data of fixed size. The values returned by a hash function are called hash values, hash codes, hash sums, or simply hashes. One use is a data structure called a hash table, widely used in computer software for rapid data lookup. Hash functions accelerate table or database lookup by detecting duplicated records in a large file. An example is finding similar stretches in DNA sequences. They are also useful in cryptography.

https://en.wikipedia.org/wiki/Foreign_function_interface

http://arxiv.org/abs/1406.2294v1 [47] [48]

http://en.wikipedia.org/wiki/Inline_expansion

http://en.wikipedia.org/wiki/Defunctionalization

From callback to (Future -> Functor -> Monad) [49]

http://www.math.upenn.edu/~wilf/gfologyLinked2.pdf [50]

Polymorphism

https://en.wikipedia.org/wiki/Polymorphism_(computer_science) - the provision of a single interface to entities of different types. A polymorphic type is one whose operations can also be applied to values of some other type, or types

https://en.wikipedia.org/wiki/Ad_hoc_polymorphism - a kind of polymorphism in which polymorphic functions can be applied to arguments of different types, because a polymorphic function can denote a number of distinct and potentially heterogeneous implementations depending on the type of argument(s) to which it is applied. It is also known as function overloading or operator overloading. The term ad hoc in this context is not intended to be pejorative; it refers simply to the fact that this type of polymorphism is not a fundamental feature of the type system.

https://en.wikipedia.org/wiki/Function_overloading

https://en.wikipedia.org/wiki/Operator_overloading

https://en.wikipedia.org/wiki/Parametric_polymorphism - a way to make a language more expressive, while still maintaining full static type-safety. Using parametric polymorphism, a function or a data type can be written generically so that it can handle values identically without depending on their type. Such functions and data types are called generic functions and generic datatypes respectively and form the basis of generic programming.

https://en.wikipedia.org/wiki/Subtyping

https://en.wikipedia.org/wiki/Polymorphism_in_object-oriented_programming

https://en.wikipedia.org/wiki/Dynamic_dispatch

https://en.wikipedia.org/wiki/Multiple_dispatch

http://eli.thegreenplace.net/2016/a-polyglots-guide-to-multiple-dispatch/ [51]

Control structures

https://en.wikipedia.org/wiki/Control_flow

http://degoes.net/articles/destroy-all-ifs [52]

https://en.wikipedia.org/wiki/Exception_handling

https://en.wikipedia.org/wiki/Continuation

https://en.wikipedia.org/wiki/Container_(abstract_data_type)

https://en.wikipedia.org/wiki/Coroutines

https://en.wikipedia.org/wiki/Generator_(computer_science) - a special routine that can be used to control the iteration behaviour of a loop. In fact, all generators are iterators. A generator is very similar to a function that returns an array, in that a generator has parameters, can be called, and generates a sequence of values. However, instead of building an array containing all the values and returning them all at once, a generator yields the values one at a time, which requires less memory and allows the caller to get started processing the first few values immediately. In short, a generator looks like a function but behaves like an iterator.

Generators can be implemented in terms of more expressive control flow constructs, such as coroutines or first-class continuations. Generators, also known as semicoroutines, are a special case of (and weaker than) coroutines, in that they always yield control back to the caller (when passing a value back), rather than specifying a coroutine to jump to

Algorithms

http://en.wikipedia.org/wiki/Algorithm

https://news.ycombinator.com/item?id=15469660

http://en.wikipedia.org/wiki/Algorithm_characterizations

https://en.wikipedia.org/wiki/Algorithmic_information_theory - a subfield of information theory and computer science that concerns itself with the relationship between computation and information. According to Gregory Chaitin, it is "the result of putting Shannon's information theory and Turing's computability theory into a cocktail shaker and shaking vigorously."

http://www.jamisbuck.org/presentations/rubyconf2011/index.html

http://www3.cs.stonybrook.edu/~algorith/video-lectures/

http://en.wikipedia.org/wiki/Memoization

http://interactivepython.org/runestone/static/pythonds/index.html [53]

http://www.nayuki.io/page/fast-fibonacci-algorithms [54]

http://visualgo.net/

https://idea-instructions.com [58]

http://bost.ocks.org/mike/algorithms/ [61]

https://news.ycombinator.com/item?id=8088842

http://justin.abrah.ms/computer-science/how-to-calculate-big-o.html

http://accidentallyquadratic.tumblr.com/
- http://accidentallyquadratic.tumblr.com/post/113840433022/why-accidentally-quadratic [62]

http://news.ycombinator.com/item?id=4783301

http://leftnode.com/entry/i-am-a-great-programmer-but-horrible-algorithmist.html

http://jake.simvla.com/posts/531-mr-mrs-kipling

https://www.varnish-cache.org/trac/wiki/ArchitectNotes

http://developers.memsql.com/blog/common-pitfalls-in-writing-lock-free-algorithms/

https://queue.acm.org/detail.cfm?id=1814327

http://compgeom.cs.uiuc.edu/~jeffe/teaching/algorithms/

http://keyj.emphy.de/balanced-shuffle/ [63]

http://www.eternallyconfuzzled.com/jsw_home.aspx [64]

http://files.righto.com/calculator/sinclair_scientific_simulator.html

http://planning.cs.uiuc.edu/

http://www.rockpapershotgun.com/2010/11/02/genetic-algorithms-find-build-order-from-hell/

http://theory.stanford.edu/~amitp/GameProgramming/AStarComparison.html [65]

http://jimkang.com/quadtreevis/

https://flafla2.github.io/2014/08/09/perlinnoise.html [https://news.ycombinator.com/item?id=8158572

http://katrinaeg.com/simulated-annealing.html [67]

https://en.wikipedia.org/wiki/Correctness_(computer_science)

https://en.wikipedia.org/wiki/Hoare_logic

https://en.wikipedia.org/wiki/Logical_assertion

https://en.wikipedia.org/wiki/Assertion_(software_development) - a statement that a predicate (Boolean-valued function, a true–false expression) is expected to always be true at that point in the code. If an assertion evaluates to false at run time, an assertion failure results, which typically causes the program to crash, or to throw an assertion exception.

https://en.wikipedia.org/wiki/Invariant_%28computer_science%29 - a condition that can be relied upon to be true during execution of a program, or during some portion of it. It is a logical assertion that is held to always be true during a certain phase of execution. For example, a loop invariant is a condition that is true at the beginning and end of every execution of a loop.

https://en.wikipedia.org/wiki/Loop_invariant

https://en.wikipedia.org/wiki/Class_invariant

https://www.cs.princeton.edu/~rs/talks/AC11-Cardinality.pdf

Libraries

Module pattern

http://depstack.io/

System calls

See *nix

https://en.wikipedia.org/wiki/System_call

http://stackoverflow.com/questions/10149621/where-can-i-find-system-call-source-code

http://jvns.ca/blog/2016/01/23/sendfile-a-new-to-me-system-call/

Macros

http://en.wikipedia.org/wiki/Macro_(computer_science)

http://www.loper-os.org/?p=401

http://www.wilfred.me.uk/blog/2014/09/15/comparative-macrology/ [68]

Metaprogramming

https://en.wikipedia.org/wiki/Metaprogramming - the writing of computer programs with the ability to treat programs as their data. It means that a program could be designed to read, generate, analyse or transform other programs, and even modify itself while running. In some cases, this allows programmers to minimize the number of lines of code to express a solution (hence reducing development time), or it gives programs greater flexibility to efficiently handle new situations without recompilation. The language in which the metaprogram is written is called the metalanguage. The language of the programs that are manipulated is called the object language. The ability of a programming language to be its own metalanguage is called reflection or reflexivity

http://c2.com/cgi/wiki?MetaProgramming

https://hookrace.net/blog/introduction-to-metaprogramming-in-nim/ [69]

http://weblog.therealadam.com/2011/12/09/why-metaprogram-when-you-can-program/

Monads

Aspect of functional. See Haskell, etc. for related.

http://cs.coloradocollege.edu/~bylvisaker/MonadMotivation/ [70]

http://gbracha.blogspot.com.au/2011/01/maybe-monads-might-not-matter.html

http://adit.io/posts/2013-06-10-three-useful-monads.html

http://www.stephendiehl.com/posts/monads.html

http://www.clojure.net/2013/02/02/Comonads/

http://marijnhaverbeke.nl/blog/common-lisp-monads.html [71]

https://gist.github.com/kachayev/b5887f66e2985a21a466 [72]

http://rcrowley.org/2011/09/21/monads.html [73]

http://roscidus.com/blog/blog/2014/10/27/visualising-an-asynchronous-monad/ [74]

http://okmij.org/ftp/Computation/IO-monad-history.html [75]

Events

http://en.wikipedia.org/wiki/Event_loop

http://en.wikipedia.org/wiki/Event_(computing)

http://en.wikipedia.org/wiki/Event-driven_programming

http://en.wikipedia.org/wiki/Futures_and_promises

Messaging

https://en.wikipedia.org/wiki/Interface_description_language

https://en.wikipedia.org/wiki/Messaging_pattern

http://c2.com/cgi/wiki?AlanKayOnMessaging [76]

https://en.wikipedia.org/wiki/Publish–subscribe_pattern

https://en.wikipedia.org/wiki/Message_queue - and mailboxes are software-engineering components used for interprocess communication, or for inter-thread communication within the same process.

http://blog.iron.io/2012/12/top-10-uses-for-message-queue.html

zeromq - Distributed Messaging. Connect your code in any language, on any platform. Carries messages across inproc, IPC, TCP, TIPC, multicast. Smart patterns like pub-sub, push-pull, and router-dealer. High-speed asynchronous I/O engines, in a tiny library. Backed by a large and active open source community. Supports every modern language and platform. Build any architecture: centralized, distributed, small, or large. Free software with full commercial support.
- ØMQ - The Guide
- Chapter Eight

https://github.com/hyperglyph/hyperglyph - ducked typed ipc over http
- Vimeo: Thomas Figg @iamtef - RESTful-RPC or how I learned to stop worrying and love hypermedia

MessagePack - an efficient binary serialization format. It lets you exchange data among multiple languages like JSON but it's faster and smaller. For example, small integers (like flags or error code) are encoded into a single byte, and typical short strings only require an extra byte in addition to the strings themselves.

AMQP - Advanced Message Queuing Protocol, an open standard for passing business messages between applications or organizations. It connects systems, feeds business processes with the information they need and reliably transmits onward the instructions that achieve their goals.

RabbitMQ - the most widely deployed open source message broker. RabbitMQ is lightweight and easy to deploy on premises and in the cloud. It supports multiple messaging protocols. RabbitMQ can be deployed in distributed and federated configurations to meet high-scale, high-availability requirements.

https://news.ycombinator.com/item?id=8377345 - using a DB for messaging?

https://blog.jooq.org/2014/09/26/using-your-rdbms-for-messaging-is-totally-ok/

Concurrency

https://en.wikipedia.org/wiki/Concurrent_computing - a form of computing in which several computations are executing during overlapping time periods—concurrently—instead of sequentially (one completing before the next starts). This is a property of a system—this may be an individual program, a computer, or a network—and there is a separate execution point or "thread of control" for each computation ("process"). A concurrent system is one where a computation can make progress without waiting for all other computations to complete—where more than one computation can make progress at "the same time".

https://en.wikipedia.org/wiki/Concurrency_(computer_science)

https://en.wikipedia.org/wiki/Concurrency_control

Concurrency is not parallelism - The Go Blog - But when people hear the word concurrency they often think of parallelism, a related but quite distinct concept. In programming, concurrency is the composition of independently executing processes, while parallelism is the simultaneous execution of (possibly related) computations. Concurrency is about dealing with lots of things at once. Parallelism is about doing lots of things at once. With talk from Rob Pike on Go.

Hand wavey definition

Concurrency = dealing with a lot of thigns at one
Parallelism = doing a lot of things at once

https://en.wikipedia.org/wiki/Thread_(computing)

http://250bpm.com/blog:81 [77]

https://github.com/0xAX/linux-insides/blob/master/SyncPrim/sync-4.md [78]

https://en.wikipedia.org/wiki/Mutual_exclusion - the requirement of ensuring that no two concurrent processes are in their critical section at the same time; it is a basic requirement in concurrency control, to prevent race conditions. Here, a critical section refers to a period when the process accesses a shared resource, such as shared memory. The requirement of mutual exclusion was first identified and solved by Edsger W. Dijkstra in his seminal 1965 paper titled Solution of a problem in concurrent programming control, and is credited as the first topic in the study of concurrent algorithms.

https://en.wikipedia.org/wiki/Race_condition#Software

https://en.wikipedia.org/wiki/Critical_section - a part of a multi-process program that may not be concurrently executed by more than one of the program's processes.[a] In other words, it is a piece of a program that requires mutual exclusion of access.[1] Typically, the critical section accesses a shared resource, such as a data structure, a peripheral device, or a network connection, that does not allow multiple concurrent accesses.

https://en.wikipedia.org/wiki/Spinlock

https://en.wikipedia.org/wiki/Concurrent_algorithm

https://drive.google.com/file/d/0B1n48l1C3eK8OWR5d0daTko2X2s/view [79]

https://en.wikipedia.org/wiki/Synchronization_(computer_science)#Thread_or_process_synchronization

https://en.wikipedia.org/wiki/Deadlock - a situation in which two or more competing actions are each waiting for the other to finish, and thus neither ever does.

https://en.wikipedia.org/wiki/Starvation_(computer_science) - where a process is perpetually denied necessary resources to process its work.

https://en.wikipedia.org/wiki/Busy_waiting

https://en.wikipedia.org/wiki/Lock_(computer_science) - or mutex (from mutual exclusion) is a synchronization mechanism for enforcing limits on access to a resource in an environment where there are many threads of execution. A lock is designed to enforce a mutual exclusion concurrency control policy.

https://en.wikipedia.org/wiki/Message_passing

https://en.wikipedia.org/wiki/Channel_(programming)

Futures and promises

http://en.wikipedia.org/wiki/Futures_and_promises - future, promise, delay, and deferred refer to constructs used for synchronizing in some concurrent programming languages. They describe an object that acts as a proxy for a result that is initially unknown, usually because the computation of its value is yet incomplete.

https://news.ycombinator.com/item?id=11526887

Parallelism

https://en.wikipedia.org/wiki/Parallel_computing - a type of computation in which many calculations or the execution of processes are carried out concurrently. Large problems can often be divided into smaller ones, which can then be solved at the same time. There are several different forms of parallel computing: bit-level, instruction-level, data, and task parallelism. Parallelism has long been employed in high-performance computing, but it's gaining broader interest due to the physical constraints preventing frequency scaling.[2] As power consumption (and consequently heat generation) by computers has become a concern in recent years, parallel computing has become the dominant paradigm in computer architecture, mainly in the form of multi-core processors.

https://en.wikipedia.org/wiki/Analysis_of_parallel_algorithms - Like in the analysis of "ordinary", sequential, algorithms, one is typically interested in asymptotic bounds on the resource consumption (mainly time spent computing), but the analysis is performed in the presence of multiple processor units that cooperate to perform computations. Thus, one can determine not only how many "steps" a computation takes, but also how much faster it becomes as the number of processors goes up.

https://en.wikipedia.org/wiki/Bit-level_parallelism - a form of parallel computing based on increasing processor word size. Increasing the word size reduces the number of instructions the processor must execute in order to perform an operation on variables whose sizes are greater than the length of the word. (For example, consider a case where an 8-bit processor must add two 16-bit integers. The processor must first add the 8 lower-order bits from each integer, then add the 8 higher-order bits, requiring two instructions to complete a single operation. A 16-bit processor would be able to complete the operation with single instruction.) Originally, all electronic computers were serial (single-bit) computers. The first electronic computer that was not a serial computer—the first bit-parallel computer—was the 16-bit Whirlwind from 1951. From the advent of very-large-scale integration (VLSI) computer chip fabrication technology in the 1970s until about 1986, advancements in computer architecture were done by increasing bit-level parallelism,[1] as 4-bit microprocessors were replaced by 8-bit, then 16-bit, then 32-bit microprocessors. This trend generally came to an end with the introduction of 32-bit processors, which have been a standard in general purpose computing for two decades. Only recently, with the advent of x86-64 architectures, have 64-bit processors become commonplace. On 32-bit processors, external data bus width continues to increase. For example, DDR1 SDRAM transfers 128 bits per clock cycle. DDR2 SDRAM transfers a minimum of 256 bits per burst.

https://en.wikipedia.org/wiki/Instruction-level_parallelism - a measure of how many of the instructions in a computer program can be executed simultaneously. There are two approaches to instruction level parallelism: in hardware and in software. Hardware level works upon dynamic parallelism whereas, the software level works on static parallelism. Dynamic parallelism means the processor decides at run time which instructions to execute in parallel, whereas static parallelism means the compiler decides which instructions to execute in parallel. The Pentium processor works on the dynamic sequence of parallel execution but the Itanium processor works on the static level parallelism.

https://en.wikipedia.org/wiki/Data_parallelism - a form of parallelization across multiple processors in parallel computing environments. It focuses on distributing the data across different nodes, which operate on the data in parallel. It can be applied on regular data structures like arrays and matrices by working on each element in parallel.

https://en.wikipedia.org/wiki/Task_parallelism - also known as function parallelism and control parallelism, is a form of parallelization of computer code across multiple processors in parallel computing environments. Task parallelism focuses on distributing tasks—concurrently performed by processes or threads—across different processors. In contrast to data parallelism which involves running the same task on different components of data, task parallelism is distinguished by running many different tasks at the same time on the same data. A common type of task parallelism is pipelining which consists of moving a single set of data through a series of separate tasks where each task can execute independently of the others.

Garbage collection

https://en.wikipedia.org/wiki/Garbage_collection_(computer_science)

https://en.wikipedia.org/wiki/Cheney%27s_algorithm

Old New Thing: Everybody thinks about garbage collection the wrong way

http://xtzgzorex.wordpress.com/2012/10/11/demystifying-garbage-collectors/

https://news.ycombinator.com/item?id=10794026

http://blog.brownplt.org/2013/02/19/teaching-gc.html

http://michaelrbernste.in/2013/06/03/real-time-garbage-collection-is-real.html

http://journal.stuffwithstuff.com/2013/12/08/babys-first-garbage-collector/ [81]

https://news.ycombinator.com/item?id=7499101

https://news.ycombinator.com/item?id=8263811

Patterns

https://en.wikipedia.org/wiki/Design_pattern

https://github.com/kamranahmedse/design-patterns-for-humans/blob/master/README.md [82]

https://en.wikipedia.org/wiki/Software_design_pattern

https://news.ycombinator.com/item?id=12849757

https://en.wikipedia.org/wiki/Architectural_pattern

WikiWikiWeb - aka Wards Wiki

"ExtremeProgramming -- all programming is maintenance."

Portland Pattern Repository
- https://en.wikipedia.org/wiki/Portland_Pattern_Repository

http://stackoverflow.com/questions/92225/what-are-the-best-resources-for-design-patterns-and-their-uses

http://perldesignpatterns.com/?HomePage

http://blog.plover.com/prog/design-patterns.html

http://www.mcdonaldland.info/files/designpatterns/designpatternscard.pdf

http://en.wikipedia.org/wiki/Architectural_pattern_%28computer_science%29

http://en.wikipedia.org/wiki/Common_layers_in_an_information_system_logical_architecture

http://en.wikipedia.org/wiki/Module_pattern

http://modulecounts.com/

http://en.wikipedia.org/wiki/Singleton_pattern - restrict the instantiation of a class to one object.

https://programmers.stackexchange.com/questions/40373/so-singletons-are-bad-then-what

http://en.wikipedia.org/wiki/Behavioral_pattern

https://en.wikipedia.org/wiki/Actor_model

https://en.wikipedia.org/wiki/Inversion_of_control

http://c2.com/cgi/wiki?CollectionOrientedProgramming

http://www.perldesignpatterns.com/?PerlDesignPatterns

http://www.embeddedrelated.com/showarticle/455.php

http://www.jroller.com/landers/entry/the_design_pattern_facade_pattern

https://news.ycombinator.com/item?id=6790392

http://ssdl-wiki.cs.technion.ac.il/wiki/index.php/Spartan_programming

http://meyerweb.com/eric/comment/chech.html

http://www.confreaks.com/videos/3337-railsconf-keynote-10-years - convention over configuration

YouTube: Lambda Jam 2014 - Eric Meijer - The Lost Art of Denotational Semantics

MV*

http://mvc.givan.se/ [83]

http://mikepackdev.com/blog_posts/34-a-review-of-mvc

"create your views, express your models or develop a controller"

http://gee.cs.oswego.edu/dl/acs/acs/acs.html?1 - reactive [84]

Push/pull

http://en.wikipedia.org/wiki/Publish%E2%80%93subscribe_pattern

http://en.wikipedia.org/wiki/Push_technology

GUI

See UI, Graphics, etc.

https://en.wikipedia.org/wiki/Layout_engine - a computer program that combines content and formatting information for electronic or printed display. It is not a stand-alone program but a core component of larger programs. There are different types of layout engines. One is the web browser engines used in web browsers and email clients. Another type is the layout managers used in widget toolkits.

https://en.wikipedia.org/wiki/List_of_layout_engines

Compile and link time

to resort

https://en.wikipedia.org/wiki/Preprocessor - a program that processes its input data to produce output that is used as input to another program. The output is said to be a preprocessed form of the input data, which is often used by some subsequent programs like compilers. The amount and kind of processing done depends on the nature of the preprocessor; some preprocessors are only capable of performing relatively simple textual substitutions and macro expansions, while others have the power of full-fledged programming languages.

Compiler

https://en.wikipedia.org/wiki/Compiler

https://en.wikipedia.org/wiki/Compile_time

http://c2.com/cgi/wiki?TheKenThompsonHack - Ken's acceptance speech Reflections On Trusting Trust describes a hack (in every sense), the most subversive ever perpetrated, nothing less than the root password of all evil. Ken describes how he injected a virus into a compiler. Not only did his compiler know it was compiling the login function and inject a backdoor, but it also knew when it was compiling itself and injected the backdoor generator into the compiler it was creating. The source code for the compiler thereafter contains no evidence of either virus.

https://en.wikipedia.org/wiki/History_of_compiler_construction

How to compile with continuations

Compiler Explorer - Run compilers interactively from your web browser and interact with the assembly] - an interactive compiler. The left-hand pane shows editable C, C++, Rust, Go, D, Haskell, Swift and Pascal code. The right, the assembly output of having compiled the code with a given compiler and settings. Multiple compilers are supported, and the UI layout is configurable (thanks to GoldenLayout). There is also an ispc compiler ? for a C variant with extensions for SPMD.
- https://github.com/mattgodbolt/compiler-explorer

https://en.wikipedia.org/wiki/Object_file - a file containing object code, meaning relocatable format machine code that is usually not directly executable. There are various formats for object files, and the same object code can be packaged in different object files. An object file also works like an Application Extension (.dll). In addition to the object code itself, object files may contain metadata used for linking or debugging, including: information to resolve symbolic cross-references between different modules, relocation information, stack unwinding information, comments, program symbols, debugging or profiling information.

https://en.wikipedia.org/wiki/Dynamic_compilation

Assembler

https://en.wikipedia.org/wiki/Assembler_(computing) - creates object code by translating combinations of mnemonics and syntax for operations and addressing modes into their numerical equivalents. This representation typically includes an operation code ("opcode") as well as other control bits. The assembler also calculates constant expressions and resolves symbolic names for memory locations and other entities. The use of symbolic references is a key feature of assemblers, saving tedious calculations and manual address updates after program modifications.

https://en.wikipedia.org/wiki/Inline_assembler - a feature of some compilers that allows very low level code written in assembly to be embedded in a high level language like C or Ada.

https://en.wikipedia.org/wiki/High-level_assembler

to sort

https://en.wikipedia.org/wiki/Compiler-compiler - or compiler generator is a programming tool that creates a parser, interpreter, or compiler from some form of formal description of a language and machine. The earliest and still most common form of compiler-compiler is a parser generator, whose input is a grammar (usually in BNF) of a programming language, and whose generated output is the source code of a parser often used as a component of a compiler.

The ideal compiler-compiler takes a description of a programming language and a target instruction set architecture, and automatically generates a usable compiler from them. In practice, the state of the art has yet to reach this degree of sophistication and most compiler generators are not capable of handling semantic or target architecture information.

https://en.wikipedia.org/wiki/Comparison_of_parser_generators

https://en.wikipedia.org/wiki/GNU_bison

https://en.wikipedia.org/wiki/Compile_and_go_system

https://en.wikipedia.org/wiki/Metacompiler - a software development tool used chiefly in the construction of compilers, translators, and interpreters for other programming languages. They are a subset of a specialized class of compiler writing tools called compiler-compilers that employ metaprogramming languages.

https://lobste.rs/s/0cv1pu/reductio_exploration_code

Linking

https://en.wikipedia.org/wiki/Link_time

https://en.wikipedia.org/wiki/Linker_(computing)

https://en.wikipedia.org/wiki/Relocation_(computing) - the process of assigning load addresses to various parts of a program and adjusting the code and data in the program to reflect the assigned addresses. A linker usually performs relocation in conjunction with symbol resolution, the process of searching files and libraries to replace symbolic references or names of libraries with actual usable addresses in memory before running a program. Relocation is typically done by the linker at link time, but it can also be done at run time by a relocating loader, or by the running program itself. Some architectures avoid relocation entirely by deferring address assignment to run time; this is known as zero address arithmetic.

https://en.wikipedia.org/wiki/Runtime_library - a set of low-level routines used by a compiler to invoke some of the behaviors of a runtime environment, by inserting calls to the runtime library into compiled executable binary. The runtime environment implements the execution model, built-in functions, and other fundamental behaviors of a programming language. During execution (run time) of that computer program, execution of those calls to the runtime library cause communication between the executable binary and the runtime environment. A runtime library often includes built-in functions for memory management or exception handling. Therefore, a runtime library is always specific to the platform and compiler.

https://en.wikipedia.org/wiki/Strip_(Unix)

http://www.kix.in/2008/06/19/an-alternative-to-shared-libraries/ [85]

Software

pcc

https://en.wikipedia.org/wiki/Portable_C_Compiler - an early compiler for the C programming language written by Stephen C. Johnson of Bell Labs in the mid-1970s, based in part on ideas proposed by Alan Snyder in 1973,[2][3] and "distributed as the C compiler by Bell Labs... with the blessing of Dennis Ritchie."

One of the first compilers that could easily be adapted to output code for different computer architectures, the compiler had a long life span. It debuted in Seventh Edition Unix and shipped with BSD Unix until the release of 4.4BSD in 1994, when it was replaced by the GNU C Compiler. It was very influential in its day, so much so that at the beginning of the 1980s, the majority of C compilers were based on it. Anders Magnusson and Peter A Jonsson restarted development of pcc in 2007, rewriting it extensively to support the C99 standard.

gcc

GNU Compiler Collection includes front ends for C, C++, Objective-C, Fortran, Java, Ada, and Go, as well as libraries for these languages (libstdc++, libgcj,...). GCC was originally written as the compiler for the GNU operating system. The GNU system was developed to be 100% free software, free in the sense that it respects the user's freedom.
- http://en.wikipedia.org/wiki/GNU_Compiler_Collection

https://gcc.gnu.org/wiki/History

https://gcc.gnu.org/egcs-1.1/

http://v3.sk/~lkundrak/dev86/

http://unix.stackexchange.com/questions/165105/packages-to-rebuild-after-upgrading-gcc-on-gentoo-systems

LLVM

LLVM is a collection of modular and reusable compiler and toolchain technologies. Despite its name, LLVM has little to do with traditional virtual machines, though it does provide helpful libraries that can be used to build them. The name "LLVM" itself is not an acronym; it is the full name of the project.
- https://idea.popcount.org/2013-07-24-ir-is-better-than-assembly/

Emscripten is an LLVM to JavaScript compiler. It takes LLVM bitcode (which can be generated from C/C++ using Clang, or any other language that can be converted into LLVM bitcode) and compiles that into JavaScript, which can be run on the web (or anywhere else JavaScript can run).

Clang

http://en.wikipedia.org/wiki/Clang - compiler front end for the C, C++, Objective-C, Objective-C++, OpenCL and CUDA programming languages. It uses LLVM as its back end and has been part of the LLVM release cycle since LLVM 2.6.

It is designed to offer a complete replacement to the GNU Compiler Collection (GCC). It is open-source,[4] and its contributors include Apple, Microsoft, Google, ARM, Sony, Intel and AMD. Its source code is available under the University of Illinois/NCSA License, a permissive free software licence.

Other

Parrot is a virtual machine designed to efficiently compile and execute bytecode for dynamic languages. Parrot currently hosts a variety of language implementations in various stages of completion, including Tcl, Javascript, Ruby, Lua, Scheme, PHP, Python, Perl 6, APL, and a .NET bytecode translator. Parrot is not about parrots, though we are rather fond of them for obvious reasons.
- http://en.wikipedia.org/wiki/Parrot_virtual_machine

http://honza.ca/2013/05/building-a-lisp-to-javascript-compiler

http://compilers.iecc.com/crenshaw/ [86]

https://en.wikipedia.org/wiki/Abstract_syntax_tree

http://www.johndcook.com/blog/2013/05/16/extreme-syntax/

https://en.wikipedia.org/wiki/Parsing#Computer_languages

http://dinosaur.compilertools.net/

https://en.wikipedia.org/wiki/Yacc - a computer program for the Unix operating system. It is a LALR parser generator, generating a parser, the part of a compiler that tries to make syntactic sense of the source code, specifically a LALR parser, based on an analytic grammar written in a notation similar to BNF. Yacc itself used to be available as the default parser generator on most Unix systems, though it has since been supplanted as the default by more recent, largely compatible, programs.

XMLVM is to offer a flexible and extensible cross-compiler toolchain. Instead of cross-compiling on a source code level, XMLVM cross-compiles byte code instructions from Sun Microsystem's virtual machine and Microsoft's Common Language Runtime. The benefit of this approach is that byte code instructions are easier to cross-compile and the difficult parsing of a high-level programming language is left to a regular compiler. In XMLVM, byte code-based programs are represented as XML documents. This allows manipulation and translation of XMLVM-based programs using advanced XML technologies such as XSLT, XQuery, and XPath.

http://danluu.com/edit-binary/ [87]

http://jvns.ca/blog/2014/09/06/how-to-read-an-executable/ [88]

http://bellard.org/tcc/

https://github.com/thejameskyle/the-super-tiny-compiler [89]

https://github.com/elfet/ultra-tiny-compiler [90]

https://gcc.godbolt.org [91]

http://www.imatix.com/
- http://legacy.imatix.com/html/smt/

https://github.com/imatix/gsl GSL/4.1 is a code construction tool. It will generate code in all languages and for all purposes. If this sounds too good to be true, welcome to 1996, when we invented these techniques. Magic is simply technology that is twenty years ahead of its time. In addition to code construction, GSL has been used to generate database schema definitions, user interfaces, reports, system administration tools and much more. [92]

http://hintjens.com/blog:75

https://news.ycombinator.com/item?id=11823949

https://github.com/avast-tl/retdec
- https://blog.avast.com/avast-open-sources-its-machine-code-decompiler [93]

Programming

General

Syntax

Paradigms

Imperative

Recursive

Object orientated

Declarative

Dataflow

Constraint

Logic

Functional

Values and types

Type systems

Machine data

Boolean

Numbers

Floating point

to sort

Data structures

Mutability

Evaluation

Operators

Functions

Polymorphism

Control structures

Algorithms

Libraries

System calls

Macros

Metaprogramming

Monads

Events

Messaging

Concurrency

Futures and promises

Parallelism

Garbage collection

Patterns

MV*

Push/pull

GUI

Compile and link time

Compiler

Assembler

to sort

Linking

Software

pcc

gcc

LLVM

Clang

Other

Navigation menu

Search