This article relies excessively on references to primary sources. Please improve this article by adding secondary or tertiary sources. Find sources: "Probabilistic programming" – news · newspapers · books · scholar · JSTOR (December 2014) (Learn how and when to remove this message)

Probabilistic programming (PP) is a programming paradigm in which probabilistic models are specified and inference for these models is performed automatically.^[1] It represents an attempt to unify probabilistic modeling and traditional general purpose programming in order to make the former easier and more widely applicable.^[2]^[3] It can be used to create systems that help make decisions in the face of uncertainty.

Programming languages used for probabilistic programming are referred to as "probabilistic programming languages" (PPLs).

Applications

Probabilistic reasoning has been used for a wide variety of tasks such as predicting stock prices, recommending movies, diagnosing computers, detecting cyber intrusions and image detection.^[4] However, until recently (partially due to limited computing power), probabilistic programming was limited in scope, and most inference algorithms had to be written manually for each task.

Nevertheless, in 2015, a 50-line probabilistic computer vision program was used to generate 3D models of human faces based on 2D images of those faces. The program used inverse graphics as the basis of its inference method, and was built using the Picture package in Julia.^[4] This made possible "in 50 lines of code what used to take thousands".^[5]^[6]

The Gen probabilistic programming library (also written in Julia) has been applied to vision and robotics tasks.^[7]

More recently, the probabilistic programming system Turing.jl has been applied in various pharmaceutical^[8] and economics applications.^[9]

Probabilistic programming in Julia has also been combined with differentiable programming by combining the Julia package Zygote.jl with Turing.jl. ^[10]

Probabilistic programming languages are also commonly used in Bayesian cognitive science to develop and evaluate models of cognition. ^[11]

Probabilistic programming languages

PPLs often extend from a basic language. For instance, Turing.jl^[12] is based on Julia, Infer.NET is based on .NET Framework,^[13] while PRISM extends from Prolog.^[14] However, some PPLs, such as WinBUGS, offer a self-contained language that maps closely to the mathematical representation of the statistical models, with no obvious origin in another programming language.^[15]^[16]

The language for WinBUGS was implemented to perform Bayesian computation using Gibbs Sampling and related algorithms. Although implemented in a relatively unknown programming language (Component Pascal), this language permits Bayesian inference for a wide variety of statistical models using a flexible computational approach. The same BUGS language may be used to specify Bayesian models for inference via different computational choices ("samplers") and conventions or defaults, using a standalone program WinBUGS (or related R packages, rbugs and r2winbugs) and JAGS (Just Another Gibbs Sampler, another standalone program with related R packages including rjags, R2jags, and runjags). More recently, other languages to support Bayesian model specification and inference allow different or more efficient choices for the underlying Bayesian computation, and are accessible from the R data analysis and programming environment, e.g.: Stan, NIMBLE and NUTS. The influence of the BUGS language is evident in these later languages, which even use the same syntax for some aspects of model specification.

Several PPLs are in active development, including some in beta test. Two popular tools are Stan and PyMC.^[17]

Relational

A probabilistic relational programming language (PRPL) is a PPL specially designed to describe and infer with probabilistic relational models (PRMs).

A PRM is usually developed with a set of algorithms for reducing, inference about and discovery of concerned distributions, which are embedded into the corresponding PRPL.

Probabilistic logic programming

Main article: Probabilistic logic programming

Probabilistic logic programming is a programming paradigm that extends logic programming with probabilities.

Most approaches to probabilistic logic programming are based on the distribution semantics, which splits a program into a set of probabilistic facts and a logic program. It defines a probability distribution on interpretations of the Herbrand universe of the program.^[18]

List of probabilistic programming languages

This list summarises the variety of PPLs that are currently available, and clarifies their origins.

This article may contain an excessive amount of intricate detail that may interest only a particular audience. Please help by spinning off or relocating any relevant information, and removing excessive detail that may be against Wikipedia's inclusion policy. (October 2019) (Learn how and when to remove this message)

Name	Extends from	Host language
Analytica^[19]		C++
bayesloop^[20]^[21]	Python	Python
Bean Machine^[22]	PyTorch	Python
Venture^[23]	Scheme	C++
BayesDB^[24]	SQLite, Python
PRISM^[14]	B-Prolog
Infer.NET^[13]	.NET Framework	.NET Framework
diff-SAT^[25]	Answer set programming, SAT (DIMACS CNF)
PSQL^[26]	SQL
BUGS^[15]		Component Pascal
Dyna^[27]	Prolog
Figaro^[28]	Scala	Scala
ProbLog^[29]	Prolog	Python
ProBT^[30]	C++, Python
Stan^[16]	BUGS	C++
Hakaru^[31]	Haskell	Haskell
BAli-Phy (software)^[32]	Haskell	C++
ProbCog^[33]		Java, Python
PyMC^[34]	Python	Python
Rainier^[35]^[36]	Scala	Scala
greta^[37]	TensorFlow	R
pomegranate^[38]	Python	Python
Lea^[39]	Python	Python
WebPPL^[40]	JavaScript	JavaScript
Picture^[4]	Julia	Julia
Turing.jl^[12]	Julia	Julia
Gen^[41]	Julia	Julia
Edward^[42]	TensorFlow	Python
TensorFlow Probability^[43]	TensorFlow	Python
Edward2^[44]	TensorFlow Probability	Python
Pyro^[45]	PyTorch	Python
NumPyro^[46]	JAX	Python
Birch^[47]		C++
PSI^[48]		D
Blang^[49]
MultiVerse^[50]	Python	Python

Difficulty

Reasoning about variables as probability distributions causes difficulties for novice programmers, but these difficulties can be addressed through use of Bayesian network visualisations and graphs of variable distributions embedded within the source code editor.^[51]

Notes

External links

Programming paradigms (Comparison by language)

Imperative

Structured	Jackson structures Block-structured Modular Non-structured Procedural Programming in the large and in the small Design by contract Invariant-based Nested function
Object-oriented (comparison, list)	Class-based, Prototype-based, Object-based Agent Immutable object Persistent Uniform Function Call Syntax

Declarative

Functional (comparison)	Recursive Anonymous function (Partial application) Higher-order Purely functional Total Strict GADTs Dependent types Functional logic Point-free style Expression-oriented Applicative, Concatenative Function-level, Value-level
Dataflow	Flow-based Reactive (Functional reactive) Signals Streams Synchronous
Logic	Abductive logic Answer set Constraint (Constraint logic) Inductive logic Nondeterministic Ontology Probabilistic logic Query
DSL	Algebraic modeling Array Automata-based (Action) Command (Spacecraft) Differentiable End-user Grammar-oriented Interface description Language-oriented List comprehension Low-code Modeling Natural language Non-English-based Page description Pipes and filters Probabilistic Quantum Scientific Scripting Set-theoretic Simulation Stack-based System Tactile Templating Transformation (Graph rewriting, Production, Pattern) Visual