The term "information algebra" refers to mathematical techniques of information processing. Classical information theory goes back to Claude Shannon. It is a theory of information transmission, looking at communication and storage. However, it has not been considered so far that information comes from different sources and that it is therefore usually combined. It has furthermore been neglected in classical information theory that one wants to extract those parts out of a piece of information that are relevant to specific questions.
A mathematical phrasing of these operations leads to an algebra of information, describing basic modes of information processing. Such an algebra involves several formalisms of computer science, which seem to be different on the surface: relational databases, multiple systems of formal logic or numerical problems of linear algebra. It allows the development of generic procedures of information processing and thus a unification of basic methods of computer science, in particular of distributed information processing.
Information relates to precise questions, comes from different sources, must be aggregated, and can be focused on questions of interest. Starting from these considerations, information algebras (Kohlas 2003) are two-sorted algebras :
Where is a semigroup, representing combination or aggregation of information, and
is a lattice of domains (related to questions) whose partial order reflects the granularity of the domain or the question, and a mixed operation representing focusing or extraction of information.
Information and its operations
More precisely, in the two-sorted algebra , the following operations are defined
Combination
Focusing
Additionally, in the usual lattice operations (meet and join) are defined.
Axioms and definition
The axioms of the two-sorted algebra , in addition to the axioms of the lattice :
Semigroup
is a commutative semigroup under combination with a neutral element (representing vacuous information).
Distributivity of Focusing over Combination
To focus an information on combined with another information to domain , one may as well first focus the second information to and then combine.
Transitivity of Focusing
To focus an information on and , one may focus it to .
Idempotency
An information combined with a part of itself gives nothing new.
Support
such that
Each information refers to at least one domain (question).
A two-sorted algebra satisfying these axioms is called an Information Algebra.
Order of information
A partial order of information can be introduced by defining if . This means that is less informative than if it adds no new information to . The semigroup is a semilattice relative to this order, i.e. . Relative to any domain (question) a partial order can be introduced by defining if . It represents the order of information content of and relative to the domain (question) .
Labeled information algebra
The pairs , where and such that form a labeled Information Algebra. More precisely, in the two-sorted algebra , the following operations are defined
Labeling
Combination
Projection
Models of information algebras
Here follows an incomplete list of instances of information algebras:
Relational algebra: The reduct of a relational algebra with natural join as combination and the usual projection is a labeled information algebra, see Example.
Constraint systems: Constraints form an information algebra (Jaffar & Maher 1994).
Let be a set of symbols, called attributes (or column names). For each let be a non-empty set, the set of all possible values of the attribute . For example, if
, then could
be the set of strings, whereas and are both the set of non-negative integers.
Let . An -tuple is a function so that
and for each The set
of all -tuples is denoted by . For an -tuple and a subset
the restriction is defined to be the
-tuple so that for all .
A relation over is a set of -tuples, i.e. a subset of .
The set of attributes is called the domain of and denoted by
. For the projection of onto is defined
as follows:
The join of a relation over and a relation over is
defined as follows:
As an example, let and be the following relations:
Then the join of and is:
A relational database with natural join as combination and the usual projection is an information algebra.
The operations are well defined since
If , then .
It is easy to see that relational databases satisfy the axioms of a labeled
information algebra:
semigroup
and
transitivity
If , then .
combination
If and , then .
idempotency
If , then .
support
If , then .
Connections
This section needs expansion. You can help by adding to it. (March 2014)
Information algebras introduce semantics by relating information to questions through focusing and combination (Groenendijk & Stokhof 1984);(Floridi 2004).
Information flow
Information algebras are related to information flow, in particular classifications (Barwise & Seligman 1997).
Tree decomposition
Information algebras are organized into a hierarchical tree structure, and decomposed into smaller problems.
Extended axiomatic foundations of information and valuation algebras
The concept of conditional independence is basic for information algebras and a new axiomatic foundation of information algebras, based on conditional independence, extending the old one (see above) is available: https://arxiv.org/abs/1701.02658
Historical Roots
The axioms for information algebras are derived from
the axiom system proposed in (Shenoy and Shafer, 1990), see also (Shafer, 1991).
References
Barwise, J.; Seligman, J. (1997), Information Flow: The Logic of Distributed Systems, Cambridge U.K.: Number 44 in Cambridge Tracts in Theoretical Computer Science, Cambridge University Press
Bergstra, J.A.; Heering, J.; Klint, P. (1990), "Module algebra", Journal of the ACM, 73 (2): 335–372, doi:10.1145/77600.77621, S2CID7910431
de Lavalette, Gerard R. Renardel (1992), "Logical semantics of modularisation", in Egon Börger; Gerhard Jäger; Hans Kleine Büning; Michael M. Richter (eds.), CSL: 5th Workshop on Computer Science Logic, Volume 626 of Lecture Notes in Computer Science, Springer, pp. 306–315, ISBN978-3-540-55789-0
Groenendijk, J.; Stokhof, M. (1984), Studies on the Semantics of Questions and the Pragmatics of Answers, PhD thesis, Universiteit van Amsterdam
Haenni, R.; Kohlas, J.; Lehmann, N. (2000), "Probabilistic argumentation systems"(PDF), in J. Kohlas; S. Moral (eds.), Handbook of Defeasible Reasoning and Uncertainty Management Systems, Dordrecht: Volume 5: Algorithms for Uncertainty and Defeasible Reasoning, Kluwer, pp. 221–287, archived from the original on January 25, 2005
Jaffar, J.; Maher, M. J. (1994), "Constraint logic programming: A survey", Journal of Logic Programming, 19/20: 503–581, doi:10.1016/0743-1066(94)90033-7
Kohlas, J. (2003), Information Algebras: Generic Structures for Inference, Springer-Verlag, ISBN978-1-85233-689-9
Kohlas, J.; Shenoy, P.P. (2000), "Computation in valuation algebras", in J. Kohlas; S. Moral (eds.), Handbook of Defeasible Reasoning and Uncertainty Management Systems, Volume 5: Algorithms for Uncertainty and Defeasible Reasoning, Dordrecht: Kluwer, pp. 5–39
Larsen, K. G.; Winskel, G. (1984), "Using information systems to solve recursive domain equations effectively", in Gilles Kahn; David B. MacQueen; Gordon D. Plotkin (eds.), Semantics of Data Types, International Symposium, Sophia-Antipolis, France, June 27–29, 1984, Proceedings, vol. 173 of Lecture Notes in Computer Science, Berlin: Springer, pp. 109–129
Lauritzen, S. L.; Spiegelhalter, D. J. (1988), "Local computations with probabilities on graphical structures and their application to expert systems", Journal of the Royal Statistical Society, Series B, 50: 157–224
Pouly, Marc; Kohlas, Jürg (2011), Generic Inference: A Unifying Theory for Automated Reasoning, John Wiley & Sons, ISBN978-1-118-01086-0
Scott, Dana S. (1970), Outline of a mathematical theory of computation, Technical Monograph PRG–2, Oxford University Computing Laboratory, Programming Research Group
Scott, D.S. (1982), "Domains for denotational semantics", in M. Nielsen; E.M. Schmitt (eds.), Automata, Languages and Programming, Springer, pp. 577–613
Shafer, G. (1991), An axiomatic study of computation in hypertrees, Working Paper 232, School of Business, University of Kansas
Shenoy, P. P.; Shafer, G. (1990). "Axioms for probability and belief-function proagation". In Ross D. Shachter; Tod S. Levitt; Laveen N. Kanal; John F. Lemmer (eds.). Uncertainty in Artificial Intelligence 4. Vol. 9. Amsterdam: Elsevier. pp. 169–198. doi:10.1016/B978-0-444-88650-7.50019-6. hdl:1808/144. ISBN978-0-444-88650-7. ((cite book)): |journal= ignored (help)
Wilson, Nic; Mengin, Jérôme (1999), "Logical deduction using the local computation framework", in Anthony Hunter; Simon Parsons (eds.), Symbolic and Quantitative Approaches to Reasoning and Uncertainty, European Conference, ECSQARU'99, London, UK, July 5–9, 1999, Proceedings, volume 1638 of Lecture Notes in Computer Science, Springer, pp. 386–396, ISBN978-3-540-66131-3