January 26, 2025
An improvement over Shannon entropy
Shannon entropy is defined for measures, not random variables. It is a common mistake to define the entropy of a discrete random variable , where the probability takes on the value with probability , as . Note this definition of entropy does not involve any .
It is an even worse mistake to define the entropy of a random variable with density as . It is true that every random variable with cumulative distribution can be modelled as the identity function on the sample space of real numbers with probablility measure .
…
Joint distributions. Negative entropy.
Logical entropy can incorporate values or random variables…
Boole and De Morgan layed down the foundation for computer science by showing predicate calculus can be reduced to arithmetic.
Frege introduced the notion of parameterizing a predicate with a variable.
Frege advanced the predicate calculus of logic developed by Boole and De Morgan by
A set is a collection of elements that are members of the set.
.
A relation on sets and is a subset of their cartesian product . The domain of is and the codomain of is . We write and when . The codset and the domset . If is a relation on sets and , then the composition is a relation on where if and only if there is a with and . This is used to define the join of relational database tables.
A relation is a function if the codset has exactly one element for every . To indicate this we write and where is the unique element of . A function is injective, or one-to-one, if has at most one element for all .
Exercise. If is injective and are functions where then .
A function is surjective, or onto, if for all . There is some with .
Exercise. If is surjective and are functions where then .
A part of a set is an injective function . It corresponds to the subset .
A partition of a set is a surjective function . Let .
Exercise. Show either or for .
Exercise. Show .
Partitions are a mathematically rigorous way of representing partial information. Given a set , full information is the partition of singletons , no information is , partial information is knowing only which in the partition that belongs to.