Patrick's Theorem
Preliminaries
Patrick's theorem is a theorem on a posteriori probabilities that will allow for multiple
classes active at the same time. As defined in Bayes Theorem, the category space Γ consists
of M mutually exclusive and exhaustive categories γ1*, γ2*, ....γM*. In addition, define a
feature space Χ with x є Χ. There exists category-feature relationships p(x|γi* ), i = 1 to M
as defined in Bayes Framework created by Patrick but not present in Bayes Theorem.
A departure from Bayes now begins with the definition of a class space Ώ:
Ώ: ώ1, ώ2, ...., ώM1 .
The category space Γ and class space Ώ must be considered separate, although there will be
components from the respective spaces which are the same. For example, the two spaces will
have certain features in common. As another example, a class in Ώ is a logical union of all
categories in Γ which "contain" the class. Even more important are certain properties of categories
in Γ which exist for Patrick's theorem but not for Bayes theorem. For Patrick's theorem, categories
γ1*, γ2*, ....γM* are mutually exclusive but there can exist relationships among the
category conditional probability density functions. These relationships constitute knowledge in terms
of significant and insignificant features of each category. Significance or insignificance is
defied in terms of probability or relative frequency This relative frequency itself becomes part
of the knowledge base. It is used to provide knowledge which relates some of the category-
conditional probability density functions. The models relating certain category conditional probability
density functions may be considered in another space, say Μ, distinct from the class space Ώ,
category space Γ, feature space Ғ, or cross-product space Ώ x Ғ and Γ x Ғ. The space Μ,
called the Model Space, contains the "glue" relating classes, only classes, and complex classes.
This "glue can be functional relationships, logical relationships, logical relationships involving
probability weights, and so on.
Likelihood (Patrick)
p (x|ώi*) p(Ώξ*)
p (x|ώi) = p (x|ώi*) -------- + Σ p(x | Ώξ*) -------
p (x|ώi) ώi є Ώξ* p (x|ώi)
Theorem on A Posteriori Class Probabilities (Patrick)
Two Class Example
It is helpful to consider a two class problem where the two classes are denoted ώ1 and ώ2 .
But these classes are "intermediate concepts," not themselves in the decision space. What are in
the decision space are three categories defined:
ώ1* = (ώ1,ώ2) : means class ώ1 but not class ώ2
ώ2* = (ώ2,ώ1) : means class ώ2 but not class ώ1
Ώ12* = (ώ1,ώ2): means class ώ1 and class ώ2.
The concepts ώ1* and ώ2* are called only classes, while the concept Ώ12* is called a complex
class. The superscript * denotes a category.
Categories ώ1* , ώ2* , and Ώ12* are mutually exclusive as in Bayes Theorem. We will show
how to learn the concept Ώ12* by discovery without a teacher. During the learning process
there are statistical dependencies among the categories, a property not present in Bayes
Theorem.