Patrick's Theorem

 

Preliminaries

     Patrick's theorem is a theorem on a posteriori probabilities that will allow for multiple

classes active at the same time. As defined in Bayes Theorem, the category space Γ consists

of M mutually exclusive and exhaustive categories γ1*, γ2*, ....γM*. In addition, define a

feature space Χ with x є Χ. There exists category-feature relationships p(x|γi* ), i = 1 to M

as defined in Bayes Framework created by Patrick but not present in Bayes Theorem.  

 

     A departure from Bayes now begins with the definition of a class space Ώ:

 

                                             Ώ: ώ1, ώ2, ...., ώM1 .

 

The category space Γ and class space  Ώ must be considered separate, although there will be

components from the respective spaces which are the same. For example, the two spaces will

have certain features in common. As another example, a class in Ώ is a logical union of all

categories in Γ which "contain" the class. Even more important are certain properties of categories

in Γ which exist for Patrick's theorem but not for Bayes theorem. For Patrick's theorem, categories

γ1*, γ2*, ....γMare mutually exclusive but there can exist relationships among the

category conditional probability density functions. These relationships constitute knowledge in terms

of significant and insignificant features of each category. Significance or insignificance is

defied in terms of probability or relative frequency  This relative frequency itself becomes part   

of the knowledge base. It is used to provide knowledge which relates some of the category-

conditional probability density functions. The models relating certain category conditional probability

density functions may be considered in another space, say Μ, distinct from the class space Ώ,

category space Γ, feature space Ғ, or cross-product space ΏҒ and  Γ x Ғ. The space Μ,

called the Model Space, contains the "glue" relating classes, only classes, and complex classes.

This "glue can be functional relationships, logical relationships, logical relationships involving

probability weights, and so on.

 

 

     Likelihood (Patrick)

                                                     p (xi*)                                                  p(Ώξ*)

                p (xi)  =  p (xi*)      --------                +          Σ p(x | Ώξ*)   -------

                                                     p (xi)                        ώi є  Ώξ*            p (xi)

 

     Theorem on A Posteriori Class Probabilities (Patrick)

 

 

Two Class Example

     It is helpful to consider a two class problem where the two classes are denoted ώ1  and  ώ2 .

But these classes are "intermediate concepts," not themselves in the decision space. What are in

the decision space are three categories defined:

                                                                    

                                                  ώ1*  =  (ώ1,ώ2) :        means class ώ1   but not class ώ2

 

                                                                   ώ2*  =  (ώ2,ώ1) :        means class ώ2   but not class ώ1

 

                                                                   Ώ12*  =   12):       means class ώ1 and class ώ2.

 

The concepts ώ1*  and  ώ2* are called only classes, while the concept  Ώ12*  is called a complex

class. The superscript * denotes a category.

 

     Categories ώ1* , ώ2* , and  Ώ12* are mutually exclusive as in Bayes Theorem. We will show

how to learn the concept  Ώ12  by discovery without a teacher. During the learning process

there are statistical dependencies among the categories, a property not present in Bayes

Theorem.