Statistical Pattern Recognition

 

Introduction

     A reviewer of the book Fundamentals of Pattern Recognition in Science suggested that a better title

might be Statistical Pattern Recognition. The author would agree that such is a good alternative title for the

book, and also that the name Statistical Pattern Recognition (SPR) well describes a discipline that has

evolved over the last 30+ years.

 

     Statistical Pattern Recognition involves:         

          Learning conditional probability density functions.

          Likelihood functions.

          Classical statistics.

          Decision making:

               Minimum probability of error.

               Minimum risk.

               Maximum likelihood.

               Nearest neighbor decision rule.

               Generalized Nearest Neighbor Rule.

          Bayes Theorem.

          Patrick's Theorem.           

          Clustering

          Supervised learning (learning with a teacher)

          Unsupervised learning (learning without a teacher)

          Expert knowledge.

          Feature Extraction:

               Basic functions.

          Feature Selection.         

 

 

 

 

 Decision Making

             Given that there are M mutually exclusive and exhaustive events B1, B2, ....,BM considered

                    outcomes or decisions, Bayes Theorem computes the probability of any of these events Bi,

                    conditioned on (given) an event A called the findings. This conditional probability, p(Bi|A) is

                    called the a posteriori probability ("after the fact" or after the findings), and is computed in

                    terms of an a priori probability ("before the fact" or "before the findings") p(Bi), the

                    probability of the findings p(A), and an expression p(A|Bi) which also. is called a likelihood

                    function (the probability of the findings given the event Bi.). That is,

 

                                                          p(A/Bi) * p(Bi)                               p(A/Bi) * p(Bi)

                        p(Bi|A)      =           ------------------                     ---------------------------                                                                                                               M

                                                                 p(A)                                      Σ p(A|Bj)*p(Bj)

                                                                                                               j =1

                  Minimum Probability of Error Decision Rule

                       A classic decision rule in SPR is to make a decision that minimizes probability of error.

                  Given M categories B1, B2, ....,BM and findings A, the decision that minimizes probability of

                  error in deciding the correct category is:

 

                       Decide category  Bj such that:

 

                            p(Bj|A)     =    Max{p(B1|A), p(B2|A),.....p(BM|A)}

 

 

         

                   Maximum Likelihood Decision Rule

                        A classic decision rule in SPR is to make a decision that maximizes the likelihood.

                  Given M categories B1, B2, ....,BM and findings A, the decision that maximizes likelihood

                  of deciding the correct category given findings A is:

                 

                       Decide category  Bj such that:

 

                            p(A/Bj)     =    Max{p(A/B1), p(A/B2),.....p(A/BM)}