US5181259A - General method of pattern classification using the two domain theory - Google Patents

General method of pattern classification using the two domain theory Download PDF

Info

Publication number
US5181259A
US5181259A US07/587,922 US58792290A US5181259A US 5181259 A US5181259 A US 5181259A US 58792290 A US58792290 A US 58792290A US 5181259 A US5181259 A US 5181259A
Authority
US
United States
Prior art keywords
patterns
sample
collection
matrix
measurements
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
US07/587,922
Inventor
Mark E. Rorvig
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National Aeronautics and Space Administration NASA
Original Assignee
National Aeronautics and Space Administration NASA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National Aeronautics and Space Administration NASA filed Critical National Aeronautics and Space Administration NASA
Priority to US07/587,922 priority Critical patent/US5181259A/en
Assigned to NATIONAL AERONAUTICS AND SPACE ADMINISTRATION, THE UNITED STATES OF AMERICA AS REPRESENTED BY THE ADMINISTRATOR OF THE reassignment NATIONAL AERONAUTICS AND SPACE ADMINISTRATION, THE UNITED STATES OF AMERICA AS REPRESENTED BY THE ADMINISTRATOR OF THE ASSIGNMENT OF ASSIGNORS INTEREST. Assignors: RORVIG, MARK E.
Application granted granted Critical
Publication of US5181259A publication Critical patent/US5181259A/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation

Definitions

  • the present invention relates to a method for automatic classification of a collection of patterns which uses the judgments of human experts on a plurality of sample patterns to organize the collection into sets of similar patterns.
  • the present invention relates to a method for the automatic classification of a collection of patterns, such as image patterns, which uses the so-called "Two Domain Theory" of pattern classification.
  • Pattern classification by computational devices is usually approached in two phases.
  • the first, a so-called “training” phase is the specification by an expert of pattern exemplars representing the classes as a training set.
  • training phase is the specification by an expert of pattern exemplars representing the classes as a training set.
  • classification phase pattern features extracted from the target pattern population are joined with the features similarly extracted from the specified exemplars.
  • information from the expert must often be encoded as specific programs for identification and matching, thus restricting the applicable domain of the algorithm.
  • the Fisher linear discriminant where neither the features of the exemplar nor the domain features of the target population of images need be exactly specified, suffers from the noise introduced in exemplars when the expert makes judgments on only a few features of a multi-featured pattern.
  • the principal object of the present invention is to provide a method of pattern classification which requires neither explicit decoding of expert judgments nor domain specific feature matching and which, further, removes from consideration the noise introduced in the Fisher method.
  • the Two Domain Method according to the present invention comprises the steps of:
  • the collection C of patterns is organized into sets of similar patterns using the judgments of human experts on the set of sample patterns.
  • the comparing step includes the steps of manually marking a line, for each pair of sample patterns, which indicates on an arbitrary scale, from dissimilar to similar, the degree of similarity of each pair, and then sensing the line to produce a signal representative of the position of the mark on the line.
  • the step of processing the signal S includes the steps of producing a histogram for each of the primitive features and then converting the feature histograms for each pattern into Lorenz information measures.
  • the calculating step thus preferably includes the step of calculating the Euclidean distance among pairs of the patterns over the Lorenz information measures to produce the matrix M.
  • the step of creating a mapping includes the step of creating a linear mapping of the ordering ⁇ on a matrix M by regressing the ordering ⁇ with the sample of matrix M corresponding to the sample manually compared to obtain a matrix of weights ⁇ by multiple regression and multiplying the matrix M by the matrix ⁇ . Thereafter, the results of the matrix multiplication are submitted to multi-dimensional scaling to produce the final ordering ⁇ ', consisting of patterns segregated into classes in an n-dimensional space.
  • multi-dimensional scaling refers to a technique described by F. W. Young and R. M. Hamer in MultiDimensional Scaling: History, Theory and Applications, Lawrence Erlbaum Associates, Publishers; Hillsdale, N.Y. and London (1987).
  • multi-dimensional scaling refers to a family of data analysis methods, all of which portray the data structure in a spatial fashion easily assimulated by the relativly untrained human eye. They construct a geometric representation of the data, usually in a Euclidean space of fairly low dimensionality. The essential ingredient found in all multi-dimensional scaling methods is the spatial representation of data structure.
  • an attribute corresponds to the straight line (a unidimensional space), and the quantity of this attribute to a point on the line
  • the attribute corresponds to an n-dimensional space, and the quantity to a point in that space.
  • the process of assigning numbers in unidimensional measurement corresponds to the location of points on a line, in terms of the order of points, their distance from one another, and/or their distances from an origin, so, in multi-dimensional scaling, the process of assigning numbers corresponds to locating the points in a multidimensional space, in terms of a set of relations between the points as specified by the particular geometrical model.
  • each image in this collection has been digitized and processed so as to extract a number of general, primitive features rendered as histograms.
  • six features are extracted: grey level, edge intensity, edge slope, line length, line distance from the origin, and angle distance from the origin.
  • a matrix, denoted M of primitive machine image interpretations may be produced. In this manner, the complex problem of image classification is reduced to the far simpler one of creating a linear mapping of ⁇ on M.
  • the mapping is performed by extracting from C the original machine measures matching the subset of C judged by the human expert, calculating Euclidean distances for both machine measurements and human coordinates, deriving weights, ⁇ , by multiple regression (where the Euclidean distances from the MDS solution for the human judgments are the dependent variable and the Euclidean distances among images based on machine measurements are the independent variables), and multiplying M by ⁇ .
  • the final ordering is produced, consisting of patterns segregated into classes in an n-dimensional space. This last result is denoted as ⁇ '.
  • FIG. 1 is a detailed block diagram of the procedural steps of the Two Domain Method, according to the present invention, for classifying a collection of image patterns.
  • FIGS. 2 and 3 are multi-dimensional scaling (MDS ALSCAL) plots of the original human view of a sample of eight images (photographic slides) of peripheral white blood cells. The human judgments were collected through the method of paired comparisons, and show a clear separation between the slides from Subject 1 and Subject 2.
  • MDS ALSCAL multi-dimensional scaling
  • FIG. 4 is an MDS ALSCAL plot of the primitive machine views of a set A of sixteen slides (slides 1-16) from a photographic film, rated as ASA 200 and exposed at ASA 200, which includes both Subject 1 and Subject 2. This plot exhibits some natural clustering by machine features alone.
  • FIG. 5 is an MDS ALSCAL plot of the primitive machine views of a set B of sixteen slides (slides 17-32) from a photographic film, rated at ASA 200 but exposed at ASA 400, including both Subject 1 and Subject 2. This plot exhibits little machine differentiation between the two subjects.
  • FIG. 6 is an MSD ALSCAL plot of both slide sets A and B and Subjects 1 and 2. It exhibits distortion of the natural clustering effect displayed in set A of FIG. 4 when set A and set B are combined.
  • FIG. 7 is an MDS ALSCAL plot of slide sets A and B and Subjects 1 and 2. This plot exhibits the reordering of Subject 1 and Subject 2 classes when weighted by the human view displayed in FIG. 2.
  • FIGS. 8 and 9 are MDS ALSCAL plots (in numbered display) of both primitive and human weighted views of all 32 peripheral blood cell slides, corresponding to the datapoints shown in FIGS. 6 and 7, respectively.
  • FIG. 9 exhibits the substantial "learning" effect created by imposition of human judgments on machine interpretations.
  • FIG. 1 illustrates the preferred embodiment of the Two Domain Method according to the present invention as applied to a collection of images. Each numbered block in this figure represents a separate and distinct step of this method.
  • the collection of images is initially sensed by machine and converted to a format--in particular, a signal S representing the patterns--which is useable by a computer.
  • the signal S is then processed in block 1 to extract primitive image features as histograms of these features.
  • the features may be grey level, edge intensity, edge slope, line length, line distance from the origin, and angle distance from the origin.
  • the histogram for each image is converted into Lorenz information measures.
  • the Lorenz information measures associated with those images which are used for the expert, human judgments are extracted from the group for later use.
  • a set of sample images is selected at random from the collection of images.
  • the sample images are compared, in pairs, by a human expert to determine the degree of dissimilarity of each pair.
  • MDS multi-dimensional scaling
  • These expert judgments are then processed using conventional multi-dimensional scaling (MDS) techniques to produce a real valued ordering ⁇ of these images by their dissimilarity, as indicated in block 7.
  • MDS multi-dimensional scaling
  • the geometric representation produced by the MDS process is converted to Euclidean distances which, in turn, are converted, in block 9, to a column vector.
  • the matrix M produced in block 4 is multiplied by the matrix of weights ⁇ from block 11.
  • the resulting vector is converted to an off-diagonal matrix in block 13 for submission to MDS in block 14.
  • the result of this MDS is the final ordering ⁇ '.
  • the Two Domain Method according to the invention will be applied to a problem of discriminating two populations of microscopic images of circulating human white blood cells (leukocytes).
  • the Two Domain Method has been tested for its power to discriminate two distinct patterns of human blood leukocyte distribution: An abnormal pattern associated with acute liver failure exhibiting abnormal circulating white blood cell frequency and distribution (Subject 1) and a normal pattern from a normal, healthy subject (Subject 2).
  • Circulating human leukocytes were separated by flotation from red blood cells by a standard flotation method, and uniform monolayer films prepared and cytochemically stained by a routine clinical laboratory automated instrument using hematoxylin and eosin dyes.
  • the resulting slides therefore included all nucleated circulating white blood cells, predominately neutrophils, eosinophils, lymphocytes and monocytes, as well as platelets.
  • the difference in exposure levels substantially alters the machine measurements of these images and is typical of problems that confound image pattern classification generally, in that "noise" introduced by one element or another distort the machine classification algorithms.
  • the purpose of this application is thus to demonstrate that the Two Domain Method is sufficiently robust not only to classify properly Set A (by segregation in an n-dimensional space), but also to reduce or eliminate the noise artificially introduced by the difference in Set B film exposure levels.
  • FIGS. 2 and 3 which are MDS ALSCAL plots of this manual examination of slides 1-8, exhibit a strong separation between the cell populations of the two subjects.
  • the images represented by datapoints in FIG. 4 appear to have some natural clustering tendency along the same lines as those provided directly by human judgments, probably due to the increased light levels in the images produced from Subject 1 and caused by the generally lower levels of white blood cells in the sample drawn from that subject.
  • FIG. 6 reveals the strong confounding effect of the Set B data when combined with Set A and scaled together.
  • each item acts to influence the scale value of every other item, so that the pure machine view, or interpretation, of these images becomes extremely confused.
  • FIG. 7 shows the effect of the Two Domain Method on the disordered data of FIG. 6.
  • FIG. 7 was produced according to the procedures of FIG. 1 with the detailed calculations described below.
  • Subject 1 and Subject 2 data are perfectly segregated for Set A and, with the exception of one image, also perfectly segregated for Set B.
  • Set B the only, confounding effect introduced by combining Set B with Set A images is eliminated.
  • FIGS. 8 and 9 are MDS ALSCAL plots, in numbered display, of both the primitive and human weighted views of the thirty-two peripheral blood cell slides.
  • FIG. 9 exhibits the substantial "learning" effect created by imposition of the human judgments on the machine interpretations.
  • Q k is a column of matrix Q
  • p is a matrix of 8 ⁇ 6,
  • p ik is the machine measurement k for image i
  • p jk is the machine measurement k for image j.
  • x ik is the coordinate of image i on dimension k
  • x jk is the coordinate of image j on dimension k
  • r is the number of dimensions in the solution.
  • Equation 3 is the multiple regression equation in standard form and equation 4 is the standard least squares solution.
  • V is the final vector converted to an off-diagonal matrix for submission to MDS.
  • M is the 496 ⁇ 6 matrix from the procedure of equation 1.
  • the Two Domain Method is effective simply because it reduces the intense machine activity associated with pattern matching to the simple operations of ratio scale value relations.
  • the scaling theory underlying the method is easily transferable to operations involving classifications among higher dimensions.
  • multi-dimensional scaling has, for some time, been more often used to record human judgments in higher dimensions for a variety of marketing applications.
  • P. E. Green and F. J. Carmone "Multi-dimensional Scaling: An Introduction and Comparison of Nonmetric Unfolding Techniques," Journal of Marketing Research, Vol. 6, 1969, pgs. 330-41.
  • the opinions of multiple experts may be combined in the creation of ⁇ .
  • the Two Domain Method is also applicable to image classification systems that routinely use Bayesian methods.
  • the operations of the Bayesian classifiers would use, as their inputs, the dissimilarity values output from multi-dimensional scaling matrix transforms, ignoring the plotted values that are derived from the dissimilarity values anyway.
  • the Two Domain Method may facilitate neural net pattern classification, both by making the net more efficient due to the reduction of information that must be submitted (dissimilarities or Euclidean distances rather than vectors of pixel values), and by the increased rigor of the training set expression that reduces noise when particular aspects of patterns are judged, rather than patterns as a whole.
  • the Two Domain Method may be used in the searching of large databases of images, where image representations are stored as feature components.
  • the method would be applied to image classes iteratively, by segregating and mapping successively smaller classes of imagery. This application may be critical to locating desired sets of images that cannot be described linguistically due either to intellectual or economic constraints.

Abstract

Human beings judge patterns (such as images) by complex mental processes, some of which may not be known, while computing machines extract features. By representing the human judgements with simple measurements and reducing them and the machine extracted features to a common metric space and fitting them by regression, the judgements of human experts rendered on a sample of patterns may be imposed on a pattern population to provide automatic classification.

Description

ORIGIN OF THE INVENTION
The invention described herein was made by an employee of the U.S. Government and may be manufactured and used by or for the Government of the United States of America for governmental purposes without the payment of any royalties thereon or therefor.
BACKGROUND OF THE INVENTION
The present invention relates to a method for automatic classification of a collection of patterns which uses the judgments of human experts on a plurality of sample patterns to organize the collection into sets of similar patterns.
More particularly, the present invention relates to a method for the automatic classification of a collection of patterns, such as image patterns, which uses the so-called "Two Domain Theory" of pattern classification.
Pattern classification by computational devices is usually approached in two phases. The first, a so-called "training" phase is the specification by an expert of pattern exemplars representing the classes as a training set. In the subsequent, so-called "classification phase" pattern features extracted from the target pattern population are joined with the features similarly extracted from the specified exemplars. Various difficulties arise with these techniques in both phases. For example, in the training phase, the expert's knowledge must be properly decoded to record accurately the salient features used for exemplar classification: a process of recognized difficulty with many pitfalls. Additionally, in the classification phase, information from the expert must often be encoded as specific programs for identification and matching, thus restricting the applicable domain of the algorithm. Even the most robust of these methods, the Fisher linear discriminant, where neither the features of the exemplar nor the domain features of the target population of images need be exactly specified, suffers from the noise introduced in exemplars when the expert makes judgments on only a few features of a multi-featured pattern.
SUMMARY OF THE INVENTION
The principal object of the present invention is to provide a method of pattern classification which requires neither explicit decoding of expert judgments nor domain specific feature matching and which, further, removes from consideration the noise introduced in the Fisher method.
This object, as well as further objects which will become apparent from the discussion that follows, are achieved, according to the present invention, by providing a method, hereinafter called the "Two Domain Method", that introduces two unique processes in both the training and classification phases. First, expert knowledge is acquired through multi-dimensional scaling of judgments of dissimilarities rendered by a human expert on a sample of patterns from the target population. Second, general pattern features extracted from the pattern of the target population are transformed to points in a Euclidean space. With this method, the problem of pattern classification is reduced from the complex one of creating machine based validity rules to the simple matter of creating a linear mapping between two datasets derived from the human domain and the machine domain, respectively.
More specifically, the Two Domain Method according to the present invention comprises the steps of:
(a) selecting a set of sample patterns, preferably by random selection from the collection C of the patterns which are to be classified;
(b) manually comparing members of the set of sample patterns to determine the degree of dissimilarity of each member of the set with respect to some, and preferably all, other members of the set;
(c) producing an ordering Φ of the members of the set by their degree of dissimilarity, preferably by multi-dimensional scaling;
(d) sensing the collection C of patterns to produce a signal S representing the patterns, for example by digitization;
(e) processing the signal S to produce a plurality of signatures representing distributions of primitive features of interest;
(f) calculating the spacial distance among pairs of the patterns from the signatures to produce a matrix M of interpoint distances; and
(g) creating a mapping of the ordering Φ on the matrix M by multiple regression.
By means of this method, the collection C of patterns is organized into sets of similar patterns using the judgments of human experts on the set of sample patterns.
According to a preferred embodiment of the invention, the comparing step, referred to above, includes the steps of manually marking a line, for each pair of sample patterns, which indicates on an arbitrary scale, from dissimilar to similar, the degree of similarity of each pair, and then sensing the line to produce a signal representative of the position of the mark on the line.
According to another preferred embodiment of the invention, the step of processing the signal S includes the steps of producing a histogram for each of the primitive features and then converting the feature histograms for each pattern into Lorenz information measures.
The calculating step thus preferably includes the step of calculating the Euclidean distance among pairs of the patterns over the Lorenz information measures to produce the matrix M.
According to another preferred embodiment of the invention, the step of creating a mapping includes the step of creating a linear mapping of the ordering Φ on a matrix M by regressing the ordering Φ with the sample of matrix M corresponding to the sample manually compared to obtain a matrix of weights β by multiple regression and multiplying the matrix M by the matrix β. Thereafter, the results of the matrix multiplication are submitted to multi-dimensional scaling to produce the final ordering Φ', consisting of patterns segregated into classes in an n-dimensional space.
As used herein, the term "multi-dimensional scaling" refers to a technique described by F. W. Young and R. M. Hamer in MultiDimensional Scaling: History, Theory and Applications, Lawrence Erlbaum Associates, Publishers; Hillsdale, N.Y. and London (1987). The term "multi-dimensional scaling" refers to a family of data analysis methods, all of which portray the data structure in a spatial fashion easily assimulated by the relativly untrained human eye. They construct a geometric representation of the data, usually in a Euclidean space of fairly low dimensionality. The essential ingredient found in all multi-dimensional scaling methods is the spatial representation of data structure.
Whereas in unidimensional measurement, an attribute corresponds to the straight line (a unidimensional space), and the quantity of this attribute to a point on the line, in multi-dimensional scaling, the attribute corresponds to an n-dimensional space, and the quantity to a point in that space. Whereas the process of assigning numbers in unidimensional measurement corresponds to the location of points on a line, in terms of the order of points, their distance from one another, and/or their distances from an origin, so, in multi-dimensional scaling, the process of assigning numbers corresponds to locating the points in a multidimensional space, in terms of a set of relations between the points as specified by the particular geometrical model.
By way of explanation of the Two Domain Method, consider a collection of patterns (in this case, images) denoted "C". Let the goal of the expert be to define pairwise dissimilarities among a sample set of these images chosen by a random process. These dissimilarities judgments may be collected by presenting all possible pairs of the images in the sample and asking the expert to place a mark on a line labeled "dissimilar" at one end and "similar" at the other. A ruler applied to these lines thus establishes a matrix of dissimilarity values among the sampled images. By processing these judgments in an n-dimensional space using conventional multi-dimensional scaling (MDS) techniques, a unique, real-valued ordering of these images by their dissimilarity may be produced. Let this ordering be denoted Φ. With this procedure it becomes unnecessary to know explicitly the portions, features, or aspects of the image, or even the deductive rules used by the expert, in rendering the judgments. Whatever features, aspects, or rules the expert may have attended to or employed are already implicit in the ordering, Φ.
Considering again the collection C, let it be assumed that each image in this collection has been digitized and processed so as to extract a number of general, primitive features rendered as histograms. In the example given below, six features are extracted: grey level, edge intensity, edge slope, line length, line distance from the origin, and angle distance from the origin. These features are not the only possible features that might be used, or even the optimal features, but are used as examples because they are very general and convenient.
By converting the histograms for each image into Lorenz information measures, and calculating the Euclidean distance among all pairs of images over all feature measures, a matrix, denoted M, of primitive machine image interpretations may be produced. In this manner, the complex problem of image classification is reduced to the far simpler one of creating a linear mapping of Φ on M.
In the present method, the mapping is performed by extracting from C the original machine measures matching the subset of C judged by the human expert, calculating Euclidean distances for both machine measurements and human coordinates, deriving weights, β, by multiple regression (where the Euclidean distances from the MDS solution for the human judgments are the dependent variable and the Euclidean distances among images based on machine measurements are the independent variables), and multiplying M by β. By resubmitting the predicted values to the multidimensional scaling process, the final ordering is produced, consisting of patterns segregated into classes in an n-dimensional space. This last result is denoted as Φ'.
The preferred embodiments of the invention will now be described with the aid of the accompanying drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a detailed block diagram of the procedural steps of the Two Domain Method, according to the present invention, for classifying a collection of image patterns.
FIGS. 2 and 3 are multi-dimensional scaling (MDS ALSCAL) plots of the original human view of a sample of eight images (photographic slides) of peripheral white blood cells. The human judgments were collected through the method of paired comparisons, and show a clear separation between the slides from Subject 1 and Subject 2.
FIG. 4 is an MDS ALSCAL plot of the primitive machine views of a set A of sixteen slides (slides 1-16) from a photographic film, rated as ASA 200 and exposed at ASA 200, which includes both Subject 1 and Subject 2. This plot exhibits some natural clustering by machine features alone.
FIG. 5 is an MDS ALSCAL plot of the primitive machine views of a set B of sixteen slides (slides 17-32) from a photographic film, rated at ASA 200 but exposed at ASA 400, including both Subject 1 and Subject 2. This plot exhibits little machine differentiation between the two subjects.
FIG. 6 is an MSD ALSCAL plot of both slide sets A and B and Subjects 1 and 2. It exhibits distortion of the natural clustering effect displayed in set A of FIG. 4 when set A and set B are combined.
FIG. 7 is an MDS ALSCAL plot of slide sets A and B and Subjects 1 and 2. This plot exhibits the reordering of Subject 1 and Subject 2 classes when weighted by the human view displayed in FIG. 2.
FIGS. 8 and 9 are MDS ALSCAL plots (in numbered display) of both primitive and human weighted views of all 32 peripheral blood cell slides, corresponding to the datapoints shown in FIGS. 6 and 7, respectively. FIG. 9 exhibits the substantial "learning" effect created by imposition of human judgments on machine interpretations.
DESCRIPTION OF THE PREFERRED EMBODIMENTS
The preferred embodiments of the present invention will now be described with reference to FIGS. 1-9 of the drawings.
The Two Domain Method
FIG. 1 illustrates the preferred embodiment of the Two Domain Method according to the present invention as applied to a collection of images. Each numbered block in this figure represents a separate and distinct step of this method.
The collection of images is initially sensed by machine and converted to a format--in particular, a signal S representing the patterns--which is useable by a computer. The signal S is then processed in block 1 to extract primitive image features as histograms of these features. By way of example and not limitation, the features may be grey level, edge intensity, edge slope, line length, line distance from the origin, and angle distance from the origin. Thereafter, in block 2, the histogram for each image is converted into Lorenz information measures. In block 3, the Lorenz information measures associated with those images which are used for the expert, human judgments are extracted from the group for later use.
Subsequently, in block 4, the Euclidean distances among all pairs of images are calculated over all Lorenz information measures to produce a matrix M of primitive machine image interpretations.
In block 5, a set of sample images is selected at random from the collection of images. In block 6, the sample images are compared, in pairs, by a human expert to determine the degree of dissimilarity of each pair. These expert judgments are then processed using conventional multi-dimensional scaling (MDS) techniques to produce a real valued ordering Φ of these images by their dissimilarity, as indicated in block 7. Thereafter, in block 8, the geometric representation produced by the MDS process is converted to Euclidean distances which, in turn, are converted, in block 9, to a column vector.
Thereafter, in block 10, the extracted sample of images, in Lorenz information measures, is converted to Euclidean distances which are regressed with the results of the conversion in block 9 to obtain a matrix of weights β, in block 11.
In block 12, the matrix M produced in block 4 is multiplied by the matrix of weights β from block 11. The resulting vector is converted to an off-diagonal matrix in block 13 for submission to MDS in block 14. The result of this MDS is the final ordering Φ'.
Application of the Two Domain Method to the Classification of Two Populations of Human Peripheral Blood Leukocytes
As an example, the Two Domain Method according to the invention will be applied to a problem of discriminating two populations of microscopic images of circulating human white blood cells (leukocytes).
Specifically, the Two Domain Method has been tested for its power to discriminate two distinct patterns of human blood leukocyte distribution: An abnormal pattern associated with acute liver failure exhibiting abnormal circulating white blood cell frequency and distribution (Subject 1) and a normal pattern from a normal, healthy subject (Subject 2).
Circulating human leukocytes were separated by flotation from red blood cells by a standard flotation method, and uniform monolayer films prepared and cytochemically stained by a routine clinical laboratory automated instrument using hematoxylin and eosin dyes. The resulting slides therefore included all nucleated circulating white blood cells, predominately neutrophils, eosinophils, lymphocytes and monocytes, as well as platelets.
Eight representative sample fields were selected for each subject. A photographic recording was standardized using one continuous film strip of Kodak Ektachrome color reversal film rated at ASA 200. All slides were photographed at the same magnification. Effects of exposure variations and background density were tested in the Two Domain Method by recording each image at two different exposures. Sixteen Set A images were exposed at ASA 200, while sixteen Set B images were exposed at ASA 400. Samples used in the test thus consisted of sixteen images from each subject, at two levels of exposure, on the same photographic film strip.
The difference in exposure levels substantially alters the machine measurements of these images and is typical of problems that confound image pattern classification generally, in that "noise" introduced by one element or another distort the machine classification algorithms. The purpose of this application is thus to demonstrate that the Two Domain Method is sufficiently robust not only to classify properly Set A (by segregation in an n-dimensional space), but also to reduce or eliminate the noise artificially introduced by the difference in Set B film exposure levels.
Expert judgments of dissimilarities were made by an experienced pathologist (C.T.L.) primarily on the basis of the segmentation of leukocyte nuclei, and lymphocyte and monocyte shape and size. Other cell types present in the images were ignored for judgment purposes. Judgments were provided in a single session on slides 1-8 of Set A according to the procedure described above, and submitted (as are all datasets discussed herein) to the ALSCAL procedure in SAS, a common multi-dimensional scaling software package.
FIGS. 2 and 3, which are MDS ALSCAL plots of this manual examination of slides 1-8, exhibit a strong separation between the cell populations of the two subjects. The primitive machine interpretations derived from both Set A and Set B, scaled by ALSCAL, appear in FIGS. 4 and 5, respectively. The images represented by datapoints in FIG. 4 appear to have some natural clustering tendency along the same lines as those provided directly by human judgments, probably due to the increased light levels in the images produced from Subject 1 and caused by the generally lower levels of white blood cells in the sample drawn from that subject. FIG. 5, on the other hand, derived from the deliberately overexposed images, reveals very little meaningful segregation.
FIG. 6 reveals the strong confounding effect of the Set B data when combined with Set A and scaled together. When the sets are combined, each item acts to influence the scale value of every other item, so that the pure machine view, or interpretation, of these images becomes extremely confused. There is, for example, some segregation of Subject 1 and Subject 2, but still much less than that appearing in the human classification of these images provided in FIG. 2.
FIG. 7 shows the effect of the Two Domain Method on the disordered data of FIG. 6. FIG. 7 was produced according to the procedures of FIG. 1 with the detailed calculations described below. In FIG. 7, Subject 1 and Subject 2 data are perfectly segregated for Set A and, with the exception of one image, also perfectly segregated for Set B. Clearly, the strong, confounding effect introduced by combining Set B with Set A images is eliminated.
FIGS. 8 and 9 are MDS ALSCAL plots, in numbered display, of both the primitive and human weighted views of the thirty-two peripheral blood cell slides. FIG. 9 exhibits the substantial "learning" effect created by imposition of the human judgments on the machine interpretations.
Detailed Calculations
The calculations used to produce the plot of FIG. 7 will now be described in detail. First, the primitive machine measurements (Lorenz information measures) for images 17-24 corresponding to the human judgments rendered on Set A for images 1-8 were converted to six sets of squared Euclidean distances (one for each machine measurement) according to the following equation:
Q.sub.k =(P.sub.ik -P.sub.jk).sup.2 ; 1<j, k=1,6           (1)
Where,
Q is matrix of 28×6,
Qk is a column of matrix Q,
p is a matrix of 8×6,
pik is the machine measurement k for image i, and
pjk is the machine measurement k for image j.
Since a column of Q contains the squared difference between all pairs of images on the corresponding machine measurements, there are [n(n-1)]/2 elements in each column, where n is the number of images.
Second, the squared Euclidean distances between all pairs of slides 1-8 of Set A, that is, Φ, were computed from the spatial coordinates of the MDS solution for the human judgments of FIG. 2 according to equation 2: ##EQU1## where, D is the square symmetric matrix,
xik is the coordinate of image i on dimension k,
xjk is the coordinate of image j on dimension k, and
r is the number of dimensions in the solution.
Third, the square symmetric matrix was converted to a column vector containing the top off-diagonal elements (for convenience also denoted D) and regressed on the matrix Q of equation 1 to produce the vector of weights, β. Equation 3 is the multiple regression equation in standard form and equation 4 is the standard least squares solution.
D=Qβ'                                                 (3)
β=(Q'Q).sup.-1 Q'D                                    (4)
Fourth, the procedure of equation 1 was applied to all machine data, images 1-32, denoted M, and multiplied by the vector of weights, β, or
V=Mβ'                                                 (5)
where,
V is the final vector converted to an off-diagonal matrix for submission to MDS, and
M is the 496×6 matrix from the procedure of equation 1.
V, submitted to MDS and scaled, thus results in Φ' as displayed in FIG. 7.
Conclusion
In conclusion, the Two Domain Method, as disclosed herein, is effective simply because it reduces the intense machine activity associated with pattern matching to the simple operations of ratio scale value relations. Moreover, the scaling theory underlying the method is easily transferable to operations involving classifications among higher dimensions. Indeed, multi-dimensional scaling has, for some time, been more often used to record human judgments in higher dimensions for a variety of marketing applications. P. E. Green and F. J. Carmone, "Multi-dimensional Scaling: An Introduction and Comparison of Nonmetric Unfolding Techniques," Journal of Marketing Research, Vol. 6, 1969, pgs. 330-41. Finally, by using replicated multi-dimensional scaling methods, the opinions of multiple experts (as opposed to the single expert used in this application) may be combined in the creation of Φ.
The Two Domain Method is also applicable to image classification systems that routinely use Bayesian methods. In this case, the operations of the Bayesian classifiers would use, as their inputs, the dissimilarity values output from multi-dimensional scaling matrix transforms, ignoring the plotted values that are derived from the dissimilarity values anyway. Along these same lines, the Two Domain Method may facilitate neural net pattern classification, both by making the net more efficient due to the reduction of information that must be submitted (dissimilarities or Euclidean distances rather than vectors of pixel values), and by the increased rigor of the training set expression that reduces noise when particular aspects of patterns are judged, rather than patterns as a whole.
Finally, the Two Domain Method may be used in the searching of large databases of images, where image representations are stored as feature components. In this application, the method would be applied to image classes iteratively, by segregating and mapping successively smaller classes of imagery. This application may be critical to locating desired sets of images that cannot be described linguistically due either to intellectual or economic constraints.
There has thus been shown and described a novel general method of pattern classification using the Two Domain Theory which fulfills all the objects and advantages sought therefor. Many changes, modifications, variations and other uses and applications of the subject invention will, however, become apparent to those skilled in the art after considering this specification and the accompanying drawings which disclose the preferred embodiments thereof. All such changes, modifications, variations and other uses and applications which do not depart from the spirit and scope of the invention are deemed to be covered by the invention, which is to be limited only by the claims which follow.

Claims (19)

What is claimed is:
1. A method for automatic classification of a collection C of patterns using the judgments of human experts on a plurality of sample patterns, said method comprising the steps of:
(a) selecting a set of sample patterns;
(b) manually comparing members of said set of sample patterns to determine the degree of dissimilarity of each member of said set with respect to other members of said set;
(c) producing an ordering Φ of said members of said set by their degree of dissimilarity in an n-dimensional space by means of multi-dimensional scaling to produce a real-valued ordering Φ of said sample patterns;
(d) sensing the collection C of patterns to produce a signal S representing said patterns;
(e) processing the signal S to produce a plurality of machine derived signatures representing distributions of primitive features of interest;
(f) calculating the spatial distance among pairs of said patterns from said machine derived signatures to produce a matrix M of interpoint distances; and
(g) creating a mapping of the ordering Φ on the matrix M by multiple regression;
whereby said collection of patterns is organized into sets of similar patterns.
2. The method defined in claim 1, wherein said patterns are images.
3. The method defined in claim 1, wherein said sample patterns are selected from said collection of patterns.
4. The method defined in claim 3, wherein said sample patterns are selected at random from said collection of patterns so as to be representative of said collection.
5. The method defined in claim 1, wherein each member of the set of sample patterns is manually compared as a pair with every other member of said set to determine the degree of dissimilarity of each pair.
6. The method defined in claim 5, wherein said comparing step includes the steps of manually marking a line, for each pair of sample patterns, which indicates, on an arbitrary scale from dissimilar to similar, the degree of dissimilarity of such pair; and sensing the line to produce a signal representative of the position of the mark on the line.
7. The method defined in claim 1, wherein said n-dimensional space is a Euclidean space.
8. The method defined in claim 1, wherein said step of sensing said collection C of patterns includes the step of digitizing each pattern and storing the digitized values.
9. The method defined in claim 1, wherein said signal S processing step includes the step of producing a histogram for each of said primitive features.
10. The method defined in claim 9, wherein said signal S processing step further includes the step of converting the feature histograms for each pattern into Lorenz information measures.
11. The method defined in claim 10, wherein said calculating step includes the step of calculating the Euclidean distance among pairs of said patterns over the Lorenz information measures to produce said matrix M.
12. The method defined in claim 11, wherein said step of creating a mapping includes the step of creating a linear mapping of the ordering Φ on the matrix M.
13. The method defined in claim 1, wherein said step of creating a mapping includes the steps of regressing the ordering Φ with the sample of matrix M corresponding to the sample manually compared to obtain a matrix of weights β by multiple regression and multiplying the matrix M by the matrix Φ.
14. The method defined in claim 13, further comprising the step of submitting the results of the matrix multiplication to multi-dimensional scaling to produce the final ordering Φ', consisting of patterns segregated into classes in an n-dimensional space.
15. A method for synthesizing human judgement measurements and machine derived measurements with respect to a collection C of patterns, said method comprising the steps of:
(a) selecting from the collection C of patterns a sample set comprising a plurality of sample patterns;
(b) forming pairs of patterns from said sample set by pairing each sample pattern with at least one other sample pattern;
(c) determining, using the subjective judgement of at least one human, a relative degree of dissimilarity between the patterns of each said pair;
(d) sensing the collection C of patterns to produce a signal S representing each pattern of said collection;
(e) extracting machine derived measurements of selected features from signal S for each pattern of collection C to create a set X of said machine derived feature measurements;
(f) selecting from the set X of machine derived feature measurements the subset Y of machine derived feature measurements corresponding to the set of sample patterns of step (a)
(g) processing the results of steps (c) and (f) to produce a matrix of weights relating the human judgement measurements with the machine derived feature measurements for the set of sample patterns; and
(h) applying the weights from step (g) to the machine derived feature measurements for the set X, whereby, the human judgement measurements and the machine measurements are related for the entire collection C of patterns.
16. The method of claim 15, wherein said set of sample patterns are selected so as to be representative of said collection.
17. The method of claim 15, wherein the sensing step includes digitizing and storing the patterns.
18. The method of claim 15, wherein the machine derived features comprise one or more primitive measurements.
19. The method of claim 15 comprising the further step of producing an ordering consisting of patterns segregated into classes in a n-dimensional space.
US07/587,922 1990-09-25 1990-09-25 General method of pattern classification using the two domain theory Expired - Fee Related US5181259A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US07/587,922 US5181259A (en) 1990-09-25 1990-09-25 General method of pattern classification using the two domain theory

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US07/587,922 US5181259A (en) 1990-09-25 1990-09-25 General method of pattern classification using the two domain theory

Publications (1)

Publication Number Publication Date
US5181259A true US5181259A (en) 1993-01-19

Family

ID=24351726

Family Applications (1)

Application Number Title Priority Date Filing Date
US07/587,922 Expired - Fee Related US5181259A (en) 1990-09-25 1990-09-25 General method of pattern classification using the two domain theory

Country Status (1)

Country Link
US (1) US5181259A (en)

Cited By (66)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5325445A (en) * 1992-05-29 1994-06-28 Eastman Kodak Company Feature classification using supervised statistical pattern recognition
US5422961A (en) * 1992-04-03 1995-06-06 At&T Corp. Apparatus and method for improving recognition of patterns by prototype transformation
US5425110A (en) * 1993-04-19 1995-06-13 Xerox Corporation Method and apparatus for automatic language determination of Asian language documents
US5444797A (en) * 1993-04-19 1995-08-22 Xerox Corporation Method and apparatus for automatic character script determination
WO1996009603A1 (en) * 1994-09-20 1996-03-28 Neopath, Inc. Method and apparatus for robust biological specimen classification
US5537488A (en) * 1993-09-16 1996-07-16 Massachusetts Institute Of Technology Pattern recognition system with statistical classification
US5572604A (en) * 1993-11-22 1996-11-05 Lucent Technologies Inc. Method for pattern recognition using prototype transformations and hierarchical filtering
US5694484A (en) * 1995-05-15 1997-12-02 Polaroid Corporation System and method for automatically processing image data to provide images of optimal perceptual quality
US5696844A (en) * 1991-05-14 1997-12-09 Matsushita Electric Industrial Co., Ltd. Outline pattern data extraction device for extracting outline pattern of a pattern distribution in a multi-dimensional feature vector space and its applications
US5835627A (en) * 1995-05-15 1998-11-10 Higgins; Eric W. System and method for automatically optimizing image quality and processing time
US5872865A (en) * 1995-02-08 1999-02-16 Apple Computer, Inc. Method and system for automatic classification of video images
WO1999034316A2 (en) * 1997-12-29 1999-07-08 Glickman Jeff B Energy minimization for classification, pattern recognition, sensor fusion, data compression, network reconstruction and signal processing
WO2001036939A2 (en) * 1999-11-04 2001-05-25 Meltec Multi-Epitope-Ligand-Technologies Gmbh Method for the automatic analysis of microscope images
US6295514B1 (en) 1996-11-04 2001-09-25 3-Dimensional Pharmaceuticals, Inc. Method, system, and computer program product for representing similarity/dissimilarity between chemical compounds
US20020002555A1 (en) * 1997-12-29 2002-01-03 Wolman Abel G. Energy minimization for data merging and fusion
US20020029114A1 (en) * 2000-08-22 2002-03-07 Lobanov Victor S. Method, system, and computer program product for detemining properties of combinatorial library products from features of library building blocks
US20020045991A1 (en) * 2000-09-20 2002-04-18 Lobanov Victor S. Method, system, and computer program product for encoding and building products of a virtual combinatorial library
US6392649B1 (en) * 1998-10-20 2002-05-21 Sony Corporation Method and apparatus for updating a multidimensional scaling database
US6434490B1 (en) 1994-09-16 2002-08-13 3-Dimensional Pharmaceuticals, Inc. Method of generating chemical compounds having desired properties
US6453246B1 (en) * 1996-11-04 2002-09-17 3-Dimensional Pharmaceuticals, Inc. System, method, and computer program product for representing proximity data in a multi-dimensional space
US20020143476A1 (en) * 2001-01-29 2002-10-03 Agrafiotis Dimitris K. Method, system, and computer program product for analyzing combinatorial libraries
US20020196277A1 (en) * 2000-03-21 2002-12-26 Sbc Properties, L.P. Method and system for automating the creation of customer-centric interfaces
US20030026409A1 (en) * 2001-07-31 2003-02-06 Sbc Technology Resources, Inc. Telephone call processing in an interactive voice response call management system
US6571227B1 (en) 1996-11-04 2003-05-27 3-Dimensional Pharmaceuticals, Inc. Method, system and computer program product for non-linear mapping of multi-dimensional data
US6571603B1 (en) * 1998-05-27 2003-06-03 California Institute Of Technology Method of resolving analytes in a fluid
US20030143981A1 (en) * 2002-01-30 2003-07-31 Sbc Technology Resources, Inc. Sequential presentation of long instructions in an interactive voice response system
US20030156706A1 (en) * 2002-02-21 2003-08-21 Koehler Robert Kevin Interactive dialog-based training method
US20030156133A1 (en) * 2000-03-21 2003-08-21 Sbc Properties, L.P. Interface and method of designing an interface
US6671627B2 (en) 2000-02-29 2003-12-30 3-D Pharmaceuticals, Inc. Method and computer program product for designing combinatorial arrays
US20040002163A1 (en) * 2002-04-15 2004-01-01 Ventana Medical Systems, Inc. Automated high volume slide staining system
WO2004032061A2 (en) * 2002-10-07 2004-04-15 Technion Research & Development Foundation Ltd. Three dimensional face recognition
US20040128624A1 (en) * 1998-09-11 2004-07-01 Sbc Technology Resources, Inc. System and methods for an architectural framework for design of an adaptive, personalized, interactive content delivery system
US20040230586A1 (en) * 2002-07-30 2004-11-18 Abel Wolman Geometrization for pattern recognition, data analysis, data merging, and multiple criteria decision making
US20050045089A1 (en) * 2003-08-27 2005-03-03 Edwin Hirahara Method of attaching an end seal to manufactured seeds
US20050069176A1 (en) * 2003-09-30 2005-03-31 Toland Mitchell R. General method of classifying plant embryos using a generalized Lorenz-Bayes classifier
US20050105795A1 (en) * 2003-11-19 2005-05-19 Rita Singh Classification in likelihood spaces
US20050108937A1 (en) * 2003-11-25 2005-05-26 Edwin Hirahara Method and system of manufacturing artificial seed coats
US20050108935A1 (en) * 2003-11-25 2005-05-26 Edwin Hirahara Method and system of manufacturing artificial seed coats
US20050114919A1 (en) * 2003-11-25 2005-05-26 Carlson William C. Combination end seal and restraint
US20050108929A1 (en) * 2003-11-25 2005-05-26 Edwin Hirahara Method and system for creating manufactured seeds
US20050114918A1 (en) * 2003-11-25 2005-05-26 Edwin Hirahara System and method of embryo delivery for manufactured seeds
US20050132436A1 (en) * 2003-12-11 2005-06-16 Carlson William C. Multi-embryo manufactured seed
US20050135595A1 (en) * 2003-12-18 2005-06-23 Sbc Knowledge Ventures, L.P. Intelligently routing customer communications
US20050133528A1 (en) * 2003-12-18 2005-06-23 Edwin Hirahara System and method for filling a seedcoat with a liquid to a selected level
US20050175244A1 (en) * 1997-12-29 2005-08-11 Glickman Jeff B. Energy minimization for classification, pattern recognition, sensor fusion, data compression, network reconstruction and signal processing
US20050180613A1 (en) * 2002-10-07 2005-08-18 Michael Bronstein Facial recognition and the open mouth problem
US20050186114A1 (en) * 2002-04-15 2005-08-25 Kurt Reinhardt Automated high volume slide processing system
US20050222828A1 (en) * 2004-04-02 2005-10-06 Ehtibar Dzhafarov Method for computing subjective dissimilarities among discrete entities
US20060032121A1 (en) * 2004-06-30 2006-02-16 Edwin Hirahara Method and system for producing manufactured seeds
US20060064930A1 (en) * 2004-09-27 2006-03-30 Carlson William C Manufactured seed having a live end seal coating
US7039621B2 (en) 2000-03-22 2006-05-02 Johnson & Johnson Pharmaceutical Research & Development, L.L.C. System, method, and computer program product for representing object relationships in a multidimensional space
US20060127634A1 (en) * 2004-12-15 2006-06-15 Dimakis Alkiviades G Oriented strand board panel having improved strand alignment and a method for making the same
US7086007B1 (en) 1999-05-27 2006-08-01 Sbc Technology Resources, Inc. Method for integrating user models to interface design
US7139739B2 (en) 2000-04-03 2006-11-21 Johnson & Johnson Pharmaceutical Research & Development, L.L.C. Method, system, and computer program product for representing object relationships in a multidimensional space
US7146347B1 (en) * 2000-08-03 2006-12-05 National Instruments Corporation System and method for automatically creating a prototype to perform a process
US20070000169A1 (en) * 2005-06-30 2007-01-04 Hartle Jeffrey E Method to improve plant somatic embryo germination from manufactured seed
US20070067212A1 (en) * 2005-09-21 2007-03-22 Eric Bonabeau System and method for aiding product design and quantifying acceptance
US7224790B1 (en) 1999-05-27 2007-05-29 Sbc Technology Resources, Inc. Method to identify and categorize customer's goals and behaviors within a customer service center environment
US20070269096A1 (en) * 1998-06-01 2007-11-22 Weyerhaeuser Company Methods for classification of somatic embryos
US20080052056A1 (en) * 1998-06-01 2008-02-28 Weyerhaeuser Company Methods for classification of somatic embryos
US7603807B2 (en) 2003-11-26 2009-10-20 Weyerhaeuser Nr Company Vacuum pick-up device with mechanically assisted release
CN101799926A (en) * 2010-05-05 2010-08-11 福州大学 Automatically quantitative analysis system of Ki-67 immune-histochemical pathological image
US8874610B2 (en) 2011-12-06 2014-10-28 International Business Machines Corporation Pattern-based stability analysis of complex data sets
US10184862B2 (en) 2008-11-12 2019-01-22 Ventana Medical Systems, Inc. Methods and apparatuses for heating slides carrying specimens
US10794805B2 (en) 2013-12-13 2020-10-06 Ventana Medical Systems, Inc. Automated histological processing of biological specimens and associated technology
US11249095B2 (en) 2002-04-15 2022-02-15 Ventana Medical Systems, Inc. Automated high volume slide processing system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4129854A (en) * 1976-10-25 1978-12-12 Hitachi, Ltd. Cell classification method
US4307376A (en) * 1976-12-09 1981-12-22 Geometric Data Corporation Pattern recognition system for generating hematology profile
US4326259A (en) * 1980-03-27 1982-04-20 Nestor Associates Self organizing general pattern class separator and identifier
US4618988A (en) * 1984-07-25 1986-10-21 Fingermatrix, Inc. Matcher
US4850024A (en) * 1984-04-05 1989-07-18 Hitachi, Ltd. Method and apparatus for classifying white blood cells

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4129854A (en) * 1976-10-25 1978-12-12 Hitachi, Ltd. Cell classification method
US4307376A (en) * 1976-12-09 1981-12-22 Geometric Data Corporation Pattern recognition system for generating hematology profile
US4326259A (en) * 1980-03-27 1982-04-20 Nestor Associates Self organizing general pattern class separator and identifier
US4850024A (en) * 1984-04-05 1989-07-18 Hitachi, Ltd. Method and apparatus for classifying white blood cells
US4618988A (en) * 1984-07-25 1986-10-21 Fingermatrix, Inc. Matcher

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
"Automatic Image Classification by Psychometric Mapping," Technical Report No. 87-2, M. E. Rorvig, R. Helfer & S. Fitzpatrick, Project ICON Image Scaling Laboratory, The University of Texas at Austin, Austin, TX 1987.
"The Two Domain Theory of Image Collection Searching," Project Icon Working Paper No. 86-1, M. E. Rorvig, Project ICON Image Scaling Laboratory, The University of Texas at Austin, Austin, TX 1986.
Automatic Image Classification by Psychometric Mapping, Technical Report No. 87 2, M. E. Rorvig, R. Helfer & S. Fitzpatrick, Project ICON Image Scaling Laboratory, The University of Texas at Austin, Austin, TX 1987. *
The Two Domain Theory of Image Collection Searching, Project Icon Working Paper No. 86 1, M. E. Rorvig, Project ICON Image Scaling Laboratory, The University of Texas at Austin, Austin, TX 1986. *

Cited By (150)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5696844A (en) * 1991-05-14 1997-12-09 Matsushita Electric Industrial Co., Ltd. Outline pattern data extraction device for extracting outline pattern of a pattern distribution in a multi-dimensional feature vector space and its applications
US5422961A (en) * 1992-04-03 1995-06-06 At&T Corp. Apparatus and method for improving recognition of patterns by prototype transformation
US5325445A (en) * 1992-05-29 1994-06-28 Eastman Kodak Company Feature classification using supervised statistical pattern recognition
US5425110A (en) * 1993-04-19 1995-06-13 Xerox Corporation Method and apparatus for automatic language determination of Asian language documents
US5444797A (en) * 1993-04-19 1995-08-22 Xerox Corporation Method and apparatus for automatic character script determination
US5537488A (en) * 1993-09-16 1996-07-16 Massachusetts Institute Of Technology Pattern recognition system with statistical classification
US5703964A (en) * 1993-09-16 1997-12-30 Massachusetts Institute Of Technology Pattern recognition system with statistical classification
US5572604A (en) * 1993-11-22 1996-11-05 Lucent Technologies Inc. Method for pattern recognition using prototype transformations and hierarchical filtering
US6434490B1 (en) 1994-09-16 2002-08-13 3-Dimensional Pharmaceuticals, Inc. Method of generating chemical compounds having desired properties
US20030033088A1 (en) * 1994-09-16 2003-02-13 Agrafiotis Dimitris K. Method of generating chemical compounds having desired properties
US5740269A (en) * 1994-09-20 1998-04-14 Neopath, Inc. Method and apparatus for robust biological specimen classification
WO1996009603A1 (en) * 1994-09-20 1996-03-28 Neopath, Inc. Method and apparatus for robust biological specimen classification
US5872865A (en) * 1995-02-08 1999-02-16 Apple Computer, Inc. Method and system for automatic classification of video images
US5835627A (en) * 1995-05-15 1998-11-10 Higgins; Eric W. System and method for automatically optimizing image quality and processing time
US5694484A (en) * 1995-05-15 1997-12-02 Polaroid Corporation System and method for automatically processing image data to provide images of optimal perceptual quality
US7188055B2 (en) 1996-11-04 2007-03-06 Johnson & Johnson Pharmaceutical Research, & Development, L.L.C. Method, system, and computer program for displaying chemical data
US20030195897A1 (en) * 1996-11-04 2003-10-16 Agrafiotis Dimitris K. Method, system and computer program product for non-linear mapping of multi-dimensional data
US6295514B1 (en) 1996-11-04 2001-09-25 3-Dimensional Pharmaceuticals, Inc. Method, system, and computer program product for representing similarity/dissimilarity between chemical compounds
US6571227B1 (en) 1996-11-04 2003-05-27 3-Dimensional Pharmaceuticals, Inc. Method, system and computer program product for non-linear mapping of multi-dimensional data
US20030014191A1 (en) * 1996-11-04 2003-01-16 3-Dimensional Pharmaceuticals, Inc. System, method and computer program product for identifying chemical compounds having desired properties
US7117187B2 (en) 1996-11-04 2006-10-03 Johnson & Johnson Pharmaceutical Reseach & Develpment, L.L.C. Method, system and computer program product for non-linear mapping of multi-dimensional data
US6453246B1 (en) * 1996-11-04 2002-09-17 3-Dimensional Pharmaceuticals, Inc. System, method, and computer program product for representing proximity data in a multi-dimensional space
US6421612B1 (en) 1996-11-04 2002-07-16 3-Dimensional Pharmaceuticals Inc. System, method and computer program product for identifying chemical compounds having desired properties
WO1999034316A3 (en) * 1997-12-29 1999-10-28 Jeff B Glickman Energy minimization for classification, pattern recognition, sensor fusion, data compression, network reconstruction and signal processing
US6968342B2 (en) * 1997-12-29 2005-11-22 Abel Wolman Energy minimization for data merging and fusion
WO1999034316A2 (en) * 1997-12-29 1999-07-08 Glickman Jeff B Energy minimization for classification, pattern recognition, sensor fusion, data compression, network reconstruction and signal processing
US20050185848A1 (en) * 1997-12-29 2005-08-25 Glickman Jeff B. Energy minimization for classification, pattern recognition, sensor fusion, data compression, network reconstruction and signal processing
US7912290B2 (en) 1997-12-29 2011-03-22 Glickman Jeff B Energy minimization for classification, pattern recognition, sensor fusion, data compression, network reconstruction and signal processing
US20050180638A1 (en) * 1997-12-29 2005-08-18 Glickman Jeff B. Energy minimization for classification, pattern recognition, sensor fusion, data compression, network reconstruction and signal processing
US20020002555A1 (en) * 1997-12-29 2002-01-03 Wolman Abel G. Energy minimization for data merging and fusion
US20050175244A1 (en) * 1997-12-29 2005-08-11 Glickman Jeff B. Energy minimization for classification, pattern recognition, sensor fusion, data compression, network reconstruction and signal processing
US7702155B2 (en) 1997-12-29 2010-04-20 Glickman Jeff B Energy minimization for classification, pattern recognition, sensor fusion, data compression, network reconstruction and signal processing
US7174048B2 (en) 1997-12-29 2007-02-06 Glickman Jeff B Energy minimization for classification, pattern recognition, sensor fusion, data compression, network reconstruction and signal processing
US6993186B1 (en) 1997-12-29 2006-01-31 Glickman Jeff B Energy minimization for classification, pattern recognition, sensor fusion, data compression, network reconstruction and signal processing
US7272262B2 (en) 1997-12-29 2007-09-18 Glickman Jeff B Energy minimization for classification, pattern recognition, sensor fusion, data compression, network reconstruction and signal processing
US6571603B1 (en) * 1998-05-27 2003-06-03 California Institute Of Technology Method of resolving analytes in a fluid
US20070276640A1 (en) * 1998-06-01 2007-11-29 Weyerhaeuser Company Methods for classification of somatic embryos
US9053353B2 (en) 1998-06-01 2015-06-09 Weyerhaeuser Nr Company Image classification of germination potential of somatic embryos
US20070269096A1 (en) * 1998-06-01 2007-11-22 Weyerhaeuser Company Methods for classification of somatic embryos
US20080052056A1 (en) * 1998-06-01 2008-02-28 Weyerhaeuser Company Methods for classification of somatic embryos
US20040128624A1 (en) * 1998-09-11 2004-07-01 Sbc Technology Resources, Inc. System and methods for an architectural framework for design of an adaptive, personalized, interactive content delivery system
US6392649B1 (en) * 1998-10-20 2002-05-21 Sony Corporation Method and apparatus for updating a multidimensional scaling database
US7526731B2 (en) 1999-05-27 2009-04-28 At&T Labs, Inc. Method for integrating user models to interface design
US20090177983A1 (en) * 1999-05-27 2009-07-09 At&T Labs, Inc. (Formerly Known As Sbc Technologyresources, Inc.) Method for integrating user models to interface design
US8103961B2 (en) 1999-05-27 2012-01-24 At&T Labs, Inc. Method for integrating user models to interface design
US7086007B1 (en) 1999-05-27 2006-08-01 Sbc Technology Resources, Inc. Method for integrating user models to interface design
US7224790B1 (en) 1999-05-27 2007-05-29 Sbc Technology Resources, Inc. Method to identify and categorize customer's goals and behaviors within a customer service center environment
US20110022963A1 (en) * 1999-05-27 2011-01-27 At&T Labs, Inc. Method for integrating user models to interface design
US7836405B2 (en) 1999-05-27 2010-11-16 At&T Labs, Inc. Method for integrating user models to interface design
US7382909B1 (en) 1999-11-04 2008-06-03 Mpb Meltec Patent-Und Beteiligungsgesellschaft Mbh Method for the automatic analysis of microscope images
WO2001036939A2 (en) * 1999-11-04 2001-05-25 Meltec Multi-Epitope-Ligand-Technologies Gmbh Method for the automatic analysis of microscope images
WO2001036939A3 (en) * 1999-11-04 2001-11-01 Meltec Multi Epitope Ligand Te Method for the automatic analysis of microscope images
US6671627B2 (en) 2000-02-29 2003-12-30 3-D Pharmaceuticals, Inc. Method and computer program product for designing combinatorial arrays
US20020196277A1 (en) * 2000-03-21 2002-12-26 Sbc Properties, L.P. Method and system for automating the creation of customer-centric interfaces
US7907719B2 (en) 2000-03-21 2011-03-15 At&T Labs, Inc. Customer-centric interface and method of designing an interface
US7379537B2 (en) 2000-03-21 2008-05-27 At&T Knowledge Ventures, L.P. Method and system for automating the creation of customer-centric interfaces
US20030156133A1 (en) * 2000-03-21 2003-08-21 Sbc Properties, L.P. Interface and method of designing an interface
US7139369B2 (en) 2000-03-21 2006-11-21 Sbc Properties, L.P. Interface and method of designing an interface
US7076049B2 (en) 2000-03-21 2006-07-11 Sbc Technology Resources, Inc. Method of designing a telecommunications call center interface
US8131524B2 (en) 2000-03-21 2012-03-06 At&T Intellectual Property I, L.P. Method and system for automating the creation of customer-centric interfaces
US7039621B2 (en) 2000-03-22 2006-05-02 Johnson & Johnson Pharmaceutical Research & Development, L.L.C. System, method, and computer program product for representing object relationships in a multidimensional space
US7139739B2 (en) 2000-04-03 2006-11-21 Johnson & Johnson Pharmaceutical Research & Development, L.L.C. Method, system, and computer program product for representing object relationships in a multidimensional space
US7146347B1 (en) * 2000-08-03 2006-12-05 National Instruments Corporation System and method for automatically creating a prototype to perform a process
US20020029114A1 (en) * 2000-08-22 2002-03-07 Lobanov Victor S. Method, system, and computer program product for detemining properties of combinatorial library products from features of library building blocks
US20050153364A1 (en) * 2000-08-22 2005-07-14 Lobanov Victor S. Method, system, and computer program product for determining properties of combinatorial library products from features of library building blocks
US6834239B2 (en) 2000-08-22 2004-12-21 Victor S. Lobanov Method, system, and computer program product for determining properties of combinatorial library products from features of library building blocks
US20020045991A1 (en) * 2000-09-20 2002-04-18 Lobanov Victor S. Method, system, and computer program product for encoding and building products of a virtual combinatorial library
US7054757B2 (en) 2001-01-29 2006-05-30 Johnson & Johnson Pharmaceutical Research & Development, L.L.C. Method, system, and computer program product for analyzing combinatorial libraries
US20020143476A1 (en) * 2001-01-29 2002-10-03 Agrafiotis Dimitris K. Method, system, and computer program product for analyzing combinatorial libraries
US20030026409A1 (en) * 2001-07-31 2003-02-06 Sbc Technology Resources, Inc. Telephone call processing in an interactive voice response call management system
US7453994B2 (en) 2002-01-30 2008-11-18 At&T Labs, Inc. Sequential presentation of long instructions in an interactive voice response system
US8036348B2 (en) 2002-01-30 2011-10-11 At&T Labs, Inc. Sequential presentation of long instructions in an interactive voice response system
US7305070B2 (en) 2002-01-30 2007-12-04 At&T Labs, Inc. Sequential presentation of long instructions in an interactive voice response system
US20030143981A1 (en) * 2002-01-30 2003-07-31 Sbc Technology Resources, Inc. Sequential presentation of long instructions in an interactive voice response system
US20080089491A1 (en) * 2002-01-30 2008-04-17 At&T Labs, Inc. Sequential presentation of long instructions in an interactive voice response system
US6914975B2 (en) 2002-02-21 2005-07-05 Sbc Properties, L.P. Interactive dialog-based training method
US8023636B2 (en) 2002-02-21 2011-09-20 Sivox Partners, Llc Interactive dialog-based training method
US20050170326A1 (en) * 2002-02-21 2005-08-04 Sbc Properties, L.P. Interactive dialog-based training method
US20030156706A1 (en) * 2002-02-21 2003-08-21 Koehler Robert Kevin Interactive dialog-based training method
US20040002163A1 (en) * 2002-04-15 2004-01-01 Ventana Medical Systems, Inc. Automated high volume slide staining system
US20050250211A1 (en) * 2002-04-15 2005-11-10 Kurt Reinhardt Automated high volume slide processing system
US11249095B2 (en) 2002-04-15 2022-02-15 Ventana Medical Systems, Inc. Automated high volume slide processing system
US11092611B2 (en) 2002-04-15 2021-08-17 Ventana Medical Systems, Inc. Automated high volume slide processing system
US10302665B2 (en) 2002-04-15 2019-05-28 Ventana Medical Systems, Inc. Automated high volume slide processing system
US9528918B2 (en) 2002-04-15 2016-12-27 Ventana Medical Systems, Inc. Automated high volume slide processing system
US8663991B2 (en) 2002-04-15 2014-03-04 Ventana Medical Systems, Inc. Automated high volume slide processing system
US20080038836A1 (en) * 2002-04-15 2008-02-14 Kurt Reinhardt Automated high volume slide staining system
US7303725B2 (en) 2002-04-15 2007-12-04 Ventana Medical Systems, Inc. Automated high volume slide staining system
US7468161B2 (en) 2002-04-15 2008-12-23 Ventana Medical Systems, Inc. Automated high volume slide processing system
US8048373B2 (en) 2002-04-15 2011-11-01 Ventana Medical Systems, Inc. Automated high volume slide staining system
US20050186114A1 (en) * 2002-04-15 2005-08-25 Kurt Reinhardt Automated high volume slide processing system
US20040230586A1 (en) * 2002-07-30 2004-11-18 Abel Wolman Geometrization for pattern recognition, data analysis, data merging, and multiple criteria decision making
US20070198553A1 (en) * 2002-07-30 2007-08-23 Abel Wolman Geometrization for pattern recognition, data analysis, data merging, and multiple criteria decision making
US20110093482A1 (en) * 2002-07-30 2011-04-21 Abel Wolman Geometrization For Pattern Recognition Data Analysis, Data Merging And Multiple Criteria Decision Making
US7885966B2 (en) 2002-07-30 2011-02-08 Abel Wolman Geometrization for pattern recognition, data analysis, data merging, and multiple criteria decision making
US8055677B2 (en) 2002-07-30 2011-11-08 Abel Gordon Wolman Geometrization for pattern recognition data analysis, data merging and multiple criteria decision making
US7222126B2 (en) 2002-07-30 2007-05-22 Abel Wolman Geometrization for pattern recognition, data analysis, data merging, and multiple criteria decision making
US8412723B2 (en) 2002-07-30 2013-04-02 Abel Wolman Geometrization for pattern recognition, data analysis, data merging, and multiple criteria decision making
US20040076313A1 (en) * 2002-10-07 2004-04-22 Technion Research And Development Foundation Ltd. Three-dimensional face recognition
US8155400B2 (en) 2002-10-07 2012-04-10 Technion Research & Development Foundation L' Facial recognition and the open mouth problem
US20050180613A1 (en) * 2002-10-07 2005-08-18 Michael Bronstein Facial recognition and the open mouth problem
WO2004032061A3 (en) * 2002-10-07 2004-05-06 Technion Res & Dev Foundation Three dimensional face recognition
WO2004032061A2 (en) * 2002-10-07 2004-04-15 Technion Research & Development Foundation Ltd. Three dimensional face recognition
US7421098B2 (en) 2002-10-07 2008-09-02 Technion Research & Development Foundation Ltd. Facial recognition and the open mouth problem
US20060251298A1 (en) * 2002-10-07 2006-11-09 Technion Research & Development Foundation Ltd. Three-dimensional face recognition
US20080292147A1 (en) * 2002-10-07 2008-11-27 Technion Research & Development Foundation Ltd. Facial recognition and the open mouth problem
US6947579B2 (en) * 2002-10-07 2005-09-20 Technion Research & Development Foundation Ltd. Three-dimensional face recognition
US7623687B2 (en) 2002-10-07 2009-11-24 Technion Research & Development Foundation Ltd. Three-dimensional face recognition
US7228658B2 (en) 2003-08-27 2007-06-12 Weyerhaeuser Company Method of attaching an end seal to manufactured seeds
US20050045089A1 (en) * 2003-08-27 2005-03-03 Edwin Hirahara Method of attaching an end seal to manufactured seeds
US20050069176A1 (en) * 2003-09-30 2005-03-31 Toland Mitchell R. General method of classifying plant embryos using a generalized Lorenz-Bayes classifier
US8691575B2 (en) 2003-09-30 2014-04-08 Weyerhaeuser Nr Company General method of classifying plant embryos using a generalized Lorenz-Bayes classifier
US20050105795A1 (en) * 2003-11-19 2005-05-19 Rita Singh Classification in likelihood spaces
US7305132B2 (en) 2003-11-19 2007-12-04 Mitsubishi Electric Research Laboratories, Inc. Classification in likelihood spaces
US20050108937A1 (en) * 2003-11-25 2005-05-26 Edwin Hirahara Method and system of manufacturing artificial seed coats
US20050114919A1 (en) * 2003-11-25 2005-05-26 Carlson William C. Combination end seal and restraint
US7131234B2 (en) 2003-11-25 2006-11-07 Weyerhaeuser Co. Combination end seal and restraint
US20050114918A1 (en) * 2003-11-25 2005-05-26 Edwin Hirahara System and method of embryo delivery for manufactured seeds
US20050108935A1 (en) * 2003-11-25 2005-05-26 Edwin Hirahara Method and system of manufacturing artificial seed coats
US20050108929A1 (en) * 2003-11-25 2005-05-26 Edwin Hirahara Method and system for creating manufactured seeds
US7555865B2 (en) 2003-11-25 2009-07-07 Weyerhaeuser Nr Company Method and system of manufacturing artificial seed coats
US7603807B2 (en) 2003-11-26 2009-10-20 Weyerhaeuser Nr Company Vacuum pick-up device with mechanically assisted release
US20050132436A1 (en) * 2003-12-11 2005-06-16 Carlson William C. Multi-embryo manufactured seed
US7356965B2 (en) 2003-12-11 2008-04-15 Weyerhaeuser Co. Multi-embryo manufactured seed
US7591287B2 (en) 2003-12-18 2009-09-22 Weyerhaeuser Nr Company System and method for filling a seedcoat with a liquid to a selected level
US20050133528A1 (en) * 2003-12-18 2005-06-23 Edwin Hirahara System and method for filling a seedcoat with a liquid to a selected level
US20060098803A1 (en) * 2003-12-18 2006-05-11 Sbc Knowledge Ventures, L.P. Intelligently routing customer communications
US20050135595A1 (en) * 2003-12-18 2005-06-23 Sbc Knowledge Ventures, L.P. Intelligently routing customer communications
US7751552B2 (en) 2003-12-18 2010-07-06 At&T Intellectual Property I, L.P. Intelligently routing customer communications
US7027586B2 (en) 2003-12-18 2006-04-11 Sbc Knowledge Ventures, L.P. Intelligently routing customer communications
US20050222828A1 (en) * 2004-04-02 2005-10-06 Ehtibar Dzhafarov Method for computing subjective dissimilarities among discrete entities
US7568309B2 (en) 2004-06-30 2009-08-04 Weyerhaeuser Nr Company Method and system for producing manufactured seeds
US20060032121A1 (en) * 2004-06-30 2006-02-16 Edwin Hirahara Method and system for producing manufactured seeds
US20060064930A1 (en) * 2004-09-27 2006-03-30 Carlson William C Manufactured seed having a live end seal coating
US7547488B2 (en) 2004-12-15 2009-06-16 Weyerhaeuser Nr Company Oriented strand board panel having improved strand alignment and a method for making the same
US20060127634A1 (en) * 2004-12-15 2006-06-15 Dimakis Alkiviades G Oriented strand board panel having improved strand alignment and a method for making the same
US10900982B2 (en) 2005-04-27 2021-01-26 Ventana Medical Systems, Inc. Automated high volume slide processing system
US11815518B2 (en) 2005-04-27 2023-11-14 Ventana Medical Systems, Inc. Automated high volume slide processing system
US20070000169A1 (en) * 2005-06-30 2007-01-04 Hartle Jeffrey E Method to improve plant somatic embryo germination from manufactured seed
US7654037B2 (en) 2005-06-30 2010-02-02 Weyerhaeuser Nr Company Method to improve plant somatic embryo germination from manufactured seed
US20070067212A1 (en) * 2005-09-21 2007-03-22 Eric Bonabeau System and method for aiding product design and quantifying acceptance
US8423323B2 (en) * 2005-09-21 2013-04-16 Icosystem Corporation System and method for aiding product design and quantifying acceptance
US10184862B2 (en) 2008-11-12 2019-01-22 Ventana Medical Systems, Inc. Methods and apparatuses for heating slides carrying specimens
US10429280B2 (en) 2008-11-12 2019-10-01 Ventana Medical Systems, Inc. Methods for heating microscope slides carrying specimens
US10520403B2 (en) 2008-11-12 2019-12-31 Ventana Medical Systems, Inc. Apparatuses for heating microscope slides carrying specimens
US11493410B2 (en) 2008-11-12 2022-11-08 Ventana Medical Systems, Inc. Methods for heating microscope slides carrying specimens
CN101799926A (en) * 2010-05-05 2010-08-11 福州大学 Automatically quantitative analysis system of Ki-67 immune-histochemical pathological image
US8874610B2 (en) 2011-12-06 2014-10-28 International Business Machines Corporation Pattern-based stability analysis of complex data sets
US10794805B2 (en) 2013-12-13 2020-10-06 Ventana Medical Systems, Inc. Automated histological processing of biological specimens and associated technology
US11614387B2 (en) 2013-12-13 2023-03-28 Ventana Medical Systems, Inc. Automated histological processing of biological specimens and associated technology

Similar Documents

Publication Publication Date Title
US5181259A (en) General method of pattern classification using the two domain theory
Agarwal et al. Learning to detect objects in images via a sparse, part-based representation
Torralba et al. Depth estimation from image structure
Carson et al. Blobworld: Image segmentation using expectation-maximization and its application to image querying
Bacusmber et al. Leukocyte pattern recognition
Oliva et al. Scene-centered description from spatial envelope properties
CN109952614A (en) The categorizing system and method for biomone
US20040054499A1 (en) System and method for identifying an object
CN110659665A (en) Model construction method of different-dimensional features and image identification method and device
Zhu et al. Segmentation assisted food classification for dietary assessment
CN109255289A (en) A kind of across aging face identification method generating model based on unified formula
Yang et al. Pathminer: a web-based tool for computer-assisted diagnostics in pathology
Fountain et al. Efficient rotation invariant texture features for content-based image retrieval
Lessmann et al. A method for linking computed image features to histological semantics in neuropathology
CN116580394A (en) White blood cell detection method based on multi-scale fusion and deformable self-attention
Salih et al. An improved content based image retrieval technique by exploiting bi-layer concept
Pun et al. Statistical structuring of pictorial databases for content-based image retrieval systems
Martínez et al. A new approach to object-related image retrieval
Hoogenboom et al. Face detection using local maxima
Rorvig et al. A new machine classification method applied to human peripheral blood leukocytes
Yunqi et al. Feature Description and Image Retrieval Based on Visual Attention Model.
Balas et al. Receptive field structures for recognition
Sanghavi et al. Content based image retrieval (cbir) system for diagnosis of blood related diseases
CN116612335B (en) Few-sample fine-granularity image classification method based on contrast learning
CN109543696A (en) A kind of image-recognizing method neural network based and its application

Legal Events

Date Code Title Description
AS Assignment

Owner name: NATIONAL AERONAUTICS AND SPACE ADMINISTRATION, THE

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST.;ASSIGNOR:RORVIG, MARK E.;REEL/FRAME:005459/0961

Effective date: 19900913

FPAY Fee payment

Year of fee payment: 4

FPAY Fee payment

Year of fee payment: 8

REMI Maintenance fee reminder mailed
LAPS Lapse for failure to pay maintenance fees
STCH Information on status: patent discontinuation

Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FP Lapsed due to failure to pay maintenance fee

Effective date: 20050119