This is an old revision of the document!
Classification
Classification or cluster analysis groups similar objects in distinct clusters (in the EM Field usually called classes) while minimizing the variance (of a certain parameter) within one cluster/class. During image processing, classification is used to find similar 2D images within a dataset.
Usage
A prerequisite for a feasible classification is a complexity reduction of the given input dataset. This can be achieved by performing a principle component analysis (PCA). The PCA can be carried out either internally by the classification-logic or beforehand by the respective PCA logic. Additionally, the user has to define the input set and the number of expected classes/clusters. Now, the logic will split the dataset according to the information provided by the PCA into as many classes/clusters as determined aiming for minimal variance within each class/cluster.
Example
Modes
PCA
This classification mode uses an internal/external PCA to reduce the dataset's complexity before splitting it into a defined number of classes.
Parameters | Description |
---|---|
Eigen images location | |
→ Number of eigen images | |
Split up method | Determine, whether large classes should be slip up into smaller classes due to their number of containing objects (size) or due to their high internal variance of the cross-correlation-coefficients (cccVariance) |
Number of classes | Number of resulting classes/clusters |
Remove duplicated images | ![]() |
Input | Description |
---|---|
Input | Stack of input images |
Pre. Eigen Images | (Only available, if Eigen images location = extern) Stack of Eigen images with the sum of all images and a mask as the last two images |
Output | Description |
---|---|
output | Stack of all images with added/altered classID header information |
sums | Stack of one image per class/cluster, which represents the average of all images within that class/cluster |
Output | Description |
---|---|
ClassID | ID of the class/cluster, the image belongs to |
Concept
Once clusters/classes of images are found, they can be averaged (see SumByClassNumber) to improve the signal-to-noise-ration dramatically.1)