Data Levelling
A: In the PCA calculation the default is to use multivariate data. This means that there must be a valid entry for every variable used in the PCA calculation. This error message indicates that there are no samples that contain a valid entry for all of variables used in the calculation. To check whether the samples you wish to use are multivariate or not use the “Apply Attributes to all Rows (Selected Data Not Null)” option under the Validate menu. You must select an attribute colour first in the Attribute Manager which will then be applied to all of the data points that contain a valid data entry for all the variables selected in the Select Variables dialog. For more information view the “Identify Multivariate Samples” training video which will enable you see which samples are multivariate for the selected elements and which ones are not.
A: This can happen when using poor quality data which contains a lot of samples with the same value for a variable or with data that is heavily quantized.
A: ioGAS has a column limit of 253.
A: An overview of the mechanism used in Gauss levelling calculation can be found in the Help file under the Data Levelling section. A more detailed explanation is as follows:
- The data is ranked.
- The rank value is made into a p value using p = (Rank + 0.5)/size
- The inverse cumulative probability (z) is calculated using this implementation http://commons.apache.org/math/apidocs/org/apache/commons/math/distribution/NormalDistributionImpl.html
- These ‘z’ values are placed in a new column in ioGAS.
This is done one group at a time, and each group has no effect on any other. Small groups are ignored, so the output remains null for these groups.