stats
Usage:
stats [<item> ...]
Calculates statistics on the data in the table. This filter turns the table sideways, so that each row of the output corresponds to a column of the input. The columns of the output table contain statistical items such as mean, standard deviation etc corresponding to each column of the input table.
By default the output table contains columns for the following items:
Name
: Column name
Mean
: Average
StDev
: Population Standard deviation
Minimum
: Numeric minimum
Maximum
: Numeric maximum
NGood
: Number of non-blank cells
However, the output may be customised by supplying one or more
<item>
headings. These may be selected
from the above as well as the following:
NBad
: Number of blank cells
Variance
: Population Variance
SampStDev
: Sample Standard Deviation
SampVariance
: Sample Variance
MedAbsDev
: Median Absolute Deviation
ScMedAbsDev
: Median Absolute Deviation * 1.4826
Skew
: Gamma 1 skewness measure
Kurtosis
: Gamma 2 peakedness measure
Sum
: Sum of values
MinPos
: Row index of numeric minimum
MaxPos
: Row index of numeric maximum
Cardinality
: Number of distinct values in column; values >100 ignored
Median
: Middle value in sequence
Quartile1
: First quartile
Quartile2
: Second quartile
Quartile3
: Third quartile
Q.25
: First quartile
Q.625
: Fifth octile
Any parameters of the input table are propagated to the output one.
Note that quantile calculations (including median and quartiles) can be expensive on memory. If you want to calculate quantiles for large tables, it may be wise to reduce the number of columns to only those you need the quantiles for earlier in the pipeline. No interpolation is performed when calculating quantiles.