Scagnostics

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
File:ScagnosticsExampleSplom.svg
Scatterplot matrix of the scagnostics measures for the 91 scatterplots of the variables of the Boston Housing data set

Scagnostics (scatterplot diagnostics) is a series of measures that characterize certain properties of a point cloud in a scatter plot. The term and idea was coined by John Tukey and Paul Tukey, though they didn't publish it; later it was elaborated by Wilkinson, Anand, and Grossman. The following nine dimensions are considered:[1][2]

  1. For the outliers in the data:
    1. outlying
  2. For the density of data points:
    1. skewed
    2. clumpy
    3. sparse
    4. striated
  3. For the shape of the point cloud:
    1. convex
    2. skinny
    3. stringy
  4. For trends in the data:
    1. monotony

References

[edit | edit source]
  1. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  2. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
[edit | edit source]