HairEyeColor {base} | R Documentation |
Distribution of hair and eye color and sex in 592 statistics students.
data(HairEyeColor)
A 3-dimensional array resulting from cross-tabulating 592 observations on 3 variables. The variables and their levels are as follows:
No | Name | Levels |
1 | Hair | Black, Brown, Red, Blond |
2 | Eye | Brown, Blue, Hazel, Green |
3 | Sex | Male, Female |
This data set is useful for illustrating various techniques for the analysis of contingency tables, such as the standard chi-squared test or, more generally, log-linear modelling, and graphical methods such as mosaic plots, sieve diagrams or association plots.
Snee, R. D. (1974), Graphical display of two-way contingency tables. The American Statistician, 28, 9–12.
Friendly, M. (1992), Graphical methods for categorical data. SAS User Group International Conference Proceedings, 17, 190–200. http://hotspur.psych.yorku.ca/SCS/sugi/sugi17-paper.html
Friendly, M. (1992), Mosaic displays for loglinear models. Proceedings of the Statistical Graphics Section, American Statistical Association, pp. 61–68. http://hotspur.psych.yorku.ca/SCS/Papers/asa92.html
chisq.test
,
loglin
,
mosaicplot
data(HairEyeColor)
## Full mosaic
mosaicplot(HairEyeColor)
## Aggregate over sex:
x <- apply(HairEyeColor, c(1, 2), sum)
x
mosaicplot(x, main = "Relation between hair and eye color")