File:Anscombe with text.svg

Summary

This graphic represents the four datasets defined by Francis Anscombe for which some of the usual statistical properties (mean, variance, correlation and regression line) are the same, even though the datasets are different.

Property Value
Mean of each variables 9.0
Variance of each variables 11.0
Mean of each variables 7.5
Variance of each variables 4.12
Correlation between each and variable 0.816
Regression line

The graphic was created by User:Schutz for Wikipedia on 13 June 2006, using the R statistical project. The program that generated the graphic is given below; it is based on the example provided with the help page of the R dataset anscombe (accessible using the command help(anscombe)), and was slightly modified to improve the result. The graph was exported in postscript format, converted to SVG using the pstoedit command, and the layout was slightly modified using Inkscape before upload.

 
The SVG code is valid.
 
This diagram was created with R.
Category:Valid SVG created with R#Anscombe%20with%20text.svg
 
The file size of this SVG diagram may be irrationally large because most of its text has been converted to paths inhibiting translations.
Category:Path text SVG
This diagram is a retouched picture, which means that it has been digitally altered from its original version. Modifications: text added from Anscombe's original publication. The original can be viewed here: Anscombe.svg: . Modifications made by Papa Lima Whiskey.

Category:SVG retouched pictures#Anscombe%20with%20text.svg

References:

  • Anscombe, Francis J. (1973) Graphs in statistical analysis. American Statistician, 27, 1721.
  • R Development Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Vienna, Austria. 2006. ISBN 3-900051-07-0. http://www.R-project.org
 postscript("anscombe.ps")
 par(las=1)
 
 Based on the raster image: [[:Image:Feather.jpg]]  * {{Français}} Plume * {{English}} Feather * {{Español}} Pluma  == Licensing == {{GFDL-self}}  [[Category:Icons]] Based on the raster image: [[:Image:Feather.jpg]]  * {{Français}} Plume * {{English}} Feather * {{Español}} Pluma  == Licensing == {{GFDL-self}}  [[Category:Icons]] Based on the raster image: [[:Image:Feather.jpg]]  * {{Français}} Plume * {{English}} Feather * {{Español}} Pluma  == Licensing == {{GFDL-self}}  [[Category:Icons]] Based on the raster image: [[:Image:Feather.jpg]]  * {{Français}} Plume * {{English}} Feather * {{Español}} Pluma  == Licensing == {{GFDL-self}}  [[Category:Icons]] resizeresizeresizeresizeresizeresizeresizeresizeresizeresizeresizeresizeresizeresizeresizeresizeresizeresizeresizeresizeresizeresizeresize


Note: See library(alr3), data(anscombe) for the original dataset. Check ?anscombe for information on the order in which the variables should be regressed (plotted). One simpler way to perform the above mentioned regressions would be to run code similar to:

require(car)
require(alr3)
scatterplot(y1~x1, reg.line=lm, smooth=TRUE, labels=FALSE, boxplots='xy', span=0.5, data=anscombe)

Licensing

The R project is licensed under the GPL ; since this image is a derived work, it is also licenced under the GPL.

GNU head

This work is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or any later version. This work is distributed in the hope that it will be useful, but without any warranty; without even the implied warranty of merchantability or fitness for a particular purpose. See version 2 and version 3 of the GNU General Public License for more details.

Category:GPL Category:Regression analysis Category:Media missing infobox template
Category:Files with no machine-readable author Category:Files with no machine-readable source Category:GPL Category:Media missing infobox template Category:Path text SVG Category:Regression analysis Category:SVG retouched pictures Category:Valid SVG created with R