1.6.9.6: References and reference cards
- Page ID
- 32684
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)There are oceans of literature about statistics, about R and about both. Below is a small selection of publications which are either mentioned in the text, or could be really useful (as we think) to readers of this book.
2em-2em1ex
Cleveland W. S. 1985. The elements of graphing data. Wandsworth Advanced Books and Software. 323 p.
Crawley M. 2007. R Book. John Whiley & Sons. 942 p.
Dalgaard P. 2008. Introductory statistics with R. 2 ed. Springer Science Business Media. 363 p.
Efron B. 1979. Bootstrap Methods: Another Look at the Jackknife. Ann. Statist. 7(1): 1–26.
Gonick L., Smith W. 1993. The cartoon guide to statistics. HarperCollins. 230 p.
Kaufman L., Rousseeuw P. J. 1990. Finding groups in data: an introduction to cluster analysis. Wiley-Interscience. 355 p.
Kimble G. A. 1978. How to use (and misuse) statistics. Prentice Hall. 290 p.
Li Ray. Top 10 data mining algorithms in plain English. URL: http://rayli.net/blog/data/top-10-data-mining-algorithms-in-plain-english/
Li Ray. Top 10 data mining algorithms in plain R. URL: http://rayli.net/blog/data/top-10-data-mining-algorithms-in-plain-r/
Marriott F. H. C. 1974. The interpretation of multiple observations. Academic Press. 117 p.
McKillup S. 2011. Statistics explained. An introductory guide for life scientists. Cambridge University Press. 403 p.
Murrell P. 2006. R Graphics. Chapman & Hall/CRC. 293 p.
Petrie A., Sabin C. 2005. Medical statistics at a glance. John Wiley & Sons. 157 p.
R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria.
Rowntree D. 2000. Statistics without tears. Clays. 195 p.
Sokal R. R., Rolf F. J. 2012. Biometry. The principles and practice of statistics in biological research. W.H. Freeman and Company. 937 p.
Sprent P. 1977. Statistics in Action. Penguin Books. 240 p.
Tukey J. W. 1977. Exploratory Data Analysis. Pearson. 688 p.
Venables W. N., Ripley B. D. 2002. Modern applied statistics with S. 4th ed. Springer. 495 p.
Happy Data Analysis!
And just a reminder: if you use R and like it, do not forget to cite it. Run citation()
command to see how.
Reference cards are attached to the very end of the book. They have a different page format, more suitable for printing. The first one was is actually one-page “cheatsheet”; we recommend to print is and use while you learn R.
-
There is however the
SOAR
package which overrides this behavior.↩ -
If you do not use these managers or centers, it is recommended to regularly update your R, at least once a year.↩
-
There is command
Xpager()
in theasmisc.r
collection of commands, it allows to see help in the separate window even if you work in terminal.↩ -
Within parentheses immediately after example, we are going to provide comments.↩
-
By the way, on Linux systems you may exit R also with
Ctrl+D
key, and on Windows withCrtl+Z
key.↩ -
Usually, small exercises are boldfaced.↩
-
By the way, if you want the Euler number, \(e\), type
exp(1)
.↩ -
And also like editor which is embedded into R for Windows or into RmacOS GUI, or the editor from
rite
R package, but not office software like MS Word or Excel!↩ -
Yet another possibility is to set working directory in preferences (this is quite different between operating systems) but this is not the best solution because you might (and likely will) want different working directories for different tasks.↩
-
There is
rio
package which can determine the structure of data.↩ -
Again, download it from Internet to
data
subdirectory first. Alternatively, replace subdirectory with URL and load it into R directly—of course, after you check the structure.↩ -
On macOS, type
Enter
twice.↩ -
With commands
dput()
anddget()
, R also saves and loads textual representations of objects.↩ -
This is a bit similar to the joke about mathematician who, in order to boil the kettle full with water, would empty it first and therefore reduce the problem to one which was already solved!↩
-
If, by chance, it started and you have no idea how to quit, press uppercase
ZQ
.↩ -
Within nano, use
Ctrl+O
to save your edits andCtrl+X
to exit.↩ -
Does not work on graphical macOS.↩
-
Under graphical macOS, this command is not accessible, and you need to use application menu.↩
-
You can also use
savehistory()
command to make a “starter” script.↩ -
On Windows and macOS, this will open internal editor; on Linux, it is better to set
editor
option manually, e.g.,file.edit("hello.r", editor="geany")
.↩ -
The better term is generic command.↩
-
Cleveland W. S., McGill R. 1985. Graphical perception and graphical methods for analyzing scientific data. Science. 229(4716): 828–833.↩
-
lattice
came out of later ideas of W.S. Cleveland, trellis (conditional) plots (see below for more examples).↩ -
ggplot2
is now most fashionable R graphic system. Note, however, that it is based on the different “ideology” which related more with SYSTAT visual statistic software and therefore is alien to R.↩ -
By the way, both PDF and SVG could be opened and edited with the freely available vector editor Inkscape.↩
-
Collection
gmoon.r
has game-like commandMiney()
, based onlocator()
; it partly imitates the famous “minesweeper” game.↩ -
In the case of our
eggs
data frame, the command of second style would beplot(eggs[, 1:2])
orplot(eggs$V1, eggs$V2)
, see more explanations in the next chapter.↩ -
Another variant is to use high-level
scatter.smooth()
function which replacesplot()
. Third alternative is a cubic smoothersmooth.spline()
which calculates numbers to use withlines()
.↩ -
Discrete measurement data are in fact more handy to computers: as you might know, processors are based on 0/1 logic and do not readily understand non-integral, floating numbers.↩
-
For unfamiliar words, please refer to the glossary in the end of book.↩
-
By default,
Ls()
does not output functions. If required, this behavior could be changed withLs(exclude="none")
.↩ -
In fact, columns of data frames might be also matrices or other data frames, but this feature is rarely useful.↩
-
There is also
hexbin
package which used hexagonal shapes and color shading.↩ -
Package
DescTools
has the handyMode()
function to calculate mode.↩ -
While it is possible to run here a cycle using
for
operator,apply
-like functions are always preferable.↩ -
In the book, we include minimum and maximum into quartiles.↩
-
Note that these options must be set a priori, before you run the test. It is not allowed to change alternatives in order to find a better p-values.↩
-
Look also into the end of this chapter.↩
-
There is a workaround though, robust rank order test, look for the function
Rro.test()
in theasmisc.r
.↩ -
Bennett C.M., Wolford G.L., Miller M.B. 2009. The principled control of false positives in neuroimaging. Social cognitive and affective neuroscience 4(4): 417–422, https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2799957/↩
-
Like it is implemented in the
ARTool
package; there also possible to use multi-way nonparametric designs.↩ -
Fisher R.A. 1971. The design of experiments. 9th ed. P. 11.↩
-
Mendel G. 1866. Versuche über Pflanzen-Hybriden. Verhandlungen des naturforschenden Vereines in Brünn. Bd. 4, Abhandlungen: 12. http://biodiversitylibrary.org/page/40164750↩
-
Yates F. 1934. Contingency tables involving small numbers and the \(\chi^2\) test. Journal of the Royal Statistical Society. 1(2): 217–235.↩
-
There are, however, advanced techniques with the goal to understand the difference between causation and correlation: for example, those implemented in
bnlearn
package.↩ -
Function
Cladd()
is applicable only to simple linear models. If you want confidence bands in more complex cases, check theCladd()
code to see what it does exactly.↩ -
Fisher R.A. 1936. The use of multiple measurements in taxonomic problems. Annals of Eugenics. 7(2): 179–188.↩
-
Package
Boruta
is especially god for all relevant feature selection.↩ -
For example, “Encyclopedia of Distances” (2009) mentions about 1,500!↩
-
Emphasis mine.↩
-
With command
source("http://ashipunov.info/r/gmoon.r")
.↩ -
To know which symbols are available, run
demo(Hershey)
.↩ -
Linux users might want to add option
editor=
.↩ -
Package
lintr
containslint()
command which checks R scripts.↩ -
There is, by the way, a life-hack for lazy reader: all plots which you need to make yourself are actually present in the output PDF file.↩
-
Among text editors, Geany is one of the most universal, fast, free and works on most operation systems.↩
-
Thompson D. W. 1945. On growth and form. Cambridge, New York. 1140 pp.↩
-
Rohlf F.J. tpsDig. Department of Ecology and Evolution, State University of New York at Stony Brook. Freely available at http://life.bio.sunysb.edu/morph/↩
-
Actually,
geomorph
package is capable to digitize images withdigitize2d()
function but it works only with JPEG images.↩