Statistics

Statistics

List View
This is a published deck. Feel free look around, review cards, or make changes, but you'll need to clone it to save your progress.
What is the formula for a mean-derivation score?

xxˉ x - \bar{x}
New
What is the formula for a z-score?

z=xxˉs z = {x - \bar{x} \over s}
New
Which type of t-test would you use if you only wanted to see if two normal distributions were different?

a two-sided t-test, OR
a two-tailed t-test
New
How does a linear transformation affect the relationship between values?

The relationship between values does not change because the process changes each value in the same way.
New
What happens to the intercept if you center the independent variable in a linear model?

It becomes the predicted value for the mean of the independent variable.
New
What is a mean-derivation score?

It is a form of centering where each value in a distribution is subtracted from the mean.
xxˉ \mathrm{x} - \bar{x}
Values above the mean are positive and values below the mean are negative.
New
Why is it useful to have the intercept be the mean?

Because in many cases, an independent variable value of zero doesn't make sense and focusing on values above and below the mean is more productive.
New
What is the most common example of centering in statistical transformations?

subtracting out the mean of a distribution, or a mean-derivation score
New
What type of transformation is centering and why is it used?

It is the application of a linear transformation to give a particular meaning to the value of 0.
New
What is a linear transformation?

It is a transformation involving addition, subtraction, multiplication, or division using a constant value.
New
The intercept of a null model is equivalent to what parameter of which variable?

xˉ \bar{\mathrm{x}}
of the dependent variable
New
What does a given
R2 \mathrm{R}^2
value imply?

That much of the variation in the
y \mathrm{y}
distribution can be explained by
x \mathrm{x}
. Conversely, that
1R2 1 - \mathrm{R}^2
of the
y \mathrm{y}
distribution is not explained by
y \mathrm{y}
. This is also called its effect size.
New
Why is
R2 \mathrm{R}^2
useful?

It allows us to compare the magnitude of residuals (SSE) of models in a standardized manner.
New
What does
R2 \mathrm{R}^2
measure?

It measures how well a model fits the data.
New
What is the formula for
R2 \mathrm{R}^2
?

R2=1SSE1SSE0 R^2 = 1 - {{SSE_{1}\over {SSE}_{0}}}
New
Why is a null model useful?

It lets you compare a model to a standard which allows us to calculate a standardized measure of fit (
R2 \mathrm{R}^2
).
New
Why are squared errors useful?

It removes negative values and prevents them from canceling out positive values in a sum operation.
New
What is a null model, or zero-slope model?

It is a model that assumes there is no relationship between the two variables.
New
Why is it important for residuals to have the same variance throughout the range of
x \mathrm{x}
values?

Because otherwise would imply that the effect or slope is stronger for certain intervals of
x \mathrm{x}
and a linear regression model cannot tell us how or why that strength of relationship changes.
New
Why is it important for consecutive residuals to be independent from one another?

Because it lowers the accuracy of the model and implies that there is some other linear relationship that the model is not capturing.
New
What can you do if there is a clear relationship between two variables but they are not linear?

Apply a non-linear transformation to the independent or dependent variables; such as the log or square root.
New
What four assumptions must be true about the data in order to safely interpret a linear regression model?

  • the two factors have a linear relationship
  • the residuals are independent from each other
  • the residuals should also be normally distributed
  • the residuals must have constant variance throughout the regression line
New
What is the technical term for the the state in which the residuals of a linear regression model have constant variance?

homoscedasticity
New
In what way does adding a residual error term to the linear regression model,
y=b0+b1x+e \mathrm{y} = \mathrm{b}_0 + b _{1}x + e
, stochastic?

In the sense that it will not always return the same
y \mathrm{y}
-value for any given
x \mathrm{x}
New
In what way is the linear regression formula,
y=b0+b1x \mathrm{y} = \mathrm{b}_0 + b_{1}x
(without residuals), deterministic?

In the sense that it will always return the same
y \mathrm{y}
-value for any given
x \mathrm{x}
value.
New
What do residuals in a linear regression model represent?

the information left over after removing the effect of the independent variable, OR
the parts of
y \mathrm{y}
that cannot be explained by
x \mathrm{x}
New
What does it mean if a linear regression model predicts a dependent variable value for an independent variable value that is not possible?

It just means that the model can't be used for independent variable values outside of its trained range. That is normal.
New
In the context of linear regression models, what does the term extrapolation mean?

the prediction an independent variable outside the range of the trained model.
New
In the context of linear regression models, what does the term interpolation mean?

the prediction of an independent variable within the range of the trained model.
New
In the regression model formula
y=b0+b1x+e \mathrm{y} = \mathrm{b}_0 + b_{1}x + e
Which term is the residuals?

e e
New
In the regression model formula
y=b0+b1x+e \mathrm{y} = \mathrm{b}_0 + b_{1}x + e
Which term is the dependent variable?

y \mathrm{y}
New
In the regression model formula
y=b0+b1x+e \mathrm{y} = \mathrm{b}_0 + b_{1}x + e
Which term is the independent variable?

x \mathrm{x}
New
In the regression model formula
y=b0+b1x+e \mathrm{y} = \mathrm{b}_0 + b_{1}x + e
Which term is the slope?

b1 \mathrm{b}_1
New
In the regression model formula
y=b0+b1x+e \mathrm{y} = \mathrm{b}_0 + b_{1}x + e
Which term is the intercept?

b0 \mathrm{b}_0
New
What is the formula for a linear regression model?

y=b0+b1x+e \mathrm{y} = \mathrm{b}_0 + b_{1}x + e
New
What are the coefficients of a linear regression model?

the intercept and the slope
New
What is the intercept of a linear regression model?

It is the predicted
y \mathrm{y}
-value for
x=0 \mathrm{x} = 0
New
How is slope defined in terms of variables x and y?

The slope is the change in
y \mathrm{y}
for each unit in
x \mathrm{x}
.
New
If we believe that the variable
GPA \mathrm{GPA}
is affected by the variable
Family income \mathrm{Family\ income}
;
is
Family income \mathrm{Family\ income}
dependent or independent?

independent
New
If we believe that the variable
GPA \mathrm{GPA}
is affected by the variable
Family income \mathrm{Family\ income}
;
is
GPA \mathrm{GPA}
dependent or independent?

dependent
New
What word describes analyses that consider the relationship between two variables?

bivariate
New
What word describes analyses that only consider one variable?

univariate
New
When presenting a boxplot, what detail needs to be specified and reported?

the interpretation of the whiskers; i.e., they can represent the minimum and maximum values or they can represent an additional 1.5 times the IQR.
New
What point does the edge of the left or lower whiskers of a boxplot usually represent?

the first quartile minus 1.5 times the interquartile range
New
What point does the edge of the right or upper whiskers of a boxplot usually represent?

the third quartile plus 1.5 times the interquartile range
New
What is the interquartile range?

the difference between the third quartile and the first quartile
New
What percentage of data points does the area of a boxplot cover?

the middle 50 percent
New
What percentage of data points are higher than the right or top edge of a box plot?

the top 25 percent
New
What percentage of data points are lower than the right or top edge of a box plot?

the bottom 75 percent
New
What percentage of data points are higher than the left or bottom edge of a box plot?

the top 75 percent
New
What percentage of data points are lower than the left or bottom edge of a box plot?

the lower 25 percent
New
What does the symbol
s \mathrm{s}
represent in statistics?

It represents the observed standard deviation of a distribution.
New
What symbol is used to show the standard deviation of an observed distribution?

s \mathrm{s}
New
What does the symbol
xˉ \bar{\mathrm{x}}
represent?

the observed mean of a distribution
New
What symbol is used to represent the observed mean of a distribution?

xˉ \bar {\mathrm{x}}
New
What does the symbol
σ \sigma
represent in statistics?

the theoretical standard deviation of a population
New
What symbol is used to show the theoretical standard deviation of a population?

σ \sigma
New
What does the symbol
μ \mu
represent in statistics?

the theoretical mean of a population
New
What symbol is used to show the theoretical mean of a population?

μ \mu
New
Why would we use the median instead of the mean, or vice versa?

The median is less affected by extreme values whereas the mean is easily pulled higher or lower.
New
What should be reported when we write about a normal distribution?

the mean, standard deviation, and number of observations
New
What percentage of the middle section of a normal distribution is within 2 times the standard deviation of the mean?

95%
New
What percentage of the middle section of a normal distribution is within 1 standard deviation of the mean?

68%
New
What is the formula for a mean of a distribution?

xˉ=xN \bar x = {\sum{x} \over N}
New
What does this formula describe?
xˉ=xN \bar x = {\sum{x} \over N}

the mean of a distribution
New
What are the two parameters of a normal distribution?

the mean and the standard deviation
New
What does discrete mean in statistics?

It means there is a limited number of responses or outcomes.
New
This is an example of what kind of distribution?

a discrete uniform distribution
New
The output of rolls of a dice would be an example of which kind of distribution?

a discrete uniform distribution
New
How is a discrete uniform distribution different from a discrete normal distribution?

In a discrete uniform distribution, all elements of a finite set are equally probable; for example, a coin toss or dice roll. In a discrete normal distribution, elements are centered around the mean.
New
Keyboard shortcuts
Press
?
to toggle this panel.
Markdown formatting
You can format text using markdown.
H1
# Heading 1
H2
## Heading 2
H3
### Heading 3
New Side
---
Shift
Bold
**bold**
Ctrl
b
Italic
_italic_
Ctrl
i
Strikethrough
~~strikethrough~~
Highlight
==highlighted text==
Ctrl
h
Link
[link text](my-site.com)
https://my-site.com
Reference
[[card title|id]]
[[id]]
[[id:embed]]
@
Media
![title](Media URL)
Ctrl
o
List
- list item
Ctrl
.
Ordered List
1. list item
Quote
> quote
Definition List
term
: definition
Ctrl
d
Hidden text
{{text}}
Ctrl
Shift
l
Hidden text group
{{1::text}}
Ctrl
1
...
9
Code
`code`
Code Fence
```clj
(defn foo)
```
Ctrl
Shift
f
Inline Latex
$\LaTeX$
Display Latex
$$
\LaTeX
$$
Superscript
20^th^
Subscript
H~2~0
Ruby characters
{振り仮名}(ふがな)
Tag
#my-new-tag
Editor
These shortcuts are usable within the editor.
Save and exit
Ctrl
Enter
Exit without saving
Esc
Open reference
Ctrl
Click
New card (from selection)
Ctrl
n
Deck page
These shortcuts are usable on the deck page.
Add a card
n
Review cards that are due
r
Learn new cards
l
List view (select mode)
Shortcuts to move around, select, and edit cards on the list view.
Move selection
j
k
Move card up
Ctrl
k
/
Ctrl
Move card down
Ctrl
j
/
Ctrl
Cancel selection
Esc
Edit card
e
Preview card
Space
Delete card
d
Notebook view (select mode)
Shortcuts to move around, select, and edit cards on the notebook view.
Move selection
j
k
Move card up
Ctrl
k
/
Ctrl
Move card down
Ctrl
j
/
Ctrl
Cancel selection
Esc
Edit card
e
Preview card
Space
Delete card
d
Add card below
o
Add card above
Shift
o
Notebook view (edit mode)
Shortcuts to use while editing a card in notebook view.
Add a new side
Ctrl
-
Split card at cursor
Ctrl
Shift
Save and add a new card
Ctrl
Save and finish editing.
Esc
Deck page (column view)
Shortcuts to move around, select, and edit cards on the column view.
Move selection
j
k
Move card up
Ctrl
k
/
Ctrl
Move card down
Ctrl
j
/
Ctrl
Cancel selection
Esc
Edit card
e
Delete card
d
Review / Learn page
These shortcuts are usable on the review or 'learn new cards' pages.
Show next side of card
Space
j
Hide last side
k
Mark as "remembered"
Space
Skip sides and mark as "remembered"
Shift
Space
Mark as "forgot"
f
Archive card
a
Toggle Reverse Reviews
v
Edit card
e
Delete card
d
Replay audio
r
Pause/play audio
p
Miscellaneous
Various other shortcuts.
Exit modal
Esc
Save editor
Ctrl
Show this panel
?