t-tests

class: center, middle, inverse, title-slide

.title[
# t-tests 
]
.subtitle[
## Psychological Research Methods: Data Management & Analysis 
]
.author[
### Monica Truelove-Hill
]
.institute[
### Department of Clinical Psychology The University of Edinburgh
]

---

## This Week's Key Topics

+ Types of `$t$`-tests and their assumptions

+ Interpreting and reporting the results of a `$t$`-test

+ Computing and interpreting effect sizes for `$t$`-tests

+ Conducting a power analysis for a `$t$`-test

---
## Differences between two means

** Are these means different? (Yes/No/It Depends)**

.pull-left[
![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-2-1.svg)

]

.pull-right[

![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-3-1.svg)

]

---

## Differences between two means

** Are these means different? (Yes/No/It Depends)**

.pull-left[
![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-4-1.svg)

]

.pull-right[

![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-5-1.svg)

]

---
## The `$t$`-test

+ `$t$`-tests take account of both the mean and the variance of the data to determine whether two means are significantly different from each other.

+ `$t$`-tests produce a `$t$`-statistic that reflects the standardised difference between two mean values

+ Generally used to test the difference between two means

+ **One-sample `$t$`-tests** test the difference between a sample mean and a known mean
  
  + **Independent-sample `$t$`-tests** test the difference between two separate sample means
  
  + **Paired-samples `$t$`-tests** test the difference in a single sample at two points in time or across different conditions
  
--

+ Because of this, all `$t$`-tests require:

+ A continuous dependent variable
  
  + A categorical (binary) independent variable

---
## The `$t$`-test

+ Running a `$t$`-test involves:

+ Computing `$t$`
  
  + Calculating the probability of obtaining our value of `$t$` if the null were true

+ Using this probability to make a decision whether to reject the null hypothesis
  
---
## Calculating `$t$`

+ Although there are 3 types of `$t$`-tests, they generally involve some form of the following calculation:

`$$t = \frac{Observed\ Mean(s) - What\ we\ expect\ if\ null\ is\ true}{Some\ combination\ of\ the\ SE}$$`

---
## One-sample `$t$`-test

`$$t = \frac{\bar{x}-\mu_0}{SE_\bar{x}}$$`
+ Compare a sample mean ( `$\bar{x}$` ) and a known mean ( `$\mu_0$` )

+ Used to check whether your sample's mean is significantly different from a known value

+ **Example research questions:**

+ Are speeds along Road A significantly higher than the speed limit?
  
  + Do a beauty product's results last longer than 3 months?

---
## Independent-samples `$t$`-test

`$$t = \frac{(\bar{x_1}-\bar{x_2})-(\mu_1 - \mu_2)}{\sqrt{\frac{s_p^2}{n_1}+\frac{s_p^2}{n2}}}$$`
`$s_p^2$`: a pooled variance estimate that weights variance by sample size

+ Compares the mean of group 1 ( `$\bar{x_1}$` ) to the mean of group 2 ( `$\bar{x_2}$` )

+ Used to determine whether there is a difference between two independently measured groups

+ **Example research questions:**

+ Are there differences in symptoms between the treatment group and the group that received a placebo?
  
  + Do inmates who are put in solitary confinement exhibit more aggressive behaviour than those who are not?
  
  + Are there differences in TBI rate between boxers and Muay Thai fighters?
  
---
## Paired-samples `$t$`-test

`$$t = \frac{\bar{d}-\mu_d}{SE_\bar{d}}$$`

+ where:

+ `$\bar{d}$`: the mean of the individual difference scores (e.g. `$d_i = x_{i1} - x_{i2}$`)

+ `$\mu_d$`: The hypothesised null mean difference

+ `$SE_\bar{d} = \frac{sd_\bar{d}}{\sqrt{n}}$`

+ Compares the mean of the same group of participants across two timepoints/conditions

+ **Example research questions:**

+ Does time of day affect one's attention span?
  
  + Is a new fitness routine effective in improving one's cardiovascular health?

---
## The `$t$` distribution

.pull-left[
+ The `$t$`-distribution is the null distribution against which our calculated `$t$`-value is compared.

+ The shape of the `$t$`-distribution depends on the degrees of freedom ( `$df$` ) of our data.
]

.pull-right[

![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-6-1.svg)

]

---
## Degrees of freedom

.pull-left[

+ The number of values within the data that are free to vary

+ df = items within dataset - parameters to be estimated

]

???

Imagine we have a set of 3 values and we know that the mean is 4. In this case, the parameter we wish to estimate is the mean, so there is 1 parameter. We have 3 values within our dataset. In this case, the degrees of freedom in this set is 2, because 3-1 is 2. To understand what this means in terms of 'free to vary', if we know the mean is 4, Value 1 could be anything...it could be 3 or 8, or 12. Value 2 could be the same, any value at all. But the third value is fixed, because it has to be whatever makes the mean equal to 4. So Value 1 and 2 are free to vary, but the 3rd value has a constraint, which is that it must be equal to whatever value makes the mean equal to 4.

---

## Degrees of freedom

.pull-left[

+ The number of values within the data that are free to vary

+ df = items within dataset - parameters to be estimated

+ The number of parameters we estimate depends on the type of `$t$`-test

]

.pull-right[
+ **One-sample `$t$`-test:** we estimate the mean of our data 
 + `$df = n - 1$`
 
+ **Independent samples `$t$`-test:** we estimate the mean of each of our two groups 
 + `$df = n - 2$`
 
+ **Paired-samples `$t$`-test:** we estimate the mean difference between observations 
 + `$df = n - 1$`
]

---
## The `$t$` distribution

.pull-left[
+ The `$t$`-distribution is the null distribution against which our calculated `$t$`-value is compared.

+ The shape of the `$t$`-distribution depends on the degrees of freedom ( `$df$` ) of our data.

+ Consider how the shape of the distribution changes as `$df$` increases

**Test your Understanding:** What does the change in shape suggest?

]

.pull-right[

![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-7-1.svg)

]

---
## Putting it all together

.pull-left[
Basically:

1) Calculate your `$t$`-statistic

2) Identify the proper null distribution using `$df$`

3) Using this distribution, compute the probability of getting a `$t$`-statistic at least as extreme as yours

]

.pull-right[
![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-8-1.svg)
]

---
class: center, inverse, middle

## Questions?

---

### Conducting a `$t$`-test...or really, most inferential statistical tests

1. State your hypothesis

2. Conduct a power analysis

3. Check your data (visualisations/descriptives)

4. Check assumptions

5. Run the test

6. Calculate the effect size/confidence intervals

7. Interpret results

8. Report

---

## A note about hypotheses...

+ Hypotheses may be directional, or make a prediction about the expected direction of the effect
  
  + Treatment X will **reduce** symptoms of anxiety
  
  + Test scores will **improve** as hour of study increase

+ Hypotheses can also be nondirectional. In this case, an effect is predicted, but the direction is not specified.

+ Treatment X will **have an effect** on symptoms of anxiety
  
  + Test scores will **be associated with** hours of study

+ Directional hypotheses can be tested with one-tailed tests, while nondirectional hypotheses can be tested with two-tailed tests

---
## One-Tailed vs Two-Tailed Tests

.pull-left[
.center[**One-Tailed**: hypothesis is *directional*

![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-9-1.svg)

![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-10-1.svg)
]]

.pull-left[
.center[**Two-Tailed**: hypothesis is *nondirectional*]

![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-11-1.svg)
]

---
## Assumptions of `$t$`-tests

+ **Normality:** Dependent data should be normally distributed
  
--

.pull-left[
.center[**Normally Distributed**]
![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-12-1.svg)
]

.pull-right[
.center[**Not so normally distributed**]
![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-13-1.svg)![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-13-2.svg)
]
---
## Assumptions of `$t$`-tests

+ **Normality:** Dependent data should be normally distributed
  
+ **Independence:** Observations should be sampled independently

+ Independent-samples `$t$`-tests also require **Homogeneity of Variance:** Equal variance between the two groups

.pull-left[
.center[**Equal Variance**]
![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-14-1.svg)
]

.pull-right[
.center[**Not So Equal Variance**]
![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-15-1.svg)
]

---

## Effect size - Cohen's `$d$`

+ Cohen's `$d$` is the standardized difference between the means

+ in other words, how many standard deviations separate the two values?

`$$d = \frac{\bar{x}_1 -\bar{x}_2}{s_{pooled}}$$`

.pull-left[
+ `$\bar{x}_1$` = mean of group 1
+ `$\bar{x}_2$` = mean of group 2
+ `$s_{pooled}$` = pooled standard deviation
]

.pull-right[

`$$s_{pooled} = \sqrt{\frac{(n_1-1)s_1^2\ + (n_2-1)s_2^2}{n_1+n_2-2}}$$`

]

---

## Interpretation of Cohen's `$d$`

+ Proposed by Cohen (1998):

---
exclude: true
## Effect size - Cohen's `$d$`

+ Works best for larger samples...for smaller samples (<50), it overinflates results **MONICA, VERIFY THIS**

---
## Checking Power

+ Recall that power usually involves some combination of the following values:
  
  + `$\alpha$`
  + Effect Size
  + Power
  + Sample Size
  
+ In almost all cases, we will set `$\alpha = .05$` and `$Power = .8$`

+ For `$t$`-tests, we'll use **Cohen's `$d$` ** as our effect size

---
class: middle, inverse, center

## Questions?

---
## Running an Independent-Samples `$t$`-Test

**Step 1: State Your Hypotheses**

.pull-left[.center[ Null Hypothesis ]
`$$H_0: \bar{x_1} = \bar{x_2}$$`
]

.pull-right[.center[ Alternative Hypothesis ]
 Nondirectional

`$$H_1: \bar{x_1} \neq \bar{x_2}$$`

Directional

`$$H_1: \bar{x_1} > \bar{x_2}$$`
`$$H_1: \bar{x_1} < \bar{x_2}$$`
]

> **Test Your Understanding:** If your research question is 'Are there differences in systolic blood pressure between the treatment group and the group that received a placebo?', what is your null hypothesis? What is your alternative hypothesis?

> Is this a directional or nondirectional hypothesis? Do we need a one-tailed or two-tailed test?

---
## Running an Independent-Samples `$t$`-Test

**Step 2: Conduct a Power Analysis**

+ [WebPower](https://webpower.psychstat.org/wiki/models/index)

+ Let's check the sample required to achieve 80% power to detect a moderate effect ( `$d$` = .5) with `$\alpha$` = .05.

+ Remember, our research question is 'Are there differences in systolic blood pressure between the treatment group and the group that received a placebo?'

---
## Running an Independent-Samples `$t$`-Test

**Step 3: Check your data**

+ Compute descriptive statistics

+ Look at relevant plots
  
--

+ Let's do this in SPSS using [these data](https://mtruelovehill.github.io/PRM/Data/BPdat.csv).

---
## Running an Independent-Samples `$t$`-Test

**Step 4: Check Assumptions**

+ Normality
  
  + Have a look at the histograms & QQ-plots
  + Check skewness and kurtosis values
  + **Do not** rely on tests of normality...these may not provide accurate results

+ Independence of Observations
 
  + Consider study design

+ Homogeneity of Variance
 
  + Typical suggestion is to use Levene's test; however, this may also be problematic (see [Delacre et al., 2017](https://rips-irsp.com/articles/10.5334/irsp.82))
  + Instead, may default to Welch's `$t$`-test

---
## Running an Independent-Samples `$t$`-Test

**Step 5: Run the test**

**Step 6: Calculate the effect size/confidence intervals**

Let's continue in SPSS...

---
## Running an Independent-Samples `$t$`-Test

**Step 7: Interpret results**

![](images/BPDesc.png)

---
## Running an Independent-Samples `$t$`-Test

**Step 7: Interpret results**

![](images/BPttest.png)

.center[
![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-18-1.svg)
]

---
## Running an Independent-Samples `$t$`-Test

**Step 7: Interpret results**

.pull-left[
![](images/BPeffect.png)
]

.pull-right[

]

---
## Running an Independent-Samples `$t$`-Test

**Step 8: Report**

+ Type of test conducted

+ Variables tested

+ Descriptive data

+ Test results:
  + Test statistic ( `$t$`)
  + Degrees of freedom
  + `$p$`-value
  + Effect size
  + Confidence intervals

+ Brief interpretation (NO DISCUSSION)

---
## Running an Independent-Samples `$t$`-Test

**Step 8: Report**

We conducted a **two-tailed independent-samples `$t$`-test** to determine the effect of treatment on systolic blood pressure (SBP) . There was a significant effect of treatment, `$t$`(128) = -3.16, `$p$` = .002, 95% CI = [-4.68, -1.08], `$d$` = -.55 . Specifically, SBP in the treatment group was lower ( `$M$` = 136.97, `$SD$` = 4.86) than SBP in the placebo group ( `$M$` = 139.85, `$SD$` = 5.50).

+ Note number of decimal places and use of italics and brackets

+ In the Statistical Analysis portion of the methods, you should specify `$\alpha$`

---
## Running an Independent-Samples `$t$`-Test

**Step 8: Report**

.pull-left[

+ Figures are useful in helping readers visualise your results.

+ A **boxplot** is especially good for demonstrating results when you have a continuous DV and a categorical IV

]

.pull-right[
![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-20-1.svg)

]

---
class: center, inverse, middle

## Questions?

---
## Running a paired-samples `$t$`-test

**Step 1: State Your Hypotheses**

.pull-left[.center[ Null Hypothesis ]
`$$H_0: \bar{d} = \mu_d$$`
]

.pull-right[.center[ Alternative Hypothesis ]
 Nondirectional

`$$H_1: \bar{d} \neq \mu_d$$`

Directional

`$$H_1: \bar{d} > \mu_d$$`
`$$H_1: \bar{d} < \mu_d$$`
]

> **Test Your Understanding:** If your research question is 'Does Medication X change blood pressure in participants over time?', is this a directional or nondirectional hypothesis? Do we need a one-tailed or two-tailed test?

---
## Running a paired-samples `$t$`-test

**Step 2: Conduct a Power Analysis**

+ Research Question: Does Medication X decrease blood pressure in participants over time?

+ [WebPower](https://webpower.psychstat.org/wiki/models/index)

+ As before, let's check the sample required to achieve 80% power to detect a moderate effect ( `$d$` = .5) with `$\alpha$` = .05.

---
## Running a paired-samples `$t$`-test

**Step 3: Check your data**

+ Let's use [this sample data](https://mtruelovehill.github.io/PRM/PRMcontent/Data/BPpairedDat.csv)

---
## Running a paired-samples `$t$`-test

**Step 4: Check Assumptions**

+ Normality of difference scores
  
  + Have a look at the histograms & QQ-plots
  + Check skewness and kurtosis values
  + May also run statistical tests of normality...but this is not recommended

+ Independence (of participants rather than observations)
 
  + Consider study design
  
--

> **Test your Critical Thinking:** Why do you not need to check for homogeneity of variance in a paired-samples `$t$`-test?

---
## Running a paired-samples `$t$`-test

**Step 5: Run the test**

**Step 6: Calculate the effect size/confidence intervals**

Again, let's continue in SPSS...

---
## Running a paired-samples `$t$`-test

**Step 7: Interpret results**

![](images/pBPtest.png)

.center[
![](Week04_ttests_lecture_files/figure-html/unnamed-chunk-22-1.svg)
]

---
## Running a paired-samples `$t$`-test

**Step 7: Interpret results**

.pull-left[
![](images/pBPeffect.png)
]

.pull-right[

]

---
## Running a paired-samples `$t$`-test

**Step 8: Report**

We conducted a **two-tailed paired-samples `$t$`-test** to determine the effect of treatment on systolic blood pressure (SBP) . There was a significant effect of treatment, `$t$`(128) = -6.32, `$p$` < .001, 95% CI = [-12.05, -6.18], `$d$` = -1.07 . Specifically, SBP was lower before the medication was administered ( `$M$` = 138.60, `$SD$` = 6.67) than after taking the medication ( `$M$` = 147.71, `$SD$` = 8.48).

---

class: center, inverse, middle

## Questions?