Sampling

Based on Chapter 7 of ModernDive. Code for Quiz 11.

Distill is a publication format for scientific and technical writing, native to the web.

Learn more about using Distill at https://rstudio.github.io/distill.

  1. Load the R packages we will use.
  1. Quiz Questions.

Question:

*Fill in the blanks

Modify the code for comparing different sample sizes from the virtual bowl

Segment 1: sample size = 28

1.a) Take 1150 samples of size of 28 instead of 1000 replicates of size 25 from the bowl dataset. Assign the output to virtual_samples_28

1.b) Compute resulting 1150 replicates of proportion red

1.c) Plot distribution of virtual_prop_red_28 via a histogram

use labs to


Segment 2: sample size = 53

2.a) Take 1150 samples of size of 53 instead of 1000 replicates of size 50. Assign the output to virtual_samples_53

2.b) Compute resulting 1150 replicates of proportion red

*create variable red equal to the sum of all the red balls

2.c) Plot distribution of virtual_prop_red_53 via a histogram use labs to


Segment 3: sample size = 113

3.a) Take 1150 samples of size of 118 instead of 1000 replicates of size 50. Assign the output to virtual_samples_118

3.b) Compute resulting 1150 replicates of proportion red

3.c) Plot distribution of virtual_prop_red_118 via a histogram use labs to


Calculate the standard deviations for your three sets of 1120 values of prop_red using the standard deviation

n = 28

# A tibble: 1 × 1
      sd
   <dbl>
1 0.0903

n = 53

# A tibble: 1 × 1
      sd
   <dbl>
1 0.0661

n = 118

# A tibble: 1 × 1
      sd
   <dbl>
1 0.0430

The distribution with sample size, n = 118, has the smallest standard deviation (spread) around the estimated proportion of red balls.