It’s almost time to wrap up the course! In this three part assignment you get to practice what we learned this week, try something new, and get creative!

Getting started

By now you should be familiar with instructions for getting started with a new assignment in RStudio Cloud and setting up your git configuration. If not, you can refer to one of the earlier assignments.

Part 1 - Bootstrapping the GSS

In this part we continue our exploration of the 2016 GSS dataset from last week.

Click here to download the data. The file is called gss2016.csv.
Create a data folder in your project – in the Files pane, click on New Folder.
Navigate to the data folder you just created and upload the gss2016.csv file.
Note that even though you made a change in your files by adding the data, gss2016.csv does not appear in your Git pane. This is because it’s being ignored by git.

Then, read in the data using the following:

gss <- read_csv("data/gss2016.csv", 
                na = c("", "Don't know",
                       "No answer", "Not applicable"),
                guess_max = 2867) %>%
  select(harass5, emailmin, emailhr, educ, born, polviews, advfront)

Remember that the GSS asked respondents how many hours and minutes they spend on email weekly. The responses to these questions are recorded in the emailhr and emailmin variables. For example, if the response is 2.5 hrs, this would be recorded as emailhr = 2 and emailmin = 30.

⊕Yes, this exercise is a repeat of what you did last week!

Create a new variable called email that combines these two variables to reports the number of minutes the respondents spend on email weekly.
Filter the data for only those who have non NA entries for email. Do not overwrite the data frame (you’ll need the full data later). Instead save the resulting data frame with a new name.
Describe how bootstrapping can be used to estimate the mean amount of time all Americans spend on email weekly.

In the following questions you will use the infer package to construct intervals rather than writing for loops.

Calculate a 95% bootstrap confidence interval for the mean amount of time Americans spend on email weekly. Interpret this interval in context of the data, reporting its endpoints in “humanized” units (e.g. instead of 108 minutes, report 1 hr and 8 minutes). If you get a result that seems a bit odd, discuss why you think this might be the case.
Would you expect a 99% confidence interval to be wider or narrower than the interval you calculated above? Explain your reasoning.
Using the bootstrap distribution from the previous Exercise 4, calculate a 99% bootstrap confidence interval for the mean amount of time Americans spend on email weekly. Once again, use humanized units.
And finally, construct and interpret a 90% confidence interval for the median amount of time Americans spend on email weekly. Once again, use humanized units.
What does the “90%” mean in your interpretation of the above interval?

Part 2 - You gotta pick a package or two

But really, one is enough. Pick a package from the list below, and use it to do something. If you want to use a package not on this list, that’s also ok, but run it by me first by posting a question about it on Pizza (so that I can confirm it’s not one we introduced in the class so far, the goal is to work with a new package).

⊕Remember, you install the package in the Console, not in your R Markdown document since you don’t want to keep reinstalling it every time you knit the document.

Your task is to install the package you pick. Depending on where the package comes from, how you install the package differs: - If the package is on CRAN (Comprehensive R Archive Network), you can install it with install.packages. - If the package is only on Github (most likely because it is still under development), you need to use the install_github function. See above for details.

Then, load the package. Regardless of how you installed the package you can load it with the library function.

Finally, do something with the package. It doesn’t have to be complicated. In fact, keep it simple. The goal is to try to read and understand the package documentation to be able to carry out a simple task.

Which package are you using? State the name of the package, whether it was on CRAN or GitHub, and include the code for loading it.
What are you doing with the package? Give me a brief narrative including code and output.

Packages on CRAN

These packages can be installed with:

install.packages("PACKAGENAME")

The package manuals are linked below, however developers of the packages might have additional information on the GitHub repo of the package.

cowsay:
- Allows printing of character strings as messages/warnings/etc. with ASCII animals, including cats, cows, frogs, chickens, ghosts, and more.
- https://cran.r-project.org/web/packages/cowsay/vignettes/cowsay_tutorial.html
babynames:
- US Baby Names 1880-2015
- https://cran.r-project.org/web/packages/babynames/babynames.pdf
Lahman:
- Provides the tables from the ‘Sean Lahman Baseball Database’ as a set of R data.frames. It uses the data on pitching, hitting and fielding performance and other tables from 1871 through 2015, as recorded in the 2016 version of the database.
- https://cran.r-project.org/web/packages/Lahman/Lahman.pdf
praise:
- https://cran.r-project.org/web/packages/praise/praise.pdf
- Build friendly R packages that praise their users if they have done something good, or they just need it to feel better.
ggimage:
- Supports image files and graphic objects to be visualized in ‘ggplot2’ graphic system.
- https://cran.r-project.org/web/packages/ggimage/vignettes/ggimage.html
suncalc:
- R interface to ‘suncalc.js’ library, part of the ‘SunCalc.net’ project http://suncalc.net, for calculating sun position, sunlight phases (times for sunrise, sunset, dusk, etc.), moon position and lunar phase for the given location and time.
- https://cran.r-project.org/web/packages/suncalc/suncalc.pdf
ttbbeer
- An R data package of beer statistics from U.S. Department of the Treasury, Alcohol and Tobacco Tax and Trade Bureau (TTB)
- https://cran.r-project.org/web/packages/ttbbeer/ttbbeer.pdf

Packages on GitHub only