CSSS 508, Lecture 1
Syllabus and Introduction to R, RStudio, and RMarkdown
Michael Pearce
(based on slides from Chuck Lanfear)
March 29, 2022
1 / 54

Today, we will do:

Introductions
Syllabus
Lecture 1: Introduction to R, RStudio, and RMarkdown

2 / 54

Introductions

We'll go around the room and each share our:

Name and preferred pronouns
Program and year
Experience with programming (in R or generally)
Something fun you did over Spring Break

3 / 54

Syllabus

The syllabus (as well as lots of other information) can be found on our course website:

https://pearce790.github.io/CSSS508

Feel free to follow along online as I run through the syllabus!

4 / 54

Course Goals

This course is intended to give students a foundational understanding of programming in the statistical language R. General topics include:

Exploring data with graphics and summaries

5 / 54

Course Goals

This course is intended to give students a foundational understanding of programming in the statistical language R. General topics include:

Exploring data with graphics and summaries
Cleaning, preparing, and linking data for analyses

5 / 54

Course Goals

This course is intended to give students a foundational understanding of programming in the statistical language R. General topics include:

Exploring data with graphics and summaries
Cleaning, preparing, and linking data for analyses
Foundational programming skills such as functions and loops

5 / 54

Course Goals

This course is intended to give students a foundational understanding of programming in the statistical language R. General topics include:

Exploring data with graphics and summaries
Cleaning, preparing, and linking data for analyses
Foundational programming skills such as functions and loops
Organizing projects and creating reproducible research

We will cover almost no statistics here, but I hope you'll leave being able to focus on statistics instead of coding in future CSSS or STAT courses!

5 / 54

Logistics

Sessions:

Lecture: Wednesdays, 3:30-5:20 (Savery 117) -- Interactive sessions in which we'll learn key skills, concepts, and principles
Lab: Mondays, 3:30-5:20 (Savery 117) -- Optional and mostly unstructured sessions to work on homework and review
Office Hours: Tuesdays, 9-10am and 3-4pm (on Zoom; link on Canvas)

Course Website: https://pearce790.github.io/CSSS508

Contact: Feel free to email me at [mpp790 at uw dot edu]

6 / 54

Schedule

Week 1: Introduction to R, RStudio, and RMarkdown
Week 2: Visualizing Data
Week 3: Manipulating and Summarizing Data
Week 4: Understanding R Data Structures
Week 5: Importing, Exporting, and Cleaning Data
Week 6: Using Loops
Week 7: Writing Functions
Week 8: Working with Text Data
Week 9: Working with Geographical Data
Week 10: Reproducibility and Model Results

This course will have no meeting during final exam week.

7 / 54

Prereqs, Materials, and Texts

Prerequisites: None

Materials: All course materials are provided on the course website. This includes:

These slides and the code used to generate them.
An R script for the slides to follow along in class.
Homework instructions and/or templates
Useful links to other resources.

Laptops: It's helpful to bring a laptop to class. If you don't have one, you can use the lab computers or borrow one for free from the UW Student Technology Loan Program.

Textbooks: This course has no textbook. However, the website has links to a few texts which I have found useful!

8 / 54

Grading

Final grade: C/NC, 60% to get Credit.

Homework (75%; assessed by peers): 8 total homeworks; assessed on a 0-3 point rubric. Assigned after lectures and due before the following lecture.
Peer Grading (25%; assessed by the instructor): One per homework, assessed on a binary "good"/"not good" scale. Due before the following lab.

Assignment/peer grading instructions and deadlines can be found on the Homework page of the course website. All homework will be turned in on Canvas.

9 / 54

Ugh, peer grading?

Yes, because:

You will write your reports better knowing others will see them
You learn alternate approaches to the same problem
You will have more opportunities to practice and have the material sink in

10 / 54

Ugh, peer grading?

Yes, because:

You will write your reports better knowing others will see them
You learn alternate approaches to the same problem
You will have more opportunities to practice and have the material sink in

How to peer review:

Leave constructive comments: You'll only get the point if you write at least 1 full paragraph that includes
- Any key issues from the assignment and,
- Points out something positive in your peer's work.
Email me if you would like your assignment to be regraded or provide feedback if no peer review was given.

10 / 54

Academic Integrity

Academic integrity is essential to this course and to your learning. Violations of the academic integrity policy include but are not limited to:

Copying from a peer
Copying from an online resource
Using resources from a previous iteration of the course.

11 / 54

Academic Integrity

Academic integrity is essential to this course and to your learning. Violations of the academic integrity policy include but are not limited to:

Copying from a peer
Copying from an online resource
Using resources from a previous iteration of the course.

I hope you will collaborate with peers on assignments and use Internet resources when questions arise to help solve issues. The key is that you ultimately submit your own work.

11 / 54

Academic Integrity

Academic integrity is essential to this course and to your learning. Violations of the academic integrity policy include but are not limited to:

Copying from a peer
Copying from an online resource
Using resources from a previous iteration of the course.

I hope you will collaborate with peers on assignments and use Internet resources when questions arise to help solve issues. The key is that you ultimately submit your own work.

Anything found in violation of this policy will be automatically given a score of 0 with no exceptions. If the situation merits, it will also be reported to the UW Student Conduct Office, at which point it is out of my hands. If you have any questions about this policy, please do not hesitate to reach out and ask.

11 / 54

Classroom Environment

I am absolutely committed to fostering a friendly and inclusive classroom environment in which all students have an equal opportunity to learn and succeed.

12 / 54

Classroom Environment

I am absolutely committed to fostering a friendly and inclusive classroom environment in which all students have an equal opportunity to learn and succeed.

Names & Pronouns: Everyone should be addressed respectfully and correctly. Feel free to send me your preferred name/pronouns anytime.

12 / 54

Classroom Environment

I am absolutely committed to fostering a friendly and inclusive classroom environment in which all students have an equal opportunity to learn and succeed.

Names & Pronouns: Everyone should be addressed respectfully and correctly. Feel free to send me your preferred name/pronouns anytime.
Covid: Covid creates unique circumstances for each of us, which may limit your ability to fully participate in this course. You never need to apologize to me for anything pandemic related. Let me know how I can help!

12 / 54

Classroom Environment

I am absolutely committed to fostering a friendly and inclusive classroom environment in which all students have an equal opportunity to learn and succeed.

Names & Pronouns: Everyone should be addressed respectfully and correctly. Feel free to send me your preferred name/pronouns anytime.
Covid: Covid creates unique circumstances for each of us, which may limit your ability to fully participate in this course. You never need to apologize to me for anything pandemic related. Let me know how I can help!
Accessibility & Accomodations: See course website for information on health, disability, and religious accomodations.

12 / 54

Classroom Environment

I am absolutely committed to fostering a friendly and inclusive classroom environment in which all students have an equal opportunity to learn and succeed.

Names & Pronouns: Everyone should be addressed respectfully and correctly. Feel free to send me your preferred name/pronouns anytime.
Covid: Covid creates unique circumstances for each of us, which may limit your ability to fully participate in this course. You never need to apologize to me for anything pandemic related. Let me know how I can help!
Accessibility & Accomodations: See course website for information on health, disability, and religious accomodations.
Feedback: I encourage feedback at any point in the quarter. I will also send out a mid-quarter evaluation around Week 5.

12 / 54

Classroom Environment

I am absolutely committed to fostering a friendly and inclusive classroom environment in which all students have an equal opportunity to learn and succeed.

Names & Pronouns: Everyone should be addressed respectfully and correctly. Feel free to send me your preferred name/pronouns anytime.
Covid: Covid creates unique circumstances for each of us, which may limit your ability to fully participate in this course. You never need to apologize to me for anything pandemic related. Let me know how I can help!
Accessibility & Accomodations: See course website for information on health, disability, and religious accomodations.
Feedback: I encourage feedback at any point in the quarter. I will also send out a mid-quarter evaluation around Week 5.
Getting Help: If you ever find yourself struggling, know I'm here to help! Try chatting after class, email, or office hours.

12 / 54

Asking Questions

Don't ask like this:

tried lm(y~x) but it iddn't work wat do

13 / 54

Asking Questions

Don't ask like this:

tried lm(y~x) but it iddn't work wat do

Instead, ask like this:

y <- seq(1:10) + rnorm(10)
x <- seq(0:10)
model <- lm(y ~ x)

Running the block above gives me the following error, anyone know why?

Error in model.frame.default(formula = y ~ x, 
drop.unused.levels = TRUE) : variable lengths differ 
(found for 'x')

I may send out your question (anonymously) and my answer to the course mailing list!

13 / 54

Questions?14 / 54

Lecture 1: Introduction to R, RStudio, and RMarkdown15 / 54

A Note on Slide Formatting

Bold and Italics indicate important terms!

16 / 54

A Note on Slide Formatting

Bold and Italics indicate important terms!

Code represents R code you could use to perform actions. For example: "Press Ctrl-P to open the print dialogue."

16 / 54

A Note on Slide Formatting

Bold and Italics indicate important terms!

Code represents R code you could use to perform actions. For example: "Press Ctrl-P to open the print dialogue."

Code chunks that span the page represent actual R code embedded in the slides.

# Sometimes important stuff is highlighted!
7 * 49

## [1] 343

16 / 54

Why R?

R is a programming language built for statistical computing.

If one already knows Stata or similar software, why use R?

17 / 54

Why R?

R is a programming language built for statistical computing.

If one already knows Stata or similar software, why use R?

R is free.

17 / 54

Why R?

R is a programming language built for statistical computing.

If one already knows Stata or similar software, why use R?

R is free.
R has a very large community.

17 / 54

Why R?

R is a programming language built for statistical computing.

If one already knows Stata or similar software, why use R?

R is free.
R has a very large community.
R can handle virtually any data format.

17 / 54

Why R?

R is a programming language built for statistical computing.

If one already knows Stata or similar software, why use R?

R is free.
R has a very large community.
R can handle virtually any data format.
R makes replication easy.

17 / 54

Why R?

R is a programming language built for statistical computing.

If one already knows Stata or similar software, why use R?

R is free.
R has a very large community.
R can handle virtually any data format.
R makes replication easy.
R is a language so it can do everything.

17 / 54

Why R?

R is a programming language built for statistical computing.

If one already knows Stata or similar software, why use R?

R is free.
R has a very large community.
R can handle virtually any data format.
R makes replication easy.
R is a language so it can do everything.
R skills transfer to other languages like Python and Julia.

17 / 54

R Studio

R Studio is a "front-end" or integrated development environment (IDE) for R that can make your life easier.

18 / 54

R Studio

R Studio is a "front-end" or integrated development environment (IDE) for R that can make your life easier.

We'll show RStudio can...

18 / 54

R Studio

R Studio is a "front-end" or integrated development environment (IDE) for R that can make your life easier.

We'll show RStudio can...

Organize your code, output, and plots

18 / 54

R Studio

R Studio is a "front-end" or integrated development environment (IDE) for R that can make your life easier.

We'll show RStudio can...

Organize your code, output, and plots
Auto-complete code and highlight syntax

18 / 54

R Studio

R Studio is a "front-end" or integrated development environment (IDE) for R that can make your life easier.

We'll show RStudio can...

Organize your code, output, and plots
Auto-complete code and highlight syntax
Help view data and objects

18 / 54

R Studio

R Studio is a "front-end" or integrated development environment (IDE) for R that can make your life easier.

We'll show RStudio can...

Organize your code, output, and plots
Auto-complete code and highlight syntax
Help view data and objects
Enable easy integration of R code into documents with R Markdown

18 / 54

R Studio

R Studio is a "front-end" or integrated development environment (IDE) for R that can make your life easier.

We'll show RStudio can...

Organize your code, output, and plots
Auto-complete code and highlight syntax
Help view data and objects
Enable easy integration of R code into documents with R Markdown

It can also...

Manage git repositories
Run interactive tutorials
Handle other languages like C++, Python, SQL, HTML, and shell scripting

18 / 54

Selling You on R Markdown