In STA 112, you learned about the simple linear regression model:
\[Y_i = \beta_0 + \beta_1 X_i + \varepsilon_i\]
Question: What assumptions does this model make?
A new question
In STA 112, you learned about the simple linear regression model:
\[Y_i = \beta_0 + \beta_1 X_i + \varepsilon_i\]
Question: How important is it that \(\varepsilon_i \sim N(0, \sigma^2)\)? Does it matter if the errors are not normal?
Activity
\[Y_i = \beta_0 + \beta_1 X_i + \varepsilon_i\]
Activity: With a neighbor, brainstorm how you could use simulation to assess the importance of the normality assumption (you do not need to write code!).
How would you simulate data?
What result would you measure for each run of the simulation?
Activity
\[Y_i = \beta_0 + \beta_1 X_i + \varepsilon_i\]
How would you study the importance of the normality assumption?
Simulating data
To start, simulate data for which the normality assumption holds:
Develop computing skills to work with data and answer statistical questions
Emphasize reproducibility and good coding practices
Introduce other important computing tools for statistics and data science (Python, SQL, Git)
What this course isn’t:
An exhaustive list of R or Python functions
A computer science course
A deep dive into how R actually works
Tentative topics
Simulation
Intro to Python
Data wrangling and manipulation
Intro to SQL
Version control and reproducibility
Working with text data
Time permitting: select advanced topics
Course components
Component
Weight
Homework
50%
Midterm exam
10%
Final exam
20%
Project
20%
Diversity and inclusion
In this class, we will embrace diversity of age, background, beliefs, ethnicity, gender, gender identity, gender expression, national origin, neurotype, race, religious affiliation, sexual orientation, and other visible and non-visible categories. The university and I do not tolerate discrimination.
You deserve to be addressed in the manner you prefer. To guarantee that I address you properly, you are welcome to tell me your pronoun(s) and/or preferred name at any time, either in person or via email.