Week 1: Getting Organized
You already have an account on the Macalester RStudio server <rstudio.macalester.edu>. Login with your Macalester email credentials.
In this brief session, we will:
- Get everyone started with using the software for the course.
- Install many of the R packages that will be needed for the course.
- Create a GitHub account and “clone” (using SSH) the repository at https://github.com/dtkaplan/Comp-110-Assignments-Spring-2017 to give you the files you will be modifying as you work in the course. Once you have done so, please fill in this form so that I will know your account name.
- Clone (again) the repository you created in (3) into RStudio as a “project”" to hold your work in the course.
- Open the
Week-1-project.Rmd
file in the RStudio editor. You’ll put some “markdown” content about a historical event or person into that document, then compile it to HTML, and update your GitHub repository with the new file.
Relevant links:
————–
Week 2: Data Infrastructure
- Readings: Data Computing Chapters 1 through 4
- Week 2 online tutorial Don’t do this in class! Do it afterwards.
- Short-answer questions from book: file
Week-2-short-answer.Rmd
- 1.1, 1.2, 1.4, 1.5
- 2.2, 2.3, 2.4, 2.6, 2.7, 2.8, 2.8, 2.9, 2.10
- 3.1, 3.2, 3.3, 3.4, 3.5, 3.6
- 4.1, 4.2, 4.3, 4.4, 4.5
- In-class activity:
- Notes from class: tidy data : Tidy data paper : R parts of speech : highway fatality data
- In-class activity:
- In-class links:
———-
Week 3: Data summaries and graphics
- Readings: Data Computing chapters 5, 6, & 7
- Week 3 online tutorials: Chapter 3, Chapter 4, Chapter 5, Chapter 6, Chapter 7
- Short-answer questions from book: File
Week-3-short-answer.Rmd
- 5.1, 5.2
- 6.5, 6.6, 6.7, 6.8
- 7.1, 7.4, 7.5, 7.7, 7.11, 7.12
- In-class activity
- Notes from class:
———-
Week 4: Data Verbs
- Readings: Data Computing chapters 8 & 9
- Week 4 online tutorials: Chapter 8, Chapter 9
- Short-answer questions: from book. File
Week-4-short-answer.Rmd
- In-class activity
- Notes from class:
————
Week 5: Combining data
- Readings: Data Computing chapters 10 & 11
- Short-answer questions from book: File
Week-5-short-answer.Rmd
- 10.1, 10.2, 10.3
- 11.1, 11.3, 11.4, 11.5
- In-class activity:
- Week 5 links:
————–
Week 6: Collective properties of data: Stats
- Readings: from Data Computing Chapters 12, 13 and 14
- Networks: comcast complaints example, immigration, movie genres
- Short-answer questions from the book: File
Week-6-short-answer.Rmd
- 12.1, 12.2, 12.3
- 13.1, 13.2, 13.3, 13.4
- 14.1, 14.3, 14.7, 14.10
- In-class activity:
Spring break
————
Week 7: Scraping and cleaning data
- Readings: Data Computing chapters 15 & 16
- Short-answer questions from the book: File
Week-7-short-answer.Rmd
- 15.1, 15.2
- 16.1, 16.2, 16.3
- In-class activity:
- Additional Resources:
- In-class links:
————-
Week 8: Machine learning
- Readings: Data Computing chapter 17
- Short-answer questions from the book: File
Week-7-short-answer.Rmd
- In-class course evaluation. Note that the Google Form is not managed by Macalester, so no information about your name will be collected.
- A data science process and how it aligns with the topics covered in this course.
- Examples: Selling car seats, Girl or boy, Flowers, Countries, Cars
- Introduction to modeling online course. This is the first chapter of an online course being developed as I update the Math 155 (Intro to Statistical Modeling) materials. The course differs from the current Math 155 in that it is based on machine-learning concepts rather than the traditional regression concepts.
- In-class activity:
————–
Final Exam
This is a short course, so the exam is similarly short. Do it in the 10 days following the last class session.