navigation

Advanced Programming for Data Science

Resource LinkDescription
SyllabusQuick access to all important course information.
MoodleGrades and quizzes will be available on the course Moodle.
SlackMain communication channel for the course.
Spinelli CenterThe Spinelli Center offers drop-in tutoring hours in Sabin-Reed 301 or on Zoom.
Office HoursSign up for a slot in office hours here.

Overview

InfoValue
WhoDr. Jared Joseph
WhatSDS 270: Advanced Programming for Data Science
WhenMon/Wed/Fri 9:25-10:40
WhereSabin-Reed 220

Schedule

Below is the tentative schedule for the course. While we will try to keep to this schedule, unanticipated situations (and mountain day) may require us to adjust. Each row is a class meeting, with the readings and assignments due on that day listed.

WeekDateTopicReadingsDue
11/27/2023 (Fri)
  1. Roadmap
  1. Class Syllabus
  1. Update R/R Studio
21/30/2023 (Mon)
  1. Intro Git
  1. Bryan, J. (2018). Excuse Me, Do You Have a Moment to Talk About Version Control? American Statistician, 72(1), 20-27.
  1. Complete all Install Guides
22/1/2023 (Wed)
  1. Objects in R
  1. Wickham, H. (2020). 3 Vectors. In Advanced R. Chapman & Hall.
22/3/2023 (Fri)
  1. Lab 1
32/6/2023 (Mon)
  1. Functions
  1. Wickham, H. (2020). 6 Functions. In Advanced R. Chapman & Hall.
32/8/2023 (Wed)
  1. Debugging & Flow
  1. Bryan, J., & Hester, J. (2021). Chapter 11 Debugging R code. In What They Forgot to Teach You About R.
  2. Grolemund, G., & Wickham, H. (2017). 8 Conditions. In R for Data Science. O’Reilly.
  1. Lab 1
32/10/2023 (Fri)
  1. Lab 2
42/13/2023 (Mon)
  1. Iteration
  1. [21.1-21.3 Only] Wickham, H., & Grolemund, G. (2017). 21 Iteration. In R for Data Science. O’Reilly.
42/15/2023 (Wed)
  1. Apply & Lists
  1. Lab 2
42/17/2023 (Fri)
  1. Lab 3, Quiz 1 Open
52/20/2023 (Mon)
  1. Web Scraping
  1. Irizarry, R. A. (2022). Chapter 24 Web scraping. In Introduction to Data Science.
  2. Zimmer, M. (2010). But the data is already public: On the ethics of research in Facebook. Ethics and Information Technology, 12(4), 313-325.
52/22/2023 (Wed)
  1. RegEx
  1. Lab 3
52/24/2023 (Fri)
  1. Lab 4
  1. Quiz 1 Due
62/27/2023 (Mon)
  1. Parallel
  1. Jones, M. (2017). Quick Intro to Parallel Computing in R.
  2. Peng, R. D. (2022). 22 Parallel Computation. In R Programming for Data Science.
63/1/2023 (Wed)
  1. PDF Data Extraction
  1. rOpenSci. (n.d.). Introduction to tabulizer.
  1. rOpenSci. (n.d.). Using the Tesseract OCR engine in R.
63/3/2023 (Fri)
  1. Lab 5, Quiz 2 Open
73/6/2023 (Mon)
  1. Bash
  1. Irizarry, R. A. (2022). Chapter 39 Organizing with Unix. In Introduction to Data Science.
  1. Recap Survey
73/8/2023 (Wed)
  1. Remote Servers
  1. Tanner, A. (2022, September 18). Working with remote computers.
  1. Lab 5
73/10/2023 (Fri)
  1. RECAP/MSA
  1. Quiz 2 Due
83/13/2023 (Mon)No Class
83/15/2023 (Wed)No Class
83/17/2023 (Fri)No Class
93/20/2023 (Mon)
  1. Package Creation
  1. [Skim] Wickham, H., & Bryan, J. (2023). 2 The Whole Game. In R Packages (2nd ed.). O’Reilly.
  1. Final Project Ideas
93/22/2023 (Wed)
  1. Project Management
  1. Quickstart for GitHub Issues. (n.d.). GitHub Docs.
  1. Final Project Interest Form
93/24/2023 (Fri)
  1. Team Formation
  1. Final Project Ranking
103/27/2023 (Mon)
  1. Adv Git
  1. Community, T. T. W. (2022a). Git Branches. In The Turing Way: A handbook for reproducible, ethical and collaborative research.
  2. Community, T. T. W. (2022b). Merging Branches in Git. In The Turing Way: A handbook for reproducible, ethical and collaborative research.
  3. Community, T. T. W. (2022c). Retrieving and Comparing Versions. In The Turing Way: A handbook for reproducible, ethical and collaborative research.
103/29/2023 (Wed)
  1. Package Documentation
  1. Wickham, H., & Bryan, J. (2023). 17 Function documentation. In R Packages (2nd ed.). O’Reilly.
  2. Wickham, H., & Bryan, J. (2023). 19 Other markdown files. In R Packages (2nd ed.). O’Reilly.
103/31/2023 (Fri)
  1. Lab 6
114/3/2023 (Mon)
  1. Adv Functions/S3
  1. A Simple Guide to S3 Methods
114/5/2023 (Wed)
  1. Finals Work Time 1
  1. Lab 6
114/7/2023 (Fri)
  1. Lab 7, Quiz 3 Open
124/10/2023 (Mon)
  1. Testing
  1. Wickham, H., & Bryan, J. (2023). 14 Testing basics. In R Packages (2nd ed.). O’Reilly.
  2. Wickham, H., & Bryan, J. (2023). 15 Designing your test suite. In R Packages (2nd ed.). O’Reilly.
124/12/2023 (Wed)
  1. Benchmarking
  1. Wickham, H. (2020). 23 Measuring performance. In Advanced R. Chapman & Hall.
  1. Lab 7
124/14/2023 (Fri)
  1. Lab 8
  1. Quiz 3 Due
134/17/2023 (Mon)
  1. Package Vignettes/Reprex
  1. Wickham, H., & Bryan, J. (2023). 18 Vignettes. In R Packages (2nd ed.). O’Reilly.
134/19/2023 (Wed)
  1. Package Website
  1. Wickham, H., & Bryan, J. (2023). 20 Website. In R Packages (2nd ed.). O’Reilly.
  1. Lab 8
134/21/2023 (Fri)
  1. Lab 9
144/24/2023 (Mon)
  1. Student’s Choice
144/26/2023 (Wed)
  1. Student’s Choice
  1. Lab 9
144/28/2023 (Fri)
  1. Finals Work Time 2, Quiz 4 Open
155/1/2023 (Mon)
  1. Finals Work Time 3
  1. Final Standards Draft
155/3/2023 (Wed)
  1. Finals Showcase
  1. Final Presentation
155/5/2023 (Fri)No Class
  1. Quiz 4 Due
1605/08/23 (Mon)
  1. Quiz 5 Open
  1. Final Project Due
1605/10/23 (Wed)No Class
  1. Quiz 5 Due