Provides intermediate to advanced python programming for data science with the aim of preparing students for more advanced courses in data science and to enable practical contributions to software development and data science projects in a commercial setting. Covers object-oriented design patterns using Python, including encapsulation, composition, and inheritance. Through case studies, the course will survey key concepts in data science with an emphasis on machine-learning (classification, clustering, deep learning), data visualization, and natural language processing.
Accompanied by DS2501: Lab for DS2500 (1.000 credits) in which students will practice the programming techniques discussed in lecture through hands-on experimentation.
Topics |
Data Science techniques: data visualization, fundamental data statistics, linear regression, correlation and covariance, scaling and normalization, k-means clustering, k-nearest neighbor classification, cross-validation, decision trees, polynomial regression.
Intermediate Programming: loops, functions, files, Python data structures, classes and objects, command-line, code design and quality, unit testing.
Python Libraries and Tools: Jupyter Notebooks, Pandas, NumPy, Scikit, Seaborn, Geopandas.
|
Syllabus
|
Download |
Piazza
|
https://piazza.com/northeastern/spring2024/ds2500 |
Lecture Questions
|
https://bit.ly/ds2500_lecture_q
|
Grading Guidelines
|
Code quality & Visualizations
|
Gradescope
|
https://www.gradescope.com/courses/679045 |
TA Appreciation
|
Feedback form |
Laney's Office Hours begin Tuesday, January 9th. We do not hold office hours on university holidays. (TA office hours begin on Tuesday, January 16th -- details on the TA page.)
laneys@northeastern.edu | |
Web | https://northeastern.edu/home/laney |
Zoom (for OH) | https://northeastern.zoom.us/my/laney |
Office Hours | T 3:30-6pm, F 12-1pm (schedule time during OH or just drop in. Meserve 313, or my Zoom. Appointments take priority, though!)
If the Tues and Fri office hours get booked quickly, or if they don't work for you, please just email me and we'll set up something else. |
DS2500-01 (Lecture) | TF 9:50-11:30am
CG 097 |
DS2500-02 (Lecture) | TF 1:35-3:15pm
CG 097 |
DS2501-01 & -03 (Lab) | M 8:00-9:40am
WVH 210A (both sections meet in 210A) TAs: Ivan, Vidhi, Roland |
DS2501-04 (Lab) | M 9:50-11:30am
WVH 210A TAs: Shivani, Matthew |
DS2501-05 (Lab) | M 9:50-11:30am
WVH 210B TAs: Laura, Matthew |
DS2501-06 (Lab) | M 11:45am-1:25pm
WVH 210A TAs: Seamus, Naman |
DS2501-07 (Lab) | M 11:45am-1:25pm
WVH 210B TAs: Varun, Naman |
DS2501-08 (Lab) | M 1:35-3:15pm
WVH 210A TAs: Keegan, Alina |
DS2501-09 (Lab) | M 1:35-3:15pm
WVH 210B TAs: Kenichi, Alina |
DS2501-10 (Lab) | M 3:25-5:05pm
WVH 210A TAs: Pratyush, Neha, Daniel |
DS2501-13 (Lab) | M 3:25-5:05pm
WVH 210B TAs: Harsh, Neha |
DS2500 Videos - Python Review | |
DS2000 Videos - Basic Python | |
DS2000 Videos - Functions, Scope, Data Structures | |
DS2000 Videos - Classes & Objects; Data Science Techniques | |
Additional Python Resources |