submissions

your homeworks

homework 1

  • review and data exploration
  • assigned: september 9
  • due date: september 16

homework 2

  • market basket analysis and association rules
  • assigned: september 16
  • due date: september 23

homework 3

  • mastering map reduce
  • assigned: september 23
  • due date: october 7

homework 4

  • clustering and parameter estimation
  • assigned: october 7
  • due date: october 21

homework 5

  • bayes rules and ml toolboxes
  • assigned: october 21
  • due date: october 28

homework 6

  • machine learning toolboxes
  • assigned: october 28
  • due date: november 18

homework 7

  • classifier evaluation metrics
  • assigned: november 18
  • due date: november 25

homework 8

  • the logistic regression
  • assigned: november 25
  • due date: december 2


your project



in-class colabs

week 1 colabs

  • In-Class
    Introduction to Google Cloud Platform
    Industry Practice of Cloud Computing

week 2 colabs

  • In-Class
    Introducing Docker and Docker Containers
    Industry Practice of Containerization

week 3 colabs

  • In-Class
    Counting Shakespeare's Words
    Basic Spark and RDD's
  • Setup
    Setup for Homework 3 - PySpark
    Spark Data Ingestion

week 4 colabs

  • In-Class
    Multi-source Joins
    Joining from multiple data sources with text

week 5 colabs

  • In-Class
    Principle component analysis
    Practice with power iterations to identify principle components

week 6 colabs

  • In-Class
    Anomaly Detection
    Colab using Naïve Bayes to predict outliers

week 11 colabs

  • In-Class
    Mining Structured Data and Images
    Manually coding logistic regression optimization

week 12 colabs

  • In-Class
    Using Keras and Traditional Text Mining
    Fun diffusion models, the MNIST convolutional network, and TF/IDF practice