SDSU CS 649 Big Data: Tools and Methods
Spring Semester, 2021
Lecture Notes
DCS
To Course Web Site
San Diego State University -- This page last updated 29-Apr-21

This page contains links to lecture notes for the CS 649 Big Data: Tools and Methods course. This page will be updated as more notes become available.

Lecture Notes By Topic
  1. Course Introduction
  2. Big Data Introduction
  3. Python
  4. SciPy
  5. Panda Data Structure zipped Jupyter Notebook
  6. DataFrame zipped Jupyter Notebook
  7. Statistics, Sampling, Bloom
  8. Plots
  9. Data Manipulation
  10. Dask
  11. Regression
  12. ScikitLearn, Bayes
  13. Clustering
  14. Spark Intro
  15. Spark 2
  16. Assignment 1 Comments
  17. Spark Reminder
  18. Running Spark
  19. Running Spark, Partitions
  20. Spark ML
  21. Spark Clustering, Deep Learning
  22. NoSQL, Cassandra
  23. Exam Comments
  24. Scaling, NoSQL, Streaming, End Remarks

Lecture Video By Date
Tuesday Thursday
Jan 21 Course Intro
Jan 26 Big Data Intro Jan28 Python
Feb 2 Panda Series Feb 4 Panda Dataframe
Feb 9 Statistics, Sampling Feb 11 Sampling, Bloom
Feb 16 Data Manipulation Feb 18 Data Manipulation, Plots
Feb 23 Dask, Regression Feb 25 Regression
Mar 2 Bayes Mar 4 Clustering
Mar 9 Spark Mar 11 Spark 2
Mar 16 Assignment 1 Comments Mar 18 Exam Questions
Mar 23 More Exam Questions Mar 25 Spark Reminder
Mar 30 No Class Apr 1 Running Spark
Apr 6 Running Spark, Partitions Apr 8 Spark ML
Apr 13 Spark ML, Spark Clustering Apr 15 No Class
Apr 20 Deep Learning, NoSQL Apr 22 Project Presentations
Apr 27 Apr 29
May 4 May 6 Last Class
May 11 May 13 Project Due