Course Syllabus

Overview

Students should watch Udacity course videos according to the following schedule. It is recommended for students to do lab sessions on the schedule by yourself as early as possible since some of homework may cover the lab materials scheduled later than the homework. For the online video lectures, CS/CSE students should go to Udacity while Analytics students should navigate to Edx instead, please check details on Canvas.

Schedule

Week #DatesIn-class lessonVideo lessonsLabDeliverable Due
11/7/2020Intro to the BDH class (Jeff)1. Intro to Big Data Analytics
11/9/2020Predictive modeling (Jimeng)2. Course Overview
21/14/2020Lab: Scala Basic (TA)3. Predictive ModelingScala Basic
21/16/2020Lab: Hadoop & HDFS Basic(TA)Hadoop & HDFS BasicHW1 Due (1/19/2020)
31/21/2020Project: Prediction in ICU for Mortality/Sepsis (Yanbo)4.MapReduce& HBase
31/23/2020Lab: Hadoop Pig & Hive (Su Young)Hadoop Pig & Hive
41/28/2020Project: Chest X-Ray/NLP (Siddarth)5.Classification evaluation metrics
41/30/2020Project: Drug Discovery (Tianfan)6.Classification ensemble methodsHW2 Due (2/2/2020)
52/4/2020Lab: Spark Basic, Spark SQL (Wendi)7. PhenotypingSpark Basic, Spark SQL
52/6/2020Lab: Spark Application & Spark MLlib (Wendi)8. Clustering 9. SparkSpark Application & Spark MLlib
62/11/2020Project: Tensor Factorization (Ari)9. Spark
62/13/2020Project: Sleep Data (Irfan)HW3 Due & Project Group Formation Due & Project Requirements Release (2/16/2020)
72/18/2020Lecture: Tensor Factorization (Ari)10. Medical ontology
72/20/2020Lab: NLP (Charity)NLP Lab by Charity Hilton
82/25/2020Joyce Ho (Emory) - The Automation of Evidence Matching and Systematic Reviews Using Web-Based Medical Literature11. Graph analysis
82/27/2020Alaa Aljiffry (CHOA) - Big Data in Pediatric Cardiac ICUProject Proposal Due (3/1/2020)
93/3/202012. Dimensionality Reduction 13. Patient similarity
93/5/2020Lecture: Deep Learning (Adversarial Examples for Electronic Health Records - Sungtae)14. DNNDeep Learning Lab by Sungtae AnHW4 Due (3/8/2020)
103/10/2020Lab: Deep Learning (Sungtae)15. CNNDeep Learning Lab
103/12/2020Lab: Deep Learning (Sungtae)16. RNN
113/17/2020Spring break
113/19/2020Spring breakHW5 Due (3/29/2020)
123/24/2020(cancelled)Diyi Yang (GT) - NLP in Healthcare
123/26/2020(cancelled)Gari Clifford (GT/Emory) - How Big is Big? Pitfalls and Opportunities in ML for Medical Data
133/31/2020(cancelled)Jon Duke - Precision Medicine at Georgia Tech: Introduction to the Health Data Analytics Platform
134/2/2020(cancelled)Omar Inan (GT)Project Draft Due (4/12/2020)
144/7/2020
144/9/2020
154/14/2020
154/16/2020Final Exam(4/16/2020 on-campus cancelled)Final Exam(4/18-4/20 online cancelled)
164/21/2020
164/23/2020Final Project with code, presentation, and the final paper (4/26/2020)

Previous Guest Lectures

See RESOURCE section.