데이터사이언스 원리와 응용
Principles and Application of Data Science
Min-hwan Oh, Seunggeun Lee, Jaejin Lee, Hyung-Sin Kim, Sang Kyun Cha, Hyunwoo Park, Sanghack Lee, Joonseok Lee
Goals
Combining computing, machine learning algorithms, and inferential thinking based on probability/statistics, data science provides a new paradigm for deriving insights and new knowledge from data and solving real-world problems. The purpose of this course is to introduce core principles and techniques of data science such as basic theoretical knowledge and analytical skills necessary for students of various majors to solve data-driven problems. In addition, students will learn data-centric thinking and decision-making based on data.
Content
Statistics & Computing
Date | Topic | Instructor |
---|---|---|
Introduction· Course Logistics · What is Data Science? | Min-hwan Oh Sang Kyung Cha | |
Statistics for Data Science· Probability, Random variables, Expectation & Variance | Seunggeun Lee | |
Statistics for Data Science· Bias-variance tradeoff, MSE | Seunggeun Lee | |
Computing· Big O (time & space complexity) | Jaejin Lee | |
Computing· Searching | Jaejin Lee | |
Computing· Sorting | Jaejin Lee | |
Computing· Data structures: Array, Linked list, Queue, Stack | Hyung-Sin Kim | |
Computing· Data structures: Binary Search Tree | Hyung-Sin Kim | |
Computing· Data structures: Trees | Hyung-Sin Kim | |
Computing· Data structures: Graph | Hyung-Sin Kim |
Database & Visualization
Date | Topic | Instructor |
---|---|---|
Database· Intro to database | Sang Kyun Cha | |
Database· SQL | Sang Kyun Cha | |
Database· Graph database | Sang Kyun Cha | |
Midterm Exam | ||
Visualization· Visualization I | Hyunwoo Park | |
Visualization· Visualization II | Hyunwoo Park | |
Visualization· Visualization III | Hyunwoo Park |
Machine Learning
Date | Topic | Instructor |
---|---|---|
· Introduction to ML, Linear regression | Sanghack Lee | |
· Logistic Regression | Sanghack Lee | |
· Decision Trees, Random forests | Sanghack Lee | |
· Overfitting, Regularizaion | Sanghack Lee | |
· Nearest Neighbor Classifiers | Joonseok Lee | |
· Gradient descent, Cross validation | Joonseok Lee | |
· Neural networks, Backpropagation | Joonseok Lee | |
· Deep learning, Convolutional neural networks (CNN) | Joonseok Lee | |
· Unsupervised learning: Clustering | Min-hwan Oh | |
· Dimension reduction, PCA | Min-hwan Oh | |
· Online decision making, Multi-armed bandits | Min-hwan Oh | |
· Reinforcement Learning | Min-hwan Oh | |
Final Exam |
Grading Policy
- : Assignment 35%, Mid-term 30%, Final exam 30%, Attendance 5%