데이터사이언스 원리와 응용

Principles and Application of Data Science
Min-hwan Oh, Seunggeun Lee, Jaejin Lee, Hyung-Sin Kim, Sang Kyun Cha, Hyunwoo Park, Sanghack Lee, Joonseok Lee

Goals

Combining computing, machine learning algorithms, and inferential thinking based on probability/statistics, data science provides a new paradigm for deriving insights and new knowledge from data and solving real-world problems. The purpose of this course is to introduce core principles and techniques of data science such as basic theoretical knowledge and analytical skills necessary for students of various majors to solve data-driven problems. In addition, students will learn data-centric thinking and decision-making based on data.

Content

Statistics & Computing

DateTopicInstructor
Introduction· Course Logistics
· What is Data Science?
Min-hwan Oh
Sang Kyung Cha
Statistics for Data Science· Probability, Random variables, Expectation & VarianceSeunggeun Lee
Statistics for Data Science· Bias-variance tradeoff, MSESeunggeun Lee
Computing· Big O (time & space complexity)Jaejin Lee
Computing· SearchingJaejin Lee
Computing· SortingJaejin Lee
Computing· Data structures: Array, Linked list, Queue, StackHyung-Sin Kim
Computing· Data structures: Binary Search TreeHyung-Sin Kim
Computing· Data structures: TreesHyung-Sin Kim
Computing· Data structures: GraphHyung-Sin Kim

Database & Visualization

DateTopicInstructor
Database· Intro to databaseSang Kyun Cha
Database· SQLSang Kyun Cha
Database· Graph databaseSang Kyun Cha
Midterm Exam
Visualization· Visualization IHyunwoo Park
Visualization· Visualization IIHyunwoo Park
Visualization· Visualization IIIHyunwoo Park

Machine Learning

DateTopicInstructor
· Introduction to ML, Linear regressionSanghack Lee
· Logistic RegressionSanghack Lee
· Decision Trees, Random forestsSanghack Lee
· Overfitting, RegularizaionSanghack Lee
· Nearest Neighbor ClassifiersJoonseok Lee
· Gradient descent, Cross validationJoonseok Lee
· Neural networks, BackpropagationJoonseok Lee
· Deep learning, Convolutional neural networks (CNN)Joonseok Lee
· Unsupervised learning: ClusteringMin-hwan Oh
· Dimension reduction, PCAMin-hwan Oh
· Online decision making, Multi-armed banditsMin-hwan Oh
· Reinforcement LearningMin-hwan Oh
Final Exam

Grading Policy

  • : Assignment 35%, Mid-term 30%, Final exam 30%, Attendance 5%