wanirepo / Stats_2024Spring

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

2024 Spring "Biostats and Big Data" at SKKU GBME


  • Lecturer: Choong-Wan Woo, Ph.D. Assistant professor (GBME).
  • Office: N-center, 86335
  • Web: Cocoan lab
  • E-mail: please send the message through iCampus or send me an email to choongwan.woo at gmail dot com
  • Class: (online) Mon 1:30-2:45, (offline) Wed 12:00-1:15 (flipped class)

What are the aims of this course?

Data are everywhere. Data science already became a key element in many research and industrial areas. The primary aim of this course is to learn basic concepts and skills for data analysis, including concepts of random variables, sampling distributions, hypothesis testing, linear modeling, data visualization, etc., preparing you for your life after graduation in this data-are-everywhere age. However, this is not the only aim of this class. Statistics goes beyond just data analysis, I believe. Statistics is a way of thinking and reasoning. It helps with scientific reasoning and critical thinking. Thus, I hope this class could provide you with some useful tips for scientific thinking and reasoning.

Course format (flipped classroom) and expectations

This class is designed in a flipped classroom format, which is a new way of teaching and learning. Different from the traditional learning environment (passively listening to the lecture in the class and doing homework at home), in the flipped classroom, you will listen to the lecture at home and do homework and practice in the classroom. I personally experienced this format of learning during my Ph.D. (for the Machine Learning class) and deeply enjoyed it. I found the flipped classroom helped students stay engaged and provided a good environment for hands-on experience.

For each week, we will use one class for watching pre-recorded video lectures (from the last few years' work) and the other class for the activities. It is crucial to watch the video lectures to participate in the class activity. Thus I might give you a quiz for each class.

Textbooks

Main textbook:

"Stats: Data and Models" by De Veaux, Velleman, and Bock

Supplements (not required):

"Statistical Thinking for the 21st Century" by Russ Poldrack Link
"Seeing theory" Link
"The Seven Pillars of Statistical Wisdom" by Stephen M. Stigler Amazon Link
"History of Statistics Reading Group" (CMU) Link

Softwares

Teaching how to use statistics packages or programming is not the main focus of this class. This class is more about statistical methods and theories. However, software packages and computer programming are actually essential in learning statistics. Therefore, I will provide some lectures about software packages for statistical analysis (e.g., JAMOVI). In addition, I might use Matlab, Python, or R sometimes. You can download Matlab through SKKU. Python and R are open-source programming languages.

TAs

TBD

Evaluation

Absolute evaluation will be used for this course.

  1. Attendance (40%)
  2. Participation (including quiz) (35%)
  3. Final exam (25%)

Schedule (TBA)

PR: pre-recording | OF: offline | A: activity | V: video

Week Video lectures Class Chapters or papers
Week 1
3/4 (PR) V01: Data Ch.1-2
3/6 (OF) Course overview
Week 2
3/11 (PR) V02: Data visualization Ch.2-3
(PR) V03: Comparing distribution Ch.4
3/13 (OF) A01: Cognitive Errors and Statistical thinking Tversky & Kahneman, 1974
Week 3
3/18 (PR) V04: Normal model Ch.5
(PR) V05: Scatterplots and correlation Ch.6
3/20 (PR) V06: Linear regression Ch.7
Week 4
3/25 (PR) V07: More about regression, re-expressing data Ch.8-9
3/27 (OF) A02: Boxplot
Week 5
4/1 (PR) V08: Sampling Ch.8-9
(PR) V09: Design Experiment Ch.11-12
4/3 (OF) A03: Normal Probability Plot
Week 6
4/8 (PR) V10: Probability and Bayes theorem Ch.13-14
(PR) V11: Random variables Ch.15
4/10 국회의원 선거
Week 7
4/15 (PR) V12: Probability models Ch.16
4/17 (OF) A04: Regression to the mean
Week 8
4/22-4/24 Midterm
Week 9
4/29 (PR) V13: Softwares and programming languages
5/1 (PR) V14: Sampling distribution, central limit theorem Ch.17
Week 10
5/6 (PR) V15: Confidence interval for proportions Ch.18
(PR) V16: Hypothesis testing, P-values Ch.19
5/8 (OF) A05: Bayes theorem and random variables
Week 11
5/13 (PR) V17: Inferences about means Ch.20
(PR) V18: More about tests and intervals Ch. 21
5/15 석가탄신일
Week 12
5/20 (PR) V19:Comparing groups Ch. 22
(PR) V20: Paired t-test Ch. 23
5/22 (OF) A06: Probability models
Week 13
5/27 (PR) V21: Comparing counts Ch. 24
(PR) V22: Inferences about regression Ch. 25
5/29 (OF) A07: Central limit theorem
Week 14
6/3 (PR) V23: Analysis of Variance (ANOVA) Ch. 26
(PR) V24: Multifactor ANOVA Ch. 27
6/5 (OF) A8: t-test
Week 15
6/10 (PR) V25: Multiple regression Ch.28
(PR) V26: Multiple regression wisdom Ch. 29
6/12 (OF) A9: chi-square and regression
Week 16
6/19 Final

Note.

About