From Data to Decisions: Measurement, Uncertainty, Analysis, and Modeling

CHE 379/384, Chris Mack, The University of Texas at Austin

 

Data Sets for use in this class:

Univariate data sets: Data_Sets_1.xlsx

Bivariate and multivariate data sets: Data_Sets_2.xlsx

Time series data sets: Data_Sets_3.xlsx

More bivariate and multivariate data sets: Data_Sets_4.xlsx

Data sets for Design of Experiments: Design of Experiments.xlsx

Body Fat.xlsx, bodyfat-reduced.csv, Model Building - Used Car Value.xlsx

 

Course materials, by class date for the Fall 2018 semester


Thursday, August 30

Lecture 0: Introduction (10 min) - hardcopy of the slides: Introduction.pdf

Getting started with R:
Install R on your computer by going here.
Install RStudio on your computer by going here (there is a free version).
A script to use as an Introduction: Introduction demo.R
Some training videos: https://stat.utexas.edu/videos/r

Tuesday, September 4

Lecture 1: The Knowledge Hierarchy (22 min) - hardcopy of the slides: Lecture1.pdf

Read this this document: The Knowledge Hierarchy and the Data to Decision Process

Lecture 2: Data and Measurement (24 min) - hardcopy of the slides: Lecture2.pdf

Lecture 3: Data Example (11 min) - hardcopy of the slides: Lecture3.pdf

Thursday, September 6

Homework #1 (due in one week): HW1_Linear Regression in Excel.pdf; HW1_Data.xls

Homework #1 Solution: HW1_solutions.xlsx

Lecture 4: Process Modeling (25 min) - hardcopy of the slides: Lecture4.pdf

Reading: NIST Statistics Handbook

Lecture 5: Regression Review, part 1 (26 min) - hardcopy of the slides: Lecture5.pdf

Lecture 6: Regression Review, part 2 (31 min) - hardcopy of the slides: Lecture6.pdf

Class Summary Notes: Least-Squares Regression

Reading on matrix formulation for OLS: Penn State Course

Preview - Some fun with R: descriptive statistics demo.R

Tuesday, September 11

Lecture 7: Appendix: Matrix Math (13 min) - hardcopy of the slides: Lecture7.pdf

Lecture 8: Regression Review, part 3 (14 min) - hardcopy of the slides: Lecture8.pdf

Lecture 9: Linear Regression in Excel (19 min) - Excel spreadsheet: Lecture9.xlsx

Lecture 10: What is the Distribution of the Residuals? (16 min) - hardcopy of the slides: Lecture10.pdf

Bonus Lecture: Fun with Histograms (5 min)

Bonus Lecture: Linear Regression in R (10 min) - Linear Regression.R

Anscombe's 1973 paper, Graphs in Statistical Analysis, and a spreadsheet of the data from that paper: Anscombe_Data.xls

Thursday, September 13

Homework #2 (due one week from today): HW2_QQ Plots.pdf

Homework #2 Excel Solution:  HW2_solutions.xlsx

Lecture 11: Q-Q and Normal Probability Plots (18 min) - hardcopy of the slides: Lecture11.pdf

Reading - Normal Probability Plots: http://www.itl.nist.gov/div898/handbook/eda/section3/normprpl.htm

Lecture 12: Normal Probability Plots in Excel (19 min): Lecture12.xlsx, Lecture12b.xlsx

Lecture 13: Testing for Skewness (16 min) - hardcopy of the slides: Lecture13.pdf

Lecture 14: Testing for Kurtosis (14 min) - hardcopy of the slides: Lecture14.pdf

Tuesday, September 18

Lecture 15: Performing Moment Tests in Excel (16 min): Lecture15.xlsx

Lecture 16: Shipario-Wilk Test for Normality (11 min) - hardcopy of the slides: Lecture16.pdf

Q-Q plots, moment testing, and normality testing in R:
NBS Weight Data.csv
, qqplot moment normality demo.R

Lecture 17: Testing for Outliers, part 1 (26 min) - hardcopy of the slides: Lecture17.pdf

Lecture 18: Testing for Outliers, part 2 (18 min) - hardcopy of the slides: Lecture18.pdf

Reading: Procedures for Detecting Outlying Observations in Samples_Grubbs_1969.pdf

Tables of critical values for the Grubbs' Outlier Tests: Grubbs Test Critical Values.pdf

Thursday, September 20

Homework #3 (due one week from today): HW3_Moments_Outliers.pdf

Homework #3 Excel Solution:  HW3_solutions.xlsx

Lecture 19: Final Thoughts on Outliers (29 min) - hardcopy of the slides: Lecture19.pdf

Lecture 20: Performing Outlier Tests in Excel and R (19 min): Lecture20.xlsx, Outliers demo.R

Lecture 21: Leverage in Regression (22 min) - hardcopy of the slides: Lecture21.pdf

Tuesday, September 25

Lecture 22: Influence in Regression (20 min) - hardcopy of the slides: Lecture22.pdf

Lecture 23: Leverage and Influence in Excel and R (17 min):
Flow Rate Calibration.csv
, Influence demo.R, Lecture23.xlsx

Lecture 24: Heteroscedasticity: When Variance Varies (23 min) - hardcopy of the slides: Lecture24.pdf

Thursday, September 27

Homework #4 (due in one week): HW4_Influence_Scedasticity.pdf

Homework #4 Excel Solution:  HW4_solutions.xlsx

Lecture 25: Testing for Homoscedasticity in Excel and R (15 min): Lecture25.xlsx, Heteroscedasticity.R

Lecture 26: Correcting for Heteroscedasticity (16 min) - hardcopy of the slides: Lecture26.pdf

Lecture 27: Data Transformations in R (19 min): OLS and four Graphs.R, Box-Cox demo.R

Tuesday, October 2

Lecture 28: Weighted Regression (18 min) - hardcopy of the slides: Lecture28.pdf

Lecture 29: Weighted Regression in R (8 min) - Weighted regression demo.R

Excel Resource - the Real-Statistics Website: www.real-statistics.com

Lecture 30: Total Regression, part 1 (19 min) - hardcopy of the slides: Lecture30.pdf

Thursday, October 4

Homework #5 (due in one week): HW5_Weighted_Total_regression.pdf

Homework #5 Excel Solution:  HW5_solutions.xlsx

Lecture 31: Total Regression, part 2 (22 min) - hardcopy of the slides: Lecture31.pdf

Lecture 32: Total Regression, part 3 (18 min) - hardcopy of the slides: Lecture32.pdf

Lecture 33: Total Regression in R (19 min):
Total Regression_effective variance.R, Deming Regression.R

Tuesday, October 9

Lecture 34: The Wrong Model (22 min) - hardcopy of the slides: Lecture34.pdf

Lecture 35: The Wrong Model, part 2 (27 min) - hardcopy of the slides: Lecture35.pdf

Lecture 36: Goodness of Fit tests in R (11 min) - Regression Goodness of Fit.R

Thursday, October 11

Homework #6 (due in one week): HW6_Autoregression.pdf

Homework #6 Excel Solution:  HW6_solutions.xlsx

Lecture 37: Independence of Residuals (27 min) - hardcopy of the slides: Lecture37.pdf

Lecture 38: Residual Independence in Excel and R (18 min) - Lecture38.xlsx, Lag plot and Runs test.R

Lecture 39: Autocorrelation in Time Series (32 min) - hardcopy of the slides: Lecture39.pdf

Tuesday, October 16

Lecture 40: Time Series Autocorrelation in Excel and R (24 min):
Lecture40.xlsx, Durbin-Watson test.R

Tables of critical values for the Durbin-Watson Test: Durbin_Watson_tables.pdf

More Durbin-Watson critical values can be found here: web.stanford.edu/~clint/bench/dwcrit.htm

Lecture 41: Regression Review (17 min) - hardcopy of the slides: Lecture41.pdf

Review materials for Exam #1: Exam1_Review.pdf

Exam #1 Take-Home piece given out : Exam1_Take Home.pdf

Thursday, October 18

Exam #1 (In-Class piece): Exam1_In Class.pdf

Exam Solutions: Exam1_Take Home_solution.pdf, Exam1_In Class_solution.pdf

Tuesday, October 23

Homework #7 (due in one week): HW7_Multiple_Regression.pdf

Homework #7 Excel Solution:  HW7_solutions.xlsx

Homework #7 R Solution: HW7_Multiple_Regression_solution.pdf

Lecture 42: Multiple Regression (23 min) - hardcopy of the slides: Lecture42.pdf

Lecture 43: Comparing Models (17 min) - hardcopy of the slides: Lecture43.pdf

Lecture 44: Multiple Regression in Excel and R (21 min):
Lecture44.xlsx, Multiple Regression.R

Thursday, October 25

Review of Exam #1 solution

Writing Project (Due Nov. 15): Data to Decisions Writing Project.pdf

Lecture 45: Best Subset Regression (18 min) - hardcopy of the slides: Lecture45.pdf

Lecture 46: Best Subset Regression in R (19 min) - Best Subset Model.R

Lecture 47: Multicollinearity (18 min) - hardcopy of the slides: Lecture47.pdf

Tuesday, October 30

Homework #8 (due in one week): HW8_Multicollinearity.pdf
Paper explaining the data set for HW#8, click here.

Homework #8 R Solution: HW8_Multicollinearity_solution.pdf

Lecture 48: Standardized Variables (10 min) - hardcopy of the slides: Lecture48.pdf

Lecture 49: Multicollinearity in Excel and R (9 min): Lecture49.xlsx, Standardized Regression.R

Lecture 50: Detecting Multicollinearity (18 min) - hardcopy of the slides: Lecture50.pdf

Thursday, November 1

Lecture 51: Addressing Multicollinearity (14 min) - hardcopy of the slides: Lecture51.pdf

Lecture 52: Detecting Multicollinearity and Ridge Regression in R (24 min):
Detecting Multicollinearity.R, Ridge Regression.R

Lecture 53: Principal Component Analysis (25 min) - hardcopy of the slides: Lecture53.pdf

Tuesday, November 6

Homework #9 (due in one week): HW9_Logistic_Regression.pdf; HW9.xlsx

Homework #9 R Solution: HW9_Logistic_Regression_solutions.pdf

Lecture 54: Principal Component Analysis in R (16 min) - Principal Components.R

Lecture 55: Robust Estimation (20 min) - hardcopy of the slides: Lecture55.pdf

Lecture 56: Robust Regression (22 min) - hardcopy of the slides: Lecture56.pdf

Lecture 57: Robust Regression in R (11 min) - Robust Regression.R

Thursday, November 8

Lecture 58: Generalized Linear Modeling (19 min) - hardcopy of the slides: Lecture58.pdf

Lecture 59: Other Regression Topics (17 min) - hardcopy of the slides: Lecture59.pdf

Lecture 60: Generalized Linear Modeling in R (21 min): Generalized Linear Modeling.R

Tuesday, November 13

Lecture 61: Logistic Modeling Example -The sinking of the Titanic (40 min): Logistic Regression.R

Lecture 62: Model Building (26 min) - hardcopy of the slides: Lecture62.pdf

Lecture 63: Model Building in R (13 min): Model building.R

2016 Model Building Contest: Model Building Contest.pdf, BlackFridayTrain.csv

2018 Model Building Contest: Model Building Contest 2018.docx, Wine Quality.csv

Model Building Contest entries due Dec. 6 in class.

Thursday, November 15

Homework #10 (due on Nov. 27): HW10_DOE.pdf

Homework #10 Excel Solution: HW10_solutions.xlsx

Homework #10 R Solution: HW10_DOE_solutions.pdf

Lecture 64: Introduction to Design of Experiments (26 min) - hardcopy of the slides: Lecture64.pdf

Lecture 65: Regression Design (21 min) - hardcopy of the slides: Lecture65.pdf

Lecture 66: Simple Regression Design in R (11 min): DOE simple example.R, DOE Simple Example.xlsx

Tuesday, November 20

Lecture 67: Blocking in Experimental Design (21 min) - hardcopy of the slides: Lecture67.pdf

Lecture 68: Factorial Design of Experiments (30 min) - hardcopy of the slides: Lecture68.pdf

Lecture 69: Analysis of Covariance in R (13 min): DOE analysis of covariance.R
DOE Analysis of Covariance.xlsx

Lecture 70: Factorial Design in R (30 min): DOE Factorial Design.R, FactorialDesign.csv
Factorial Designs.xlsx

Thursday, November 22

No class - Happy Thanksgiving!

Tuesday, November 27

Lecture 71: Response Surface Modeling (20 min) - hardcopy of the slides: Lecture71.pdf

Lecture 72: Final Thoughts on Design of Experiments (18 min) - hardcopy of the slides: Lecture72.pdf

Lecture 73: Response Surface Modeling in R (14 min): DOE Response Surface.R

Thursday, November 29

Review materials for Exam #2: Exam2_Review.pdf

Exam #2 Take-Home Piece given out: Exam2_Take Home.pdf

Lecture 74: Bayesian Regression, part 1 (26 min) - hardcopy of the slides: Lecture74.pdf

Lecture 75: Bayesian Regression, part 2 (24 min) - hardcopy of the slides: Lecture75.pdf

Lecture 76: Bayesian Regression, part 3 (xx min) - hardcopy of the slides: Lecture76.pdf

Lecture 77: Bayesian Regression in R (xx min): (sorry - not completed!)

Tuesday, December 4

Exam #2 (In-Class Piece): Exam2_In Class.pdf

Exam Solutions: Exam2_Take Home - Solution.pdf, Exam2_In Class - Solution.pdf

Thursday, December 6

Model Building Contest Results! Here is my R-code solution: WineQuality.R

Lecture 78: Measurement Uncertainty - hardcopy of the slides: Lecture78.pdf

NIST Technical Note 1297 (1994): Guidelines for Evaluating and Expressing the Uncertainty of NIST Measurement Results

GUM: Guide to the Expression of Uncertainty in Measurement

Lecture 79: Propagation of Uncertainty - hardcopy of the slides: Lecture79.pdf

 

Final Exam: Exam_Final.pdf

Final Exam Solution: Exam_Final - Solution.pdf

 

Return to Data To Decisions home page.