Expected Learning Results
General Synthetic Description
The course provides notions on supervised learning techniques mainly in the framework of regression and classification approaches for both parametric (Linear Regression, logistic regression, discriminant analysis) and non-parametric (tree-based methods) approaches.
Basics of statistical learning (0.5 CFU). Estimation of Dependencies Based on Empirical Data. General Model of Learning from Examples. The problem of risk minimization. Assessing Model Accuracy, The Bias-Variance Trade Off. Regression vs Classification, Bayes Classifier, KNN. Learning Paradigms in Statistics Exploratory Data Analysis. Lab with R.
Linear regression (1.5 CFU). Introduction to Linear Regression Models. Estimating Model Parameters. Model Adequacy Checking. Assessing the Accuracy of the Coefficient Estimates. Properties of least-squares estimators. Confidence intervals and hypothesis testing. Multiple regression. Parameter estimation. Model assessment. Subset selection. Qualitative predictors. Extension of the linear model. Diagnostics for Multiple regression models. Lab with R.
Resampling methods (0.5 CFU). Basics of Sampling. Validation set approach. Leave-One-Out Cross-Validation and k-Fold Cross-Validation. Bias-Variance Trade-Off for k-Fold Cross-Validation. Cross-Validation on Classification Problems Lab with R.
Logistic regression (1.5 CFU). Logistic regression. Simple logistic regression. Parameter estimation and interpretation Diagnostic checking in Logistic Regression. Multiple logistic regression. Linear model selection criteria. Generalized Linear Models. Lab with R.
Generative Models for Classification (0.5 CFU). Linear discriminant analysis. Quadratic discriminant analysis. Naïve Bayes. Comparison among methods. Lab with R.
Tree-based methods (1.5 CFU). Basics of tree-based methods. Regression trees. Growing regression trees. Tree pruning. Classification trees Bagging, Random Forests Boosting. Lab with R.
1. James G., Witten D., Hastie T., Tibshirani R. An Introduction to Statistical Learning with Applications in R, Springer 2021
2. Hastie T., Tibshirani R., Friedman J. The Elements of Statistical Learning, Springer 2008.
3. Teaching material that will be made available during the lessons on Studium.
| Argomenti | Riferimenti testi | |
|---|---|---|
| 1 | Basics of statistical learning: Estimation of dependencies Based on Empirical Data, General Model of Learning from Examples, The problem of risk minimization, Assessing Model Accuracy, The Bias-Variance Trade Off, Regression vs Classification, Bayes Classifier, KNN, Learning Paradigms in Statistics Exploratory Data Analysis, Lab with R. | Slides, Textbook n.1, chap. 1-2. |
| 2 | Linear regression: Introduction to linear regression Models, Estimating Model Parameters, Model Adequacy Checking, Assessing the Accuracy of the Coefficient Estimates, Properties of least-squares estimators, Confidence intervals and hypothesis testing, Multiple regression, Parameter estimation, Model assessment, Subset selection, Qualitative predictors, Extension of the linear model, Diagnostics for Multiple regression models, Lab with R. | Slides, Textbook n.1, chapp. 3. |
| 3 | Resampling methods: Basics of Sampling, Validation set approach, Leave-One-Out Cross-Validation and k-Fold Cross-Validation, Bias-Variance Trade-Off for k-Fold Cross-Validation, Cross-Validation on Classification Problems, Lab with R. | Slides, Textbook n.1, chap. 5. |
| 4 | Logistic regression: Logistic regression, Simple logistic regression, Parameter estimation and interpretation, Diagnostic checking in Logistic Regression, Multiple logistic regression, Linear model selection criteria, Generalized Linear Models, Lab with R. | Slides, Textbook n.1, chapp. 4 and 6. |
| 5 | Generative Models for Classification: Linear discriminant analysis, Quadratic discriminant analysis, Naïve Bayes, Comparison among methods, Lab with R. | Slides, Textbook n.1, chap. 4. |
| 6 | Tree-based methods: Basics of tree-based methods, Regression trees, Growing regression trees, Tree pruning, Classification trees Bagging, Random Forests Boosting. Lab with R. | Slides, Textbook n.1, chap. 8 |
The purpose of the exam is to assess the attainment of the learning objectives. It involves an oral assessment featuring questions regarding the course content, as well as a discussion on a report detailing a practical data analysis conducted using the methodologies covered in the class and the R statistical software.
The final mark is based on the following criteria: