Elements of statistical learning with R
(TDAESLR)
Learning about probability distributions, hypothesis test, regression and classification by using R, the best open statical tool for data scientist. The theoretical part of the course is based on the famous "Probability and Statistics for Engineers and Scientists" by Sheldon M. Ross, while the practical one follows the exercises shown in "Understanding Statistics Using R" by R. Schumacker and S. Tomek.
Audience
Anyone with no prerequisites who would like to develop statistical analysis with R also as an introductory step to Machine Learning
Approaches (Objective)
R fundamentals
- Running R programs with R Studio
- Dataframes and lists
- Basic syntax with tidyverse
- Data wrangling with dplyr
- Data visualization with ggplot2
Probability and statistical theory
- Descriptive statistics
- Element of probability
- Random variables
- Distributions of sampling statistics
Statistical estimation and testing
- Distribution of sampling statistics
- Confidence intervals
- Hypothesis testing
Regression and analysis of variance
- Simple regression
- Analysis of residuals
- Polynomial regression
- Logistic regression teaser
- One-Way Analysis of Variance
- Two-Factor Analysis of Variance