Datasets
Download and inspect the datasets used throughout the course.❗
Data hygiene
Always check column names, missing values, and units before modeling.
Sample workflow
Dataset Inspection
r
data <- read.csv("dataset.csv")
str(data)
summary(data)
colSums(is.na(data))Glossary
Training set
Data used to fit the model.
Validation set
Data used to test model performance.
Target variable
The outcome being predicted.