Supervised Machine Learning
Label: variable we are predicting
Features: input variable describes our data
Example: particular instance of data
Labeled example: feature and label
Unlabeld example: feature but no label
Model: maps examples to predicted labels
Modeling Process
Training model on training set
Fit the model
Evaluate model on validation set
Estimate prediction error
Repeat
Record the result on test set
Assessment of the error
Metrics
True positive: correct
True negative: correct
False positive: identify as positive but its not
False negative: identify as negative but its not
Accuracy: correct classfication / total classifcation
Precision: TP/(TP+FP) correct/everything classfied as positive
Recall: TP/(TP+FN) correct/all actual positives
False positive rate: FP/(FP+TN)
F1: 2precisionrecall/(precision+recall)
Loss function
A loss function quantifies difference betwen actual response variable and the response variable predicted by your model