Supervised Learning
We will practice supervised learning with an example competition from Kaggle. The Titanic survival prediction is a getting started competition on Kaggle.
Preparation
Download the train.csv
and test.csv
from Kaggle.
Put the two files under current directory with those IPython Notebooks.
IPython Notebooks
- Preparation and Basic Statistics: online viwer, source
- Supervised Learning: online viwer, source
Outcome of This Tutorial
- Have a feel of what a data mining competition looks like in real life. Participating in those competitions is a good way to sharpen your skills and learn new things in the future.
- Build up the concept of supervised learning. Can recognize and transform real world problems into classification or regression format.
Outcome of Whole Tutorial Series and Homeworks
- Get to know state-of-the-art systems, platforms and workflows.
- Not afraid of self-learning any new algorithms and tools for data mining.