More Into Hadoop
Reenforce concepts of Hadoop and MapReduce. Solve remaining problems from previous tutorials.
Slides
Q/A Sections
Tips will be added based on your problems.
How does start-all.sh know where to start datanode?
Check conf/slaves
.
How to remove storage if not fully done before?
You may mess up your VM sometimes. The ultimate solution is to delete it and start a new one. If you didn't remove storage, Azure won't allow you to create a new one.
If you have problem finding the place, check the notes from Wu Yang
Wu Yang
Robin Lee
Outcome of This Tutorial
- Strengthen the understanding of each component of Hadoop.
- Strengthen the understanding of MapReduce job workflow.
- Know the layout of Hadoop directory. Know where to find resources.