"Tell me and I forget. Teach me and I remember. Involve me and I learn."
-Benjamin Franklin
I'm a big fan of practical learning, "implement as you learn" is my mantra for learning anything. Hadoop being open source gives the best opportunity for getting your hands dirty as you read about it. There are plenty of free resources online that you can refer to get started with and in this post, I'm going to list and refer some of the good ones I've come across.
Getting Started with Hadoop
Depending on your level of interest in learning and exploring Hadoop, you can enroll in any of the free online fundamental courses offered from Big Data University or watch video tutorials form edureka on YouTube. These two sources do not require a sign in from your corporate email id and give a basic overview on what Hadoop is? And of-course the documentation provided by Apache helps in understanding it detail, alternatively you can read the Yahoo Hadoop tutorial.
After you have read enough and feel like getting a hands on experience, you'll have two tracks to choose:
- Install it locally on your laptop
- Access it via a free Virtual Machine
The easy way of learning is of-course the VMs, they are available for different versions of Hadoop and will enable you to start quickly on trying out what's in the store for you? They do require a sign in from your corporate email ID but have a good knowledge base and tutorials followed by the exercises to practice what you learned. The good ones are Cloudera and Hortonworks. However, since they are a platform service provider company, you'll be hearing a sales pitch very often. Well, not bad at all to know for the resources they provide to the community. The only limitation I see with a VM is that you cannot perform a POC on your Company's in house live data source, unless you are flexible in taking a snapshot of it to the VM to play with.
On the other hand, you can perform Hadoop installation on your own laptop or a test server in your organization and setup things as you want. It is like doing it from scratch and it helps in learning in a better way but again it depends on what your interest is? Some of the good references for Hadoop installation are Apache installation guide and single node cluster setup from Michael G. Noll.
Happy learning!