The Hortonworks Hadoop Sandbox download got corrupted the first time. It worked fine the second time. Installation I installed Oracle VirtualBox first. Then, in the Oracle VM VirtualBox Manager, I select the File | Import Appliance... option, selected the HDP_2.4_virtualbox_v3.ova file and clicked Next and Import. A few seconds later, the box was installed, so I … Continue reading Hadoop Follow Up – Hortonworks HDP Sandbox
Hadoop
My First Foray into Hadoop
So I have a big dataset (1.7 billion rows) that I want to analyze. I figured, "Hey, Hadoop is all over this Big Data thing, I wonder if I can do a Proof of Concept?" Compiling Hadoop on Windows (Ugh!) So, first, I tried to follow some instructions on how to get the Hadoop source … Continue reading My First Foray into Hadoop