Orzota
Hive Tutorial for Beginners
Publish on
Hive is a data warehouse system for Hadoop that facilitates ad-hoc queries and the analysis of large datasets stored in Hadoop. Hive provides a SQL-like language called HiveQL. Due its SQL-like interface, Hive is increasingly becoming the technology of choice for using Hadoop. Objective The objective of this Hive tutorial is to get you up…
Categories: Blog
MapReduce Tutorial
Publish on
Objective We will learn the following things with this step-by-step MapReduce tutorial MapReduce programming with a column Writing a map function Writing a reduce function Writing the Driver class Prerequisites The following are the prerequisites for writing MapReduce programs using Apache Hadoop You should have the latest stable build of Hadoop (as of writing this…
Categories: Blog
Eclipse Setup for Hadoop Development
Publish on
Objectives We will learn the following things with this Eclipse Setup for Hadoop tutorial  Setting Up the Eclipse plugin for Hadoop Testing the running of Hadoop MapReduce jobs Prerequisites The following are the prerequisites for Eclipse setup for Hadoop program development using MapReduce and further extensions. You should have the latest stable build of Hadoop…
Categories: Blog
Single-Node Hadoop Tutorial
Publish on
Objectives We will learn the following things with this single-node Hadoop tutorial Setting Up Hadoop in Single-Node and Pseudo-Cluster Node modes. Test execution of Hadoop by running sample MapReduce programs provided in the Hadoop distribution. Prerequisites Platform Either Linux or Windows. Basic knowledge of Linux shell commands. Software Java™ 1.6.x must be installed. ssh must be…
Categories: Blog
Welcome
Publish on
We are very excited to launch the Orzota blog. We hope to provide articles on hadoop and related technologies which will hopefully prove useful. We will address both beginner topics for those just getting started on hadoop and also more advanced tips and techniques.
Categories: Blog