Hadoop History

Apache-Hadoop-History

Hadoop History Hadoop was created by Doug Cutting who had created the Apache Lucene(Text Search),which is origin in Apache Nutch(Open source search Engine).Hadoop is a part of Apache Lucene Project.Actually Apache Nutch was started in 2002 for working crawler and search system.Nutch Architecture would not  scale up to billions of pages on the web. In […]

Hadoop Hive Architecture

Hadoop Hive Architecture

Hadoop Hive Architecture  Hive is one of the most important component of Hadoop,In previous post we discussed about Hive Introduction.Now we have to know about Hadoop Hive Architecture. The above diagram shows the basic Hadoop Hive architecture. Primarily The diagram represents CLI (Command Line Interface),JDBC/ODBC and Web GUI (Web Graphical User Interface ).This represents when […]

Apache Tez Introduction

Apache Tez

Apache Tez Introduction In Simple words,Apache Tez is framework for YARN-based,Data processing applications in hadoop,In detailed manner Apache Tez is an extensible framework for building Yarn based,High data performance batch and interactive data applications in Hadoop and Apache Tez can handle TB to PB of data sets.Apache Tez is used by hadoop ecosystem such as […]

Introduction to Hive

Introduction to Hive

Introduction to Hive What is Hive  Hive is a data warehouse software which is used for facilitates querying and managing large data sets residing in distributed storage.Hive language almost look like SQL language called HiveQL.Hive is designed to enable easy data summarization.Hive also allows traditional map reduce programs to customize mappers and reducers when it […]

Hadoop Hive ORC File Format

Hadoop Hive ORC File Format

Hadoop Hive ORC File Format ORC File Format Full Form is Optimized Row Columnar File Format.ORC File format provides very efficient way to store relational data then RC file,By using ORC File format we can reduce the size of original data up to 75%.Comparing to Text,Sequence,Rc file formats ORC is better . Using ORC files improves […]