Hadoop History

Apache-Hadoop-History

Hadoop History Hadoop was created by Doug Cutting who had created the Apache Lucene(Text Search),which is origin in Apache Nutch(Open source search Engine).Hadoop is a part of Apache Lucene Project.Actually Apache Nutch was started in 2002 for working crawler and search system.Nutch Architecture would not  scale up to billions of pages on the web. In […]

Apache Tez Introduction

Apache Tez

Apache Tez Introduction In Simple words,Apache Tez is framework for YARN-based,Data processing applications in hadoop,In detailed manner Apache Tez is an extensible framework for building Yarn based,High data performance batch and interactive data applications in Hadoop and Apache Tez can handle TB to PB of data sets.Apache Tez is used by hadoop ecosystem such as […]

Introduction to Hive

Introduction to Hive

Introduction to Hive What is Hive  Hive is a data warehouse software which is used for facilitates querying and managing large data sets residing in distributed storage.Hive language almost look like SQL language called HiveQL.Hive is designed to enable easy data summarization.Hive also allows traditional map reduce programs to customize mappers and reducers when it […]

Hadoop Hive ORC File Format

Hadoop Hive ORC File Format

Hadoop Hive ORC File Format ORC File Format Full Form is Optimized Row Columnar File Format.ORC File format provides very efficient way to store relational data then RC file,By using ORC File format we can reduce the size of original data up to 75%.Comparing to Text,Sequence,Rc file formats ORC is better . Using ORC files improves […]

Apache Hive 10 Best Practices

Apache Hive 10 Best Practices

Apache Hive 10 Best Practices Apache hive is looks like Traditional SQL software used Hadoop to  give users the capability of performing SQL-like queries on it’s own language,HiveQL works very quickly and efficiently.Comparing to Traditional SQL HiveQL gives users additional query and analytical abilities which are not available in Traditional SQL. With APache Hive we can […]