Tag: hadoop

How to Create an External Table in Hive

December 9, 2020

External tables in Hive are used to access and query data stored outside of a database. Follow the steps of this tutorial to learn how to create, query and drop external tables.

Read more

How to Create a Table in Hive

October 28, 2020

Apache Hive is a data warehousing tool used to perform queries and analyze structured data in Apache Hadoop. Use this clear-cut tutorial to learn how to create tables in Hive.

Read more

What is HDFS? Hadoop Distributed File System Guide

August 27, 2020

Want to learn more about HDFS? This guide explains what the Hadoop Distributed File System is, how it works, and when to use it. If you need an effective solution for big data, HDFS and Hadoop could be the answer.

Read more

How to Install Apache Hive on Ubuntu

June 23, 2020

This tutorial shows you how to install, configure, and perform basic commands in Apache Hive. Improve your Hadoop cluster by allowing it to communicate with relational databases conveniently.

Read more

Hadoop vs Spark – Detailed Comparison

June 4, 2020

This article explains how Hadoop and Spark are different in multiple categories. The tools are both free, but there is more than meets the eye. Learn how they compare working alone and if they can work together.

Read more

What is Hadoop Mapreduce and How Does it Work

June 2, 2020

MapReduce is a powerful framework that handles big blocks of data to produce a summarized output. This article explains how MapReduce works and lists its basic concepts.

Read more

Apache Hadoop Architecture Explained (with Diagrams)

May 25, 2020

Apache Hadoop is the go-to framework for storing and processing big data. This article provides clear-cut explanations, Hadoop architecture diagrams, and best practices for designing a Hadoop cluster.

Read more