What are the most popular Hadoop tools/projects?

Posted on January 31, 2019

I have a question what are the most popular Hadoop tools/projects?

This textbox defaults to using Markdown to format your answer.

You can type !ref in this text area to quickly search our full set of tutorials, documentation & marketplace offerings and insert the link!

These answers are provided by our Community. If you find them useful, show some love by clicking the heart. If you run into issues leave a comment, or add your own answer to help others.

nishakale1122

January 31, 2019

Accepted Answer

Hive is an SQL-like language for data processing, which gets converted into a MapReduce job behind the scenes. Hive is popular because it is written using familiar SQL-like syntax. This is often confusing, because Hive doesn’t have all the controls of a relational database. but the query language is familiar.

Spark is popular, especially for data processing, analytics and distributed machine learning. Spark jobs can be written in Scala or Python. The latter makes Spark especially popular with data scientists and those from a statistics background.

With the speed of open source software development, this answer may be very different in a year’s time.

sivacynixit

September 3, 2019

the most popular Hadoop tools are:

Hadoop distributed File System
Hbase
Hive
Sqoop
Pig
NoSQL
GIS Tools
Spark: Apache Spark is an open-source distributed general-purpose cluster-computing framework. Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Flume
Ambari

Become a contributor for community

Get paid to write technical tutorials and select a tech-focused charity to receive a matching donation.

DigitalOcean Documentation

Full documentation for every DigitalOcean product.

Learn more

Resources for startups and AI-native businesses

The Wave has everything you need to know about building a business, from raising funding to marketing your product.

Learn more

Get our newsletter

Stay up to date by signing up for DigitalOcean’s Infrastructure as a Newsletter.

New accounts only. By submitting your email you agree to our Privacy Policy

The developer cloud

Scale up as you grow — whether you're running one virtual machine or ten thousand.

View all products

Get started for free

Get started

*This promotional offer applies to new accounts only.