HDFS

Hadoop Distributed File System, HDFS for short, is a Java-based distributed file system that allows to store large data sets (files which are in the range of terabytes and petabytes) reliably. HDFS is highly fault-tolerant and is designed to be deployed on low-cost hardware. It is the primary storage used by Hadoop applications. 

Learn more

First released 2005
Developed by Doug Cutting
Open-source Yes
Used by eBay, Facebook, Yahoo!

Interesting facts

HDFS is one of the two major components of Apache Hadoop (the second one is YARN).

7

Development by Synergize.digital

Sign up for updates
straight to your inbox