HDFS

Hadoop Distributed File System, HDFS for short, is a Java-based distributed file system that allows to store large data sets (files which are in the range of terabytes and petabytes) reliably. HDFS is highly fault-tolerant and is designed to be deployed on low-cost hardware. It is the primary storage used by Hadoop applications. 

Learn more

First released 2005
Developed by Doug Cutting
Open-source Yes
Used by eBay, Facebook, Yahoo!

Interesting facts

HDFS is one of the two major components of Apache Hadoop (the second one is YARN).

22

Development by Synergize.digital

Sign up for updates
straight to your inbox