Hadoop Distributed File System, HDFS for short, is a Java-based distributed file system that allows to store large data sets (files which are in the range of terabytes and petabytes) reliably. HDFS is highly fault-tolerant and is designed to be deployed on low-cost hardware. It is the primary storage used by Hadoop applications.
|Developed by||Doug Cutting|
|Used by||eBay, Facebook, Yahoo!|
HDFS is one of the two major components of Apache Hadoop (the second one is YARN).