What is a Data Lake?


A data lake is a centralized data storage repository strategy that stores data in raw format. Data lakes use HDFS and NoSQL databases to store and process data later used in analytics.