You are here
A data lake is an important element of the buzzwords related to Big Data and Advanced Analytics. A data lake is a storage repository in which we hold raw data in its native format; this can be structured (e.g. entire source data-tables), semi-structured, and unstructured (e.g., photos, tweets) data.
as data scientists ideally leverage all data available in the organisation. Data lakes are similarly linked to Big Data as when you storage all possible information in a 'pool of data', one will need special technologies to smartly (but speedly) access the relevant data: e.g. Hadoop, Spark.