Data Science & Developer Roadmaps with Chat & Free Learning Resources

The New Generation Data Lake

 Towards Data Science

The volumes of data used for Machine Learning projects are relentlessly growing. Data scientists and data engineers have turned to Data Lakes to store vast volumes of data and find meaningful…

Read more at Towards Data Science | Find similar documents

My Definition of Data Lake

 Analytics Vidhya

Unlike most of the similar articles, I’ll focus on explanation about this concept and obviously my subject: “my definition of data lake”. Not to mention about technology details. Please understand…

Read more at Analytics Vidhya | Find similar documents

What is a Data Lake?

 Towards Data Science

Both, Data Lakes and Data Warehouses are established terms when it comes to storing Big Data, but the two terms are not synonymous. A data lake is a large pool of raw data for which no use has yet…

Read more at Towards Data Science | Find similar documents

Data Lake And Quality Assurance

 Analytics Vidhya

A data lake is a centralized repository of data that allows you to store all your structured and unstructured data at any scale. You can store your data as-is, without having to first structure the…

Read more at Analytics Vidhya | Find similar documents

Data Lake: an asset or a liability?

 Towards Data Science

A Data Lake, as its name suggests, is a central repository of enterprise data that stores structured and unstructured data. The promise of a Data Lake is “to gain more visibility or put an end to…

Read more at Towards Data Science | Find similar documents

Do you really need a data lake?

 Towards Data Science

Data lake is an important component of any Data Strategy. What kind of problems will a data lake solve and how would it address Business Intelligence and Advanced Analytics issues?

Read more at Towards Data Science | Find similar documents

How to build a data lake from scratch — Part 1: The setup

 Towards Data Science

The complete tutorial of how to make use of popular technology to build a data lake and data engineering sandbox with docker-compose.

Read more at Towards Data Science | Find similar documents

From Data Lakes to Data Reservoirs

 Towards Data Science

It is amusing that when we talk about data the best analogy is typically rooted in water. This makes sense in order to fathom the idea of data — which comes in all shapes and sizes— people tend to…

Read more at Towards Data Science | Find similar documents

Lakehouse and the evolution of Data Lake

 Towards AI

Lakehouse's main goal is to bring the key features from data warehouses into the data lake model with the open-source storage layer Delta Lake.

Read more at Towards AI | Find similar documents

What is Data Lakehouse? 👀

 Analytics Vidhya

Data warehouses are systems that contain relational data from the past, where we perform data transformations or data cleaning with ETLs. Data warehouses commonly used to find answers to existing…

Read more at Analytics Vidhya | Find similar documents

What is a Data Lake? It is not a Data Swamp

 Towards Data Science

At work, I am currently building a data lake on the Google Cloud Platform. While working, you really realize how much data a medium-sized company can already have. I work in the energy sector. In…

Read more at Towards Data Science | Find similar documents

How to build a data lake from scratch — Part 2: Connecting the components

 Towards Data Science

The complete tutorial of how to make use of popular technology to build a data lake and data engineering sandbox with docker-compose. Part 2.

Read more at Towards Data Science | Find similar documents