AI-powered search & chat for Data / Computer Science Students

The New Generation Data Lake

 Towards Data Science

The volumes of data used for Machine Learning projects are relentlessly growing. Data scientists and data engineers have turned to Data Lakes to store vast volumes of data and find meaningful…

Read more at Towards Data Science

My Definition of Data Lake

 Analytics Vidhya

Unlike most of the similar articles, I’ll focus on explanation about this concept and obviously my subject: “my definition of data lake”. Not to mention about technology details. Please understand…

Read more at Analytics Vidhya

What is a Data Lake?

 Towards Data Science

Both, Data Lakes and Data Warehouses are established terms when it comes to storing Big Data, but the two terms are not synonymous. A data lake is a large pool of raw data for which no use has yet…

Read more at Towards Data Science

Data Lake And Quality Assurance

 Analytics Vidhya

A data lake is a centralized repository of data that allows you to store all your structured and unstructured data at any scale. You can store your data as-is, without having to first structure the…

Read more at Analytics Vidhya

Data Lake: an asset or a liability?

 Towards Data Science

A Data Lake, as its name suggests, is a central repository of enterprise data that stores structured and unstructured data. The promise of a Data Lake is “to gain more visibility or put an end to…

Read more at Towards Data Science

Do you really need a data lake?

 Towards Data Science

Data lake is an important component of any Data Strategy. What kind of problems will a data lake solve and how would it address Business Intelligence and Advanced Analytics issues?

Read more at Towards Data Science

How to build a data lake from scratch — Part 1: The setup

 Towards Data Science

The complete tutorial of how to make use of popular technology to build a data lake and data engineering sandbox with docker-compose.

Read more at Towards Data Science

From Data Lakes to Data Reservoirs

 Towards Data Science

It is amusing that when we talk about data the best analogy is typically rooted in water. This makes sense in order to fathom the idea of data — which comes in all shapes and sizes— people tend to…

Read more at Towards Data Science

Lakehouse and the evolution of Data Lake

 Towards AI

Lakehouse's main goal is to bring the key features from data warehouses into the data lake model with the open-source storage layer Delta Lake.

Read more at Towards AI

What is Data Lakehouse? 👀

 Analytics Vidhya

Data warehouses are systems that contain relational data from the past, where we perform data transformations or data cleaning with ETLs. Data warehouses commonly used to find answers to existing…

Read more at Analytics Vidhya

What is a Data Lake? It is not a Data Swamp

 Towards Data Science

At work, I am currently building a data lake on the Google Cloud Platform. While working, you really realize how much data a medium-sized company can already have. I work in the energy sector. In…

Read more at Towards Data Science

How to build a data lake from scratch — Part 2: Connecting the components

 Towards Data Science

The complete tutorial of how to make use of popular technology to build a data lake and data engineering sandbox with docker-compose. Part 2.

Read more at Towards Data Science

Moving from a database mindset to a Data Lake mindset

 Towards Data Science

There are several key conceptual differences between working with databases and Data Lakes. In this post, let’s identify some of these differences which may not be intuitive at first sight…

Read more at Towards Data Science

A Gentle Introduction to Data Lakehouse

 Towards Data Science

Data Lakehouse is a new data architecture that has been mentioned a lot in the past few years. It has been proposed in order to solve the pain points that old and well-established data architectures…

Read more at Towards Data Science

Benefits of a Hybrid Data Lake

 Towards Data Science

Both, data lakes and data warehouses are established terms when it comes to storing Big Data, but the two terms are not synonymous. A data lake is a large pool of raw data for which no use has yet…

Read more at Towards Data Science

From Data Warehouse to Data Lake to Data Lakehouse

 Towards Data Science

What’s for what, what you need, and what are the advantages and limitations Before we go to Data Lake we need to go through the other Data Store technologies, to see the full picture and to understan...

Read more at Towards Data Science

Databricks Delta Lake — Database on top of a Data Lake

 Towards Data Science

Going back 8 years, I still remember the days when I was adopting Big Data frameworks like Hadoop and Spark. Coming from a database background this adaptation was challenging for many reasons. The…

Read more at Towards Data Science

Databricks Delta Lake — Database on top of a Data Lake — Part 2

 Towards Data Science

In Part 1 we explored how Delta Lake features like ACID Transactions, Checkpoints, Transaction Log & Time Travel can positively impact change data capture, processing and management. In this article…

Read more at Towards Data Science

Data Lakes, and SQL???

 Towards Data Science

With greater data volumes, the push is toward newer technologies and paradigm changes. SQL meanwhile has remained the mainstay. Here, I explore how SQL is used with Data Lakes and the new data…

Read more at Towards Data Science

Big Data & Data Lake a complete overview

 Oracle Developers

What’s the crack jack?If you ever wanted to know what is Big Data and not what you think Big Data is or If you ever wanted to know what is Data Lake and not what you think Data Lake is, you should che...

Read more at Oracle Developers

How to Ingest and Consume Data from Azure Data Lake

 Towards Data Science

Analysis on ingestion/consumption patterns including delta lake PoC Continue reading on Towards Data Science

Read more at Towards Data Science

Build An Enterprise Data Lake On Cloud — Introduction

 Analytics Vidhya

This blog is part 1 of a blog series which covers how to create a data lake on the cloud. The first , most important part to understand for every solution / design architect is that building the data…...

Read more at Analytics Vidhya

Getting Started with Data Lakes

 Towards Data Science

Ideas and Explanations behind the complement to Data Warehouses Continue reading on Towards Data Science

Read more at Towards Data Science

Data Lake VS Data Warehouse

 Towards Data Science

Data Lakes and Data Warehouses are used widely to store large amounts of data. However, they are not interchangeable terms. You will be surprised to know that both of these approaches are…

Read more at Towards Data Science