Roadmaps - fundamentals

Before learning about Data Science or Data Engineering you need to understand the fundamentals of Data Analytics, Statistics, and the most used frameworks. Follow the path below before jumping on any other path! Every box contains a link to free learning resources or official documentation.

Do you like to study on your own but with the support of a collaborative learning community? Then join the Data Science Foundation - study circle.

Fundamentals

Fundamentals
Matrices & Linear Algebra Fundamentals
Matrices & Linear Algebra Fun...
Database Basics
Database Basics
Relational &. non-relational databases
Relational &. non-relational databases
SQL + Joins
SQL + Joins
NoSQL
NoSQL
Tabular Data
Tabular Data
Data Frames & Series%3CmxGraphModel%3E%3Croot%3E%3CmxCell%20id%3D%220%22%2F%3E%3CmxCell%20id%3D%221%22%20parent%3D%220%22%2F%3E%3CmxCell%20id%3D%222%22%20value%3D%22Tabular%20Data%22%20style%3D%22rounded%3D1%3BwhiteSpace%3Dwrap%3Bhtml%3D1%3B%22%20vertex%3D%221%22%20parent%3D%221%22%3E%3CmxGeometry%20x%3D%22170%22%20y%3D%22350%22%20width%3D%22170%22%20height%3D%2230%22%20as%3D%22geometry%22%2F%3E%3C%2FmxCell%3E%3C%2Froot%3E%3C%2FmxGraphModel%3E
Data Frames & Series%3CmxGra...
Extract, Transform, Load 
Extract, Transform, Load 
Reporting vs BI vs Analytics
Reporting vs BI vs Analytics
Data Formats
Data Formats
JSON
JSON
XML
XML
Regular Expressions (RegEx)
Regular Expressions (RegEx)
Python Basics
Python Basics
Important libraries
Important libraries
Virtual Environments
Virtual Environments
Expressions
Expressions
Variables
Variables
Data Structures
Data Structures
Functions
Functions
Install packages (via pip, conda...)
Install packages (via pip, conda...)
Code Style
Code Style
Numpy
Numpy
Pandas
Pandas
▶️ Basics
▶️ Basics
Python Programming
Python Programming

📊 Exploratory Data Analysis 

📊 Exploratory Data Analysis...
Dimensionality
Reduction
Dimensionality...
Normalization
Normalization
Data Cleaning,
Handling Missing Values
Data Cleaning,...
Estimators
Estimators
Binning sparse values
Binning sparse values
Feature Extraction
Feature Extraction
Denoising
Denoising
Sampling
Sampling
Principal Component Analysis 
Principal Component Analysis 
CSV
CSV
Public Datasets
Public Datasets
Kaggle
Kaggle
Jupyter Notebooks / Lab
Jupyter Notebooks / Lab
🔀 Data Sources
🔀 Data Sources
Data Mining
Data Mining
Web Scraping
Web Scraping
MongoDB
MongoDB
PostgreSQL
PostgreSQL
Scikit-Learn
Scikit-Learn
PyTorch
PyTorch
TensorFlow
TensorFlow
Matplotlib
Matplotlib
Rapid MIner
Rapid MIner
Excel
Excel
Data Science
Roadmap
Data Science...
Data Engineer
Roadmap
Data Engineer...

🔎 Legend

Yellow boxes are key subjects to study. Purple boxes are derivative topics. Blue boxes are tools to master. Boxes contain links to relevant learning resources.

🔎 Legend...
⭕️ Join the 'Data Science Foundation' study circle.
Text is not SVG - cannot display