AI-powered search & chat for Data / Computer Science Students

Roadmaps - fundamentals

Before learning about Data Science or Data Engineering you need to understand some fundamentals. The Data Science Foundation roadmap cover topics like data sources, databases, Exploratory Data Analysis (EDA) and Python basics. Follow the path below before jumping on any other path.

Legend: yellow boxes are key subjects to study. Purple boxes are subtopics. Blue boxes are tools to master. Click on the boxes for recommended learning resources and official documentation. 

Fundamentals

Fundamentals
Matrices & Linear Algebra Fundamentals
Matrices & Linear Algebra Fun...
Database Basics
Database Basics
Relational &. non-relational databases
Relational &. non-relational databases
SQL + Joins
SQL + Joins
NoSQL
NoSQL
Tabular Data
Tabular Data
Data Frames & Series%3CmxGraphModel%3E%3Croot%3E%3CmxCell%20id%3D%220%22%2F%3E%3CmxCell%20id%3D%221%22%20parent%3D%220%22%2F%3E%3CmxCell%20id%3D%222%22%20value%3D%22Tabular%20Data%22%20style%3D%22rounded%3D1%3BwhiteSpace%3Dwrap%3Bhtml%3D1%3B%22%20vertex%3D%221%22%20parent%3D%221%22%3E%3CmxGeometry%20x%3D%22170%22%20y%3D%22350%22%20width%3D%22170%22%20height%3D%2230%22%20as%3D%22geometry%22%2F%3E%3C%2FmxCell%3E%3C%2Froot%3E%3C%2FmxGraphModel%3E
Data Frames & Series%3CmxGra...
Extract, Transform, Load 
Extract, Transform, Load 
Reporting vs BI vs Analytics
Reporting vs BI vs Analytics
Data Formats
Data Formats
JSON
JSON
XML
XML
Regular Expressions (RegEx)
Regular Expressions (RegEx)
Python Basics
Python Basics
Important libraries
Important libraries
Virtual Environments
Virtual Environments
Expressions
Expressions
Variables
Variables
Data Structures
Data Structures
Functions
Functions
Install packages (via pip, conda...)
Install packages (via pip, conda...)
Code Style
Code Style
Numpy
Numpy
Pandas
Pandas
▶️ Basics
▶️ Basics
Python Programming
Python Programming

📊 Exploratory Data Analysis 

📊 Exploratory Data Analysis...
Dimensionality
Reduction
Dimensionality...
Normalization
Normalization
Data Cleaning,
Handling Missing Values
Data Cleaning,...
Estimators
Estimators
Binning sparse values
Binning sparse values
Feature Extraction
Feature Extraction
Denoising
Denoising
Sampling
Sampling
Principal Component Analysis 
Principal Component Analysis 
CSV
CSV
Public Datasets
Public Datasets
Kaggle
Kaggle
Jupyter Notebooks / Lab
Jupyter Notebooks / Lab
🔀 Data Sources
🔀 Data Sources
Data Mining
Data Mining
Web Scraping
Web Scraping
MongoDB
MongoDB
PostgreSQL
PostgreSQL
Scikit-Learn
Scikit-Learn
PyTorch
PyTorch
TensorFlow
TensorFlow
Matplotlib
Matplotlib
Rapid MIner
Rapid MIner
Excel
Excel
Data Science
Roadmap
Data Science...
Data Engineer
Roadmap
Data Engineer...
Join the 'Data Science Foundation' self-teaching study circle.
Text is not SVG - cannot display