Skytowner Guides on Machine Learning

“Skytowner Guides on Machine Learning” provides a comprehensive overview of machine learning concepts and applications. The content delves into building machine learning models, running experiments, and interpreting results. It emphasizes the importance of understanding data augmentation, deterministic processes, and fault-tolerant architectures in machine learning projects. Additionally, the guide explores the challenges of working with big data, the significance of context in AI applications, and the use of generative AI for enterprise solutions. Overall, the guide aims to equip readers with practical insights and strategies for successful machine learning implementation.

Comprehensive Guide on Cross Validation

 Skytowner Guides on Machine Learning

Cross validation is a technique to measure the performance of a model through resampling. It is a standard practice in machine learning to split the dataset into training and testing sets. The trainin...

📚 Read more at Skytowner Guides on Machine Learning
🔎 Find similar documents

Comprehensive Guide on Mean Absolute Error (MAE)

 Skytowner Guides on Machine Learning

Mean absolute error , or MAE, measures the performance of a regression model. The mean absolute error is defined as the average of the all absolute differences between true and predicted values: $$\ma...

📚 Read more at Skytowner Guides on Machine Learning
🔎 Find similar documents

Comprehensive Guide on ROC Curve

 Skytowner Guides on Machine Learning

The ROC (Receiver Operating Characteristic) curve is a way to visualise the performance of a binary classifier. Confusion matrix Consider the following confusion matrix , which is essentially just a s...

📚 Read more at Skytowner Guides on Machine Learning
🔎 Find similar documents

Comprehensive Guide on Logistic Regression

 Skytowner Guides on Machine Learning

Logistic regression is a popular classifier that predicts the probability of a binary event occurring. For instance, suppose we were given past data about the number of hours students spent studying a...

📚 Read more at Skytowner Guides on Machine Learning
🔎 Find similar documents

Gentle but Comprehensive Guide to Naive Bayes

 Skytowner Guides on Machine Learning

Simple dataset Consider the following dataset: Color Size Target Brown Small Cat Black Large Dog White Large Dog Black Large Cat Brown Small Cat Here, color is a categorical variable with 3 levels (Br...

📚 Read more at Skytowner Guides on Machine Learning
🔎 Find similar documents

Simple linear regression

 Skytowner Guides on Machine Learning

Linear regression is one of the most popular and simplest machine learning models that capture the relationship between two or more features. The objective of linear regression is to draw a line of be...

📚 Read more at Skytowner Guides on Machine Learning
🔎 Find similar documents

Comprehensive Guide to k-means Clustering

 Skytowner Guides on Machine Learning

k-means is perhaps the most popular unsupervised algorithm (requires no labeled data) for clustering data points. The objective of k-means is to partition the data into a predefined number of clusters...

📚 Read more at Skytowner Guides on Machine Learning
🔎 Find similar documents

Comprehensive Guide on DBSCAN

 Skytowner Guides on Machine Learning

DBSCAN , or Density-Based Spatial Clustering of Applications with Noise , is a clustering technique that relies on density to group data points. The basic idea behind DBSCAN is that points that are cl...

📚 Read more at Skytowner Guides on Machine Learning
🔎 Find similar documents

Getting Started with PySpark

 Skytowner Guides on Machine Learning

PySpark is an API interface that allows you to write Python code to interact with Apache Spark, which is an open source distributed computing framework to handle big data. As the size of data grows ye...

📚 Read more at Skytowner Guides on Machine Learning
🔎 Find similar documents

Applying a custom function on PySpark Columns with user-defined functions

 Skytowner Guides on Machine Learning

PySpark comes with a rich set of built-in functions that you can leverage to implement most tasks, but there may be cases when you would have to roll out your own custom function. In PySpark, we can e...

📚 Read more at Skytowner Guides on Machine Learning
🔎 Find similar documents

Getting Started with PySpark on Databricks

 Skytowner Guides on Machine Learning

Databricks is the original creator of Spark and describes themselves as an "open and unified data analytics platform for data engineering, data science, machine learning and analytics." The company ad...

📚 Read more at Skytowner Guides on Machine Learning
🔎 Find similar documents

Comprehensive Guide on Perceptrons in Machine Learning

 Skytowner Guides on Machine Learning

A perceptron takes in multiple inputs and returns a single binary output. Visually, a perception can be depicted as follows: Note the following: our perceptron has $2$ inputs: $x_1$ and $x_2$ . In gen...

📚 Read more at Skytowner Guides on Machine Learning
🔎 Find similar documents