# Naïve Bayes — The Idiot Genius of Algorithms: Machine Learning in Python

Probability and Classification is one of the most important aspect of Machine Learning. They often go hand in hand with each other. We use various algorithms to classify data into distinguishable classes. One such algorithm is The Naïve Bayes Algorithm.

“A learner that uses Bayes’ theorem and assumes the effects are independent given the cause is called a Naïve Bayes classifier. That’s because, well, that’s such a naïve assumption.”
— Pedro Domingos

# The basics of Naïve Bayes Algorithm

In order to understand Naïve Bayes, we require some basic knowledge of probability and the Bayes Theorem. To understand this, lets consider an example:

A Dice and a…

# Support Vector Machine: Machine Learning in Python

Moving on with our knowledge from Logistic Regression — A Supervised Learning Algorithm for Classification of Data. We now study a much more geometrically motivated algorithm — The Support Vector Machine Algorithm.

With this blog, we will learn about the working and the agenda behind using SVM for classification. We will also discuss about the need of SVM over Logistic Regression Algorithm and when to use it. Although, Support Vector Machine can be used for both regression and classification tasks, it is mainly used for classification problems.

# Support Vector Machine — The way to Social Distancing!

Logistic Regression is a probabilistic binary classifier algorithm. This means it classifies the…

# K-Means Clustering: Machine Learning in Python

With our gathered knowledge from learning K-Nearest-Neighbors — KNN, which is a Supervised Learning Algorithm and a Lazy learner we found that all similar things exist in close proximity. It hinges on this assumption being true enough for the algorithm to be useful.

K-Means is no different. Based on the same basic principle of proximity or similarity, K-Means is also able to categorize data points into groups. In this blog, we will go over the Math behind K-Means Clustering and build a small model from scratch.

# K-Means Clustering — Introduction

K-Means Clustering, also known as Lloyd’s Algorithm, is an iterative, data-partitioning, Unsupervised Learning Algorithm…

# Logistic Regression: Machine Learning in Python

Finding whether or not something will happen is another dilemma we face everyday. We are faced with the question of Yes or No all the time. Researchers in the field of Machine Learning are no different.

In Machine Learning, to answer this question of probability of an event happening is solved using Logistic Regression. Although it is called “Regression”, Logistic Regression is an algorithm built to solve Classification Problems.

# Math behind Logistic Regression

In statistics, the logit function or the log-odds is the logarithm of the odds p/(1-p) where p is a probability. …

# Decision Tree and Random Forest: Machine Learning in Python

Decisions, Decisions, Decisions… we make numerous decisions everyday; unconsciously or consciously, sometimes doing it automatically with little effort and sometimes, agonizing for hours over another. If only there was a way to chart a path to the conclusion.

Decision Tree is one of the most popular and powerful tool for Classification and Regression. Clear from its name, it is a flowchart like tree structure. It’s this property of Decision Tree that makes them easy to understand and interpret.

# K-Nearest Neighbors : Machine Learning in Python

Today, we expect our machine to be autonomous, intelligent, and decision maker. We want them to make our lives easier and hassle free. In this blog, we will learn how a machine can be trained to take such decisions for us.

The oldest, shortest words — ‘Yes’ and ‘No’ — are those which require the most thought.
— Pythagoras

Classification — A problem of identifying to which set of categories a new observation belongs. Classification comes naturally to human beings, we see similar looking, feeling, or even smelling things and we put them under the same category. This technique is…

# Linear Regression: Machine Learning in Python

Getting into Machine Learning, one of the first things everyone learns is Regression. It is a Supervised Learning technique which helps us in finding the correlation between two or more variables. It enables us to predict the continuous output (dependent) variable based on one or more predictor (independent) variables.

“An approximate answer to the right problem is worth a good deal more than an exact answer to an approximate problem.”
— John Tukey

# Regression Algorithms

There are numerous Regression Algorithms used in Data Science and Machine Learning. Each algorithm is used in a different scenario, which majorly depends upon the type of…

# Why study Mathematics? II : Machine Learning in Python

In this follow-up blog, we shall study about the next concept of Mathematics behind Machine Learning.

“There are three types of lies — lies, damn lies, and statistics.”
— Benjamin Disraeli

We’ve already seen how Linear Algebra and Probability work and function in the world of Machine Learning. If you haven’t already, read the previous blog first: Why study Mathematics? : Machine Learning in Python | by Divyansh Chaudhary | Jan, 2021 | Medium. The next topics we encounter the most while working through data and predictions are Statistics and Calculus.

# Statistics

Statistics is an important tool of Machine Learning. It…

# Why study Mathematics? : Machine Learning in Python

“Mathematics is not about numbers, equations, computation, or algorithms: it is about understanding.”
— William Paul Thurston

Understanding is a crucial part on the journey of becoming a Machine Learning Professional. Even though one might argue that learning mathematics behind machine learning is not necessary as python provides numerous libraries to perform these mathematical operations, it is a fallacy that has created a false sense of expectation among inspiring ML Professionals.

# Mathematics behind Machine Learning

If you are feeling a bit of anxiety even before reading this blog, don’t worry. Mathematics can be quite complex for some people, especially for people coming from non-technical…

# Data Visualization III: Machine Learning in Python

With all our gathered knowledge from the previous Data Visualization blogs, Data Visualization and Data Visualization II, our next move should be to gain an even deeper understanding of visualization in Python. 