MACHINE LEARNING ALGORITHMS

WHAT IS MACHINE LEARNING?

Machine learning is the science of making computers learn and act like humans by feeding data and information without being explicitly defined.

MACHINE LEARNING IS OF TWO TYPES:

SUPERVISED LEARNING

UNSUPERVISED LEARNING

1.SUPERVISED LEARNING:

A model is able to predict with the help of labeled dataset

In supervised machine learning the algorithms teaches the machine to learn from the labeled data .

In the above diagram the known data is labeled with known response i.e either spoon or knife.The machine is trained based on the features such as height,sharpness etc.

When we enter a new data it predicts whether it is a knife or spoon based on the labeled trained data.

TYPES OF SUPERVISED LEARNING:

1. CLASSIFICATION

When the input variable is categorical i.e yes/no , true/false .We make use of classification.

TYPES OF CLASSIFICATION ALGORITHMS:

Logistic Regression
Naive Bayes Classifier
Support Vector Machines
Decision Trees
Random Forest
Nearest Neighbor
Neural Network

2. REGRESSION :

Regression is used in data where a change in one variable is associated with a change in another variable.

For example consider temperature and humidity.Temperature is the independent variable and humidity is the dependent variable.As temperature increases humidity decreases.Hence they are correlated.

When we feed this data into regression model it understand the relationship between them .If we place the temperature value in the model it will be able to predict the humidity.

TYPES OF REGRESSION MODELS:

Linear regression

Multiple linear regression

Polynomial linear regression

DETAILS OF CLASSIFICATION ALGORITHMS:

1.LOGISTIC REGRESSION:

Logistic regression is a statistical method for analyzing a dataset in which there are one or more independent variables that determine an outcome. The outcome is measured with a dichotomous variable (in which there are only two possible outcomes).

CONSIDER THIS EXAMPLE:

Imagine it's been a few years since you serviced your car and you want to predict when will your car break down.

The above graph is obtained when we plot the data of several car users using this classification technique.

Whers the y-axis representes the probability of breakdown and x-axis represents years.

From this graph we can predict a car will break or not.

We can observe that the blue curve is close to x-axis that means the probability of breakdown is less and as the years increases the probability of breakdown also increases.

Let us look briefly into the graph for better understanding:

The threshold valuse is calculated based on the data.

And the graph shows that after 3.5 years the car is more likely to break down.

Hence you can easily predict the data.

For math behind logistic regression visit:click here

APPLICATIONS OF LOGISTIC REGRESSION:

IMAGE CATEGORIZATION

HEALTH CARE(to predict the survival of a person)

2.NAIVE BAYES CLASSIFIER:

Naive Bayes classifier are a collection of classification algorithms based on Bayes Theorem .It is not a single algorithm but a family of algorithms where all of them share a common principle,i.e every pair of feature being classified is independent of each other.

Before we move ahead let us discuss about simple concepts of probability:

Lets us consider the example of tossing two coins:

The chances may be of getting Two Heads,Head and a Tail,Tail and Head,Two Tails.

p(Getting two heads)=1/4
p(At least one tail)=3/4
p(Getting two heads)=1/4

The formula for BAYES CLASSIFICATION is

Let us predict whether a person will purchase a product based on specific combination of day,discount and free delivery using Naive Bayes Classifier.

Let us consider the sample data:

Now calculate the probability for each column like number of yes and no probability for weekend,likewise for holiday ,yes and no attributes in discount and free delivery columns too.

The result will be like:

Now let us predict whether a person will buy or not if Day=holiday,Discount=yes,
Free delivery=yes

Using the formula of naive bayes we can predict by simply placing the above values in the formula based on the conditions.

The result for person will not buy will be:

The result for person likely to buy is:

Comparing both the values person will buy the product based on our given condition as probability of person buys is greater than probabilty of person not buying the products.

APPLICATIONS OF NAIVE BAYES:

NEWS CATEGORIZING

EMAIL SPAM DETECTION

FACE RECOGNITION

SENTIMENT ANALYSIS

DIGIT RECOGNITION

3.SUPPORT VECTOR MACHINE:

A support vector Machine is a discriminative classifier formally defined by a seperating hyperplane.In other words,given labeled training data(supervised learning),the algorithms outputs an optimal hyperplane which categorizes new examples.

EXAMPLE:

Let us consider there are set of people with different heights,weights

So what a SVM does is,it groups the people in the graph as below:

where y-axis represent height of the person and x-axis represents weight

We can divide both groups by a line.Support vector Machine follows some rules for dividing the data.

Let us divide the group by our wish without following SVM

Look at the below figure:

If we divide them like this there will be error when predicting the data.

As of the above figure the red person is a new data point and we have to predict whether the person belong to left group or right.

The groups wern't divided equally by the two lines.So the prediction can be wrong or may not be accurate

SVM follows some rules and divide the grouping like this:

SVM divides the grouping in such a way that there is maximum distance between the two dotted lines

The two points near the the dotted line are called Support Vector Points.

Hence we can easily predict the new data point.

APPLICATIONS OF SVM :

FACE DETECTION

TEXT AND HYPERTEXT CLASSFICATION

HAND WRITTING RECOGNITION

4.DECISION TREES:

A decision tree is a decision support tool that uses a tree like model of decisions and their possible consequences ,including chance event outcomes,resources costs,and utility.It is one way to display an algorithm that only contains conditional control statements.Each branch of tree represents possible decision,occurrence of reaction.

We must be aware of basic terms while discussing about decision tree:

ENTROPY: It is the measure of randomness or unpredictability in the data set.
INFORMATION GAIN: It is the measurement of decrease in entropy after the dataset is split

Let us consider the example of Animals:

Now based on the data caculate the entropy for the entire table,the result will be as follows:

Similarly calculate the Information gain of each attribute.

The one with the Highest Information gain will be the root node

In this case the Highest Information Gain is for yellow,therefore the splitting will be in such a way:

And similarly the height attribute have the second largest Information Gain in the table.

Hence second splitting is done Based on height.

APPLICATIONS OF DECISION TREES:

CUSTOMER RELATIONSHIP MANAGEMENT

FRAUDULENT STATEMENT DETECTION

FAULT DIAGNOSIS

HEALTH CARE MANAGEMENT

5.RANDOM FOREST:

Random forests or random decision forests are an ensemble learning method for classification,regression and other tasks that operates by constructing a multitude of decision trees at training time and outputting the class that is the mode of the classes or mean prediction of the individual trees.Random decision forests correct for decision trees habit of overfitting to their training sets

Training time is less.

Runs efficiently on large databases

Example:

Let us consider a data set of fruits:

Now what random forest does is,It constructs different decision trees based on differnt set of conditions. For the above data it construct trees with different root nodes such as colour=yellow, diameter>3 etc

The decision trees will be as follows:

Now if you want to predict a data , say for example you want to predict what kind of fruit is the below figure

Our decision trees will predict in such a way

The first tree predicted it as an orange

The second tree predicted it as a cherry

The third tree predicted it as an orange

As the majority of trees predicted it as ORANGE the machine will recognize it as Orange.

APPLICATIONS OF RANDOM FOREST:

BANKING

STOCK MARKET

MEDICINE

E-COMMERCE

6. K-NEAREST NEIGHBOUR:

K nearest neighbors is a simple algorithm that stores all available cases and classifies new cases based on a similarity measure (e.g., distance functions). KNN has been used in statistical estimation and pattern recognition already in the beginning of 1970’s as a non-parametric technique.

KNN algorithm is based on feature similarity.Choosing the right value of K is a process called parameter tuning,and is important for better accuracy.

KNN must be used in noisy free data for better accuracy

Before we discuss about Knn we must know about EUCLIDEAN DISTANCE

the formula for euclidean distance is:

Now let us consider a data set

It consists of height,weight and it says whether a person is overweight or normal.

Let us predict for the below data

We must plot all the points in a graph and calculate the euclidean distance from unknown point to the other known points as shown in below figure

How to choose value of k:

Write all the distances in tabular form

As for k=3 the last three rows represent normal condition...our data set will be predicted as normal person.

APPLICATIONS OF KNN:

RECOMMENDATION SYSTEM

CREDIT RATING

LOAN MANAGEMENTS

MONEY LAUNDERING ANALYSIS

7.NEURAL NETWORK:

Neural Networks are a class of models within the general machine learning literature. Neural networks are a specific set of algorithms that have revolutionized machine learning. They are inspired by biological neural networks and the current so-called deep neural networks have proven to work quite well. Neural Networks are themselves general function approximations, which is why they can be applied to almost any machine learning problem about learning a complex mapping from the input to the output space.

POPULAR NEURAL NETWORKS:

FEED FORWARD NEURAL NETWORK

RECURRENT NEURAL NETWORK

CONVENTIONAL NEURAL NETWORK

FEED FORWARD NEURAL NETWORK:

In a Feed Forward Neural Network ,information flows only in forward direction,from the input nodes through the hidden layers and to the output nodes.

There are no cycles or loops in the network.

DECISIONS ARE BASED ON CURRENT INPUT

NO MEMORY ABOUT PAST

NO FUTURE SCOPE

RECURRENT NEURAL NETWORK:

Recurrent Neural Networks are the state of the art algorithm for sequential data and among others used by Apples Siri and Googles Voice Search. This is because it is the first algorithm that remembers its input, due to an internal memory, which makes it perfectly suited for Machine Learning problems that involve sequential data

THIS IS ADVANCEMENT FFN

CAN HANDLE SEQUENTIAL DATA

CONSIDERS BOTH CURRENT AND PREVIOUS INPUT

CAN MEMORIZE PREVIOUS INPUTS

APPLICATIONS OF RNN:

USED IN IMAGE CAPTIONING

TIME SERIES PREDICTION

NATURAL LANGUAGE PROCESSING

MACHINE TRANSLATION

CONVOLUTIONAL NEURAL NETWORK:

CNN is a feed forward neural network that is generlly used to analyze visual images by processing data with grid like topology .A CNN is also known as 'CONVNET'

LAYERS OF CNN:

CONVOLUTION LAYER

RELU LAYER

POOLING LAYER

FLATTENING

FULLY CONNECTED LAYER

CONVOLUTION LAYER:

A convolution layer has a number of filters that performs convolution operation

RELU LAYER:

Real image is scanned in multiple convolution Relu layers for locating features

POOLING LAYER:

The feature map now goes through a pooling layer.pooling is a down sampling operation that reduces the dimensionality of feature map.

max number from each 2*2 matrix is collected and then placed in new matrix

FLATTENING:

It is the process of converting all the resultant 2 dimensional arrays from pooled feature map into a single long continous vector

FULLY CONNECTED LAYER:

The flattened matrix from the layer is fed as input to the fully connected layer to classify the image

HOW THE ENTIRE PROCESSING WORKS?

After completion of these 5 steps machine may identify an image.

APPLICATIONS OF CNN:

FACE RECOGNITION

SCENE LABELING

IMAGE CLASSIFICATION

ACTION RECOGNITION

TYPES OF UNSUPERVISED LEARNING:

1.CLUSTERING:

A clustering problem is where you want to discover the inherent groupings in the data, such as grouping customers by purchasing behavior.

EXAMPLE: K MEANS CLUSTERING

2.ASSOCIATION:

An association rule learning problem is where you want to discover rules that describe large portions of your data, such as people that buy X also tend to buy Y

EXAMPLE: APRIORI

K MEAN CLUSTERING:

K-means clustering is one of the simplest and popular unsupervised machine learning algorithms.

Typically, unsupervised algorithms make inferences from datasets using only input vectors without referring to known, or labelled, outcomes.

A cluster refers to a collection of data points aggregated together because of certain similarities.

How the K-means algorithm works:

To process the learning data,the K-means algorithm in data mining starts with a first group of randomly selected centroids, which are used as the beginning points for every cluster, and then performs iterative (repetitive) calculations to optimize the positions of the centroids

It halts creating and optimizing clusters when either:

The centroids are stabilized i.e there is no change in their values because the clustering has been successful.
The defined number of iterations has been achieved.

APPLICATIONS OF K MEAN CLUSTERING:

MOVIE RECOMMENDATION

FRAUD DETECTION

BANKING SECTOR

INSURANCE COMPANY