Interested in Personalized Training with Job Assistance? Know More

Complete Machine Learning Course in English > Regularization and Optimization

Underfitting and Overfitting in Models

17.7k

Start a new search

To find content from modules and lessons

Overview

Namaskar I am (name) from learnvern. (6 seconds pause, music)

In the continuation to the tutorial on machine learning we will continue ahead in today’s tutorial. In today’s tutorial; on machine learning we will see underfitting and overfitting although during discussions on algorithms many times you have heard about this concept and sometimes we have even touched upon this. Now, we will understand in proper manner as to what overfitting and underfitting is.

So when we talk about overfitting,over means more and under means less. From this also you can take a hint that over means something more is happening and under means something less is happening. Now what this more and less is that I shall tell you. Now in overfitting our model or the algorithm tries to cover all the data points and now if we collect data about all the data points then there arises a problem. Now the problem is that our training data has some noise and some inaccuracies and the model learns these noise and inaccuracies also during overfitting and the difficulty that arises is if you pass from this training data some data for prediction then you will get the correct output but if you pass any test data or new sample data then there will be a problem because accuracy there will be very less. So this is what happens in overfitting.

Now if we talk about underfitting then underfitting you would have already understood that the model here is not able to understand the data in the same way that it identifies a trend or creates a good mapping function between input and output. So this is the incapability of this model and it creates a lot of difficulty because even if you give training data, then too the accuracy will be poor and testing data also will not be generalized in a proper way and correct outputs won’t be displayed. So underfitting and overfitting both of them are a kind of problem for us.So for this reason we should identify such a fit that is optimum and which we can call a better or best fit.So in sklearn, let us refer to this document also and try to understand .

So here you can see the official document of sklearn when we applied a function which has a degree 1 which is a polynomial function, so here you can see that the degree is 1.So what happened in this particular function, in this particular function the blue line that you see is the line of our model and the true function is this orange line and sample are our data points. So this first example that we see here is underfitting because here the prediction on the training data which is plotted is going to be wrong and on the test data points it is obviously going to be wrong.

3:42

So this blue line is not ready in any way to give the output meaning that the accuracy will be very poor. Now when this degree was increased to four , at that time you will see that this blue line has taken the form of a curve and in a good manner it is matching with the true function, so we can say that this is optimized or its optimal fit. Now When the degree was further increased, meaning it was made 15 , in this case you can see that overfitting has happened and it is trying to cover every data point and because of this you will get accuracy on the training set but whenever new data is given then your output won’t be correct and accuracy will remain low. So let’s execute it once and see for the same example how underfitting , overfitting and optimal fitting is taking place.

So here I am executing and see here we have calculated the mean square error also . MSC, so scores dot mean, scores dot standard deviation also we have calculated , so come on let us see, so here the same implementation has been displayed with degree1, degree 4 and degree 15. So tell me which is better. So the center one with degree 4 is best. So this way through cross validation you can also check the one which is better and provides optimized fitting , that model, that particular mapping function , and by putting it through training you can choose the model and optimize it. So friends let us conclude here today , today’s session will end here, and the parts ahead we will see in the next session. So keep learning, remain motivated, thank you.

If you have any queries or comments, click the discussion button below the video and post there. This way, you will be able to connect to fellow learners and discuss the course. Also, Our Team will try to solve your query.

See More

Learner's Ratings

4.5

Overall Rating

78%
11%
0%
6%
5%

Reviews

A

Aryan Ambat

5

Yes

Z

zeyana Fathima

5

thanks for giving this wonderful course in a understandable way please provide the details from where can i get the datasets

L

Losika Nicholas

5

were can i get the dataset

K

Kumar Madduru

5

Thanks for giving this course

D

Dinesh Kumar

4

Your screen is very blur and it doesn't has clarity even in 720P.Please make sure that will not happen again.

D

DOGALA UDAYKUMAR

5

bettor

N

Naresh Kulunge

4

good learning but the content titles are jumbled up, like first title of this module is decision tree dichotomiser which is practical part ahead of theory part. Same with the SVM practical 1 title has

E

Eswar Veeranki

5

good

I

Isakki Alias Devi P

5

Wonderful course

S

sushma Yadla

5

yes, i am happy to learning for machine learning in LearnVern.it i s easily understanding for Beginners.

Show More

Recommended Courses

Free हिन्दी

Excel For Data Analysis

52113

3.7 Enroll For Free

Free हिन्दी

SQL For Data Analysis

19528

3.8 Enroll For Free

Course Content

Getting Started with Machine Learning

How to use LearnVern

Introduction to Machine Learning

Environment Setup Part 1

Environment Setup Part 2

Environment Setup Part 3

Data Wrangling

Importing Libraries and Dataset

Handling Missing Data

Handling Missing Data - Practical

Encoding Categorical Data

Encoding Catergorical Data - Practical

Splitting Dataset

Splitting Dataset - Practical

Normalizing the Data - Part 1

Normalizing the Data - Part 2

Finding Machine Learning Datasets

Exploratory Data Analysis

Plotting Graphs - Part 1

Plotting Graphs - Part 2

Distribution Models - Part 1

Distribution Models - Part 2

Assignment : Data Preprocessing for Machine Learning

Machine Learning Paradigms

Assignment : Machine Learning Paradigms

Decision Tree Iterative Dichotomiser 3

Random Forest

Support Vector Machine Classifier

Support Vector Machine Classifier - Practical 1

Support Vector Machine Classifier - Practical 2

Naive Bayes Classifier

Naive Bayes Classifier - Practical 1

Naive Bayes Classifier - Practical 2

Evaluating Classification Models Performance

Evaluating Classification Models Performance - Practical

Overview of Classification

Logistic Regression

Logistic Regression - Practical - 1

Logistic Regression - Practical - 2

KNN

KNN Practical - 1

KNN - Practical 2

Decision Trees for Classification

Decision Trees for Classification - Practical 1

Decision Trees for Classification - Practical 2

Assignment : Supervised Learning Algorithms

Simple Linear Regression

Simple Linear Regression - Practical

Salary Prediction using Linear Regression

Multi-Linear Regression

Startup Prediction using Multiple Regression

Support Vector Regressor

Support Vector Regressor - Practical 1

Support Vector Regressor - Practical 2

Decision Tree Regressor

Decision Tree Regressor - Practical 1

Decision Tree Regressor - Practical 2

Regressor Model Selection

Evaluating Regression Model Performance

Evaluating Regression Model Performance - Practical

Assignment : Regression Algorithms

Distance Metrics

K-Means Clustering

K-Means Clustering - Practical

Mall Customers Prediction using K Means Clustering

Hierarchical Clustering - Agglomerative , Divisive

Agglomerative Clustering - Practical

Divisive Clustering - Practical

DBscan Spatial Clustering

Mall Customers Prediction using Hierarchical Clustering

Assignment : Unsupervised Learning Algorithms

Association Rule Learning - Apriori, FP Growth

Association Rule Learning - Apriori Practical

Market Basket Analysis using Apriori

FP Growth

Market Basket Analysis using FP Growth

Assignment : Association Rule Mining

Reinforcement Learning Theory - Multi Armed Bandits

Upper Confidence Bound - Practical

Thompson Sampling - Practical

Q Learning

Assignment : Reinforcement Learning

Overview of Dimensoionality Reduction

Princinpal Component Analysis

Principal Component Analysis - Practical

Linear Discriminant Analysis

Linear Discriminant Analysis - Practical

Assignment : Dimensionality Reduction

Basics of Regularization and Optimization

Cross Validation

Hyperparameter Tuning

Sampling Methods

Underfitting and Overfitting in Models

Variance and Bias

Assignment : Regularization and Optimization

Advance Trends in Machine Learning

Introduction to Keras and Deep Learning

Practical Demonstration -Keras

Reinforcement Learning Project - Teach a Taxi Part 1

Reinforcement Learning Project - Teach a Taxi Part 2

Reinforcement Learning Project - Teach a Taxi Part 3

Reinforcement Learning Project - Teach a Taxi Part 4

Loan Prediction Project Part 1

Loan Prediction Project Part 2

Course Summary

Interview Questions Part 1

Interview Questions Part 2

Interview Questions Part 3

Career Guidelines

Enroll For Free

Complete Machine Learning Course in English Code

Free

Full Course, No Certificate

With Ads
No Certificate

₹999/-

No Ads

Full Course, with NSDC Certificate

Ad Free
Globally Recognized NSDC Certificate