Unsupervised Learning in Machine Learning > Data Preprocessing for Machine Learning

Encoding Categorical Data - Practical

11.9k

Start a new search

To find content from modules and lessons

Encoding categorical data is the process of transforming categorical data into integer format so that data with converted categorical values can be fed into various models. In the field of data science, data preparation is a must before moving on to modelling.

Encoding is a method of turning categorical variables to numerical values so that a machine learning model may be easily fitted to them.

Some algorithms can deal directly with categorical data. A decision tree, for example, can be learned straight from categorical data without the need for any data transformation (this depends on the specific implementation). Many machine learning algorithms are unable to operate directly on label data.

In general, handling missing data by replacing them with the mean/median/mode is a clumsy method. Such a crude approximation is acceptable and could produce good results depending on the circumstances, such as if the variation is low or if the variable has low leverage over the response.

Learner's Ratings

Overall Rating

100%
0%
0%
0%
0%

Reviews

Prabhat Yadav

Superb course content and easy to understand.

Malay Mehta

Good Course

Recommended Courses

Free हिन्दी

Python Programming Course

232934

4.3 Enroll For Free

Free हिन्दी

Excel For Data Analysis

50909

3.7 Enroll For Free

Free हिन्दी

Complete Machine Learning Course

17740

4.4 Enroll For Free

Encoding Categorical Data - Practical

Start a new search

How do you encoding categorical data?

What is data encoding in machine learning?

Can machine learning algorithms be trained on categorical data?

How does machine learning handle missing categorical data?

Learner's Ratings

Reviews

Prabhat Yadav

Malay Mehta

Recommended Courses

Python Programming Course

Excel For Data Analysis

Complete Machine Learning Course

Course Content

Introduction to Machine Learning

Environment Setup part 1

Environment Setup part 2

Environment Setup part 3

Data Wrangling

Importing Libraries and Dataset

Handling Missing Data

Handling Missing Data - Practical

Encoding Categorical Data

Encoding Categorical Data - Practical

Splitting Dataset

Splitting Dataset - Practical

Normalizing the Data - Part 1

Normalizing the Data - Part2

Finding Machine Learning Datasets

Exploratory Data Analysis

Plotting Graphs - Part 1

Plotting Graphs - Part 2

Distribution Models - Part 1

Distribution Models - Part 2

Assignment of Data Preprocessing for Machine Learning

Machine Learning Paradigms

Sampling Methods

Underfitting and Overfitting in Models

Variance and Bias

Distance Metrics

K-Means Clustering

K-Means Clustering - Practical

Hierarchical Clustering - Agglomerative , Divisive

Agglomerative Clustering - Practical

Divisive Clustering - Practical

DBscan Spatial Clustering

FP Growth

Assignment of Unsupervised Learning Algorithms

Overview of Dimensionality Reduction

Principal Component Analysis

Princinpal Component Analysis - Practical

Linear Discriminant Analysis

Linear Discriminant Analysis - Practical

Assignment of Dimensionality Reduction

Advance Trends in Machine Learning

Course Summary

Interview Questions part 1

Interview Questions part 2

Interview Questions part 3

Career Guidelines