This is my customer segmentation project

Reading the file using pandas
checking the columns data types using describe function
checking for null values
checking for duplicate records and drop the duplicated values

Univariate analysis

perform the univariate analysis i.e. how the each feature is distributed
if any value treaten as null value replace it with mode of perticular feature

Outlier detection

Detect the outliers in categorical columns using box plot
impute te IQR values
replace the outliers with IQR values

Bivariate analysis

Bivariate analysis is knowing how feature is distrubuted with respect totarget variable

Encoding

Transforming categorical features into numerical values

Feature selection

checking for the features for correlated or not by setting threshold value

splitting the data

splitting the data into train data and test data

Balencing the data

Here the data is imbalenced balencing the train data using oversampling technique called SMOTTEN

standerdization

Transforming train data from range of numerical values into between 0 to

Model building

Building the model and train with different parameters and find the best parameters using gridserchCV
check the performance parameters and check the model accuracy using AURROC metric
find the best model with high accuracy and high AOUROC value

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
App2.py		App2.py
README.md		README.md
customer -segmentation-project.ipynb		customer -segmentation-project.ipynb
customer prediction.docx		customer prediction.docx
customer_segment_xgb.pkl		customer_segment_xgb.pkl
customer_xgb.pkl		customer_xgb.pkl
requirements.txt		requirements.txt
train.csv		train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

This is my customer segmentation project

Univariate analysis

Outlier detection

Bivariate analysis

Encoding

Feature selection

splitting the data

Balencing the data

standerdization

Model building

About

Releases

Packages

Languages

Palemravichandra/customer-segmentation

Folders and files

Latest commit

History

Repository files navigation

This is my customer segmentation project

Univariate analysis

Outlier detection

Bivariate analysis

Encoding

Feature selection

splitting the data

Balencing the data

standerdization

Model building

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages