Project Summary :-

TED=Technology Entertainment And Design

TED is an unprofitable organisation that post videos online in Free. TED is devoted to spreading powerful ideas on just about any topic. These datasets contain over 4,000+ TED talks including transcripts in many languages. It was basically started as a conference in 1984 that designed by Richard saul wurman but due to some reasons it was unsuccesful. After 6 years in 1990 ,it back with a bang. In TED, speaker share their views and ideas to the society in 18 minutes. As of 2015, TED and its sister TEDx chapters have published more than 2000 talks for free consumption by the masses and its speaker list boasts of the likes of Al Gore, Jimmy Wales, Shahrukh Khan, and Bill Gates.

            Slogan Of TED :- IDEAS WORTH SPREADING ⏰

Objective :

The main objective is to build a predictive model, which could help in predicting the views of the videos uploaded on the TEDx website.

Dataset info :

1.Number of records: 4,005
2.Number of features: 19

The dataset contains features like :

talk_id: A unique identifier for each TED Talk video.
title: The title of the talk.
speaker_1: The primary speaker for the talk.
all_speakers: A list of all the speakers for the talk.
occupations: The occupations of the speakers.
about_speakers: Information about the speakers, such as their backgrounds and expertise.
recorded_date: The date the talk was recorded.
published_date: The date the talk was published on the TED Talks YouTube channel.
event: The name of the TED event where the talk was given.
native_lang: The language the talk was given in.
available_lang: The languages the talk is available in.
duration: The length of the video.(in sec.)
topics: The topics covered in the talk.
related talks: Other TED Talks that are related to this talk.
url: The URL of the video.
description: A brief description of the talk.
transcript: A transcript of the talk.

Target Variable :

views: The number of views the video has received.

Project Work flow :-

Importing Neccessary Libraries
Data Wrangling

      1. Gathering Data 
          - CSV and others files 
          - APIs 
          - Web Scraping 
          - Databases 
      2. Assessing Data
      3. Cleaning Data

EDA and Features Engineering on features
Features Transformation and selection
Remove Multicollinearity
model implementation and pre-processing

      1. Train, Test and Split
      2. preprocessing using column-transformer
      3. make best pipeline

Fitting the regression models and HyperParameter Tuning
Final selection of the model
Conclusion

Algorithms used for ML model implementation :-

Linear Regression
Ridge Regression(L2)
Lasso Regression(L1)
DecisionTreeRegressor
RandomForestRegressor
AdaBoostRegressor
GradientBoostingRegressor
XGBRegressor
VotingRegressor
StackingRegressor

Best Models to Achieve our Objective :-

🥇 RandomForestRegressor with hyperparameter tuning 🥇

Training data R2 and Adjusted R2 Score

R2 score 0.9108
Adjusted R2 score 0.9106

Testing data R2 and Adjusted R2 Score

R2 score 0.8977
Adjusted R2 score 0.8968

Cross-validation score

0.8974

The performance metrics

MAE 0.2613
MSE 0.1055
RMSE 0.3249

🥈 VotingRegressor with hyperparameter tuning 🥈

Training data R2 and Adjusted R2 Score

R2 score 0.9109
Adjusted R2 score 0.9107

Testing data R2 and Adjusted R2 Score

R2 score 0.8981
Adjusted R2 score 0.8972

Cross-validation score

0.8977

The performance metrics

MAE 0.2615
MSE 0.1051
RMSE 0.3243

click here To access the solution of the Capstone Project on TED Talks Views Prediction using Supervised Learning, which includes the usage of 10 different algorithms with detailed explanations and conclusions.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Basic info about all algorithms		Basic info about all algorithms
certificate		certificate
README.md		README.md
TED_Talks_Views_Prediction_Supervised_learning_project.ipynb		TED_Talks_Views_Prediction_Supervised_learning_project.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Summary :-

TED=Technology Entertainment And Design

Objective :

Dataset info :

The dataset contains features like :

Target Variable :

Project Work flow :-

Algorithms used for ML model implementation :-

Best Models to Achieve our Objective :-

🥇 RandomForestRegressor with hyperparameter tuning 🥇

🥈 VotingRegressor with hyperparameter tuning 🥈

click here To access the solution of the Capstone Project on TED Talks Views Prediction using Supervised Learning, which includes the usage of 10 different algorithms with detailed explanations and conclusions.

CERTIFICATE

About

Releases

Packages

Languages

SarangGami/TED-Talks-Views-Prediction-Supervised-learning

Folders and files

Latest commit

History

Repository files navigation

Project Summary :-

TED=Technology Entertainment And Design

Objective :

Dataset info :

The dataset contains features like :

Target Variable :

Project Work flow :-

Algorithms used for ML model implementation :-

Best Models to Achieve our Objective :-

🥇 RandomForestRegressor with hyperparameter tuning 🥇

🥈 VotingRegressor with hyperparameter tuning 🥈

click here To access the solution of the Capstone Project on TED Talks Views Prediction using Supervised Learning, which includes the usage of 10 different algorithms with detailed explanations and conclusions.

CERTIFICATE

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages