- Project Brief
- Getting Started
- Usage
- Project Structure
- Contributing
- License
- Related Links
- Acknowledgments
You are a newly employed data analyst in the Customer Experience (CX) team at Vanguard, the US-based investment management company. You’ve been thrown straight into the deep end with your first task. Before your arrival, the team launched an exciting digital experiment, and now, they’re eagerly waiting to uncover the results and need your help!
The digital world is evolving, and so are Vanguard’s clients. Vanguard believed that a more intuitive and modern User Interface (UI), coupled with timely in-context prompts (cues, messages, hints, or instructions provided to users directly within the context of their current task or action), could make the online process smoother for clients. The critical question was: Would these changes encourage more clients to complete the process?
An A/B test was set into motion from 3/15/2017 to 6/20/2017 by the team.
- Control Group: Clients interacted with Vanguard’s traditional online process.
- Test Group: Clients experienced the new, spruced-up digital interface.
Both groups navigated through an identical process sequence: an initial page, three subsequent steps, and finally, a confirmation page signaling process completion.
The goal is to see if the new design leads to a better user experience and higher process completion rates.
Note: Review what A/B testing is here if you are not already aware of how it works.
This repository contains the code and dataset used for cleaning a web log dataset. The dataset consists of web visit logs with several attributes related to user activity on a website.
-
Web Data
- Files:
df_final_web_data_pt_1.txt
anddf_final_web_data_pt_2.txt
- Columns:
client_id
: Unique identifier for each client.visitor_id
: Unique identifier for each visitor session.visit_id
: Unique identifier for each visit.process_step
: The step of the process the client is currently on.date_time
: Timestamp of when the process step occurred.
- Files:
-
Demographic Data
- File:
df_final_demo.txt
- Columns:
client_id
: Unique identifier for each client.age
: Age of the client.gender
: Gender of the client.income_level
: Income level of the client.
- File:
-
Experiment Clients Data
- File:
df_final_experiment_clients.txt
- Columns:
client_id
: Unique identifier for each client.group
: Experimental group (control or test) to which the client belongs.
- File:
client_id | visitor_id | visit_id | process_step | date_time |
---|---|---|---|---|
9988021 | 580560515_7732621733 | 781255054_21935453173_531117 | step_3 | 2017-04-17 15:27:07 |
9988021 | 580560515_7732621733 | 781255054_21935453173_531117 | step_2 | 2017-04-17 15:26:51 |
client_id | age | gender | income_level |
---|---|---|---|
9988021 | 34 | Female | High |
9988022 | 45 | Male | Medium |
client_id | group |
---|---|
9988021 | test |
9988022 | control |
The data cleaning process was performed in a Jupyter Notebook (Dataset_pt_Cleaning.ipynb
) and involved the following steps:
- Remove Duplicates
- Check for Missing Values
- Convert Data Types
- Normalize Data
- Analyze Categories in process_step
- Figure 1: Distribution of clients across different process steps.
- Figure 2: Comparison of completion rates between the control and test groups.
- Figure 3: Comparison of error rates between the control and test groups.
- Figure 4: Tableau Dashboard Fig.
========================================================
- Python 3.x
- Jupyter Notebook
- Pandas
- NumPy
- Matplotlib / Seaborn (for data visualization)
- Clone the repository:
git clone https://github.com/your-username/vanguard-ab-test.git
- Navigate to the project directory:
cd vanguard-ab-test
- Install the required packages:
pip install -r requirements.txt
- Open the Jupyter Notebook:
jupyter notebook
- Open the notebook file
main.ipynb
and run through the cells to see the analysis.
Data/
: Contains the dataset used for the analysis.Notebooks/
: Jupyter Notebooks with detailed analysis.Figures/
: Contains the figures.README.md
: Project documentation.
- Fork the repository.
- Create your feature branch:
git checkout -b feature/YourFeature
- Commit your changes:
git commit -m 'Add some feature'
- Push to the branch:
git push origin feature/YourFeature
- Open a pull request.
This project is licensed under the Ironhack.
- Project Presentation Google Slides
- Tableau Tableau
- Streamlit Streamlit