First project of the Machine Learning Master's course at EPFL.

The goal of the project is to explore and compare different supervised learning algorithms and how they deal with a data-set from CERN in the field of physics, in order to best predict the presence of the Higgs Boson.

How to run the solution

Run the command: "python run.py"

The train and test csv files should be located in a folder named "data" one directory above the location of the run.py file. The predictions will be saved in a file named 'result_LS.csv'

Code Organization

We organized our codebase using several files:

implementations.py contains the mandatory functions together with some helper functions used in the implementation of the 6 mandatory ones.
run.py is the execution script.

Notes on some of the helper functions:

In the beginning of the run.py file we have placed some useful functions used throughout the code.

def  number_to_nan(tX)
def  nan_to_median(tX)
def  number_to_other_number(tX, new_value)

These functions are used in the data cleaning process, replacing the value -999 with other more useful information

def  accuracy(y_true, y_pred)

We found it more useful to calculate the accuracy of the predictions, instead of the Mean Squared Errors in some case. We used this function to do this

def  build_poly(x, degree)

We use this function for expanding our features to a higher dimension. For a detailed explanation on how it works and why we decided on doing the expansion this way, see the report

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
.DS_Store		.DS_Store
.gitignore		.gitignore
ML_project_1_report.pdf		ML_project_1_report.pdf
README.md		README.md
implementations.py		implementations.py
proj1_helpers.py		proj1_helpers.py
run.ipynb		run.ipynb
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

First project of the Machine Learning Master's course at EPFL.

How to run the solution

Code Organization

Notes on some of the helper functions:

About

Releases

Packages

Contributors 3

Languages

byrek3d/ML-Project-1-EPFL

Folders and files

Latest commit

History

Repository files navigation

First project of the Machine Learning Master's course at EPFL.

How to run the solution

Code Organization

Notes on some of the helper functions:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages