Skip to content

Building machine learning pipelines with procedural programming, custom-pipeline or third-party code using the titanic data set from Kaggle

Notifications You must be signed in to change notification settings

ezekielolugbami/ml_pipeline_from_scratch

Repository files navigation

ML Pipeline From Scratch

This is an all level friendly repo showing how machine learning pipeline can be built from scratch adopting the procedural programming approach or custom pipeline code or third-party code leveraging on the sckit-learn library. All three pipelines are built with the Titanic data set from Kaggle in mind https://www.kaggle.com/c/titanic/data.

It is meant to show you how codes from the research environment 'Jupyter Notebook' are gradually been transformed into reusable pipelines while ensuring reproducibility and modularity in mind. I have also organised the code in a way that is easy for you to edit if you want to make changes to any of the file.

Installation

pip install pandas==1.18.1
pip install numpy==0.25.3
pip install Scikit-Learn==0.22.1

Contact

@OlugbamiEzekielezekiel.olugbami@gmail.com

https://github.com/ezekielolugbami

Contributing

Fork it (https://github.com/ezekielolugbami/ml_pipeline_from_scratch.git)

About

Building machine learning pipelines with procedural programming, custom-pipeline or third-party code using the titanic data set from Kaggle

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages