ML Pipeline From Scratch

This is an all level friendly repo showing how machine learning pipeline can be built from scratch adopting the procedural programming approach or custom pipeline code or third-party code leveraging on the sckit-learn library. All three pipelines are built with the Titanic data set from Kaggle in mind https://www.kaggle.com/c/titanic/data.

It is meant to show you how codes from the research environment 'Jupyter Notebook' are gradually been transformed into reusable pipelines while ensuring reproducibility and modularity in mind. I have also organised the code in a way that is easy for you to edit if you want to make changes to any of the file.

Installation

pip install pandas==1.18.1
pip install numpy==0.25.3
pip install Scikit-Learn==0.22.1

Contact

@OlugbamiEzekiel – ezekiel.olugbami@gmail.com

https://github.com/ezekielolugbami

Contributing

Fork it (https://github.com/ezekielolugbami/ml_pipeline_from_scratch.git)

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
procedural_programming_pipeline		procedural_programming_pipeline
third_party_pipeline		third_party_pipeline
README.md		README.md
load_and_save_dataset.py		load_and_save_dataset.py
requirements.txt		requirements.txt
titanic.csv		titanic.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML Pipeline From Scratch

Installation

Contact

Contributing

About

Releases

Packages

Languages

ezekielolugbami/ml_pipeline_from_scratch

Folders and files

Latest commit

History

Repository files navigation

ML Pipeline From Scratch

Installation

Contact

Contributing

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages