Skip to content
This repository has been archived by the owner on Nov 3, 2021. It is now read-only.

Latest commit

 

History

History
89 lines (63 loc) · 2.86 KB

File metadata and controls

89 lines (63 loc) · 2.86 KB

Python application CodeQL Codacy Badge Codacy Badge


Olist : Categorize clients for a e-commerce website

Repository of OpenClassrooms' AI Engineer path, project #5

Goal : use Jupyter Notebook and Scikit-Learn to create, assess and improve a clustering model based on customers and orders data.

You can see the results here :

Installation

Prerequisites

Virtual environment

make venv
source env/bin/activate

Dependencies

# pip install kaggle jupyterlab ipywidgets numpy pandas scipy statsmodels sklearn plotly matplotlib
# > or :
# pip install -r requirements.txt
# > or :
make install

Usage

Run Notebook

jupyter-lab notebooks/main.ipynb

Quality Assurance

# make isort
# make format
# make lint
# make bandit
# make mypy
# make test
# > or just :
make qa

Troubleshooting

  • Fix Plotly issues with JupyterLab

cf. Plotly troubleshooting

jupyter labextension install jupyterlab-plotly
  • If using Jupyter Notebook instead of JupyterLab, uncomment the following lines in the notebook
import plotly.io as pio
pio.renderers.default='notebook'