[rewrite] rewrite all repo to properly use DVC, MLFlow and data engineering practices #15

Chouffe · 2024-05-16T10:04:39Z

Description of the PR:

Rewrite from scratch of the code of the repo following best practices and making model training reproducible.
Add random hyperparameter search and instructions in the README file
Move all data assets to DVC in an S3 bucket - added instructions in the README file
Only the baselines and the best models are stored with DVC and are a dvc pull away

How to test:

Follow the README file to test the repo and dvc setup.

I just copied and pasted all code from the repo developed here: https://github.com/earthtoolsmaker/pyronear-mlops

…eering practices

MateoLostanlen

Hi @Chouffe,

Thanks a lot Arthur for this amazing work !

I left you a comment on something that's buggy, but the rest is fine with me.

Just one thing, I'm not sure about the relevance of having all three stages:

train_yolov8_baseline_small_dataset
train_yolov8_baseline_full_dataset
train_yolov8_best

For me, the first two are simple tests. You can explain how to launch them in the README, but I don't think they should be there. I suggest keeping train_yolov8_best and renaming it train_yolov8. What do you think?

MateoLostanlen · 2024-06-05T09:46:59Z

Makefile

+	  --n 10 \
+	  --loglevel "info"
+
+yolov8_benchmark:


Got an error here :

(.venv) mateo@mateo:~/pyronear/vision/mlops/pyro-mlops$ make yolov8_benchmark python ./scripts/model/yolov8/benchmark.py \ --input-dir ./data/04_models/yolov8/ \ --output-dir ./data/06_reporting/yolov8/ \ --loglevel "info" INFO:root:{'input_dir': PosixPath('data/04_models/yolov8'), 'output_dir': PosixPath('data/06_reporting/yolov8'), 'loglevel': 'info'} INFO:root:Loading files data/04_models/yolov8/best/args.yaml and data/04_models/yolov8/best/results.csv /home/mateo/pyronear/vision/mlops/pyro-mlops/./scripts/model/yolov8/benchmark.py:77: PerformanceWarning: DataFrame is highly fragmented. This is usually the result of calling `frame.insert` many times, which has poor performance. Consider joining all columns at once using pd.concat(axis=1) instead. To get a de-fragmented frame, use `newframe = frame.copy()` df[k] = str(args[k]) /home/mateo/pyronear/vision/mlops/pyro-mlops/./scripts/model/yolov8/benchmark.py:77: PerformanceWarning: DataFrame is highly fragmented. This is usually the result of calling `frame.insert` many times, which has poor performance. Consider joining all columns at once using pd.concat(axis=1) instead. To get a de-fragmented frame, use `newframe = frame.copy()` df[k] = str(args[k])

Is it only log warning?
If that's the case I would not mind TBH. Or we could spend some time optimizing the dataframe operations to comply with the warning?

Chouffe · 2024-06-08T13:16:49Z

Hi @Chouffe,

Thanks a lot Arthur for this amazing work !

I left you a comment on something that's buggy, but the rest is fine with me.

Just one thing, I'm not sure about the relevance of having all three stages:

train_yolov8_baseline_small_dataset

train_yolov8_baseline_full_dataset

train_yolov8_best

For me, the first two are simple tests. You can explain how to launch them in the README, but I don't think they should be there. I suggest keeping train_yolov8_best and renaming it train_yolov8. What do you think?

Thanks for taking the time to review this @MateoLostanlen.
I left a comment on the log warning issue you mentioned.

The small stages were used to tighten the feedback loop to training yolo models with a smaller dataset. That usually saves me a lot of time instead of burning GPU compute for something that is not sure to work well.

I think we can get rid of them if you think that's confusing or the other options would be to add some comments?
Let me know what you prefer and I can take care of it.

Chouffe · 2024-07-19T13:57:35Z

@MateoLostanlen I have been working directly on my github repo here to add hyperparameter search for yolov9 and yolo10.
Happy to open a PR once we get this one merged in.

Chouffe added 2 commits May 16, 2024 11:56

[rewrite] rewrite all repo to properly use DVC, MLFlow and data engin…

6870de9

…eering practices

[dvc] remove tmp config

56e781d

Chouffe requested review from MateoLostanlen and gaetanbrison May 16, 2024 10:04

Chouffe assigned MateoLostanlen May 17, 2024

MateoLostanlen reviewed Jun 5, 2024

View reviewed changes

MateoLostanlen added the type: enhancement New feature or request label Jun 5, 2024

MateoLostanlen assigned Chouffe and unassigned MateoLostanlen Jun 5, 2024

Chouffe requested a review from MateoLostanlen July 19, 2024 13:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rewrite] rewrite all repo to properly use DVC, MLFlow and data engineering practices #15

[rewrite] rewrite all repo to properly use DVC, MLFlow and data engineering practices #15

Chouffe commented May 16, 2024 •

edited

Loading

MateoLostanlen left a comment

MateoLostanlen Jun 5, 2024

Chouffe Jun 8, 2024

Chouffe commented Jun 8, 2024

Chouffe commented Jul 19, 2024

[rewrite] rewrite all repo to properly use DVC, MLFlow and data engineering practices #15

Are you sure you want to change the base?

[rewrite] rewrite all repo to properly use DVC, MLFlow and data engineering practices #15

Conversation

Chouffe commented May 16, 2024 • edited Loading

MateoLostanlen left a comment

Choose a reason for hiding this comment

MateoLostanlen Jun 5, 2024

Choose a reason for hiding this comment

Chouffe Jun 8, 2024

Choose a reason for hiding this comment

Chouffe commented Jun 8, 2024

Chouffe commented Jul 19, 2024

Chouffe commented May 16, 2024 •

edited

Loading