I do a lot of data transformations or simple model fine tuning in Jupyter notebooks. After listening to a few MLOps podcasts, it seems like that’s generally advised against.
I guess because you want reproducibility and more modular, more manageable code.
The good thing about notebooks for me is the UI where you can inspect Python variables and debug functions quickly.
So, my approach has been to develop in notebooks, write functions for everything and once I stop touching certain functions I separate them into a “lib” type of repo and continue importing them from there.
What’s your take on this? What are the pros and cons of notebooks for you and how do you mitigate the cons?