https://github.com/irvineAlgotrading/quoteripper/blob/master/quoteripper.ipynb
made this for collecting a large amount of text quotes from particular people, then cleaning all the way down until ready for feeding into language models.
still needs some very minor tweaks, but wanted to make it public since it's so fast and helpful as it is.
cheers!!