To help automate the process of conducting systematic reviews of the scientific literature on a topic, we've created https://github.com/scottleibrand/SystematicReviewer, which automates the process of using GPT-3 to systematically answer your list of questions against a list of paper URLs.
To do so, download_articles_and_embeddings.py first takes as input a papers.csv with a list of paper URLs, downloads the full text HTML or PDF, splits it into sections, generates embeddings for each section, and stores the results.
Then answer_questions.py takes as input a questions.csv of questions to answer (or a single question string), calculates the embedding for each question, finds the top_n sections most relevant to the question, and then feeds those sections (plus the title and abstract, if provided in the papers.csv) to InstructGPT to answer them, and then combines the top_n answers into a single answer for each question. After all questions are answered, it writes a copy of the original papers.csv with additional answer columns.