Use GPT-3 to do semantic search over exported Slack messages to find answers to questions about previously discussed topics
slackAskBot is a project that uses natural language processing (NLP) to search through a given dataset for messages that are similar to a given search string. It uses OpenAI's text-embedding-ada-002 engine to generate embeddings for the search string and the messages in the dataset, and then uses cosine similarity to find the most similar messages. It then prints out the top n results, and uses OpenAI's text-davinci-003 engine to generate a summary of the context and answer the question.
https://github.com/scottleibrand/slackAskBot
It does a good job of answering questions about topics that were previously discussed in the exported Slack history.
If you're a Slack workspace or org Owner/Admin, you can export your Slack history by following the steps at https://slack.com/help/articles/201658943-Export-your-workspace-data
If you're not a Slack admin, you can ask an admin to export only your Slack history.
Once you've got a Slack export, there are scripts to perform all the necessary processing to filter out just the Slack messages and generate and store (as files) the semantic search embeddings required to search for the content most relevant to the search inquiry.
Once you've done all the initial processing on your Slack export, you can simply run search.py everything.json "your topical inquiry or question" and it will find the most relevant results, and ask InstructGPT to use them to summarize the context most relevant to your inquiry and answer any explicit question you asked.
Future work:
- Make a Slack bot that runs search.py
- Orchestrate all the above scripts to run all of the necessary preprocessing on a Slack export
- Figure out how to extend this to Discord
Use GPT-3 to do semantic search over exported Slack messages to find answers to questions about previously discussed topics - GitHub - scottleibrand/slackAskBot: Use GPT-3 to do semantic search ove...