1 messages · Page 1 of 1 (latest)
very interested in this angle - especially if it's something that can translate to local models
Ultra-low latency (milliseconds)
uber fast llms RAG'd with a corpus focused on tool calling is quite an exciting idea
https://tenor.com/bkrQOwoAa7h.gif