#Tool to evaluate LLMs on summarization
3 messages · Page 1 of 1 (latest)
3 messages · Page 1 of 1 (latest)
I need a tool to evaluate llms models on a document summary dataset. Do you know of a tool that takes the dataset in json, models it and calculates the blue scores?