#StarChat2-15B-v0.1

10 messages · Page 1 of 1 (latest)

pliant stratus
pliant stratus
#

@deep thunder The twitter post you mentioned says that this is also hosted by HuggingFace Inference API

@iamRezaSayar @Gradio @huggingface I made you a list:

HuggingFaceM4/idefics2-8b
codellama/CodeLlama-7b-hf
HuggingFaceH4/zephyr-7b-alpha
google/flan-t5-xxl
bigcode/octocoder
bigcode/santacoder
bigcode/starcoder
bigcode/starcoder2-15b
bigcode/starcoder2-3b
codellama/CodeLlama-13b-hf…

#

Its benchmark performance is way better than BigCode's instruct finetune of StarCoder2

deep thunder
deep thunder
pliant stratus
#

StarChat2 - 71.3 HumanEval+, 64.6 MBPP+
StarCoder2-Instruct - 60.4 HumanEval+, 65.1 MBPP+
Speechless-StarCoder2 - 62.8 HumanEval+, 62.1 MBPP+

#

oh, it's not as good at MBPP+

#

unfortunately we have no benchmark data from it

#

other than EvalPlus

#

also original MBPP is probably completely useless, since it contains many nonsense questions and questions with incorrect test cases