Marco Mini and Nano Instruct | OpenRouter | Page 1

high cove Apr 9, 2026, 5:38 AM

#

https://huggingface.co/AIDC-AI/Marco-Mini-Instruct

https://huggingface.co/AIDC-AI/Marco-Nano-Instruct

AIDC-AI/Marco-Mini-Instruct · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

AIDC-AI/Marco-Nano-Instruct · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#

The 17b one shows strong multilingual perf and offers unprecedented activation rations for a model its size. Here's what I got off ModelScopes twitter:

#

Marco Mini and Nano Instruct

#

https://x.com/ModelScope2022/status/2042084482661191942

ModelScope (@ModelScope2022)

Meet Marco-Mini-Instruct: a highly sparse MoE multilingual model from Alibaba International. 17.3B total params, only 0.86B active (5% activation ratio). 🚀

Beats Qwen3-4B, Gemma3-12B, Granite4-Small on English, multilingual general, and cultural benchmarks — with a fraction of

autumn stag Apr 9, 2026, 3:48 PM

#

Interesting 🤔

high cove Apr 9, 2026, 8:07 PM

#

I just tried 4Q_K_M of the 17B A0.86B. It's reasonably fast for AVX2 on two threads. It's math is certainly at least on par with Qwen3 4B, and it's not even a thinking model. So far stable, and no issues.

#

CtxLimit:329/8192, Amt:300/300, Init:0.05s, Process:0.69s (21.65T/s), Generate:21.29s (14.09T/s),

low coral Apr 9, 2026, 8:49 PM

#

high cove I just tried 4Q_K_M of the 17B A0.86B. It's reasonably fast for AVX2 on two thre...

Did you use llama.cpp or what?

high cove Apr 9, 2026, 8:50 PM

#

low coral Did you use llama.cpp or what?

Mhm. There are GGUFs out. It's just qwen3moe arch. I used kobold.ccp.

low coral Apr 9, 2026, 8:50 PM

#

high cove Mhm. There are GGUFs out. It's just qwen3moe arch. I used kobold.ccp.

Cool thanks, wasn't sure if it was supported yet.

high cove Apr 9, 2026, 8:50 PM

#

NP ^^

low coral Apr 9, 2026, 8:51 PM

#

It didn't cross my mind that it isn't a thinking model; I guess I just assume all new releases are.

high cove Apr 9, 2026, 8:54 PM

#

If you give it math or a logic puzzle, it will give you chain of thought. It is still a qwen model afterall. But, it doesn't use thinking tags by default.

#Marco Mini and Nano Instruct