Why are fp4 providers allowed to be used ahead of bf16 providers only due to a better price? | OpenRouter | Page 1

icy olive Dec 11, 2025, 3:25 AM

#

Currently, the top provider for gpt-oss-120b is an fp4 provider which is $0.01 cheaper than a bf16 provider. The bf16 model will have significantly better response quality. It seems like you are penalizing the bf16 providers, or rather incentivizing open source inference providers to provide the most quantized and low quality version of open source models possible by not factoring quantization into your best bid algorithm.

still yacht Dec 12, 2025, 3:08 PM

#

Our price sorting isn't so aggressive that the BF16 provider will get no traffic with the difference being $0.01. Regarding "significantly better response quality", are there any specific evals you ran to measure that difference? I'm sure the team is happy to factor that into consideration if we can reproduce those results

icy olive Jan 3, 2026, 4:54 AM

#

I do not have an eval, but anecdotal reports from open source subreddits are that fp4 will have a noticeable quality and intelligence reduction from 8 bit or 16 bit

brisk kindle Jan 3, 2026, 7:09 PM

#

Ive never heard of open source subreddits

icy olive Jan 4, 2026, 8:38 PM

#

open source subreddit: https://www.reddit.com/r/LocalLLaMA/
Anecdotal reports:
https://www.google.com/search?q=quantized+performance+site%3Areddit.com
Looks like it isn't too much of a difference for large param models to use fp4 but smaller models suffer performance degredation

LocalLlama

Subreddit to discuss AI & Llama, the large language model created by Meta AI.

www.google.com

🔎 quantized performance site:reddit.com - Google Search

brisk kindle Jan 5, 2026, 9:39 AM

#

icy olive open source subreddit: https://www.reddit.com/r/LocalLLaMA/ Anecdotal reports: h...

That’s a local llama subreddit. Reddit is inherently closed source. I think you mean public instead of open source

icy olive Jan 9, 2026, 5:32 AM

#

No need to be intentionally obtuse

#Why are fp4 providers allowed to be used ahead of bf16 providers only due to a better price?