#Alibaba releases an open-source QwQ-32B chain of thought challenger to OpenAI’s o1 reasoning model

1 messages · Page 1 of 1 (latest)

outer aspen
#

https://www.msn.com/en-us/news/other/alibaba-releases-an-open-challenger-to-openai-s-o1-reasoning-model/ar-AA1uSFko?ocid=BingNewsSerp

Alibaba's Qwen team has released QwQ-32B-Preview, a new AI reasoning model containing 32.5 billion parameters that rivals OpenAI's o1 models. The model outperforms o1-preview and o1-mini on certain benchmarks like AIME and MATH tests, demonstrating strong capabilities in logic puzzles and mathematical problems while featuring self-fact-checking abilities. However, it has limitations including unexpected language switches and underperformance on common sense tasks, and notably reflects its Chinese origin by adhering to government perspectives on sensitive political topics.

The release of QwQ-32B-Preview highlights the pause movement's concerns about an accelerating US-China race towards AGI, as Alibaba has quickly matched OpenAI's capabilities in reasoning models. This accomplishment suggests the competitive rush is outpacing our ability to ensure safe AI development.

wet ember
#

Got a link?

stone sleet
#

Whoah, that's very impressive...

agile quest
#

Well I didn't expect there to ever be a frontier-beating open-weights CHINESE model anytime soon, this surprises me

stone sleet
#

Sota on math? Near sota livecodebench? Holy shit

agile quest
#

Let's write a post on this

stone sleet
agile quest
#

Hmm

#

I thought the Chinese would want to keep their models under strict control, not release them into the wild

stone sleet
#

I wonder how they manage to train these given export restrictions

agile quest
#

Yeah

#

apparently export control doesn't work

stone sleet
#

Well we don't know the counterfactual. Maybe they'd have way better models if the controls weren't in place

agile quest
#

Yeah, perhaps.

Probably this strengthens the commission's recommendation to race ahead

outer aspen
outer aspen
# agile quest I thought the Chinese would want to keep their models under strict control, not ...

Perhaps they are using the same business model as Facebook. If you release open source models then that deprives your competitors of income making them weak in the long run.

Also, China thinks that it can make its open source model available to developing countries making them win that aspect of the race. Developing countries would think: Why pay for American models when Chinese models are free.

soft cedar
#

Takeaways: it is still a suicide race to race. And benchmarks can be gamed, which doesn't speak to real capabilities.

#

We clearly need better export controls, however.

night bridge
#

Fact is we have no clue what their government is doing if they are open sourcing qwq this they feel safe in their state backed ones

agile quest
#

Very interesting.