Alibaba releases an open-source QwQ-32B chain of thought challenger to OpenAI’s o1 reasoning model | PauseAI | Page 1

outer aspen Nov 28, 2024, 11:40 AM

#

https://www.msn.com/en-us/news/other/alibaba-releases-an-open-challenger-to-openai-s-o1-reasoning-model/ar-AA1uSFko?ocid=BingNewsSerp

Alibaba's Qwen team has released QwQ-32B-Preview, a new AI reasoning model containing 32.5 billion parameters that rivals OpenAI's o1 models. The model outperforms o1-preview and o1-mini on certain benchmarks like AIME and MATH tests, demonstrating strong capabilities in logic puzzles and mathematical problems while featuring self-fact-checking abilities. However, it has limitations including unexpected language switches and underperformance on common sense tasks, and notably reflects its Chinese origin by adhering to government perspectives on sensitive political topics.

The release of QwQ-32B-Preview highlights the pause movement's concerns about an accelerating US-China race towards AGI, as Alibaba has quickly matched OpenAI's capabilities in reasoning models. This accomplishment suggests the competitive rush is outpacing our ability to ensure safe AI development.

wet ember Nov 28, 2024, 12:15 PM

#

Got a link?

#

https://qwenlm.github.io/blog/qwq-32b-preview/

Qwen

QwQ: Reflect Deeply on the Boundaries of the Unknown

GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD
Note: This is the pronunciation of QwQ: /kwju:/ , similar to the word “quill”.
What does it mean to think, to question, to understand? These are the deep waters that QwQ (Qwen with Questions) wades into. Like an eternal student of wisdom, it approaches every problem - be it mathematics, code, or knowle...

stone sleet Nov 28, 2024, 12:29 PM

#

Whoah, that's very impressive...

agile quest Nov 28, 2024, 12:29 PM

#

Well I didn't expect there to ever be a frontier-beating open-weights CHINESE model anytime soon, this surprises me

stone sleet Nov 28, 2024, 12:30 PM

#

Sota on math? Near sota livecodebench? Holy shit

agile quest Nov 28, 2024, 12:30 PM

#

Let's write a post on this

stone sleet Nov 28, 2024, 12:30 PM

#

agile quest Well I didn't expect there to ever be a frontier-beating open-weights CHINESE mo...

DeepSeek is also very close to this btw

agile quest Nov 28, 2024, 12:30 PM

#

Hmm

#

I thought the Chinese would want to keep their models under strict control, not release them into the wild

stone sleet Nov 28, 2024, 12:31 PM

#

I wonder how they manage to train these given export restrictions

agile quest Nov 28, 2024, 12:31 PM

#

Yeah

#

apparently export control doesn't work

stone sleet Nov 28, 2024, 12:33 PM

#

Well we don't know the counterfactual. Maybe they'd have way better models if the controls weren't in place

agile quest Nov 28, 2024, 12:34 PM

#

Yeah, perhaps.

Probably this strengthens the commission's recommendation to race ahead

outer aspen Nov 28, 2024, 1:18 PM

#

stone sleet I wonder how they manage to train these given export restrictions

Kai Fu Lee said that desperation is necessary for innovation.

This is forcing China to optimize for algorithms given the lack of scaling using GPUs because of sanctions.

outer aspen Nov 28, 2024, 1:21 PM

#

agile quest I thought the Chinese would want to keep their models under strict control, not ...

Perhaps they are using the same business model as Facebook. If you release open source models then that deprives your competitors of income making them weak in the long run.

Also, China thinks that it can make its open source model available to developing countries making them win that aspect of the race. Developing countries would think: Why pay for American models when Chinese models are free.

soft cedar Nov 28, 2024, 11:20 PM

#

Takeaways: it is still a suicide race to race. And benchmarks can be gamed, which doesn't speak to real capabilities.

#

We clearly need better export controls, however.

night bridge Dec 1, 2024, 11:45 PM

#

agile quest I thought the Chinese would want to keep their models under strict control, not ...

These are small ones inn2022 they dedicated an exascale compute center to training a 174 trillion parameter ai and then shut up about it real fast .

https://keg.cs.tsinghua.edu.cn/jietang/publications/PPOPP22-Ma et al.-BaGuaLu Targeting Brain Scale Pretrained Models w.pdf

#

Fact is we have no clue what their government is doing if they are open sourcing qwq this they feel safe in their state backed ones

#

agile quest Dec 2, 2024, 1:24 AM

#

Very interesting.

#Alibaba releases an open-source QwQ-32B chain of thought challenger to OpenAI’s o1 reasoning model