#Tongyi DeepResearch
29 messages · Page 1 of 1 (latest)
I second this motion
Tongyi DeepResearch is an agentic large language model developed by Tongyi Lab, with 30 billion total parameters activating only 3 billion per token. It's optimized for long-horizon, deep information-seeking tasks and delivers state-of-the-art performance on benchmarks like Humanity's Last Exam, BrowserComp, BrowserComp-ZH, WebWalkerQA, GAIA, xb...
alibaba puts a heavy premium on their endpoints
lovely pricing
oh this would be nice to have, would like to try deep research models with open deep research platforms
Let’s gooo
Super cool
i don’t understand this
is this supposed to search the web? how do we make it do that
Give your own tools
It's good at using it
This fails at tool calling for me for some reason xD
Yeah @fossil tundra Is this natively supposed to have access to web search or do we need to provide it a web search tool?
It doesn’t have native web search, but you can add :online to to the end of the model name to enable Exa-powered web search
oh my god this model is incredible now
it was hallucinating every url before and i didn't understand why but that makes total sense now
Nice :)
Def getting some errors on some requests though like
(Tongyi request failed)
or
We need to perform a new web search. Let's use the appropriate tool.
</think>
We to=assistant<|channel|>commentary<|message|>Error: Tool call is not valid.
</tool_call>
</tool_response
This model is not for general purpose
It's an agentic deep research model that makes use of tools and is really good at assimilating sources and making queries for deep research
It needs to be paired with a deep research platform/code
I’d suggest checking out the Alibaba Cloud Developer Community on Discord,
the admin Lara usually loops in the Qwen team to help solve issues directly.
It’s been pretty helpful for a lot of people
Are there any examples of this? Curious what kind of harness is required for this. Is it just a while loop running over and over again checking the queries its generated and the tool calls its made until a certain condition is met?
Ah, will do!
The best way to learn about the procedure is through their own report:
https://github.com/Alibaba-NLP/DeepResearch/
This github repo has the pipeline required in here, they also mention how you can connect it via openrouter api
So, inorder to use this model effectively, you need to download this repo and follow the steps mentioned in this repo and then use the model with this pipeline(It already has the code and everything, you probably just need to plug in some api key and do some minor changes)