When you choose a reasoning model for deep research, the reasoning model is primarily used for tasks that require complex problem-solving, strategizing, and planning. According to the information available, reasoning models like the o-series are particularly effective at multi-step agentic planning. This means they are well-suited for creating a detailed, multi-step solution to a problem, such as making a research plan or drafting an outline.
Once the reasoning model has developed a plan, the execution of the plan can involve both reasoning and non-reasoning models, depending on the specific needs of each step. The reasoning model can orchestrate the workflow, selecting and assigning the appropriate model for each task based on whether high intelligence or low latency is more important. For example, the reasoning model might handle the initial planning and complex decision-making, while a GPT model could be used for executing specific tasks that require speed and efficiency.
In essence, the reasoning model acts as "the planner," and it can delegate execution tasks to "the doer" (a GPT model) when appropriate. This approach allows for a combination of strategic planning and efficient task execution, leveraging the strengths of both model types.
For more detailed guidance on how reasoning models are used in multi-step tasks, you can refer to the Reasoning Best Practices guide.
Note: you can ask me a follow up question by @ mentioning me again :speech_balloon: