Chapter 6: Planning

Formulate multi-step strategies to achieve complex goals through autonomous task decomposition

Intermediate16 min readInteractive Playground

Chapter 6 • Core Pattern

Planning Pattern

Master the art of breaking down complex tasks into manageable steps with intelligent planning agents

At a Glance

Quick Overview

❓ What

Complex problems often cannot be solved with a single action and require foresight to achieve a desired outcome. Without a structured approach, an agentic system struggles to handle multifaceted requests that involve multiple steps and dependencies.

💡 Why

The Planning pattern offers a standardized solution by having an agentic system first create a coherent plan to address a goal. It involves decomposing a high-level objective into a sequence of smaller, actionable steps or sub-goals.

🎯 Rule of Thumb

Use this pattern when a user's request is too complex to be handled by a single action or tool. It is ideal for automating multi-step processes, such as generating a detailed research report or executing a competitive analysis.

What is Planning?

For Beginners

🎯 Simple Analogy: The Trip Planner

Imagine you're planning a road trip from New York to Los Angeles. You don't just start driving randomly! You break it down: check the route, identify stops for gas and food, book hotels, estimate driving time for each day.

Planning agents work the same way. When given a complex task like "analyze this dataset and create a report," they don't jump straight to execution. They first create a step-by-step plan: load data → clean data → analyze patterns → generate visualizations → write summary → compile report.

How Planning Works

📋

1. Task Analysis

Break down the goal into smaller, manageable sub-tasks

🔄

2. Plan Generation

Create a sequence of actions with dependencies

⚡

3. Execution

Execute each step in order, adapting as needed

✅

4. Verification

Check results and adjust plan if necessary

Planning pattern workflow diagram showing task analysis, plan generation, execution, and verification

Topic: planning

Image placeholder - upload your image to replace

🔧 How Planning Agents Work

Planning Strategies

ReAct (Reasoning + Acting)

Interleaves reasoning and action steps. The agent thinks about what to do, takes an action, observes the result, then reasons about the next step.

Thought → Action → Observation → Thought → ...

Plan-and-Execute

Creates a complete plan upfront, then executes each step. More efficient but less adaptive to changes.

Plan All Steps → Execute Step 1 → Execute Step 2 → ...

Hierarchical Planning

Breaks tasks into high-level goals, then decomposes each goal into sub-tasks recursively.

Goal → Sub-goals → Tasks → Sub-tasks

💡 Key Insight

Planning is especially powerful for multi-step tasks where the order matters and where intermediate results inform future actions. It's the difference between "winging it" and having a roadmap.

🚀 When to Use Planning

✅ Great For:

•Multi-step workflows with dependencies
•Complex research or analysis tasks
•Tasks requiring tool orchestration
•Goal-oriented problem solving

❌ Not Ideal For:

•Simple, single-step tasks
•Real-time reactive systems
•Highly unpredictable environments
•Tasks with unclear goals

⚖️ Planning vs Fixed Workflows

The Trade-off: Flexibility vs Predictability

A hallmark of planning is adaptability. An initial plan is merely a starting point, not a rigid script. The agent's real power is its ability to incorporate new information and steer the project around obstacles.

However, it is crucial to recognize the trade-off between flexibility and predictability. Dynamic planning is a specific tool, not a universal solution. When a problem's solution is already well-understood and repeatable, constraining the agent to a predetermined, fixed workflow is more effective.

The decision to use a planning agent versus a simple task-execution agent hinges on a single question: does the "how" need to be discovered, or is it already known?

Use Planning When:

•The solution path is unknown or varies by context
•The environment is dynamic and requires adaptation
•Multiple valid approaches exist
•Obstacles may require plan adjustments

Use Fixed Workflows When:

•The solution is well-understood and repeatable
•Predictability and consistency are critical
•The process is standardized and proven
•Reducing uncertainty is more important than flexibility

🔍 Google DeepResearch

Google Gemini DeepResearch is an agent-based system designed for autonomous information retrieval and synthesis. It functions through a multi-step agentic pipeline that dynamically and iteratively queries Google Search to systematically explore complex topics.

How DeepResearch Works

1. Multi-Point Research Plan

Deconstructs a user's prompt into a multi-point research plan, presented to the user for review and modification before execution.

2. Iterative Search-and-Analysis Loop

Dynamically formulates and refines queries based on gathered information, actively identifying knowledge gaps, corroborating data points, and resolving discrepancies.

3. Asynchronous Processing

Manages the investigation asynchronously, analyzing hundreds of sources while being resilient to single-point failures.

4. Synthesis Phase

Performs critical evaluation of collected information, identifying major themes and organizing content into a coherent narrative with logical sections, citations, and interactive features.

💡 Key Benefits

Efficiency: Automates the iterative search-and-filter cycle, a core bottleneck in manual research
Comprehensiveness: Processes larger volume and variety of sources than typically feasible for humans
Transparency: Returns full list of sources with citations for verification

🤖 OpenAI Deep Research API

The OpenAI Deep Research API is a specialized tool designed to automate complex research tasks. It utilizes an advanced, agentic model (like o3-deep-research-2025-06-26) that can independently reason, plan, and synthesize information from real-world sources.

Structured, Cited Output

Produces well-organized reports with inline citations linked to source metadata, ensuring claims are verifiable and data-backed.

Transparency

Exposes all intermediate steps, including the agent's reasoning, specific web search queries, and any code it ran.

Extensibility

Supports Model Context Protocol (MCP), enabling connection to private knowledge bases and internal data sources.

Example API Usage

response = client.responses.create(
  model="o3-deep-research-2025-06-26",
  input=[
    {"role": "developer", "content": [{"type": "input_text", "text": system_message}]},
    {"role": "user", "content": [{"type": "input_text", "text": user_query}]}
  ],
  reasoning={"summary": "auto"},
  tools=[{"type": "web_search_preview"}]
)

📊 Visual Summary

Goal Decomposition

Break complex objectives into manageable sub-tasks

Sequential Execution

Execute steps in logical order with dependencies

Adaptive Planning

Adjust plans based on intermediate results

Goal Achievement

Synthesize results into final outcome

🎯 Key Takeaways

1. Planning enables agents to break down complex goals into actionable, sequential steps

2. It is essential for handling multi-step tasks, workflow automation, and navigating complex environments

3. LLMs can perform planning by generating step-by-step approaches based on task descriptions

4. Explicitly prompting or designing tasks to require planning steps encourages this behavior in agent frameworks

5. Google Deep Research and OpenAI Deep Research exemplify advanced planning systems that reflect, plan, and execute autonomously