Understanding the agent loop

Note

Copilot SDK is currently in public preview. Functionality and availability are subject to change.

Architecture

Diagram showing the flow from your app to the SDK session, to the Copilot CLI, to the LLM, and back.

The Copilot SDK is a transport layer—it sends your prompt to Copilot CLI over JSON-RPC and surfaces events back to your app. The CLI is the orchestrator that runs the agentic tool-use loop, making one or more LLM API calls until the task is done.

The tool-use loop

When you call session.send({ prompt }), Copilot CLI enters a loop:

Diagram of the tool-use loop: prompt to LLM call, tool execution if requested, otherwise session.idle.

The model sees the full conversation history on each call—system prompt, user message, and all prior tool calls and results.

Each iteration of this loop is exactly one LLM API call, visible as one assistant.turn_start / assistant.turn_end pair in the event log. There are no hidden calls.

Turns

A turn is a single LLM API call and its consequences:

Copilot CLI sends the conversation history to the LLM
The LLM responds (possibly with tool requests)
If tools were requested, Copilot CLI executes them
assistant.turn_end is emitted

A single user message typically results in multiple turns. For example, a question like "how does X work in this codebase?" might produce:

Turn	What the model does	toolRequests?
1	Calls `grep` and `glob` to search the codebase	Yes
2	Reads specific files based on search results	Yes
3	Reads more files for deeper context	Yes
4	Produces the final text answer	No (loop ends)

The model decides on each turn whether to request more tools or produce a final answer. Each call sees the full accumulated context (all prior tool calls and results), so it can make an informed decision about whether it has enough information.

Event flow for a multi-turn interaction

Diagram of three turns in an agent interaction, each with start, message, tool, and end events, ending in session.idle.

Who triggers each turn

Actor	Responsibility
Your app	Sends the initial prompt via `session.send()`
Copilot CLI	Runs the tool-use loop—executes tools and feeds results back to the LLM for the next turn
LLM	Decides whether to request tools (continue looping) or produce a final response (stop)
Copilot SDK	Passes events through; does not control the loop

Copilot CLI is purely mechanical: "model asked for tools → execute → call model again." The model is the decision-maker for when to stop.

`session.idle` vs `session.task_complete`

These are two different completion signals with very different guarantees.

`session.idle`

Always emitted when the tool-use loop ends
Ephemeral: not persisted to disk, not replayed on session resume
Means: "the agent has stopped processing and is ready for the next message"
Use this as your reliable "done" signal

The Copilot SDK's sendAndWait() method waits for this event:

// Blocks until session.idle fires
const response = await session.sendAndWait({ prompt: "Fix the bug" });

`session.task_complete`

Optionally emitted: requires the model to explicitly signal it
Persisted: saved to the session event log on disk
Means: "the agent considers the overall task fulfilled"
Carries an optional summary field

session.on("session.task_complete", (event) => {
    console.log("Task done:", event.data.summary);
});

Autopilot mode

In autopilot mode (headless or autonomous operation), Copilot CLI actively tracks whether the model has called task_complete. If the tool-use loop ends without it, Copilot CLI injects a synthetic user message nudging the model to continue working.

This creates a two-level completion mechanism in autopilot:

The model calls task_complete with a summary → Copilot CLI emits session.task_complete → done
The model stops without calling it → Copilot CLI nudges → model continues or calls task_complete

Why `task_complete` might not appear

In interactive mode (normal chat), Copilot CLI does not nudge for task_complete. The model may skip it entirely. Common reasons:

Conversational Q&A: the model answers a question and simply stops—there's no discrete "task" to complete
Model discretion: the model produces a final text response without calling the task-complete signal
Interrupted sessions: the session ends before the model reaches a completion point

Copilot CLI emits session.idle regardless, because it's a mechanical signal (the loop ended), not a semantic one (the model thinks it's done).

Which signal to use

Use case	Signal
Wait for the agent to finish processing	`session.idle`
Know when a coding task is done	`session.task_complete` (best-effort)
Timeout and error handling	`session.idle` + `session.error`

Counting LLM calls

The number of assistant.turn_start / assistant.turn_end pairs in the event log equals the total number of LLM API calls made. There are no hidden calls for planning, evaluation, or completion checking.

Who can use this feature?

In this article