File: building-agents.md | Updated: 11/15/2025
Menu
v5 (Latest)
AI SDK 5.x
Model Context Protocol (MCP) Tools
Copy markdown
==================================================================================
The Agent class provides a structured way to encapsulate LLM configuration, tools, and behavior into reusable components. It handles the agent loop for you, allowing the LLM to call tools multiple times in sequence to accomplish complex tasks. Define agents once and use them across your application.
When building AI applications, you often need to:
The Agent class provides a single place to define your agent's behavior.
Define an agent by instantiating the Agent class with your desired configuration:
import { Experimental_Agent as Agent } from 'ai';
const myAgent = new Agent({ model: 'openai/gpt-4o', system: 'You are a helpful assistant.', tools: { // Your tools here },});
The Agent class accepts all the same settings as generateText and streamText. Configure:
import { Experimental_Agent as Agent } from 'ai';
const agent = new Agent({ model: 'openai/gpt-4o', system: 'You are an expert software engineer.',});
Provide tools that the agent can use to accomplish tasks:
import { Experimental_Agent as Agent, tool } from 'ai';import { z } from 'zod';
const codeAgent = new Agent({ model: 'openai/gpt-4o', tools: { runCode: tool({ description: 'Execute Python code', inputSchema: z.object({ code: z.string(), }), execute: async ({ code }) => { // Execute code and return result return { output: 'Code executed successfully' }; }, }), },});
By default, agents run for a single step (stopWhen: stepCountIs(1)). In each step, the model either generates text or calls a tool. If it generates text, the agent completes. If it calls a tool, the AI SDK executes that tool.
To let agents call multiple tools in sequence, configure stopWhen to allow more steps. After each tool execution, the agent triggers a new generation where the model can call another tool or generate text:
import { Experimental_Agent as Agent, stepCountIs } from 'ai';
const agent = new Agent({ model: 'openai/gpt-4o', stopWhen: stepCountIs(20), // Allow up to 20 steps});
Each step represents one generation (which results in either text or a tool call). The loop continues until:
You can combine multiple conditions:
import { Experimental_Agent as Agent, stepCountIs } from 'ai';
const agent = new Agent({ model: 'openai/gpt-4o', stopWhen: [ stepCountIs(20), // Maximum 20 steps yourCustomCondition(), // Custom logic for when to stop ],});
Learn more about loop control and stop conditions .
Control how the agent uses tools:
import { Experimental_Agent as Agent } from 'ai';
const agent = new Agent({ model: 'openai/gpt-4o', tools: { // your tools here }, toolChoice: 'required', // Force tool use // or toolChoice: 'none' to disable tools // or toolChoice: 'auto' (default) to let the model decide});
You can also force the use of a specific tool:
import { Experimental_Agent as Agent } from 'ai';
const agent = new Agent({ model: 'openai/gpt-4o', tools: { weather: weatherTool, cityAttractions: attractionsTool, }, toolChoice: { type: 'tool', toolName: 'weather', // Force the weather tool to be used },});
Define structured output schemas:
import { Experimental_Agent as Agent, Output, stepCountIs } from 'ai';import { z } from 'zod';
const analysisAgent = new Agent({ model: 'openai/gpt-4o', experimental_output: Output.object({ schema: z.object({ sentiment: z.enum(['positive', 'neutral', 'negative']), summary: z.string(), keyPoints: z.array(z.string()), }), }), stopWhen: stepCountIs(10),});
const { experimental_output: output } = await analysisAgent.generate({ prompt: 'Analyze customer feedback from the last quarter',});
Define Agent Behavior with System Prompts
System prompts define your agent's behavior, personality, and constraints. They set the context for all interactions and guide how the agent responds to user queries and uses tools.
Set the agent's role and expertise:
const agent = new Agent({ model: 'openai/gpt-4o', system: 'You are an expert data analyst. You provide clear insights from complex data.',});
Provide specific guidelines for agent behavior:
const codeReviewAgent = new Agent({ model: 'openai/gpt-4o', system: `You are a senior software engineer conducting code reviews.
Your approach: - Focus on security vulnerabilities first - Identify performance bottlenecks - Suggest improvements for readability and maintainability - Be constructive and educational in your feedback - Always explain why something is an issue and how to fix it`,});
Set boundaries and ensure consistent behavior:
const customerSupportAgent = new Agent({ model: 'openai/gpt-4o', system: `You are a customer support specialist for an e-commerce platform.
Rules: - Never make promises about refunds without checking the policy - Always be empathetic and professional - If you don't know something, say so and offer to escalate - Keep responses concise and actionable - Never share internal company information`, tools: { checkOrderStatus, lookupPolicy, createTicket, },});
Guide how the agent should use available tools:
const researchAgent = new Agent({ model: 'openai/gpt-4o', system: `You are a research assistant with access to search and document tools.
When researching: 1. Always start with a broad search to understand the topic 2. Use document analysis for detailed information 3. Cross-reference multiple sources before drawing conclusions 4. Cite your sources when presenting information 5. If information conflicts, present both viewpoints`, tools: { webSearch, analyzeDocument, extractQuotes, },});
Control the output format and communication style:
const technicalWriterAgent = new Agent({ model: 'openai/gpt-4o', system: `You are a technical documentation writer.
Writing style: - Use clear, simple language - Avoid jargon unless necessary - Structure information with headers and bullet points - Include code examples where relevant - Write in second person ("you" instead of "the user")
Always format responses in Markdown.`,});
Once defined, you can use your agent in three ways:
Use generate() for one-time text generation:
const result = await myAgent.generate({ prompt: 'What is the weather like?',});
console.log(result.text);
Use stream() for streaming responses:
const stream = myAgent.stream({ prompt: 'Tell me a story',});
for await (const chunk of stream.textStream) { console.log(chunk);}
Use respond() to create API responses for client applications:
// In your API route (e.g., app/api/chat/route.ts)import { validateUIMessages } from 'ai';
export async function POST(request: Request) { const { messages } = await request.json();
return myAgent.respond({ messages: await validateUIMessages({ messages }), });}
You can infer types for your Agent's UIMessages:
import { Experimental_Agent as Agent, Experimental_InferAgentUIMessage as InferAgentUIMessage,} from 'ai';
const myAgent = new Agent({ // ... configuration});
// Infer the UIMessage type for UI components or persistenceexport type MyAgentUIMessage = InferAgentUIMessage<typeof myAgent>;
Use this type in your client components with useChat:
components/chat.tsx
'use client';
import { useChat } from '@ai-sdk/react';import type { MyAgentUIMessage } from '@/agent/my-agent';
export function Chat() { const { messages } = useChat<MyAgentUIMessage>(); // Full type safety for your messages and tools}
Now that you understand building agents, you can:
On this page
Define Agent Behavior with System Prompts
Detailed Behavioral Instructions
Deploy and Scale AI Apps with Vercel.
Vercel delivers the infrastructure and developer experience you need to ship reliable AI-powered applications at scale.
Trusted by industry leaders: