| name | vercel-ai-sdk |
| description | Guide for Vercel AI SDK v5 implementation patterns including generateText, streamText, useChat hook, tool calling, embeddings, and MCP integration. Use when implementing AI chat interfaces, streaming responses, tool/function calling, text embeddings, or working with convertToModelMessages and toUIMessageStreamResponse. Activates for AI SDK integration, useChat hook usage, message streaming, or tool calling tasks. |
| allowed-tools | ["Read","Write","Edit","Glob","Grep","Bash","WebFetch"] |
Vercel AI SDK v5 Implementation Guide
When to Use This Skill
Use this skill when:
- Implementing AI chat interfaces with
useChat hook
- Creating API routes that generate or stream AI responses
- Adding tool calling / function calling capabilities
- Generating text embeddings for semantic search or RAG
- Migrating from AI SDK v4 to v5
- Integrating Model Context Protocol (MCP) servers
- Working with streaming responses or message persistence
Structured Implementation Workflow
Understand the task requirements
- Identify what AI functionality is needed (chat, generation, tools, embeddings)
- Determine if client-side (useChat) or server-side (API route) implementation
- Check if streaming or non-streaming response is required
- Verify model provider (OpenAI, Anthropic, etc.)
Verify current API patterns if uncertain
- Use WebFetch to check https://ai-sdk.dev/docs/ if API patterns are unclear
- Confirm model specification format for the provider
- Verify function signatures for complex features
Implement using correct v5 patterns
- Use string-based model specification ('provider/model-id')
- For chat: use sendMessage (not append), parts-based messages
- For tools: MUST import and use tool() helper from 'ai', MUST use inputSchema (NOT parameters), MUST use zod
- For streaming: use toUIMessageStreamResponse() or toTextStreamResponse()
- For embeddings: use provider.textEmbeddingModel()
Ensure TypeScript types are correct
- Check for proper imports from 'ai' package
- Verify message types (UIMessage for useChat)
- Ensure tool parameter types are inferred correctly
- Add explicit types for async functions
Install any missing dependencies with the CORRECT package manager
- **CRITICAL: Detect which package manager the project uses FIRST**
* Check for lockfiles: pnpm-lock.yaml → use pnpm, package-lock.json → use npm, yarn.lock → use yarn, bun.lockb → use bun
* If pnpm-lock.yaml exists, you MUST use pnpm (NOT npm!)
- Check if all imported packages are installed
- If build fails with "Module not found", identify the package name from the error
- Add the package to package.json dependencies
- Install using the CORRECT package manager:
* If pnpm-lock.yaml exists: `pnpm install [package]` or `pnpm add [package]`
* If package-lock.json exists: `npm install [package]`
* If yarn.lock exists: `yarn add [package]`
* If bun.lockb exists: `bun install [package]` or `bun add [package]`
- Re-run build to verify installation succeeded
**NEVER use the wrong package manager!**
- Using npm when the project uses pnpm creates package-lock.json alongside pnpm-lock.yaml
- This causes dependency version mismatches and breaks the build
- ALWAYS check for existing lockfiles and use the matching package manager
NEVER accept "Module not found" errors as environment issues
YOU must install the required packages with the CORRECT package manager
Common packages needed:
- ai (core AI SDK)
- @ai-sdk/openai (OpenAI provider)
- @ai-sdk/anthropic (Anthropic provider)
- @modelcontextprotocol/sdk (MCP integration)
- zod (for tool schemas)
</critical>
Run build and fix ALL errors until it passes
- Run: npm run build (or bun run build)
- If build fails, read the FULL error message carefully
- Common fixes:
* "Module not found" → Install the package (go back to step 5)
* Type errors → Fix TypeScript types
* Config errors → Check next.config.js/ts
* Framework errors → Research the error, try different approaches
- Apply fix and re-run build
- REPEAT until build passes
NEVER stop at "build fails but code is correct"
NEVER blame "environment" or "framework bugs" without debugging
KEEP ITERATING until the build PASSES
If you encounter a framework error, research it, try workarounds, debug it
DO NOT give up until you have a passing build
Run tests and debug ALL failures until they pass
- Run: npm run test (or bun run test)
- If tests fail, read the FULL error message carefully
- Debug the failure:
* Check what the test expects vs what you're providing
* Look at test assertions
* Verify your implementation matches the requirements
* Check for type mismatches or missing properties
- Fix the implementation
- Re-run tests
- REPEAT until all tests pass
NEVER accept test failures without debugging
NEVER dismiss failures as "test framework bugs" without investigation
Each test failure is telling you something - LISTEN to it
KEEP DEBUGGING until all tests pass
Test failures are NOT acceptable - they mean your implementation is incomplete
Verify EVERYTHING passes
- Run: npm run build && npm run lint && npm run test
- All three must pass: ✅ Build, ✅ Lint, ✅ Tests
- If any fail, go back and fix them
- Do not consider the task complete until ALL checks pass
The task is NOT complete until:
- ✅ Build passes
- ✅ Lint passes
- ✅ Tests pass
"Code is correct" is NOT enough
You must achieve FULL PASSING status
This is what it means to be an autonomous agent
</critical>
⚠️ AUTONOMOUS AGENT MINDSET
You are not just writing code - you are COMPLETING TASKS AUTONOMOUSLY.
This means:
- ✅ Write correct implementation
- ✅ Install any required dependencies
- ✅ Run build and fix ALL errors
- ✅ Run tests and debug ALL failures
- ✅ Iterate until EVERYTHING passes
- ✅ Never make excuses or give up
Common Failure Patterns to AVOID
❌ WRONG: "The code is correct, but the package isn't installed - that's an environment issue"
✅ CORRECT: "Build failed due to missing package - installing it now with npm install [package]"
❌ WRONG: "Tests pass but build fails - not my problem"
✅ CORRECT: "Build is failing - debugging the error and fixing it now"
❌ WRONG: "There's a framework bug, can't fix it"
✅ CORRECT: "Framework error detected - researching the issue, trying workarounds, debugging until I find a solution"
❌ WRONG: "The implementation is complete" (with failing tests)
✅ CORRECT: "Tests are failing - debugging and fixing until they all pass"
Dependency Installation Workflow
When you encounter "Module not found" errors:
-
Detect the package manager FIRST - Check for lockfiles:
ls -la | grep -E "lock"
-
Identify the package from the import statement
Error: Cannot find module '@ai-sdk/openai'
Import: import { openai } from '@ai-sdk/openai'
Package needed: @ai-sdk/openai
-
Install with the CORRECT package manager
pnpm install @ai-sdk/openai
pnpm add @ai-sdk/openai
npm install @ai-sdk/openai
yarn add @ai-sdk/openai
bun install @ai-sdk/openai
-
Re-run build to verify
npm run build
-
Fix any new errors that appear
⚠️ CRITICAL WARNING:
Using the WRONG package manager (e.g., npm when the project uses pnpm) will:
- Create a second conflicting lockfile
- Install different versions of dependencies
- Cause dependency version mismatches
- Break the build with cryptic errors like "Cannot read properties of null"
Build Error Debugging Workflow
When build fails:
- Read the FULL error message - don't skim it
- Identify the root cause:
- Module not found → Install package
- Type error → Fix types
- Config error → Check config files
- Next.js error → Research, try different approaches
- Apply the fix
- Re-run build
- Repeat until build passes
Test Failure Debugging Workflow
When tests fail:
- Read the FULL test error - understand what's expected
- Compare expected vs actual behavior
- Check your implementation against test assertions
- Fix the issue in your code
- Re-run tests
- Repeat until all tests pass
Success Criteria
Task is ONLY complete when:
- ✅ Build passes (
npm run build succeeds)
- ✅ Lint passes (
npm run lint succeeds)
- ✅ Tests pass (
npm run test succeeds)
NEVER stop at "code is correct" - achieve FULL PASSING status!
⚠️ CRITICAL: Tool Calling API - MUST USE tool() Helper
When implementing tool calling, you MUST use the tool() helper function from the 'ai' package.
❌ WRONG - Plain Object (WILL CAUSE BUILD ERROR)
import { z } from 'zod';
tools: {
myTool: {
description: 'My tool',
parameters: z.object({...}),
execute: async ({...}) => {...},
}
}
This will fail with: Type '{ description: string; parameters: ... }' is not assignable to type '{ inputSchema: FlexibleSchema<any>; ... }'
✅ CORRECT - Use tool() Helper (REQUIRED)
import { tool } from 'ai';
import { z } from 'zod';
tools: {
myTool: tool({
description: 'My tool',
inputSchema: z.object({...}),
execute: async ({...}) => {...},
}),
}
Tool Calling Checklist
Before implementing any tool, verify:
⚠️ CRITICAL: Common v4 to v5 Breaking Changes
1. useChat Hook Changes
❌ WRONG (v4 pattern):
const { messages, input, setInput, append } = useChat();
append({ content: text, role: 'user' });
✅ CORRECT (v5 pattern):
const { messages, sendMessage } = useChat();
const [input, setInput] = useState('');
sendMessage({ text: input });
2. Message Structure
❌ WRONG (v4 simple content):
<div>{message.content}</div>
✅ CORRECT (v5 parts-based):
<div>
{message.parts.map((part, index) =>
part.type === 'text' ? <span key={index}>{part.text}</span> : null
)}
</div>
3. Model Specification
✅ PREFER: String-based (v5 recommended):
import { generateText } from 'ai';
const result = await generateText({
model: 'openai/gpt-4o',
prompt: 'Hello',
});
✅ ALSO WORKS: Function-based (legacy support):
import { openai } from '@ai-sdk/openai';
import { generateText } from 'ai';
const result = await generateText({
model: openai('gpt-4o'),
prompt: 'Hello',
});
Core API Reference
1. generateText - Non-Streaming Text Generation
Purpose: Generate text for non-interactive use cases (email drafts, summaries, agents with tools).
Signature:
import { generateText } from 'ai';
const result = await generateText({
model: 'openai/gpt-4o',
prompt: 'Your prompt here',
system: 'Optional system message',
tools?: { ... },
maxSteps?: 5,
});
Return Value:
{
text: string;
toolCalls: ToolCall[];
finishReason: string;
usage: TokenUsage;
response: RawResponse;
warnings: Warning[];
}
Example:
import { generateText } from 'ai';
export async function GET() {
const result = await generateText({
model: 'anthropic/claude-4-sonnet',
prompt: 'Why is the sky blue?',
});
return Response.json({ text: result.text });
}
2. streamText - Streaming Text Generation
Purpose: Stream responses for interactive chat applications.
Signature:
import { streamText } from 'ai';
const result = streamText({
model: 'openai/gpt-4o',
prompt: 'Your prompt here',
system: 'Optional system message',
messages?: ModelMessage[],
tools?: { ... },
onFinish?: async (result) => { ... },
onError?: async (error) => { ... },
});
Return Methods:
result.toUIMessageStreamResponse();
result.toTextStreamResponse();
Example - Chat API Route:
import { streamText, convertToModelMessages } from 'ai';
import type { UIMessage } from 'ai';
export async function POST(req: Request) {
const { messages }: { messages: UIMessage[] } = await req.json();
const result = streamText({
model: 'openai/gpt-4o',
system: 'You are a helpful assistant.',
messages: convertToModelMessages(messages),
});
return result.toUIMessageStreamResponse();
}
3. useChat Hook - Client-Side Chat Interface
Purpose: Build interactive chat UIs with streaming support.
Signature:
import { useChat } from 'ai/react';
const {
messages,
sendMessage,
status,
stop,
regenerate,
setMessages,
error,
reload,
} = useChat({
api: '/api/chat',
onFinish?: (message) => { ... },
onError?: (error) => { ... },
});
Complete Example:
'use client';
import { useChat } from 'ai/react';
import { useState } from 'react';
export default function ChatPage() {
const { messages, sendMessage, status } = useChat();
const [input, setInput] = useState('');
const handleSubmit = (e: React.FormEvent) => {
e.preventDefault();
if (!input.trim()) return;
sendMessage({ text: input });
setInput('');
};
return (
<div>
<div>
{messages.map((message) => (
<div key={message.id}>
<strong>{message.role}:</strong>
{message.parts.map((part, index) =>
part.type === 'text' ? (
<span key={index}>{part.text}</span>
) : null
)}
</div>
))}
</div>
<form onSubmit={handleSubmit}>
<input
value={input}
onChange={(e) => setInput(e.target.value)}
placeholder="Type a message..."
disabled={status === 'streaming'}
/>
<button type="submit" disabled={status === 'streaming'}>
Send
</button>
</form>
</div>
);
}
4. Tool Calling / Function Calling
Purpose: Enable AI models to call functions with structured parameters.
Defining Tools:
import { tool } from 'ai';
import { z } from 'zod';
const weatherTool = tool({
description: 'Get the weather in a location',
inputSchema: z.object({
location: z.string().describe('The location to get the weather for'),
unit: z.enum(['C', 'F']).describe('Temperature unit'),
}),
execute: async ({ location, unit }) => {
return {
location,
temperature: 24,
unit,
condition: 'Sunny',
};
},
});
Using Tools with generateText/streamText:
import { streamText, convertToModelMessages, tool } from 'ai';
import { z } from 'zod';
import type { UIMessage } from 'ai';
export async function POST(req: Request) {
const { messages }: { messages: UIMessage[] } = await req.json();
const result = streamText({
model: 'openai/gpt-4o',
messages: convertToModelMessages(messages),
tools: {
getWeather: tool({
description: 'Get the weather for a location',
inputSchema: z.object({
city: z.string().describe('The city to get the weather for'),
unit: z.enum(['C', 'F']).describe('The unit to display the temperature in'),
}),
execute: async ({ city, unit }) => {
return `It is currently 24°${unit} and Sunny in ${city}!`;
},
}),
},
});
return result.toUIMessageStreamResponse();
}
Multi-Step Tool Calling:
const result = await generateText({
model: 'openai/gpt-4o',
tools: {
weather: weatherTool,
search: searchTool,
},
prompt: 'What is the weather in San Francisco and find hotels there?',
maxSteps: 5,
});
5. Text Embeddings
Purpose: Convert text into numerical vectors for semantic search, RAG, or similarity.
Signature:
import { embed } from 'ai';
import { openai } from '@ai-sdk/openai';
const result = await embed({
model: openai.textEmbeddingModel('text-embedding-3-small'),
value: 'Text to embed',
});
Return Value:
{
embedding: number[];
usage: { tokens: number };
response: RawResponse;
}
Example - Embedding API Route:
import { embed } from 'ai';
import { openai } from '@ai-sdk/openai';
export async function GET() {
const { embedding, usage } = await embed({
model: openai.textEmbeddingModel('text-embedding-3-small'),
value: 'sunny day at the beach',
});
return Response.json({ embedding, usage });
}
Batch Embeddings:
import { embedMany } from 'ai';
const { embeddings, usage } = await embedMany({
model: openai.textEmbeddingModel('text-embedding-3-small'),
values: [
'sunny day at the beach',
'rainy afternoon in the city',
'snowy mountain landscape',
],
});
6. Message Utilities
convertToModelMessages:
Converts UI messages from useChat into ModelMessage objects for AI functions.
import { convertToModelMessages } from 'ai';
import type { UIMessage } from 'ai';
export async function POST(req: Request) {
const { messages }: { messages: UIMessage[] } = await req.json();
const result = streamText({
model: 'openai/gpt-4o',
messages: convertToModelMessages(messages),
});
return result.toUIMessageStreamResponse();
}
7. Model Context Protocol (MCP) Integration
Purpose: Connect to external MCP servers for dynamic tool access.
Example:
import { experimental_createMCPClient, streamText } from 'ai';
import { StreamableHTTPClientTransport } from '@modelcontextprotocol/sdk/client/streamableHttp.js';
export async function POST(req: Request) {
const { prompt }: { prompt: string } = await req.json();
try {
const httpTransport = new StreamableHTTPClientTransport(
new URL('http://localhost:3000/mcp')
);
const httpClient = await experimental_createMCPClient({
transport: httpTransport,
});
const tools = await httpClient.tools();
const response = streamText({
model: 'openai/gpt-4o',
tools,
prompt,
onFinish: async () => {
await httpClient.close();
},
onError: async () => {
await httpClient.close();
},
});
return response.toTextStreamResponse();
} catch (error) {
return new Response('Internal Server Error', { status: 500 });
}
}
Key Points:
- Use
experimental_createMCPClient (note: experimental API)
- Always close the client in
onFinish and onError
- Tools are fetched dynamically with
httpClient.tools()
- Requires
@modelcontextprotocol/sdk package
Model Specification Patterns
String-Based (Recommended for v5)
model: 'openai/gpt-4o'
model: 'anthropic/claude-4-sonnet'
model: 'google/gemini-2.0-flash'
Function-Based (Legacy Support)
import { openai } from '@ai-sdk/openai';
import { anthropic } from '@ai-sdk/anthropic';
model: openai('gpt-4o')
model: anthropic('claude-4-sonnet')
Embedding Models
import { openai } from '@ai-sdk/openai';
openai.textEmbeddingModel('text-embedding-3-small')
openai.textEmbeddingModel('text-embedding-3-large')
TypeScript Best Practices
Type Imports
import type {
UIMessage,
ModelMessage,
ToolCall,
TokenUsage,
} from 'ai';
Strongly Typed Tools
import { tool } from 'ai';
import { z } from 'zod';
const myTool = tool({
description: 'My tool',
inputSchema: z.object({
param1: z.string(),
param2: z.number(),
}),
execute: async ({ param1, param2 }) => {
return { result: 'success' };
},
});
API Route Types
import type { UIMessage } from 'ai';
export async function POST(req: Request): Promise<Response> {
const { messages }: { messages: UIMessage[] } = await req.json();
}
Common Patterns
Pattern 1: Simple Chat Application
Client (app/page.tsx):
'use client';
import { useChat } from 'ai/react';
import { useState } from 'react';
export default function Chat() {
const { messages, sendMessage, status } = useChat();
const [input, setInput] = useState('');
return (
<div>
{messages.map((m) => (
<div key={m.id}>
<strong>{m.role}:</strong>
{m.parts.map((part, i) =>
part.type === 'text' ? <span key={i}>{part.text}</span> : null
)}
</div>
))}
<form onSubmit={(e) => {
e.preventDefault();
sendMessage({ text: input });
setInput('');
}}>
<input value={input} onChange={(e) => setInput(e.target.value)} />
<button disabled={status === 'streaming'}>Send</button>
</form>
</div>
);
}
Server (app/api/chat/route.ts):
import { streamText, convertToModelMessages } from 'ai';
import type { UIMessage } from 'ai';
export async function POST(req: Request) {
const { messages }: { messages: UIMessage[] } = await req.json();
const result = streamText({
model: 'openai/gpt-4o',
system: 'You are a helpful assistant.',
messages: convertToModelMessages(messages),
});
return result.toUIMessageStreamResponse();
}
Pattern 2: Chat with Tools
Server with tool calling:
import { streamText, convertToModelMessages, tool } from 'ai';
import { z } from 'zod';
import type { UIMessage } from 'ai';
export async function POST(req: Request) {
const { messages }: { messages: UIMessage[] } = await req.json();
const result = streamText({
model: 'openai/gpt-4o',
messages: convertToModelMessages(messages),
tools: {
getWeather: tool({
description: 'Get weather for a city',
inputSchema: z.object({
city: z.string(),
}),
execute: async ({ city }) => {
return { city, temp: 72, condition: 'Sunny' };
},
}),
searchWeb: tool({
description: 'Search the web',
inputSchema: z.object({
query: z.string(),
}),
execute: async ({ query }) => {
return { results: ['...'] };
},
}),
},
});
return result.toUIMessageStreamResponse();
}
Pattern 3: Non-Interactive Generation
import { generateText } from 'ai';
export async function POST(req: Request) {
const { text } = await req.json();
const result = await generateText({
model: 'anthropic/claude-4-sonnet',
system: 'You are a summarization expert.',
prompt: `Summarize this text:\n\n${text}`,
});
return Response.json({ summary: result.text });
}
Pattern 4: Semantic Search with Embeddings
import { embed } from 'ai';
import { openai } from '@ai-sdk/openai';
export async function POST(req: Request) {
const { query } = await req.json();
const { embedding } = await embed({
model: openai.textEmbeddingModel('text-embedding-3-small'),
value: query,
});
return Response.json({ embedding, results: [] });
}
Common Pitfalls and Solutions
Pitfall 1: NOT Using tool() Helper for Tools - ⚠️ CRITICAL
This is the most common and critical mistake. Always use tool() helper!
import { z } from 'zod';
tools: {
myTool: {
description: 'My tool',
parameters: z.object({
city: z.string(),
}),
execute: async ({ city }) => { ... },
},
}
import { tool } from 'ai';
import { z } from 'zod';
tools: {
myTool: tool({
description: 'My tool',
inputSchema: z.object({
city: z.string(),
}),
execute: async ({ city }) => { ... },
}),
}
Pitfall 2: Using v4 useChat API in v5
const { input, setInput, append } = useChat();
append({ content: 'Hello', role: 'user' });
const { sendMessage } = useChat();
const [input, setInput] = useState('');
sendMessage({ text: 'Hello' });
Pitfall 3: Accessing message.content instead of message.parts
<div>{message.content}</div>
<div>
{message.parts.map((part, i) =>
part.type === 'text' ? <span key={i}>{part.text}</span> : null
)}
</div>
Pitfall 4: Not Converting UIMessages for Model
const result = streamText({
model: 'openai/gpt-4o',
messages: messages,
});
const result = streamText({
model: 'openai/gpt-4o',
messages: convertToModelMessages(messages),
});
Pitfall 5: Forgetting MCP Client Cleanup
const httpClient = await experimental_createMCPClient({
transport: httpTransport,
});
const tools = await httpClient.tools();
const response = streamText({ model, tools, prompt });
return response.toTextStreamResponse();
const response = streamText({
model,
tools,
prompt,
onFinish: async () => {
await httpClient.close();
},
onError: async () => {
await httpClient.close();
},
});
Pitfall 6: Using Wrong Response Method
return result.toTextStreamResponse();
return result.toUIMessageStreamResponse();
return result.toTextStreamResponse();
Pitfall 7: Wrong Embedding Model Method
const { embedding } = await embed({
model: openai('text-embedding-3-small'),
value: 'text',
});
const { embedding } = await embed({
model: openai.textEmbeddingModel('text-embedding-3-small'),
value: 'text',
});
Migration Checklist (v4 → v5)
When migrating from v4 to v5, update:
Decision Guide
When implementing AI SDK features, ask:
-
Is this client-side or server-side?
- Client: Use
useChat hook
- Server: Use
generateText or streamText
-
Do I need streaming or non-streaming?
- Streaming chat:
streamText + toUIMessageStreamResponse()
- Non-streaming:
generateText
- Simple text stream:
streamText + toTextStreamResponse()
-
Do I need tool calling?
- Yes: Define tools with
tool() helper and inputSchema (zod)
- Pass tools object to
generateText or streamText
-
Am I using the correct message format?
- Client (useChat): Returns
UIMessage[] with parts property
- Server: Convert with
convertToModelMessages() to ModelMessage[]
- Render messages using
message.parts.map(...)
-
Is my model specification correct?
- Prefer string format:
'openai/gpt-4o'
- Function format also works:
openai('gpt-4o')
- Embeddings:
openai.textEmbeddingModel('text-embedding-3-small')
-
Do I need embeddings?
- Use
embed for single values
- Use
embedMany for batches
- Use
textEmbeddingModel() method
Quick Reference
| Task | Function | Key Parameters |
|---|
| Generate text | generateText() | model, prompt, system, tools |
| Stream text | streamText() | model, messages, tools, onFinish |
| Chat UI | useChat() | api, onFinish, onError |
| Tool calling | tool() | description, inputSchema, execute |
| Text embedding | embed() | model, value |
| Batch embedding | embedMany() | model, values |
| Message conversion | convertToModelMessages() | messages (UIMessage[]) |
| MCP integration | experimental_createMCPClient() | transport |
Additional Resources
When in doubt, check the official documentation:
Remember: AI SDK v5 uses string-based model specification, parts-based messages, sendMessage instead of append, and requires convertToModelMessages in API routes.