On October 6 at its 2025 Dev Day, OpenAI unveiled AgentKit—a powerful new toolkit for building, deploying, and optimizing AI agents. Released alongside ChatGPT Apps and the general launch of Codex, it marks a major step in turning ChatGPT into a full development platform.
AgentKit brings together everything developers need: visual workflow design, real-time chat UIs, secure data connectors, and built-in evaluation tools. It replaces the need for custom orchestration, manual prompt tuning, and frontend work.
But perhaps most importantly, AgentKit positions OpenAI as a direct challenger to traditional no-code platforms like Zapier, n8n, and Make. While those tools helped define automation for the SaaS era, AgentKit aims to redefine it for the AI era—introducing not just task execution, but contextual understanding, dynamic decision-making, and seamless human-AI interaction.
Introducing AgentKit—build, deploy, and optimize agentic workflows.
— OpenAI Developers (@OpenAIDevs) October 6, 2025
💬 ChatKit: Embeddable, customizable chat UI
👷 Agent Builder: WYSIWYG workflow creator
🛤️ Guardrails: Safety screening for inputs/outputs
⚖️ Evals: Datasets, trace grading, auto-prompt optimization pic.twitter.com/pGgNHKOvj3
Building Agents Was Messy
Before AgentKit, creating AI agents meant juggling a messy and fragmented toolset. Developers had to manually orchestrate logic, fine-tune prompts, write custom connectors for every integration, and build UI components from scratch—all without built-in versioning, safety guardrails, or evaluation tools.
Even general automation platforms like Zapier or n8n weren’t built for this. They helped with triggers and tasks, but not with dynamic reasoning, multi-agent workflows, or real-time chat interfaces. Developers were left cobbling together scripts, APIs, dashboards, and models just to get something usable.
For teams at companies like Klarna or Clay, launching production-ready agents could take weeks or months. Iterating was slow. Debugging was complex. And coordinating between product, legal, and engineering teams often led to confusion and delays.
The AgentKit Is a Full AI Agent Stack
AgentKit is OpenAI’s unified stack for building, deploying, and optimizing AI agents. It replaces fragmented tools with an integrated system covering workflow design, data integration, chat interfaces, and performance evaluation.
Instead of juggling orchestration scripts, frontend builds, and manual evals, developers get one clean, AI-native toolkit.
Visual Workflow Design
At the heart of AgentKit is Agent Builder, a drag-and-drop interface for constructing multi-agent workflows. Each block on the canvas can represent a model, tool, rule, guardrail, or branching logic, making it easy to map out complex flows without touching orchestration code.
Developers can visually connect steps like classification, retrieval, user approval, and conditional branching. Built-in features like preview runs, inline evaluations, and version tracking make iteration fast and safe. You can run a test, trace output, and deploy—all in one place.
Security is baked in. Guardrails can be configured to detect jailbreak attempts, redact sensitive data, or flag unusual behavior. Crucially, Agent Builder’s interface allows cross-functional collaboration with product teams, legal, and operations—turning agent design from a siloed effort into a transparent, shared process.
“What once took months of backend work can now be done in a single sprint.”

Connector Registry
AgentKit also introduces the Connector Registry, a centralized hub for managing how your agents access third-party systems and data sources. This is especially important for enterprise teams operating across multiple workspaces and tools.
Admins can define and enforce access control policies, manage auth flows, and track usage—all from one place. This reduces integration complexity and ensures governance standards are met across departments.
ChatKit
Deploying agents into real products typically means building a frontend from scratch. ChatKit removes that step.
ChatKit is an embeddable UI toolkit that lets developers drop a production-ready chat interface into their app or website in minutes. It supports:
- Real-time streaming responses
- Threaded conversations
- Typing indicators and model “thinking” states
- Full theming and branding customization
Whether you’re adding an onboarding assistant to a SaaS product or a customer support bot to your help center, ChatKit makes it feel seamless. Teams like Canva and HubSpot have already used ChatKit to launch agents in under an hour—saving weeks of frontend work and design iterations.

Native Optimization Tools
Reliable agents require more than working logic—they need to be measured, tested, and improved over time. That’s where AgentKit’s Evals system comes in.
Developers can build structured evaluation datasets, run step-by-step trace grading to find failure points, and use the Prompt Optimizer to generate improved versions automatically. Evaluations can be applied not only to OpenAI models, but to third-party LLMs as well—making it model-agnostic.
For teams looking to push performance further, Reinforcement Fine-Tuning (RFT) is available on select models like o4-mini, with GPT‑5 support currently in beta. RFT allows developers to define custom success criteria, train models to choose the right tools at the right moment, and adjust outputs for highly specialized use cases.
Pricing & Availability
AgentKit is included with standard OpenAI API pricing—no additional tiers or hidden costs. Here is what’s available today:
| Tool | Status |
|---|---|
| Agent Builder | Beta |
| ChatKit | General Availability |
| Evals Upgrades | General Availability |
| Connector Registry | Beta (Enterprise only) |
Coming Soon
- Standalone Workflows API
- Agent deployment directly inside ChatGPT
- More granular permissions and monetization controls
- A public directory of published agents
AgentKit is the new foundation for how OpenAI sees the future of building with AI: streamlined, production-ready, and easy to integrate.
Final Thoughts
OpenAI is pushing the agent world from proof-of-concept to production. By bundling workflow design, data connectors, chat UI, and evaluation in a single stack, AgentKit gives small teams the leverage that only well-funded tech giants had a year ago. For companies already using ChatGPT or the OpenAI API, the upgrade path is almost friction-free—just drop your logic onto the canvas and hit deploy.
This puts traditional no-code leaders such as Zapier, Make, and n8n in an uncomfortable spot. Their drag-and-drop flows handle triggers and tasks well, but they lack built-in reasoning and evaluation for LLMs. If OpenAI wins developer mindshare for AI-native automation, the market may split: rule-based zaps on one side, adaptive agents on the other. Expect acquisitions, partnerships, or fast follow-ups as incumbents race to add deeper AI hooks.
Enterprises also get something they have long demanded: centralized governance. The Connector Registry and Guardrails let security, legal, and engineering work from the same playbook. That makes it easier for regulated industries—finance, healthcare, public sector—to roll out chat interfaces that talk to real corporate data without opening new compliance holes.
Looking ahead, success will hinge on two things. First, how quickly OpenAI can widen Connector Registry access and finalize the Workflows API so agents can live outside the OpenAI ecosystem. Second, how well the community addresses model safety and reliability at scale. If those pieces fall into place, AgentKit could become the Rosetta Stone that translates today’s LLM hype into everyday business automation.




