Building Agents with OpenAI Agents SDK

Name: Building Agents with OpenAI Agents SDK | Create practical AI agents and agentic systems through hands-on projects
Brand: De Gruyter
Price: 32.39 EUR
Availability: OnlineOnly

Create practical AI agents and agentic systems through hands-on projects

Henry Habib(Autor*in)

De Gruyter (Verlag)

1. Auflage

Erschienen am 10. Oktober 2025

276 Seiten

E-Book

ePUB mit Adobe-DRM

Systemvoraussetzungen

E-Book

ePUB ohne DRM

Systemvoraussetzungen

978-1-80611-200-5 (ISBN)

ab 32,39 €

Als Download verfügbar

Merkliste: siehe Preise

Beschreibung

Master OpenAI's Agents SDK to design production-ready AI agents and agentic systems that solve real-world problems with practical guidance Get your book with a free PDF, AI Assistant, and Next-Gen ReaderKey Features - Gain a complete understanding of the OpenAI Agents SDK features including models, tools, memory, guardrails, orchestration, tracing, and multi-agent systems
- Progressively build AI agents through several hands-on projects that evolve from a simple workflow to a complex multi-agent system
- Implement advanced agent capabilities such as RAG, MCPs, administration, workflow integration, and much more
Book DescriptionEveryone's talking about AI agents, but how do you build one that works in the real world? Not a toy demo, but an agent that solves real problems, saves time, and integrates into workflows. With vague frameworks, fragmented tooling, and endless hype, most developers are left without a clear path. The hardest part isn't technical; it is knowing where to start. This book gives you that starting point. It's a complete guide to building intelligent AI agents and agentic systems using the official OpenAI Agents SDK. It begins by grounding you in the core concepts, design principles, and architecture of AI agents, how they differ from other traditional systems, their advantages, and why that matters. Through practical step-by-step projects, you'll master every feature of the SDK-tools, memory, RAG, multi-agent orchestration, tracing, handoffs, and more-while contributing to an end-to-end agent system that grows in complexity. Projects include a custom support agent, invoice and inventory assistant, health advisor, sales trainer, and data analyst, giving you production-ready skills. By the end, you'll know how to design, build, and deploy agentic systems that interact with APIs, query databases, hand off to external systems, and drive meaningful outcomes. You won't just understand AI agents; you'll be ready to ship them.What you will learn - Understand the core principles of AI agents and why they matter
- Use the OpenAI Agents SDK to build real, working agents from scratch
- Design both single-agent and multi-agent systems
- Integrate external tools, APIs, and data sources to extend agent capabilities
- Add memory and stateful context to your agents so they can "remember" and adapt over time
- Coordinate agent-to-agent handoff orchestrations
- Secure, monitor, and scale agents in production
Who this book is forThis book is for LLM engineers, developers, tech-savvy professionals, analysts, and consultants who want to build practical agentic AI solutions using the OpenAI SDK. A basic understanding of Python and AI concepts is recommended, but no prior experience with agents is required. Whether you're exploring agents for the first time or want to deepen your skills with hands-on projects, this guide provides structured, production-ready knowledge.

Alle Preise

Weitere Details

Inhalt

Cover
Title Page
Copyright Page
Contributors
Table of Contents
Preface
Your Book Comes with Exclusive Perks - Here's How to Unlock Them
Part 1: AI Agents
Chapter 1: Introduction to AI Agents
Technical requirements
Overview of AI agents
What is an AI agent?
Understanding AI agents with a simple analogy
Strengths and weaknesses of AI agents versus traditional systems
Practical applications of AI agents
Productivity gains
Better interactivity
New businesses
Build methodology of AI agents
Anatomy of an AI agent
Model
Tooling interface
Memory and knowledge
Design patterns
CoT
ReAct (Reasoning + Acting)
Planner-execution
Hierarchical/multi-agent
Summary
Chapter 2: Introduction to OpenAI Agents SDK
Technical requirements
Design features of OpenAI Agents SDK
Framework for building AI agents
Multi-agent orchestration
Minimal abstraction
Pythonic, extensible, and open sourced
Core primitives
Agent
Runner
Tools
Handoff
Guardrails
Tracing
Summary
Chapter 3: Environment Setup and Developing Your First Agent
Technical requirements
Environment setup
Python version and dependencies
Project directory, virtual environment, and installations
Registering for OpenAI API and setting up the API key
Verifying the environment setup
Alternative methods: Google Colab
Development prerequisites
Python functions architecture
Python asynchronous programming
Python Pydantic data validation
Developing your first AI Agent
A simple customer service agent
Adding a tool
Adding a handoff
Summary
Part 2: OpenAI Agents SDK
Chapter 4: Agent Tools and MCPs
Technical requirements
Using custom tools with Python functions
Defining a new tool
Agent and tool behavior
Tool choice
Tool use behavior
Complex tool inputs with Pydantic
Examples of custom tools
Arithmetic computation tool
External API call tool
Database query tool
Chained tool calls
OpenAI hosted tools
WebSearchTool
FileSearchTool
ImageGenerationTool
CodeInterpreterTool
Handoff versus agent-as-tool patterns
Functionality
MCP
What is MCP?
Adding an MCP server as a tool
Summary
Chapter 5: Memory and Knowledge
Technical requirements
Working memory
Managing inputs and responses
Chat conversations
Conversation management with Sessions
Managing large conversation threads
Sliding message window
Message summarization
Long-term memory
Persistent message logs
Structured memory recall
Training knowledge
Retrieved knowledge
Unstructured data
Document ingestion
Retrieval
Using vector stores and FileSearchTool in the Agents SDK
Limitations
Summary
Chapter 6: Multi-Agent Systems and Handoffs
Technical requirements
Multi-agent orchestrations
Deterministic orchestration
Dynamic orchestration
Handoffs in OpenAI Agents SDK
Introduction to handoffs
Multi-agent switching
Customizing handoffs
Handoff prompting
Multi-agent patterns
Centralized system
Hierarchical system
Decentralized system
Swarm system
Summary
Chapter 7: Model and Context Management
Technical requirements
Model management
Adjusting the underlying model
Adjusting the model settings
Third-party models
Context management
Local context
Summary
Chapter 8: Agent System Management
Technical requirements
Agent visualization
Guardrails
Input guardrails
Output guardrails
Logging, tracing, and observability
Custom traces and spans
Grouping multiple traces and spans together
Disabling traces
Agent testing
End-to-end testing
Unit testing
Summary
Part 3: Build AI Agents
Chapter 9: Building AI Agents and Agentic Systems
Technical requirements
Building a customer service employee AI agent
Setting up the database
Setting up a vector store
Creating a function tool to query data
Creating a vector store search tool
Creating an input guardrail
Creating a retention agent
Creating a customer service agent
Building the runner
Testing the agent
Orchestrating an automated multi-agent workflow
Setting up a customer database
Setting up the transcripts JSON
Creating function tools to retrieve data and search the web
Creating the customer research agent
Creating the email creation agent
Orchestrating the workflow
Testing the workflow
Summary
Packt Page
Other Books You May Enjoy
Index
Blank Page

1 Introduction to AI Agents

AI agents are changing the way we work. Software has typically created deterministic (if X, then Y) and rigid systems that cannot address ambiguity or adapt to different goals - but this is changing. With the advancements of large language models (LLMs), intelligent systems are being created that can independently reason through steps and take actions to complete a goal. These AI agents are taking a larger share of work previously thought only a human could do, and it's just beginning.

By the end of this book, you will become a master at creating AI agents through OpenAI Agents SDK. The best way to learn this is to get your hands dirty and start building AI agent systems using that framework. Before we do this, however, we need to start at the most basic level, which is answering the question, "What is an AI agent?".

This chapter goes through everything you need to know to answer that question and, more importantly, lays the foundation we'll build in the rest of the book. We will explain exactly what an AI agent is and how it differs from traditional systems. This is important as many readers often confuse AI agents with sophisticated applications, such as chatbots or fraud detection systems. It's important to understand how AI agent systems work before we start building them. We will explore AI agents' practical applications beyond productivity. Finally, we will go through the different design patterns and frameworks available when building an AI agent, and understand why OpenAI Agents SDK is the pragmatic choice for most production systems.

Here is what we will cover in this first chapter:

Overview of the AI agent system and its strengths and weaknesses compared to more traditional systems
Practical applications of AI agents
How AI agents are built, by understanding their anatomy and different design/framework patterns used to build them

By the end of this opening chapter, we will have a strong mental blueprint for how every real-world AI agent is assembled, which will serve as our compass for when we start building our own.

Technical requirements

This chapter will be an overview of AI agents from a theoretical point of view to set a good foundation before we start building them. As a result, we will not be writing any code or developing any applications in this chapter. However, to follow along and complete the exercises and projects discussed throughout the rest of the book, make sure you have the following set up in your development environment:

Operating system: Windows 10/11, macOS, or Linux-based distribution (Ubuntu recommended).
Python version: Python 3.8 or later. You can verify your Python version by running python --version in your terminal or Command Prompt.
OpenAI account: Sign up at https://platform.openai.com/signup.
OpenAI API key: Obtained by creating an account with OpenAI. You will require this to utilize OpenAI Agents SDK.
Code editor: VS Code, PyCharm, or any IDE/editor you prefer.

Throughout this book, practical examples and the complete code from each chapter will be made available via the accompanying GitHub repository at https://github.com/PacktPublishing/Building-Agents-with-OpenAI-Agents-SDK.

You are encouraged to clone the repository, reuse and adapt the provided code samples, and refer to it as needed while progressing through the chapters.

Overview of AI agents

Before exploring AI agents in depth, we must first establish an intuitive understanding of what an AI agent actually is, how it fundamentally differs from traditional software, and what advantages and disadvantages this brings. This is difficult as there are varying definitions that often evolve with technological advancements. By clearly defining the key concepts upfront - including its benefits such as intelligent autonomy, reasoning abilities, and adaptive problem-solving - we can set the stage for understanding its practical applications and building approaches.

What is an AI agent?

An AI agent is an intelligent system that can operate independently to accomplish a specific goal by perceiving the world around it and taking action. Key distinguishing features of an AI agent include its ability to think and reason from a broad and sometimes ambiguous goal, its ability to create a plan to accomplish that goal, and its ability to autonomously complete that goal using a set of tools at its disposal that interact with the world.

This is in direct contrast to other conventional software systems that are deterministic (i.e., they follow a strict set of instructions based on a predefined plan) and cannot reason if situations outside of that plan are encountered. AI agents, on the other hand, can observe their environment, reason about what needs to be done, and act upon it in a continuous manner.

AI agents achieve this by combining the intelligence and reasoning abilities of LLMs with actions through standardized API calls. Let's explore the concepts and strengths of AI agents through a simple analogy to cement our understanding and differentiate it from classical software automation frameworks.

Understanding AI agents with a simple analogy

Imagine you are the head chef of a five-star restaurant, and you need to train two junior chefs, Carlos and Adam. Carlos is like a conventional automation software system or model, whereas Adam is like an AI agent. The way you would train these two chefs and the way that these two chefs operate are completely different.

Carlos requires you to teach him exactly what to do to prepare every dish. If you're teaching him to make an omelet, you must teach him how to open the fridge, take an egg, turn on the stove, pour some oil, crack the egg, and so on. Each step must be meticulously defined and shown to Carlos. When asked to make an omelet, Carlos performs the task exactly as-is, to perfection.

Adam works a different way, more like a human. Instead of giving him predefined steps, you show him how to perform actions around the kitchen - this is how you grab ingredients from the fridge, this is how you operate a stove, these are the basics of gastronomy, and so on. When asked to make an omelet, Adam relies on his reasoning ability and the set of tools/knowledge he's been given to accomplish that task, rather than following predefined steps.

Both Carlos and Adam are amazing chefs but have different strengths and weaknesses. In particular, Adam can embrace complexity and ambiguity. Because he can reason and is taught how to perform general actions, he can cook more than just an omelet - he can theoretically cook all kinds of foods as they all use the same actions.

This acts as the perfect analogy between AI agents and classical automation software/models. In short, the intelligent autonomy afforded to an AI agent enables it to perform a diverse set of ambiguous tasks that just cannot be replicated.

Note

It's important to mention that intelligent autonomy comes with the need for safeguards. An autonomous agent might make a poor decision if its "brain" (the AI model) is misinformed. We will later discuss how to guide and constrain agents (through prompt instructions and guardrails) to ensure their autonomy is exercised responsibly. The key takeaway here is that AI agents bring a level of smart, goal-directed independence that sets them apart from traditional automated systems.

Strengths and weaknesses of AI agents versus traditional systems

The preceding analogy describes the key differences and advantages that AI agents have over other systems in addition to their ability to embrace complexity. Adam has goal-directed autonomy, which enables him to cook more than just an omelet; he can make scrambled eggs, poached eggs, and even sunny-side-up eggs. In fact, Adam can create new/novel creations that he has not been explicitly trained on as long as his set of actions is sufficient to perform that task. Adam can also complete tasks in another order if appropriate.

Adam exhibits reasoning, which means he can perform adaptive problem-solving, which enables him to do the following, which would be impossible for Carlos:

Vary his cooking style to meet customer requests - Adam can cook an omelet more or less runny because he knows that leaving food on the stovetop for longer will make them more dry.
If there is an ingredient missing, Adam can compromise and see whether there are any substitutions that he can make. He can handle real-world ambiguity and thrive on it.

Carlos would find these tasks impossible as he has been taught and can only cook one single way and cannot reason otherwise. If there are any externalities that prevent him from opening the fridge or turning on the stove, Carlos cannot proceed and stalls, whereas Adam could adapt.

There are, however, weaknesses with the AI agent model that, for certain use cases, may be so large and impactful that they are not the best options. Adam's brain...

Systemvoraussetzungen

Dateiformat: ePUB
Kopierschutz: Adobe-DRM (Digital Rights Management)

Systemvoraussetzungen:

Computer (Windows; MacOS X; Linux): Installieren Sie bereits vor dem Download die kostenlose Software Adobe Digital Editions (siehe E-Book Hilfe).
Tablet/Smartphone (Android; iOS): Installieren Sie bereits vor dem Download die kostenlose App Adobe Digital Editions oder die App PocketBook (siehe E-Book Hilfe).
E-Book-Reader: Bookeen, Kobo, Pocketbook, Sony, Tolino u.v.a.m. (nicht Kindle)

Das Dateiformat ePUB ist sehr gut für Romane und Sachbücher geeignet – also für „fließenden” Text ohne komplexes Layout. Bei E-Readern oder Smartphones passt sich der Zeilen- und Seitenumbruch automatisch den kleinen Displays an.
Mit Adobe-DRM wird hier ein „harter” Kopierschutz verwendet. Wenn die notwendigen Voraussetzungen nicht vorliegen, können Sie das E-Book leider nicht öffnen. Daher müssen Sie bereits vor dem Download Ihre Lese-Hardware vorbereiten.

Bitte beachten Sie: Wir empfehlen Ihnen unbedingt nach Installation der Lese-Software diese mit Ihrer persönlichen Adobe-ID zu autorisieren!

Weitere Informationen finden Sie in unserer E-Book Hilfe.

Dateiformat: ePUB
Kopierschutz: ohne DRM (Digital Rights Management)

Systemvoraussetzungen:

Computer (Windows; MacOS X; Linux): Verwenden Sie eine Lese-Software, die das Dateiformat ePUB verarbeiten kann: z.B. Adobe Digital Editions oder FBReader – beide kostenlos (siehe E-Book Hilfe).
Tablet/Smartphone (Android; iOS): Installieren Sie bereits vor dem Download die kostenlose App Adobe Digital Editions oder die App PocketBook (siehe E-Book Hilfe).
E-Book-Reader: Bookeen, Kobo, Pocketbook, Sony, Tolino u.v.a.m.

Das Dateiformat ePUB ist sehr gut für Romane und Sachbücher geeignet – also für „glatten” Text ohne komplexes Layout. Bei E-Readern oder Smartphones passt sich der Zeilen- und Seitenumbruch automatisch den kleinen Displays an.
Ein Kopierschutz bzw. Digital Rights Management wird bei diesem E-Book nicht eingesetzt.

Weitere Informationen finden Sie in unserer E-Book Hilfe.

Als PDF speichern Als Link merken