You're reading from Building Agentic AI Systems Create intelligent, autonomous AI agents that can reason, plan, and adapt

Product type Paperback

Published in Apr 2025

Publisher Packt

ISBN-13 9781803238753

Length 288 pages

Edition 1st Edition

Concepts

GPT/LLMs

Authors (2):

Anjanava Biswas

Wrick Talukdar

View More author details

Table of Contents (17) Chapters

Preface

1. Part 1: Foundations of Generative AI and Agentic Systems

2. Chapter 1: Fundamentals of Generative AI FREE CHAPTER

3. Chapter 2: Principles of Agentic Systems

4. Chapter 3: Essential Components of Intelligent Agents

5. Part 2: Designing and Implementing Generative AI-Based Agents

6. Chapter 4: Reflection and Introspection in Agents

7. Chapter 5: Enabling Tool Use and Planning in Agents

8. Chapter 6: Exploring the Coordinator, Worker, and Delegator Approach

9. Chapter 7: Effective Agentic System Design Techniques

10. Part 3: Trust, Safety, Ethics, and Applications

11. Chapter 8: Building Trust in Generative AI Systems

12. Chapter 9: Managing Safety and Ethical Considerations

13. Chapter 10: Common Use Cases and Applications

14. Chapter 11: Conclusion and Future Outlook

15. Index

Why subscribe?

16. Other Books You May Enjoy

Agent memory architecture and context management

Memory architecture and context management are fundamental components that enable intelligent agents to maintain coherent interactions and make informed decisions based on past experiences and current context. This section explores the design principles and implementation strategies for creating effective memory systems and managing contextual information in agent-based systems. Agent memory architectures typically incorporate three distinct types of memory, each serving different purposes in the agent’s operation: short-term memory, long-term memory, and episodic memory. Let us discuss these memory architectures in detail.

Short-term memory (working memory)

Short-term memory, also known as working memory, serves as the agent’s immediate cognitive workspace. It temporarily holds and manages information relevant to the current interaction or task being processed. This type of memory is particularly crucial for maintaining conversation context, handling multi-step processes, and managing active user sessions. In our travel agent system, short-term memory is essential for tracking ongoing search parameters, maintaining the current state of a booking process, and remembering context-specific details that might influence immediate decisions.

For example, when a customer is searching for flights, the short-term memory would maintain details such as their current search criteria, recently viewed options, and any temporary preferences they’ve expressed during the current session. This information doesn’t need to be stored permanently but is critical for providing a coherent and personalized experience during active interaction. The temporary nature of this memory also helps in managing system resources efficiently, as the data is cleared once the session ends or the information becomes irrelevant.

For our travel agent system, a practical implementation of short-term memory might include the following Python class. This class defines the parameters required for an active real-time conversation such as customer_id, the session start timestamp, the current query in the conversation thread, and any specific preferences that are deduced from the user query. The update_context function is used to update the properties of current_interaction as the conversation progresses, keeping the short-term memory up to date with the current information. Since short-term memory is often ephemeral and session-specific, the clear_session function is used to remove and reset the state of current_session to prepare it for subsequent new sessions:

class WorkingMemory:
    def __init__(self):
        self.current_interaction = {
            'customer_id': None,
            'session_start': None,
            'current_query': None,
            'active_searches': [],
            'temporary_preferences': {}
        }
    def update_context(self, new_information):
        # Update current interaction context
        self.current_interaction.update(new_information)
    def clear_session(self):
        # Reset temporary session data
        self.__init__()

While short-term memory helps provide sufficient context for the intelligent agent to perform its task, there is often additional persistent information, as opposed to ephemeral information, that is important for the intelligent agent to achieve its goal. Let us take a deeper look at what long-term memory, also known as the knowledge base, entails.

Long-term memory (knowledge base)

Long-term memory functions as the agent’s persistent knowledge repository, storing information that remains relevant and valuable across multiple interactions and sessions. Unlike short-term memory, this type of storage is designed for data that needs to be preserved and accessed over extended periods. It serves as the foundation for the agent’s accumulated knowledge, learned patterns, and established relationships with customers.

Long-term memory is particularly crucial for maintaining consistency in customer service and enabling personalized interactions based on historical data. For instance, in our travel agent system, this would include storing customer preferences discovered over multiple bookings, maintaining records of past travel arrangements, and preserving knowledge about destinations, seasonal patterns, and service provider relationships. This persistent storage allows the agent to make informed decisions based on historical patterns and provide personalized service without requiring customers to repeat their preferences in every interaction.

The implementation of long-term memory typically requires careful consideration of data organization, retrieval efficiency, and update mechanisms to ensure that the stored information remains accurate and accessible. In our travel agent system, this may include the following:

Customer profiles and preferences:

class CustomerMemory:
    def __init__(self):
        self.profiles = {
            'preferences': {},
            'travel_history': [],
            'feedback_history': [],
            'special_requirements': {},
            'loyalty_status': None
        }
    def update_profile(self, customer_id, new_data):
        # Merge new information with existing profile
        self.profiles[customer_id] = {
            **self.profiles.get(customer_id, {}),
            **new_data
        }

Travel knowledge base:

class TravelKnowledge:
    def __init__(self):
        self.destination_info = {}
        self.seasonal_patterns = {}
        self.service_providers = {}
        self.travel_regulations = {}
    def update_knowledge(self, category, key, value):
        # Update specific knowledge category
        getattr(self, category)[key] = value

Short-term and long-term memory serves as the important cornerstones of intelligent agentic systems. However, a third type of memory, known as episodic memory, has emerged, especially for conversational interfaces such as chatbots. This type of memory helps LLMs and intelligent agents further refine their actions and provide prescriptive outputs to the user.

Episodic memory (interaction history)

Episodic memory represents a specialized form of memory that captures and stores specific interactions, events, and their outcomes as discrete episodes. This type of memory enables the agent to learn from past experiences and use historical interactions to inform future decisions. Unlike general long-term memory, episodic memory focuses on the temporal sequence and context of events, making it particularly valuable for understanding patterns in customer behavior and service outcomes.

In the context of our travel agent system, episodic memory serves multiple critical functions. It helps identify successful booking patterns, understand common customer journey paths, and recognize situations that have led to either positive outcomes or challenges in the past. For example, if a customer previously encountered issues with layover times in specific airports, the agent can use this episodic information to avoid similar situations in future bookings. This memory type also enables the agent to provide more contextually relevant responses by referencing past interactions and their outcomes.

The implementation of episodic memory requires careful consideration of how to structure and store interaction records in a way that facilitates efficient retrieval and pattern recognition. For our travel agent system, this may include the following:

class EpisodicMemory:
    def __init__(self):
        self.interaction_history = []
    def record_interaction(self, interaction_data):
        # Add timestamp and store interaction
        interaction_data['timestamp'] = datetime.now()
        self.interaction_history.append(interaction_data)
    def retrieve_relevant_episodes(self, context):
        # Find similar past interactions
        return [episode for episode in
               self.interaction_history
                if self._is_relevant(episode, context)]

Having established the core memory systems, we now turn our attention to how these different types of memory work together in practice. The agent needs sophisticated mechanisms to manage the flow of information between these memory systems and ensure that the right information is available at the right time. This brings us to two critical components: context management and decision-making integration.

Context management

Effective context management ensures that the agent maintains appropriate awareness of the current situation and relevant historical information. Imagine our travel booking agent assisting a customer with planning a multi-city business trip to Tokyo and Singapore. The agent must maintain awareness of various contextual elements: the customer’s corporate travel policy limiting flight costs to $2,000, their preference for morning flights due to scheduled meetings, and the need to coordinate hotel bookings within walking distance of specific office locations. As the booking process unfolds, the agent continuously references and updates this information while navigating between flight searches, hotel availability, and meeting schedule constraints. This real-world scenario demonstrates why robust context management is essential for handling complex, multi-step travel arrangements. Effective context management ensures that the agent maintains appropriate awareness of the current situation and relevant historical information. This involves several key components:

Context hierarchy: The context management system should maintain different levels of context:
- Global context:
  - System-wide settings and constraints
  - Current operational status
  - Global travel alerts and advisories
- Session context:
  - Current customer interaction state
  - Active searches and queries
  - Temporary preferences and constraints
- Task context:
  - Specific booking details
  - Current step in multi-step processes
  - Related bookings and dependencies
Context switching: Context switching is a critical capability that allows the agent to smoothly transition between different operational contexts while maintaining coherence and continuity. This process involves several key aspects:
- Context preservation:
  - Saving the current state before switching
  - Maintaining a history of context changes
  - Ensuring no critical information is lost during transitions
- Context restoration:
  - Retrieving previous contexts when needed
  - Rebuilding the operational environment
  - Reestablishing relevant connections and states
- Context merging:
  - Combining information from multiple contexts
  - Resolving conflicts between different contexts
  - Maintaining consistency across context changes

The sophisticated interplay between memory systems and context management ultimately serves one primary purpose: enabling intelligent decision-making. By maintaining awareness of both historical data and current context, the agent can make more informed and effective decisions. Let’s examine how these components come together to support the agent’s decision-making processes.

Integration with decision-making

The memory architecture and context management system must effectively support the agent’s decision-making processes through several key mechanisms:

Information retrieval: The system must efficiently gather and synthesize relevant information from various memory components to support decision-making. This includes the following:
- Accessing customer history and preferences
- Retrieving similar past cases and their outcomes
- Combining current context with historical data
- Filtering and prioritizing relevant information
Pattern recognition: Pattern recognition capabilities enable the agent to identify relevant patterns and trends that can inform decisions:
- Analyzing historical interaction patterns
- Identifying successful booking patterns
- Recognizing potential issues based on past experiences
- Detecting seasonal trends and preferences
Decision optimization: The decision-making process should incorporate multiple factors and optimize outcomes based on the following:
- Weighted evaluation of different options
- Consideration of multiple constraints
- Balance between customer preferences and system requirements
- Risk assessment and mitigation strategies

The effective integration of memory architecture and context management systems enables agents to maintain coherent interactions, learn from past experiences, and make more informed decisions. By carefully designing these components and ensuring they work together seamlessly, agents can provide a more personalized and effective service while maintaining consistency across interactions.