Agentic System Design
30 items across fundamentals, orchestration patterns, named case studies (MACRS, NVIDIA Eureka, ChainBuddy, WebVoyager, MuLan, OpenClaw), build-it-yourself implementations with Google ADK, and open design exercises. Each kind follows a consistent H2 template so writeups are scannable across topics.
Fundamentals
7 items What makes a system 'agentic' rather than a plain LLM call. The four-component reference architecture, memory types, the perception–reasoning–action loop, and the open problems no framework solves.
Fundamentals
7 items- What Is an AI Agent?
What makes a system 'agentic' vs a plain LLM call. Autonomy, tools, memory, and the perception–reasoning–action loop.
Concept Foundational - Agent Architecture Overview
The four-component reference architecture: model, tools, memory, instructions. How requests flow through each.
Concept Foundational - Agent Memory
Short-term working context vs long-term storage. Episodic, semantic, and procedural memory in agent systems.
Concept Foundational - Perception and Grounding
How agents take input — text, vision, audio, structured data — and ground it to actions in their environment.
Concept Foundational - Reasoning and Planning
Chain-of-thought, tree-of-thought, plan-then-execute. Where the agent's 'thinking' happens before it acts.
Concept Intermediate - Action and Tool Use
How agents act in the world. Function calls, code execution, API requests, and the safety boundary around each.
Concept Foundational - Key Challenges in Agentic Systems
Hallucination, long-horizon drift, cost overruns, evaluation difficulty, prompt injection, and the open problems no framework solves for you.
Concept Intermediate
Patterns
7 items The recurring orchestration patterns that show up across agent systems — ReAct loop, function calling, reflection, hierarchical planning, multi-agent shapes, human-in-the-loop, guardrails.
Patterns
7 items- The ReAct Loop
Reason, then act, then observe — the foundational interleaved-thought-and-tool-use pattern that powers most modern agents.
Pattern Foundational - Function Calling and Tool Use
Schema-typed tool calls as the agent's verb. Parallel calls, structured outputs, and the lifecycle of a tool invocation.
Pattern Foundational - Reflection and Self-Critique
Letting an agent review its own output before committing. Self-correction loops, judge-models, and the diminishing-returns curve.
Pattern Intermediate - Hierarchical Planning
High-level planner + low-level executor. When to decompose, how deep to go, and how to keep sub-plans aligned with the goal.
Pattern Intermediate - Multi-Agent Orchestration
Sequential, router, swarm, supervisor-worker. The four shapes of multi-agent coordination and what each costs.
Pattern Intermediate - Human-in-the-Loop
When and how to ask for human confirmation, feedback, or override. Designing the handoff so it's neither annoying nor unsafe.
Pattern Foundational - Guardrails and Safety
Pre-call validation, post-call filtering, content policies, action allowlists. The defense-in-depth pattern for agent safety.
Pattern Intermediate
Case Studies
6 items Deep-dives into named research and industry agent systems — MACRS, NVIDIA Eureka, ChainBuddy, WebVoyager, MuLan, OpenClaw. Concrete designs you can learn from.
Case Studies
6 items- MACRS — Multi-Agent Conversational Recommender
A multi-agent system for goal-directed conversational recommendations. Multi-agent act planning + user-feedback-aware reflection.
System Advanced - NVIDIA Eureka — LLM-Driven Reward Design
Coding LLMs that autonomously design and refine RL reward functions. Zero-shot generation, evolutionary search, reward reflection.
System Advanced - ChainBuddy — LLM Pipeline Generator
An agent that solves the 'blank page problem' for LLM-evaluation workflows. Requirement-gathering chat + multi-agent pipeline generation.
System Intermediate - WebVoyager — Multimodal Web Agent
A vision-and-text agent that navigates real websites. Multimodal ReAct loop, screenshot grounding, end-to-end task completion.
System Advanced - MuLan — Multimodal Diffusion Agent
An LLM-orchestrated approach to multi-object text-to-image generation. Planning, progressive generation, VLM-feedback control.
System Advanced - OpenClaw — Personal AI Assistant
A personal-assistant design: how to compose an agent that mixes calendar, mail, search, and reminders behind a single conversational surface.
System Intermediate
Implementations
5 items Build-it-yourself guides — wiring a Eureka-style reward loop and a multimodal web agent end-to-end using Google's Agent Development Kit (ADK).
Implementations
5 items- Agent Development Kit (ADK) Overview
Google's Agent Development Kit. What's in the box: agent primitives, tools, orchestration, evaluation harness, and the workflow it expects.
Implementation Foundational - Setting Up and Grounding an Agent
Wiring an agent to its environment — env vars, tool registration, prompt scaffolding, and the smallest workable hello-world loop.
Implementation Intermediate - Building a Eureka-Style Reward Loop with ADK
End-to-end implementation: reward generation, evaluation, selection, reflection, and human feedback — wired together as a closed loop.
Implementation Advanced - Building a Multimodal Web Agent with ADK
Playwright state management, screenshot-and-DOM grounding tools, and a multimodal ReAct loop assembled with ADK primitives.
Implementation Advanced - Loop Control and Exit Conditions
When to stop. Step budgets, success predicates, stagnation detection, escape-hatch handoffs — the under-loved half of every agent.
Implementation Intermediate
Design Exercises
5 items Open-ended design challenges — smart parking, AI hospital, self-improving web agent. Practice surfaces for translating intent into agent architecture.
Design Exercises
5 items- Design a Smart Parking Agent
A constrained-environment design exercise: turn a sensor + ticketing system into a proactive, intent-aware parking agent.
Exercise Foundational - Design a Multi-Agent Medical Diagnosis System
A safety-critical multi-agent design: triage, diagnosis, second-opinion, and uncertainty handling for a hospital setting.
Exercise Advanced - Design a Self-Improving Web Agent
How to evolve a WebVoyager-style agent over time: experience replay, failure mining, prompt-update loops, eval harness.
Exercise Advanced - Design Your First Agent
An open-ended exercise: pick a real workflow you do daily, decompose it, and design the agent that would automate it.
Exercise Foundational - Mock Interview — Agent System Design
A 45-minute mock loop for an agent-system-design interview. The prompt, the rubric, and the common follow-up questions.
Exercise Intermediate