Agentic System Design

30 items across fundamentals, orchestration patterns, named case studies (MACRS, NVIDIA Eureka, ChainBuddy, WebVoyager, MuLan, OpenClaw), build-it-yourself implementations with Google ADK, and open design exercises. Each kind follows a consistent H2 template so writeups are scannable across topics.

11 Foundational 11 Intermediate 8 Advanced 5 topics RSS

Kind: Topic: Level:

Fundamentals

7 items

What makes a system 'agentic' rather than a plain LLM call. The four-component reference architecture, memory types, the perception–reasoning–action loop, and the open problems no framework solves.

What Is an AI Agent?
What makes a system 'agentic' vs a plain LLM call. Autonomy, tools, memory, and the perception–reasoning–action loop.

Concept Foundational
Agent Architecture Overview
The four-component reference architecture: model, tools, memory, instructions. How requests flow through each.

Concept Foundational
Agent Memory
Short-term working context vs long-term storage. Episodic, semantic, and procedural memory in agent systems.

Concept Foundational
Perception and Grounding
How agents take input — text, vision, audio, structured data — and ground it to actions in their environment.

Concept Foundational
Reasoning and Planning
Chain-of-thought, tree-of-thought, plan-then-execute. Where the agent's 'thinking' happens before it acts.

Concept Intermediate
Action and Tool Use
How agents act in the world. Function calls, code execution, API requests, and the safety boundary around each.

Concept Foundational
Key Challenges in Agentic Systems
Hallucination, long-horizon drift, cost overruns, evaluation difficulty, prompt injection, and the open problems no framework solves for you.

Concept Intermediate

Patterns

7 items

The recurring orchestration patterns that show up across agent systems — ReAct loop, function calling, reflection, hierarchical planning, multi-agent shapes, human-in-the-loop, guardrails.

The ReAct Loop
Reason, then act, then observe — the foundational interleaved-thought-and-tool-use pattern that powers most modern agents.

Pattern Foundational
Function Calling and Tool Use
Schema-typed tool calls as the agent's verb. Parallel calls, structured outputs, and the lifecycle of a tool invocation.

Pattern Foundational
Reflection and Self-Critique
Letting an agent review its own output before committing. Self-correction loops, judge-models, and the diminishing-returns curve.

Pattern Intermediate
Hierarchical Planning
High-level planner + low-level executor. When to decompose, how deep to go, and how to keep sub-plans aligned with the goal.

Pattern Intermediate
Multi-Agent Orchestration
Sequential, router, swarm, supervisor-worker. The four shapes of multi-agent coordination and what each costs.

Pattern Intermediate
Human-in-the-Loop
When and how to ask for human confirmation, feedback, or override. Designing the handoff so it's neither annoying nor unsafe.

Pattern Foundational
Guardrails and Safety
Pre-call validation, post-call filtering, content policies, action allowlists. The defense-in-depth pattern for agent safety.

Pattern Intermediate

Case Studies

6 items

Deep-dives into named research and industry agent systems — MACRS, NVIDIA Eureka, ChainBuddy, WebVoyager, MuLan, OpenClaw. Concrete designs you can learn from.

MACRS — Multi-Agent Conversational Recommender
A multi-agent system for goal-directed conversational recommendations. Multi-agent act planning + user-feedback-aware reflection.

System Advanced
NVIDIA Eureka — LLM-Driven Reward Design
Coding LLMs that autonomously design and refine RL reward functions. Zero-shot generation, evolutionary search, reward reflection.

System Advanced
ChainBuddy — LLM Pipeline Generator
An agent that solves the 'blank page problem' for LLM-evaluation workflows. Requirement-gathering chat + multi-agent pipeline generation.

System Intermediate
WebVoyager — Multimodal Web Agent
A vision-and-text agent that navigates real websites. Multimodal ReAct loop, screenshot grounding, end-to-end task completion.

System Advanced
MuLan — Multimodal Diffusion Agent
An LLM-orchestrated approach to multi-object text-to-image generation. Planning, progressive generation, VLM-feedback control.

System Advanced
OpenClaw — Personal AI Assistant
A personal-assistant design: how to compose an agent that mixes calendar, mail, search, and reminders behind a single conversational surface.

System Intermediate

Implementations

5 items

Build-it-yourself guides — wiring a Eureka-style reward loop and a multimodal web agent end-to-end using Google's Agent Development Kit (ADK).

Agent Development Kit (ADK) Overview
Google's Agent Development Kit. What's in the box: agent primitives, tools, orchestration, evaluation harness, and the workflow it expects.

Implementation Foundational
Setting Up and Grounding an Agent
Wiring an agent to its environment — env vars, tool registration, prompt scaffolding, and the smallest workable hello-world loop.

Implementation Intermediate
Building a Eureka-Style Reward Loop with ADK
End-to-end implementation: reward generation, evaluation, selection, reflection, and human feedback — wired together as a closed loop.

Implementation Advanced
Building a Multimodal Web Agent with ADK
Playwright state management, screenshot-and-DOM grounding tools, and a multimodal ReAct loop assembled with ADK primitives.

Implementation Advanced
Loop Control and Exit Conditions
When to stop. Step budgets, success predicates, stagnation detection, escape-hatch handoffs — the under-loved half of every agent.

Implementation Intermediate

Design Exercises

5 items

Open-ended design challenges — smart parking, AI hospital, self-improving web agent. Practice surfaces for translating intent into agent architecture.

Design a Smart Parking Agent
A constrained-environment design exercise: turn a sensor + ticketing system into a proactive, intent-aware parking agent.

Exercise Foundational
Design a Multi-Agent Medical Diagnosis System
A safety-critical multi-agent design: triage, diagnosis, second-opinion, and uncertainty handling for a hospital setting.

Exercise Advanced
Design a Self-Improving Web Agent
How to evolve a WebVoyager-style agent over time: experience replay, failure mining, prompt-update loops, eval harness.

Exercise Advanced
Design Your First Agent
An open-ended exercise: pick a real workflow you do daily, decompose it, and design the agent that would automate it.

Exercise Foundational
Mock Interview — Agent System Design
A 45-minute mock loop for an agent-system-design interview. The prompt, the rubric, and the common follow-up questions.

Exercise Intermediate