Skip to main content
ClaudeWave
Skill2.1k repo starsupdated 2mo ago

google-adk-python

Google ADK Python is an open-source toolkit for building, evaluating, and deploying AI agents with tool integration and multi-agent orchestration capabilities. Use this skill when implementing AI agents that require tool integration, multi-agent coordination, workflow automation through sequential or parallel processing, evaluation frameworks, and deployment to Google Cloud infrastructure or custom environments.

Install in Claude Code
Copy
git clone --depth 1 https://github.com/mrgoonie/claudekit-skills /tmp/google-adk-python && cp -r /tmp/google-adk-python/.claude/skills/google-adk-python ~/.claude/skills/google-adk-python
Then start a new Claude Code session; the skill loads automatically.

SKILL.md

# Google ADK Python Skill

You are an expert guide for Google's Agent Development Kit (ADK) Python - an open-source, code-first toolkit for building, evaluating, and deploying AI agents.

## When to Use This Skill

Use this skill when users need to:
- Build AI agents with tool integration and orchestration capabilities
- Create multi-agent systems with hierarchical coordination
- Implement workflow agents (sequential, parallel, loop) for predictable pipelines
- Integrate LLM-powered agents with Google Search, Code Execution, or custom tools
- Deploy agents to Vertex AI Agent Engine, Cloud Run, or custom infrastructure
- Evaluate and test agent performance systematically
- Implement human-in-the-loop approval flows for tool execution

## Core Concepts

### Agent Types

**LlmAgent**: LLM-powered agents capable of dynamic routing and adaptive behavior
- Define with name, model, instruction, description, and tools
- Supports sub-agents for delegation and coordination
- Intelligent decision-making based on context

**Workflow Agents**: Structured, predictable orchestration patterns
- **SequentialAgent**: Execute agents in defined order
- **ParallelAgent**: Run multiple agents concurrently
- **LoopAgent**: Repeat execution with iteration logic

**BaseAgent**: Foundation for custom agent implementations

### Key Components

**Tools Ecosystem**:
- Pre-built tools (google_search, code_execution)
- Custom Python functions as tools
- OpenAPI specification integration
- Tool confirmation flows for human approval

**Multi-Agent Architecture**:
- Hierarchical agent composition
- Specialized agents for specific domains
- Coordinator agents for delegation

## Installation

```bash
# Stable release (recommended)
pip install google-adk

# Development version (latest features)
pip install git+https://github.com/google/adk-python.git@main
```

## Implementation Patterns

### Single Agent with Tools

```python
from google.adk.agents import LlmAgent
from google.adk.tools import google_search

agent = LlmAgent(
    name="search_assistant",
    model="gemini-2.5-flash",
    instruction="You are a helpful assistant that searches the web for information.",
    description="Search assistant for web queries",
    tools=[google_search]
)
```

### Multi-Agent System

```python
from google.adk.agents import LlmAgent

# Specialized agents
researcher = LlmAgent(
    name="Researcher",
    model="gemini-2.5-flash",
    instruction="Research topics thoroughly using web search.",
    tools=[google_search]
)

writer = LlmAgent(
    name="Writer",
    model="gemini-2.5-flash",
    instruction="Write clear, engaging content based on research.",
)

# Coordinator agent
coordinator = LlmAgent(
    name="Coordinator",
    model="gemini-2.5-flash",
    instruction="Delegate tasks to researcher and writer agents.",
    sub_agents=[researcher, writer]
)
```

### Custom Tool Creation

```python
from google.adk.tools import Tool

def calculate_sum(a: int, b: int) -> int:
    """Calculate the sum of two numbers."""
    return a + b

# Convert function to tool
sum_tool = Tool.from_function(calculate_sum)

agent = LlmAgent(
    name="calculator",
    model="gemini-2.5-flash",
    tools=[sum_tool]
)
```

### Sequential Workflow

```python
from google.adk.agents import SequentialAgent

workflow = SequentialAgent(
    name="research_workflow",
    agents=[researcher, summarizer, writer]
)
```

### Parallel Workflow

```python
from google.adk.agents import ParallelAgent

parallel_research = ParallelAgent(
    name="parallel_research",
    agents=[web_researcher, paper_researcher, expert_researcher]
)
```

### Human-in-the-Loop

```python
from google.adk.tools import google_search

# Tool with confirmation required
agent = LlmAgent(
    name="careful_searcher",
    model="gemini-2.5-flash",
    tools=[google_search],
    tool_confirmation=True  # Requires approval before execution
)
```

## Deployment Options

### Cloud Run Deployment

```bash
# Containerize agent
docker build -t my-agent .

# Deploy to Cloud Run
gcloud run deploy my-agent --image my-agent
```

### Vertex AI Agent Engine

```python
# Deploy to Vertex AI for scalable agent hosting
# Integrates with Google Cloud's managed infrastructure
```

### Custom Infrastructure

```python
# Run agents locally or on custom servers
# Full control over deployment environment
```

## Model Support

**Optimized for Gemini**:
- gemini-2.5-flash
- gemini-2.5-pro
- gemini-1.5-flash
- gemini-1.5-pro

**Model Agnostic**: While optimized for Gemini, ADK supports other LLM providers through standard APIs.

## Best Practices

1. **Code-First Philosophy**: Define agents in Python for version control, testing, and flexibility
2. **Modular Design**: Create specialized agents for specific domains, compose into systems
3. **Tool Integration**: Leverage pre-built tools, extend with custom functions
4. **Evaluation**: Test agents systematically against test cases
5. **Safety**: Implement confirmation flows for sensitive operations
6. **Hierarchical Structure**: Use coordinator agents for complex multi-agent workflows
7. **Workflow Selection**: Choose workflow agents for predictable pipelines, LLM agents for dynamic routing

## Common Use Cases

- **Research Assistants**: Web search + summarization + report generation
- **Code Assistants**: Code execution + documentation + debugging
- **Customer Support**: Query routing + knowledge base + escalation
- **Content Creation**: Research + writing + editing pipelines
- **Data Analysis**: Data fetching + processing + visualization
- **Task Automation**: Multi-step workflows with conditional logic

## Development UI

ADK includes built-in interface for:
- Testing agent behavior interactively
- Debugging tool calls and responses
- Evaluating agent performance
- Iterating on agent design

## Resources

- GitHub: https://github.com/google/adk-python
- Documentation: https://google.github.io/adk-docs/
- llms.txt: https://raw.githubusercontent.com/google/adk-python/refs/h
mcp-managerSubagent

Manage MCP (Model Context Protocol) server integrations - discover tools/prompts/resources, analyze relevance for tasks, and execute MCP capabilities. Use when need to work with MCP servers, discover available MCP tools, filter MCP capabilities for specific tasks, execute MCP tools programmatically, or implement MCP client functionality. Keeps main context clean by handling MCP discovery in subagent context.

cmSlash Command

Stage all files and create a commit.

cpSlash Command

Stage, commit and push all code in the current branch

prSlash Command

Create a pull request

createSlash Command

Create a new agent skill

use-mcpSlash Command

Utilize tools of Model Context Protocol (MCP) servers

aestheticSkill

Create aesthetically beautiful interfaces following proven design principles. Use when building UI/UX, analyzing designs from inspiration sites, generating design images with ai-multimodal, implementing visual hierarchy and color theory, adding micro-interactions, or creating design documentation. Includes workflows for capturing and analyzing inspiration screenshots with chrome-devtools and ai-multimodal, iterative design image generation until aesthetic standards are met, and comprehensive design system guidance covering BEAUTIFUL (aesthetic principles), RIGHT (functionality/accessibility), SATISFYING (micro-interactions), and PEAK (storytelling) stages. Integrates with chrome-devtools, ai-multimodal, media-processing, ui-styling, and web-frameworks skills.

ai-multimodalSkill

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, composition, refinement). Use when working with audio/video files, analyzing images or screenshots, processing PDF documents, extracting structured data from media, creating images from text prompts, or implementing multimodal AI features. Supports multiple models (Gemini 2.5/2.0) with context windows up to 2M tokens.