Gen AI Solutions Documentation

Overview
Architecture
Features
Components
Setup & Installation
Usage
API Reference
Deployment
Troubleshooting

Overview

This Gen AI solution provides a comprehensive platform for deploying and interacting with Google ADK (Agent Development Kit) agents on Vertex AI Agent Engine. It includes:

Multiple AI Agents: Deploy and manage multiple BigQuery-powered agents
Web Interface: Modern React-based chat interface for interacting with agents
Query History: Firestore-based persistent query history with user isolation
Conversation Context: Session-based context retention for natural conversations
Agent Management: Automated deployment and cleanup tools

Architecture

High-Level Architecture

┌─────────────────┐
│  React Frontend │  (Chat UI)
│  (Port 3000)    │
└────────┬────────┘
         │ HTTP/SSE
         ▼
┌─────────────────┐
│  FastAPI Backend│  (Port 8000)
│  (Proxy Layer)  │
└────────┬────────┘
         │ REST API
         ▼
┌─────────────────┐
│ Vertex AI Agent │
│     Engine      │
└────────┬────────┘
         │
         ▼
┌─────────────────┐     ┌─────────────────┐
│   BigQuery      │     │   Firestore     │
│  (Data Source)  │     │  (History)      │
└─────────────────┘     └─────────────────┘

Component Flow

User Interaction: User sends message via React frontend
Context Building: Frontend includes last 6 messages (conversation history) with the new message
Backend Processing: FastAPI backend receives request with conversation context
Agent Engine: Vertex AI Agent Engine processes query with conversation context embedded in message
BigQuery Execution: Agent executes SQL queries against BigQuery
Response Streaming: Results stream back through backend to frontend
History Storage: Query/response saved to Firestore for future reference

Features

1. Multi-Agent Support

bq_agent_mick: BigQuery billing data analysis agent (Mick’s version)
bq_agent: Standard BigQuery data analysis agent
Easy addition of new agents via configuration

2. Web Chat Interface

Modern React-based UI
Real-time streaming responses
Agent selection dropdown
Message history display
Clean, responsive design

3. Query History (Firestore)

Automatic Saving: All queries saved after execution
User Isolation: Users only see their own history
History Sidebar: Clickable history items
Delete Functionality: Delete individual or all queries
Persistent Storage: History survives page refreshes

4. Conversation Context

Message History: Last 6 messages (3 exchanges) included with each query
Context Retention: Agent sees conversation history embedded in the message
Natural Conversations: Support for follow-up questions and references
History Loading: When loading a history item, context is preserved for follow-up queries
Session Tracking: Session IDs used for tracking but not sent to Agent Engine

5. Deployment Management

Automated Deployment: One-command agent deployment
Cleanup Tools: Automatic cleanup of old deployments
Multi-Agent Support: Deploy individual or all agents
Makefile Integration: Simple commands for common tasks

Components

Backend (`web/backend/`)

`main.py`

FastAPI application providing:

/agents - List available agents
/query/stream - Stream query to agent (with session support)
/query - Non-streaming query endpoint
/history - Get user’s query history
/history/{query_id} - Delete specific query
/history (DELETE) - Delete all user’s history

`history.py`

Firestore service for query history:

save_query() - Save query/response to Firestore
get_query_history() - Retrieve user’s history
delete_query() - Delete specific query
delete_all_history() - Delete all user’s queries

Frontend (`web/frontend/`)

`App.jsx`

Main React component:

Agent selection
Message display
Streaming response handling
History sidebar
Session management

Key Features:

Real-time message streaming (SSE)
Session ID generation and management
History loading and display
Delete functionality

Agents

`bq_agent_mick/agent.py`

BigQuery billing data analysis agent
Custom instructions for currency formatting
SQL query generation and execution

`bq_agent/agent.py`

Standard BigQuery agent
General data analysis capabilities

Setup & Installation

Prerequisites

Python 3.11+
Node.js 18+
Google Cloud Project with:
- Vertex AI API enabled
- BigQuery API enabled
- Firestore API enabled
- Appropriate IAM permissions

1. Clone and Setup

# Clone repository
git clone <repository-url>
cd gcp-billing-ai

# Install Python dependencies
pip install -r requirements.txt
pip install -r web/backend/requirements.txt

# Install Node.js dependencies
cd web/frontend
npm install
cd ../..

2. Configure Environment

Create .env files:

Root .env:

BQ_PROJECT=your-project-id
GCP_PROJECT_ID=your-project-id
LOCATION=us-central1

Agent-specific .env files:

bq_agent_mick/.env: REASONING_ENGINE_ID=<deployed-engine-id>
bq_agent/.env: REASONING_ENGINE_ID=<deployed-engine-id>

3. Deploy Agents

# Deploy specific agent
make deploy-bq-agent-mick

# Deploy all agents
make deploy-all-agents

4. Setup Firestore

# Enable Firestore API
gcloud services enable firestore.googleapis.com --project=YOUR_PROJECT_ID

# Create Firestore database
gcloud firestore databases create --location=us-central1 --project=YOUR_PROJECT_ID

5. Start Services

Terminal 1 - Backend:

cd web/backend
python main.py

Terminal 2 - Frontend:

cd web/frontend
npm run dev

6. Access Application

Open browser: http://localhost:3000

Usage

Web Interface

Select Agent: Choose agent from dropdown
Send Query: Type your question and press Enter
View History: Click history button (📜) to see past queries
Load History: Click any history item to reload conversation
Delete History: Use trash icon to delete queries

Conversation Context

The agent maintains context within a conversation by sending previous messages with each new query:

First Query: Creates new session automatically
Follow-up Queries: Includes last 6 messages (3 exchanges) as context in the request
Clear Chat: Starts new session
Load History: When loading a history item, the conversation context is preserved - subsequent queries include the loaded messages in the context

How it works:

Each new message includes the last 6 messages from the current chat window
The context is formatted as a conversation history in the message
This allows the agent to understand references like “the above query” or “that result”

Example Conversation:

User: "What are the top 10 projects by cost?"
Agent: [Shows results with project IDs and costs]

User: "Order them by TLA"
Agent: [Remembers previous query and orders by TLA - context sent with message]

User: "run the above query again"
Agent: [Understands "above query" refers to previous query - context includes it]

Agent Deployment

Deploy Single Agent:

make deploy-bq-agent-mick

Deploy All Agents:

make deploy-all-agents

List Deployments:

make list-deployments

Cleanup Old Deployments:

make cleanup-deployments AGENT_NAME=bq_agent_mick KEEP=1

API Reference

Backend Endpoints

`GET /agents`

List all available agents.

Response:

[
  {
    "name": "bq_agent_mick",
    "display_name": "BigQuery Agent (Mick)",
    "description": "BigQuery billing data analysis agent",
    "reasoning_engine_id": "123456789",
    "is_available": true
  }
]

`POST /query/stream`

Stream query to agent (recommended).

Request:

{
  "agent_name": "bq_agent_mick",
  "message": "What are the top 10 projects?\n\nUser: previous question\nAssistant: previous response",
  "user_id": "web_user",
  "session_id": "session-123456"  // Optional, for tracking (not sent to Agent Engine)
}

Note: The message field includes conversation context (last 6 messages) formatted as a conversation history. The session_id is used for tracking but not sent to Agent Engine REST API.

Response: Server-Sent Events (SSE) stream

data: {"text": "I'll query..."}
data: {"text": " the top 10..."}
data: {"query_id": "abc123", "done": true}

`GET /history`

Get user’s query history.

Query Parameters:

user_id (required): User identifier
limit (optional, default: 50): Max number of queries

Response:

[
  {
    "id": "doc123",
    "user_id": "web_user",
    "agent_name": "bq_agent_mick",
    "message": "What are the top 10 projects?",
    "response": "Here are the results...",
    "timestamp": "2025-11-05T18:20:33.806283Z"
  }
]

`DELETE /history/{query_id}`

Delete specific query.

Query Parameters:

user_id (required): User identifier

`DELETE /history`

Delete all user’s history.

Query Parameters:

user_id (required): User identifier

Deployment

Local Development

See GETTING_STARTED.md for local setup.

Cloud Run Deployment

Simple IAP Deployment (No DNS)

cd web/deploy
export PROJECT_ID="your-project-id"
./deploy-simple-iap.sh

Features:

No DNS configuration required
IAP security enabled
Quick deployment for course/temporary use

Full IAP + Load Balancer Deployment

cd web/deploy
export PROJECT_ID="your-project-id"
./deploy-all.sh

Features:

Custom domain support
Load balancer
Full IAP integration
Production-ready

See DEPLOYMENT_GUIDE.md for the full deployment guide.

Troubleshooting

Agent Not Responding

Symptoms: Agent returns errors or doesn’t respond

Solutions:

Check REASONING_ENGINE_ID in agent .env file
Verify agent is deployed: make list-deployments
Check backend logs for errors
Verify BigQuery permissions

History Not Loading

Symptoms: 500 error when loading history

Solutions:

Enable Firestore API: gcloud services enable firestore.googleapis.com
Create Firestore database
Check backend logs for specific errors
Verify Firestore permissions

Conversation Context Not Working

Symptoms: Agent doesn’t remember previous messages

Solutions:

Check conversation history is being sent: The frontend includes last 6 messages with each query
Verify messages are in chat window: Context is built from messages visible in the current chat
Check backend logs: Look for the conversation context in the request payload
Load history properly: When loading a history item, the messages become part of the conversation context

How context works:

Frontend sends last 6 messages (3 exchanges) with each new query
Context is embedded in the message as a conversation history
Agent Engine processes the full conversation context

Permission Errors

Common Issues:

BigQuery: Grant roles/bigquery.dataViewer and roles/bigquery.jobUser
Firestore: Ensure service account has Firestore access (roles/datastore.user)
Agent Engine: Verify service account has Vertex AI permissions
Cloud Run: Verify service account has roles/run.invoker for IAP

Fix Permissions:

# BigQuery permissions
make grant-bq-permissions

# Firestore permissions (if history saving fails with 403)
gcloud projects add-iam-policy-binding YOUR_PROJECT_ID \
  --member="serviceAccount:agent-engine-api-sa@YOUR_PROJECT_ID.iam.gserviceaccount.com" \
  --role="roles/datastore.user"

# Check service accounts
gcloud projects get-iam-policy YOUR_PROJECT_ID \
  --flatten="bindings[].members" \
  --filter="bindings.members:serviceAccount:agent-engine-api-sa@YOUR_PROJECT_ID.iam.gserviceaccount.com"

Frontend Not Connecting to Backend (Cloud Run)

Symptoms: CORS errors or “Failed to fetch” errors

Solutions:

Verify API URL is embedded correctly: The frontend JavaScript should contain the Cloud Run API URL, not localhost:8000
Hard refresh browser: Cmd+Shift+R (Mac) or Ctrl+Shift+R (Windows/Linux)

Check CORS configuration: Verify CORS_ALLOWED_ORIGINS is set on backend:

gcloud run services describe agent-engine-api \
  --region=us-central1 \
  --project=YOUR_PROJECT_ID \
  --format="value(spec.template.spec.containers[0].env)"

Rebuild frontend: If the API URL is wrong, rebuild with correct URL:

cd web/frontend
API_URL="https://agent-engine-api-xxxxx.run.app"
gcloud builds submit --config=cloudbuild.yaml \
  --substitutions=_API_URL="$API_URL" \
  . --project=YOUR_PROJECT_ID

Security Considerations

User Isolation

Each user’s history is isolated by user_id
Firestore queries filter by user_id
Delete operations verify ownership

Authentication

Local Development:

Uses default GCP credentials
gcloud auth application-default login

Production:

IAP (Identity-Aware Proxy) for web interface
Service account for backend → Agent Engine
User ID from authenticated session

Data Privacy

Query history stored in Firestore
User-scoped queries only
No cross-user data access
Session IDs are user-specific

Best Practices

Agent Development

Clear Instructions: Write explicit agent instructions
Tool Selection: Use appropriate BigQuery toolsets
Error Handling: Include error handling in agent logic
Testing: Test agents locally before deployment

Deployment

Cleanup: Regularly clean up old deployments
Monitoring: Monitor Agent Engine logs
Versioning: Track agent versions
Backup: Backup important configurations

Performance

Session Management: Reuse sessions for related queries
History Limits: Limit history retrieval to reasonable sizes
Caching: Consider caching common queries
Indexing: Create Firestore indexes for better performance

Future Enhancements

Planned Features

Multi-user authentication
Query analytics and insights
Export history functionality
Agent performance metrics
Advanced conversation management
Query templates and suggestions
Multi-language support

Integration Opportunities

Google Cloud Monitoring integration
Cloud Logging for query analytics
BigQuery Data Transfer Service
Cloud Functions for automated tasks
Cloud Scheduler for periodic queries

Gen AI Solutions Documentation

Table of Contents

Overview

Architecture

High-Level Architecture

Component Flow

Features

1. Multi-Agent Support

2. Web Chat Interface

3. Query History (Firestore)

4. Conversation Context

5. Deployment Management

Components

Backend (web/backend/)

main.py

history.py

Frontend (web/frontend/)

App.jsx

Key Features:

Agents

bq_agent_mick/agent.py

bq_agent/agent.py

Setup & Installation

Prerequisites

1. Clone and Setup

2. Configure Environment

3. Deploy Agents

4. Setup Firestore

5. Start Services

6. Access Application

Usage

Web Interface

Conversation Context

Agent Deployment

API Reference

Backend Endpoints

GET /agents

POST /query/stream

GET /history

DELETE /history/{query_id}

DELETE /history

Deployment

Local Development

Cloud Run Deployment

Simple IAP Deployment (No DNS)

Full IAP + Load Balancer Deployment

Troubleshooting

Agent Not Responding

History Not Loading

Conversation Context Not Working

Permission Errors

Frontend Not Connecting to Backend (Cloud Run)

Security Considerations

User Isolation

Authentication

Data Privacy

Best Practices

Agent Development

Deployment

Performance

Future Enhancements

Planned Features

Integration Opportunities

Support & Resources

Documentation

Google Cloud Resources

Troubleshooting

License

Contributors

Backend (`web/backend/`)

`main.py`

`history.py`

Frontend (`web/frontend/`)

`App.jsx`

`bq_agent_mick/agent.py`

`bq_agent/agent.py`

`GET /agents`

`POST /query/stream`

`GET /history`

`DELETE /history/{query_id}`

`DELETE /history`