View a markdown version of this page

AI - AWS Blocks

AI

This section covers Blocks for AI-powered features: conversational agents and semantic search.

Choosing an AI Block

Block Best for Avoid when

Agent

Conversational AI with tool calling, streaming, and human-in-the-loop approval

You only need document search without conversation (use KnowledgeBase)

KnowledgeBase

Semantic search over documents, RAG pipelines, context retrieval

You need multi-turn conversation or tool calling (use Agent, optionally with KnowledgeBase as a tool)

Agent

AI agent with streaming responses, tool calling, human-in-the-loop approval, and conversation persistence. Powered by the Strands Agents SDK. Define a system prompt and a set of tools, then stream messages to the agent. The agent can call your tools, ask for human approval before executing sensitive actions, and persist conversations across sessions.

Locally, Agent uses a canned keyword-based provider that returns predictable responses without calling any real model. No API keys or cloud costs needed. You can alternatively configure an openai-api provider pointing to Ollama or any OpenAI-compatible endpoint for testing with real models. On AWS, it connects to Amazon Bedrock.

For more information, see bb-agent on GitHub.

KnowledgeBase

Semantic document retrieval. Point it at a folder of documents and query with natural language to get the most relevant chunks ranked by relevance. Use it to build RAG (Retrieval-Augmented Generation) pipelines or contextual search features.

Locally, KnowledgeBase uses in-memory vector search. On AWS, it provisions Amazon Bedrock Knowledge Bases with automatic document ingestion, chunking, and embedding.

For more information, see bb-knowledge-base on GitHub.