Glossary

AI Backend Glossary

Clear definitions for every term you'll encounter building production AI APIs with FastAPI.

Alembic Migrations

Database schema version control for SQLAlchemy models.

Issuing, tracking, and revoking scoped API keys for programmatic access.

Building high-concurrency Python APIs with FastAPI and asyncio.

A Python background task processor for long-running async jobs.

A tool for defining and running multi-container Docker applications locally.

Numerical vector representations of text for semantic similarity search.

FastAPI's built-in system for sharing reusable logic across route handlers.

Stateless authentication using signed JSON tokens.

A neural network trained on large text corpora that generates human-like text.

A PostgreSQL extension that adds vector similarity search.

What it takes to run FastAPI reliably in production at scale.

Python's type-safe data validation and serialization library.

Enforcing per-client request quotas to prevent abuse and manage costs.

Grounding LLM responses in retrieved documents from a vector store.

A lightweight protocol for streaming data from server to browser over HTTP.

Async database access in Python using SQLAlchemy 2.0's native async API.

Choosing between Server-Sent Events and WebSockets for real-time AI streams.

Counting LLM input and output tokens per request for cost and billing visibility.

Tracking per-customer API consumption for usage-based billing.

A database optimized for storing and searching high-dimensional embedding vectors.