Skip to main content
Glossary

AI Backend Glossary

Clear definitions for every term you'll encounter building production AI APIs with FastAPI.

Alembic Migrations

Database schema version control for SQLAlchemy models.

API Key Management

Issuing, tracking, and revoking scoped API keys for programmatic access.

Async FastAPI

Building high-concurrency Python APIs with FastAPI and asyncio.

Celery Worker

A Python background task processor for long-running async jobs.

Docker Compose

A tool for defining and running multi-container Docker applications locally.

Embeddings

Numerical vector representations of text for semantic similarity search.

FastAPI Dependency Injection

FastAPI's built-in system for sharing reusable logic across route handlers.

JWT Authentication

Stateless authentication using signed JSON tokens.

Large Language Model (LLM)

A neural network trained on large text corpora that generates human-like text.

pgvector

A PostgreSQL extension that adds vector similarity search.

Production FastAPI

What it takes to run FastAPI reliably in production at scale.

Pydantic Validation

Python's type-safe data validation and serialization library.

Rate Limiting

Enforcing per-client request quotas to prevent abuse and manage costs.

Retrieval-Augmented Generation (RAG)

Grounding LLM responses in retrieved documents from a vector store.

Server-Sent Events (SSE)

A lightweight protocol for streaming data from server to browser over HTTP.

SQLAlchemy Async

Async database access in Python using SQLAlchemy 2.0's native async API.

SSE vs WebSockets

Choosing between Server-Sent Events and WebSockets for real-time AI streams.

Token Tracking

Counting LLM input and output tokens per request for cost and billing visibility.

Usage Metering

Tracking per-customer API consumption for usage-based billing.

Vector Database

A database optimized for storing and searching high-dimensional embedding vectors.