Blog

Technical writing on FastAPI, LLM integration, RAG pipelines, and building production backends.

Adding streaming LLM responses with Server-Sent Events in FastAPI

How to implement real-time streaming chat responses using SSE in FastAPI, with token counting and proper error handling.

January 8, 2025·FastAPI AI Kit Team

FastAPIArchitecturePythonBackend

How we structure a production FastAPI project

A practical guide to the routers, services, and repository pattern that makes FastAPI codebases easy to maintain at scale.

December 15, 2024·FastAPI AI Kit Team

ragfastapipgvectoropenaiproduction

Building a Production RAG Pipeline with FastAPI and pgvector

A complete walkthrough of building a retrieval-augmented generation pipeline: document ingestion, embedding, vector search, and LLM context injection — all in async FastAPI.

November 15, 2024·FastAPI AI Kit Team

fastapiauthjwtapi-keyssecurityproduction

JWT Auth and API Key Management in FastAPI: A Production Guide

How to implement production-grade JWT authentication and API key issuance in FastAPI — with refresh tokens, per-key rate limiting, and secure storage.

October 28, 2024·FastAPI AI Kit Team

fastapistripebillingmeteringsaasproduction

Usage-Based Billing with Stripe Metering in FastAPI

How to implement token-based usage metering with Stripe's metered billing in a FastAPI backend — from per-request tracking to webhook handling.

October 10, 2024·FastAPI AI Kit Team

fastapideploymentrailwayrenderdockerproduction

Deploying FastAPI to Railway and Render: A Production Guide

Step-by-step production deployment of a FastAPI app with Postgres, Redis, and Celery workers on Railway and Render — including migration automation and health checks.

September 20, 2024·FastAPI AI Kit Team

fastapiceleryredisbackground-jobsasyncproduction

Async Background Jobs with Celery and FastAPI

How to offload long-running LLM tasks to Celery workers in FastAPI — job queuing, status polling, result storage, and monitoring with Flower.

August 30, 2024·FastAPI AI Kit Team

fastapisqlalchemyasyncpostgresormproduction

SQLAlchemy 2.0 Async with FastAPI: Best Practices

The right way to use SQLAlchemy 2.0 async sessions in FastAPI — dependency injection, transaction management, eager loading, and common pitfalls.

August 5, 2024·FastAPI AI Kit Team