Talk Intermediate MIT License

How I made FastAPI 6x Faster: From Python to Rust Without Rust

Approved

Session Description

Building fast Python APIs is a solved problem. You pick FastAPI, write clean async code with type hints, and deploy with Uvicorn or Gunicorn. For most use cases, this works perfectly fine. But, it's too SLOW!

But there are some scenarios where Python's performance ceiling becomes a bottleneck. These is real world, we have HIGH throughput applications where every millisecond of latency matters, and where horizontal scaling becomes prohibitively expensive.

Example scenarios:

How would you handle 30,000+ requests per second on a single server without rewriting your entire codebase in Go or Rust?
How do you keep your FastAPI decorator syntax and Pydantic validation while getting 6x better performance?

Given: You have an existing FastAPI application with thousands of lines of code

When: You need 10x better throughput without changing your API contract

Then: What options do you have besides "rewrite everything in a compiled language"?

Even though we can optimize Python code, add more workers, or scale horizontally, we eventually hit fundamental limits of the Python runtime. These performance-critical scenarios require a different approach, one that doesn't force you to abandon Python! In this session, I will demonstrate FastrAPI, an open-source framework that delivers Rust-level performance while keeping your familiar Python syntax. I will show real benchmarks, explain the architecture, and demonstrate how to migrate existing FastAPI applications with a single line change, and most importantly, ALL THE OPTIMIZATIONS and MICRO-OPTIMIZATIONS (From Branch Predictions to Hashing algorithms).

Performance Comparison

FastrAPI: 31,360 req/sec, 0.59ms avg latency

FastAPI + Gunicorn (1 worker): 937 req/sec, 21ms avg latency
FastAPI + Gunicorn (16 workers): 3,882 req/sec, 4.84ms avg latency

Drop in Migration

- from fastapi import FastAPI
+ from fastrapi import FastrAPI

app = FastrAPI()

@app.get("/hello")
def hello():
    return {"Hello": "World"}

The ability to:

Use same FastAPI decorator syntax (@app.get, @app.post)
Achieve 6x performance improvement over FastAPI + Gunicorn in real-world benchmark
Maintain full Pydantic integration for request/response validation
Handle 30,000+ requests per second with sub-millisecond average latency
Add middleware (CORS, GZip, TrustedHost, Session) exactly like FastAPI
Migrate existing applications by changing a single import statement
Deploy with a simple serve() method - no Uvicorn, Gunicorn, or worker management
Leverage Rust's type safety and memory efficiency without writing Rust code
Run async Python handlers on Tokio's high-performance runtime
Build production APIs with minimal memory overhead and maximum throughput

Key Takeaways

Understanding the performance gap: Learn why Python web frameworks hit performance ceilings and how Rust can bridge that gap (with concrete benchmarks of pyo3)
How Python 3.14's GIL removal affects PyO3 and FFI frameworks right now, what changes for FastrAPI's architecture, and whether free threading actually delivers the performance gains everyone's hoping for in production scenarios.
Deep dive into every FastrAPI feature built from scratch including the routing system mapping Python decorators to Axum's router, Pydantic integration without GIL bottlenecks, middleware architecture for CORS/GZip/Sessions, and response types with zero-copy serialization.
An optimization timeline on how I optimized the framework from 11K+ reqs/s base barebones requests to 35K+ reqs/s, showing exactly what changes made the difference.
Guide to tools like pyo3-asyncio for async bridging, pythonize, sonic-rs for serialization, and arc/mutex patterns using parking_lot, dashmap vs papaya instead of hashmap - with before/after performance comparisons for each optimization.
Real issues I have encountered during development - GIL deadlocks, memory leaks across FFI boundaries, Python exception handling in Rust, async runtime conflicts, and how I fixed it
How I reverse-engineered FastAPI's API surface to make migration literally one import change - handling edge cases, maintaining decorator behavior, ensuring error messages match FastAPI's, and the testing strategy to validate compatibility.
Learn actionable steps to either adopt FastrAPI for your projects or build your own Python Rust hybrid libraries with practical migration guide
When to use FastrAPI vs FastAPI (and when to build your own): Decision framework based on actual production scenarios - latency requirements, request volume, cost of horizontal scaling, team expertise, and the FastrAPI feature roadmap. Plus a starter template if you want to build similar Python-Rust hybrids for your use case.

References

https://fastrapi.preetham.top/

https://github.com/ppmpreetham/fastrapi

https://github.com/PyO3/pyo3

https://github.com/tokio-rs/axum

https://github.com/tiangolo/fastapi

Session Categories

Introducing a FOSS project or a new version of a popular project

Technology architecture

Engineering practice - productivity, debugging

Talk License: MIT License

Speakers

Preetham Pemmasani Creator, FastRAPI

Creator of FastRAPI. 3D artist, open-source contributor, frontend dev, and CTF player. 45M+ Instagram views. Clients include Netflix and Lecrae.

Reviews

Very interesting project, but on a personal note, I implore you to make the website a LOT simpler. It was not good for my eyeballs.

Reviewer #1 Approved

Strong yes - but i doubt that the proposer will be able to cram everything from their proposal into a standard duration (25 min?) talk. Personally, I'd recommend reshaping the story to highlight the ecosystem changes that made FastRAPI possible (freethreading in Py 3.14), how the proposer/creator was uniquely positioned to identify the ecosystem shift and build a new project, and the decision framework to help a potential user understand when to adopt FastAPI vs FastRAPI.

Reviewer #2 Approved