Lightning Talk
Beginner

Evals First, Code Later: A Practical Guide to Evaluations, Rerankers & Caches

Rejected

Most RAG and search systems "just work"... until they silently fail in production.

I learned that the hard way. We were indexing millions of lines of code. Average latency was 20 minutes, relevance was weak, and most issues only showed up after shipping. It was painful.

So in order to fix this, we completely flipped our dev process, started treating evals like tests, and not afterthoughts. That led to a 40x latency drop, 30% relevance boost, and even the indexing time cut down to mere 30s.

This talk is a practical walkthrough of what worked for us:

  • Setting up Evals Pipelines

  • Combining BM25 with ANN retrieval

  • Using re-rankers.

  • Semantic caching strategies that keep results fast and fresh

We’ll look at:

  • Performance benchmarks before vs after

  • A live demo using Postgres (pgvector) + rerankers

  • A cheatsheet + open-source repo so you can plug this into your own stack

Whether you're just getting started or already building with LLMs, search, or RAG, this will give you concrete techniques to improve relevance, speed, and reliability.

You'll be able to iterate based on systemic methods, not on intuition or vibes.

Tutorial about using a FOSS project
Engineering practice - productivity, debugging
Technology architecture
Which track are you applying for?
Main track

Saksham Aggarwal
Co-founder, CTO Iterate AI (useiterate.ai)
https://www.linkedin.com/in/sakshamagg27
Speaker Image

50 %
Approvability
1
Approvals
1
Rejections
0
Not Sure

Nice talk given the emphasis on having more technical depth this time. Suggesting the proposer to add more references of code

Reviewer #1
Approved

Thank you for submitting your proposal for IndiaFOSS 2025. Your submission was well-received and progressed to our final review stages.

Unfortunately, due to the high volume of excellent proposals this year, we were unable to select your talk for the final program. We appreciate the effort you put into your submission and encourage you to apply again for future events.

Reviewer #2
Rejected