Workshop Beginner

Building Agentic AI Systems with Small Language Models

Review Pending

Session Description

Introduction

3-hour hands-on workshop equips participants to design, build, and optimize real-world AI systems powered by Small Language Models (SLMs). Unlike traditional LLM-heavy approaches, this workshop focuses on cost-efficient, production-aware architectures that run entirely within Google Colab’s free tier—making advanced AI engineering accessible without expensive compute infrastructure.

Participants will progress through building toward a complete multi-agent RAG-enabled AI system that solves a real-world problem. Every module includes end-to-end hands-on demos with pre-configured notebooks that participants take home after the session.

Target Audience

AI/ML engineers and data scientists seeking cost-efficient model deployment strategies
GenAI practitioners evaluating SLMs as alternatives or complements to large LLMs
Engineers building production LLM/SLM applications under compute or budget constraints

Prerequisites

Proficiency in Python (intermediate level; ability to read and write functions, classes, and scripts)
Foundational familiarity with machine learning concepts or prior exposure to LLMs/NLP is helpful but not required
A Google account for Colab and Drive access

Content Provided

Google Colab notebooks with complete, runnable code
Curated small datasets for inference, fine-tuning, and RAG experiments
Reference slides covering theory, architecture diagrams, and design pattern summaries

Infrastructure

Google Colab (Free Tier) — CPU-first design; GPU optional for fine-tuning acceleration
Google Drive for notebook persistence and model checkpoint storage
No paid cloud resources required; all demos run within free-tier constraints

Speakers:

Nitin Agarwal, Sanathraj Narayan

Key Takeaways

By the end of this workshop, participants will be able to:

Deploy and run inference with SLMs (like Phi-3 Mini, Gemma 2B) within Google Colab free-tier limits
Apply quantization techniques and parameter-efficient fine-tuning with QLoRA for domain-specific tasks
Architect an end-to-end Agentic RAG system combining retrieval, reasoning, and generation under constrained compute
Evaluation of AI system using SLM-as-a-Judge and observability

References

https://youtu.be/xX_yWsnSwnw?si=KkdQ_22XJViT0HoB

https://www.youtube.com/watch?v=xQvwJxJxSzQ

Session Categories

Other

Technology architecture

Tutorial about using a FOSS project

Which track are you applying for?

Main track

Speakers

NITIN AGARWAL PRINCIPAL DATA SCIENTIST | ATLASSIAN

Nitin Agarwal is a generative AI leader with deep expertise in Large Language Models (LLMs), Natural Language Processing, Machine Learning, and intelligent automation. Passionate about turning cutting-edge AI capabilities into practical, production-grade systems that drive measurable business value.

Extensive experience developing end-to-end AI platforms, LLM-powered applications, and intelligent systems that enable decision intelligence, knowledge discovery, and next-generation digital experiences. Skilled at translating complex AI concepts into scalable enterprise solutions that enhance productivity, improve customer engagement, and unlock new growth opportunities.

Known for leading high-impact AI initiatives, guiding cross-functional teams, and fostering a culture of innovation, experimentation, and continuous learning. Strong focus on bridging business strategy with emerging AI technologies to deliver responsible, scalable, and impactful AI solutions.

Active contributor to the AI ecosystem as a mentor, speaker, and thought leader, with a strong interest in advancing the practical adoption of Generative AI and shaping the future of intelligent systems.

Driven by a mission to push the boundaries of AI—building technologies that empower people, transform enterprises, and redefine what intelligent systems can achieve.

SANATHRAJ NARAYAN DATA SCIENCE MANAGER | LAM RESEARCH

Sanath brings over a decade of experience in AI/ML, data science, and analytics, with a strong track record of building and deploying machine learning solutions at scale. Before joining Lam, he was a Senior Data Scientist at Ericsson, where he led model development and implementation for network rollout and forecasting use cases. He has also worked at Mindtree and KPMG, focusing on predictive analytics, scalable ML models, and enterprise AI solutions. Sanath is passionate about industrializing AI/ML models and driving real-world impact. He has been an active speaker at AI/ML conferences like Cypher and DataHack Summit, sharing insights on LangChain and LLM-based applications.

Reviews

No reviews yet.