Skip to Main Content
Talk Intermediate

Privacy First: Building LLM-Powered Web Apps with client side WASM

Rejected
Session Description

It’s no secret that for a long time machine learning has been mostly a Python game, but the recent surge in popularity of Gen AI has brought many new developers into the field. With JavaScript being the most widely-used programming language, it’s no surprise that this has included many web developers.

There’s been a ton of ink spilled on building with LLMs via API calls to the likes of OpenAI, Anthropic, Google, and others but in these cases, the user is sending the data and the prompt to the servers of these tools and hence is not a 100% secure. and raises issues like cost, latency, and privacy. Moreover certain use-cases might require companies to run models on Device

This is where open source tools like WebLLM, Transformers.js come into the picture. We detail constructing a complete language model pipeline that runs fully in the browser. such has interfacing with a state-of-the-art model like Gemma.

By using lightweight packages like Transformers.js we can quantize and compile models to run efficiently on the user’s device all thanks to WASM. This allows us to build complex conversational workflows like retrieval augmented generation entirely on-device.

Key Takeaways

Ecosystem

There’s been a ton of ink spilled on building with LLMs via API calls to the likes of OpenAI, Anthropic, Google, and others but in these cases, the user is sending the data and the prompt to the servers of these tools and hence is not a 100% secure. and raises issues like cost, latency, and privacy. Moreover certain use-cases might require companies to run models on Device

Thus this talk covers such use-cases and shares real world cases for on browser AI adoption for modern Web Apps while covering the technical architecture of building such apps

References

Session Categories

Introducing a FOSS project or a new version of a popular project
Technology architecture
Engineering practice - productivity, debugging
Technology / FOSS licenses, policy
Which track are you applying for?
Main track

Speakers

Shivay Lamba Developer Experience Engineer | Couchbase

Shivay Lamba is a software developer specializing in DevOps, Machine Learning and Full Stack Development.

He is an Open Source Enthusiast and has been part of various programs like Google Code In and Google Summer of Code as a Mentor and has also been a MLH Fellow. He is actively involved in community work as well. He is a TensorflowJS SIG member, Mentor in OpenMined and CNCF Service Mesh Community, SODA Foundation and has given talks at various conferences like Github Satellite, Voice Global, Fossasia Tech Summit, TensorflowJS Show & Tell.

Shivay Lamba
https://x.com/howdevelop

Reviews

The proposal is clear, and it pushes the usual discourse around AI/GenAI using browser-based technologies and WASM. My only comment - please provide details about the use cases that will be demoed in the talk, as I'm not able to understand whether these will be toy problems or potential real-world/production-worthy use cases.

Reviewer #1 Approved

Thank you for submitting your proposal for IndiaFOSS 2025. Your submission was well-received and progressed to our final review stages.

Unfortunately, due to the high volume of excellent proposals this year, we were unable to select your talk for the final program. We appreciate the effort you put into your submission and encourage you to apply again for future events.

Reviewer #2 Rejected