Nirmalya Ghosh Applied AI | Technologist

Text-to-SQL the Naïve Way: Why Most Demos Fail in Production

The promise of Text-to-SQL is compelling: let anyone query a database using plain English. The reality is that most implementations silently return wrong data, expose sensitive information, and cost more than they should.

Continue reading ...

TTFT Optimisation: Practical Patterns

How to reduce TTFT in production: practical patterns, implementation strategies, and edge cases to watch for.

Continue reading ...

How Prompt Size Directly Impacts LLM Response Latency

Understanding the mechanics of Time to First Token (TTFT) and why those extra tokens may lead to poor user experience (UX).

Continue reading ...

A Newsletter Decluttering AI Agent Using ReAct Pattern

Our inboxes contain dozens (if not hundreds) of newsletters we subscribed to during moments of curiosity, but we seldom read most of them. Manually unsubscribing is tedious: open each email, scroll to the bottom, click unsubscribe, confirm … repeat 50+ times.

This post covers a personal project developing an AI agent using the ReAct pattern to analyse newsletters I have subscribed to and recommend the ones to unsubscribe based on my reading behaviour.

Continue reading ...

Trying Out Osmosis-Structure-0.6B

While large language models (LLMs), such as GPT-4 and Claude, are capable of extracting structured information from text, small language models (SLMs) have historically struggled to do so reliably. Previously, the only viable approach was to fine-tune a larger open-weights model using distillation. A week ago, there was an announcement, which appears to an alternative.

Continue reading ...