Contents

Technologybuildbyravirai.com

Technology

AI Agents in Production Web Apps: What We're Actually Shipping in 2026

RRRavi Rai·May 6, 2026·9 min read

In 2026, every client meeting starts the same way. Founder shows me the dashboard. Pauses. Says 'so… can we add AI to this?' Sometimes they have a specific use case. Often they don't — they just know that not having AI in a product is starting to feel like not having a website in 2010. This post is what we actually ship when that conversation happens.

“AI in 2026 is what mobile apps were in 2014. Most clients don't need their own AI. They need someone honest enough to tell them what they actually need — and what they're better off skipping.”

The five categories of AI we ship

Across 18 production AI integrations we've built or maintained in the last 12 months, every use case fits into one of five buckets. Picking the wrong bucket is what makes most AI projects fail — and what makes them cost 10× what they should.

1. Embedded chat assistants

A small chat box on a website that answers customer questions about products, pricing, and policies — pulling from your existing FAQ pages, product catalog, or knowledge base. The most common 'add AI to my site' request.

Cost to build (one-time): ₹40,000–₹90,000 depending on data complexity
Cost to run: ₹2,000–₹15,000/month, depending on traffic and which model (GPT-4o-mini, Claude Haiku 4.5, or self-hosted LLM)
What works: 70-80% of customer questions get answered without human handoff. Clear ROI when it replaces a ₹25,000/month support hire.
What doesn't: complex multi-step workflows (returns, refunds with policy edge cases). Don't promise the bot will handle these — escalate to humans.

2. Content / SEO generation pipelines

Automated blog post drafting, product description generation, meta tag writing, FAQ page generation, alt text generation. We build internal admin tools that take a brief and produce a draft for human review.

Cost to build: ₹50,000–₹1,50,000 (depends on integration depth — does it write to your CMS automatically? Does it cross-link to existing posts?)
Cost to run: ₹3,000–₹20,000/month
What works: SEO product description generation for e-commerce stores with 1000+ SKUs. Bulk meta description regeneration. First drafts of blog posts that humans then edit.
What doesn't: fully automated 'just publish whatever AI writes' content workflows. We've never seen this work well — it produces generic content that ranks, briefly, then disappears as Google's AI-detection improves.

3. Data extraction and structuring

Taking unstructured data — invoices, resumes, customer emails, support tickets, scraped product data, scanned documents — and turning it into structured database records. This is where AI is genuinely magical and probably the highest-ROI category we ship.

Cost to build: ₹60,000–₹2,00,000 depending on data complexity and accuracy requirements
Cost to run: ₹500–₹10,000/month (per-document costs are tiny)
What works: invoice parsing for accounting tools, resume parsing for HR tools, product catalog ingestion from supplier PDFs, customer feedback categorization at scale
What doesn't: anything where 100% accuracy matters and there's no human review step. Banking, healthcare diagnostics, legal compliance — these need different tooling, not LLMs

4. Personalization and recommendations

AI-powered product recommendations, personalized email subject lines, dynamic homepage content based on user behavior, intelligent upsells in checkout flow.

Cost to build: ₹80,000–₹3,00,000 (this category has the widest range — depends on data infrastructure already in place)
Cost to run: ₹5,000–₹50,000/month
What works: e-commerce product recommendations that lift AOV by 8-15%. Email subject line A/B test winners selected by AI. Cart abandonment messages personalized by user history.
What doesn't: replacing human merchandisers entirely. The best results come from AI doing the heavy lifting and a human approving the top 3-5 weekly campaigns.

5. AI-native products

The product itself is AI — image generators, voice cloning tools, AI video editors, copywriting assistants. The AI isn't an add-on; it's the entire value proposition.

Cost to build: ₹2,00,000–₹15,00,000+ depending on scope
Cost to run: significant, often ₹50,000-₹5,00,000/month at scale (LLM inference is the main cost)
What works: vertical AI tools for specific Indian markets — legal contract review, GST audit assist, FSSAI compliance checking, regional language translation
What doesn't: 'ChatGPT for X' as a thin wrapper. There's no defensible product when OpenAI launches a feature that does what you do. Build vertical depth instead.

If you're a typical Indian SME or startup asking 'should I add AI?', the most common answer we give is: start with a small embedded chat assistant or a data extraction pipeline. Both have clear ROI in 30-60 days, low setup cost, and don't require you to bet your roadmap on AI.

We avoid recommending AI for use cases where the failure mode is critical — billing, payment confirmations, legal advice, medical info, anything where a wrong answer creates liability. AI is good at being mostly right, fast. It's bad at being always right. Match the tool to the problem.

The actual stack we use in 2026

For client AI projects, our default stack:

Models: Claude Haiku 4.5 for cost-sensitive tasks (₹0.20-₹2 per request). Claude Sonnet 4.6 for most production work (₹2-₹10 per request). GPT-4o for client-side embedding and tools that need OpenAI's ecosystem. Self-hosted Llama models when client data can't leave their infrastructure (e.g., healthcare, finance).
Vector database: Postgres with pgvector for most projects. Cheap, fast enough, no extra service. Pinecone or Weaviate only when scaling past 10M vectors.
Framework: Vercel AI SDK on Next.js for streaming. LangChain only for complex multi-step agents — most projects don't need it and the abstractions add fragility.
Hosting AI features: same as the rest of the app — usually CloudNX (our own product) or Vercel for client-facing apps. Inference is on the model provider's servers.
Logging and observability: Langfuse or Helicone for prompt logging, cost tracking, and debugging. Critical for AI in production — without it, costs and accuracy drift silently.

Indian context that matters

Some details that affect AI projects specifically for Indian businesses:

Hindi and regional language support: GPT-4o and Claude both handle Hindi well in 2026. Tamil, Telugu, Marathi vary — test before committing. For Hindi-English code-mixing (Hinglish), prompt engineering matters more than model choice.
INR billing for AI APIs: most providers bill in USD. Plan for forex variance and pre-paid credits. We've seen 10-15% monthly variance on production projects.
DPDP compliance: India's Digital Personal Data Protection Act requires explicit consent for AI processing of personal data. Don't pipe customer PII to third-party AI without consent flows in place.
On-premise / VPC deployment: some Indian enterprise clients (banks, government, defense-adjacent) won't allow data to leave India. We use AWS Mumbai with Bedrock for these, or self-hosted models on their own infrastructure.

What we don't ship

Anything where the AI output is the final answer and there's no human review step in workflows that matter to the business. Build with humans in the loop until the workflow is mature.
AI-generated content auto-published to a public site. Always human-edited.
Voice agents that act on behalf of the customer financially (e.g., 'place an order'). The error rate isn't ready for production yet — it will be in 2027-2028.
Anything called an 'AI replacement for [profession]'. We've never seen this work without significant scope reduction within 6 months. Build augmentation tools, not replacement tools.

How to know if your project actually needs AI

Quick test we run with every prospective AI client:

What's the specific problem you're solving? If you can't name it in one sentence, you don't need AI yet — you need clarity.
Could a simple rule-based system solve 60% of this? If yes, build that first. AI for the remaining 40% is usually cheaper and more reliable than AI for everything.
What's the cost of the AI being wrong? If it's 'embarrassing' or 'inconvenient' — go ahead. If it's 'we lose ₹10L+' or 'we get sued' — find a different solution.
Do you have data to ground the AI with? Most useful AI projects use Retrieval-Augmented Generation (RAG) — the AI reads your data before answering. Without your data, it's just ChatGPT with extra steps.
Is there an actual user pain point this solves? 'Our investors want us to add AI' is not a user pain point. We've turned down work that fails this test.

Thinking about adding AI to your product? We'll do a 30-minute scoping call — honest assessment, no pitch unless you want one.

Book a free AI scoping call

Where AI is heading in 2026-2027

Inference costs continue to fall ~10× per year. The thing that costs ₹15K/month today might cost ₹1.5K/month next year.
Agentic AI (where the model takes actions, not just answers) becomes production-ready for narrow domains in 2026. We're already shipping agents for invoice processing, support ticket triage, and lead qualification.
Voice agents that work in Indian English and Hindi reach production reliability mid-2027 (our prediction). Not yet for transactional flows — first wave will be customer support FAQ replacement.
Multimodal (image + text + voice in one prompt) becomes default. Single-modal applications start to feel dated.
On-device LLMs become viable for non-critical features. Faster, free of API costs, but limited capability.

The agencies and product teams that win in this period are the ones who actually ship AI features that solve real problems — not the ones who put 'AI-powered' in their tagline. We're betting on the former.

Building something AI-related? We've shipped 18 AI integrations in production. Tell us what you're building.

Get a quote

Written by

Ravi Rai

Founder of buildbyRaviRai, a freelance web development agency based in Noida, India. 5+ years shipping Next.js, WordPress, Shopify, and Laravel projects for clients in India, USA, Canada, and the UK.

Working with us in your city

Web developer in Bangalore

For: Engineering-led teams

Web developer in Hyderabad

For: Product & SaaS teams

Web developer in Chennai

For: OMR product startups

Keep Reading

Technology

Next.js 16 in Production: What We Learned Shipping Real Client Projects

Six months into Next.js 16 in production across five client projects — the wins, the caching traps, and what we would do differently next time.

Technology

Claude Code vs Human Developers: Where Each Actually Wins in 2026

An honest look at where AI coding tools like Claude Code have genuinely replaced freelance developer work, where they haven't, and how we actually use them on client projects at buildbyRaviRai.

AI Agents in Production Web Apps: What We're Actually Shipping in 2026

The five categories of AI we ship

1. Embedded chat assistants

2. Content / SEO generation pipelines

3. Data extraction and structuring

4. Personalization and recommendations

5. AI-native products

The actual stack we use in 2026

Indian context that matters

What we don't ship

How to know if your project actually needs AI

Where AI is heading in 2026-2027

Working with us in your city

Keep Reading

Next.js 16 in Production: What We Learned Shipping Real Client Projects

Claude Code vs Human Developers: Where Each Actually Wins in 2026

buildbyRaviRai Assistant

Chat on WhatsApp

AI Agents in Production Web Apps: What We're Actually Shipping in 2026

The five categories of AI we ship

1. Embedded chat assistants

2. Content / SEO generation pipelines

3. Data extraction and structuring

4. Personalization and recommendations

5. AI-native products

What we recommend most often

The actual stack we use in 2026

Indian context that matters

What we don't ship

How to know if your project actually needs AI

Where AI is heading in 2026-2027

Working with us in your city

Keep Reading

Next.js 16 in Production: What We Learned Shipping Real Client Projects

Claude Code vs Human Developers: Where Each Actually Wins in 2026

buildbyRaviRai Assistant

Chat on WhatsApp