Question 1

What do you actually build?

Accepted Answer

Production LLM features, agent loops, MCP tool servers, retrieval pipelines, evals, and the product surface around them. The model is rarely the hard part; the system around it usually is.

Question 2

Where are you based, and who do you work with?

Accepted Answer

Based in India and working globally. Most recent engagements have been with US-based AI startups and US-headquartered enterprises, with overlap windows for sync time.

Question 3

How is this different from a generalist software engineer?

Accepted Answer

I came up as a generalist and still operate full-stack when needed. The focus today is the parts of LLM-powered products that most engineering teams undercount: tool design, context engineering, evals, fallbacks, latency and cost shaping, and the infra around model calls.

Question 4

What stack do you typically work in?

Accepted Answer

Python and TypeScript on the model and product side. LangChain, LlamaIndex, MCP, pgvector, Pinecone, LoRA fine-tuning, Langfuse for evals, Next.js or Astro on the product surface, AWS or Cloudflare for the runtime. The stack changes with the problem; the discipline does not.

Question 5

How do you decide whether an AI feature is shippable?

Accepted Answer

A feature is shippable when its evals are honest, its failure modes degrade gracefully, its cost and latency stay inside a budget at the traffic you actually expect, and the product around it is still usable when the model is wrong.

Question 6

What is the fastest way to figure out fit?

Accepted Answer

Read the Work page for shipped systems, skim two posts under Writing, then reach out via LinkedIn or GitHub. A short async exchange almost always tells both sides whether the next call is worth booking.

About me

Frequently asked

When this is not the right fit