Akarsh Gajbhiye

Backend & AI Engineer · India · EU + US hours

I build the AI layer that has to survive production.

RAG pipelines, agentic systems, and the backends underneath them. Six years, three startups, systems serving 10M+ requests a day.

Now independent. I take on custom backend builds and white-label engineering for agencies that hit the limit of no-code AI tools. You own the client relationship; I own the technical execution. Strictly under NDA.

Akarsh Gajbhiye

Selected work

AI architecture audit · BlitzApr 2026 · 4 wks

Blitz's AI voice agent — a Gemini caller handling thousands of customer calls a day, rescheduling deliveries and logging failed-delivery reasons for a logistics-tech company — was drifting off-script. Over four weeks I built the observability and LLM-as-judge evals it was missing, isolated where it failed, and added a pre-call intent gate plus an output validator that lifted correct-action rate from 95% to 98% in eval.

Gemini · Custom orchestration · Arize Phoenix · LLM-as-judge · SLM classifier · Python
Lead discovery & validation · Leadzo2026

2,143 usable leads at ~2.5¢ each. An async backend that discovers service-provider leads from search, crawls and validates each with an LLM, and dedups them into the marketplace's database — crawling fanned out one URL per Lambda invocation to keep per-lead cost flat. Delivered as a clean API for the client's own team to build against.

Python 3.12 · MongoDB · Taskiq + Redis · AWS Lambda + SQS · Crawl4AI · DeepSeek V3Private repo
03demo soon
HR AI · agent security testbed2026

An AI recruitment agent built as a security-research testbed: it screens candidates end to end across 15 tools — resume, LinkedIn, and web presence gathered concurrently, scored and ranked via a dedicated ATS sub-agent — then deliberately surfaces the prompt-injection, data-poisoning, and cross-tenant-leakage failure modes production LLM agents hit, and how to harden them. 145 tests.

LangGraph · DeepSeek (OpenRouter) · PostgreSQL · FastAPI · Streamlit · AlembicGitHub ↗
04demo soon
jakk · black-box MCP security scanner2026

A security scanner that probes AI agents (MCP servers) for whether they can be tricked into leaking data or running unintended commands. It enumerates the server's tools, fires curated probes, and classifies each response deterministically — no LLM in the loop, so every “vulnerable” verdict is real, not noise.

Python · MCP / FastMCP · Pydantic · httpx · Rich · pytestGitHub ↗
↗ Loom
ReelLab · video DNA engine2026

An AI pipeline that pulls the psychological hooks, visual mechanics, and clean transcriptions out of Instagram Reels, so creators can read what's actually working on a profile.

FastAPI · Next.js · SQLite · Apify · OpenRouterPrivate repo

How I can help

01 · AuditTechnical audit

5–10 day deep dive on a stuck or fragile workflow. Architecture, failure points, a risk map, and a build estimate you can price against.

from $3,000
02 · BuildBuild sprint

Fixed-scope technical delivery. You sell the AI solution to your enterprise clients; I build the custom Python/Go backend, agentic workflows, and infrastructure that actually makes it work. Predictable margins for your agency.

$5,000 – $12,000
03 · EmbedWhite-label partner

Ongoing senior capacity behind your agency. Async-first, NDA-ready, invisible to your client if that's the engagement. Operates entirely behind your @agency email.

Custom retainer

References

Akarsh gave our AI voice agent the observability and eval discipline it was missing. He pinned down exactly where it drifted off-script and added guardrails that measurably tightened it. Sharp, fast, and he ships.
GauravCTO · Blitz
Akarsh built our AI lead engine end to end: sourcing, scoring, and outreach that moves faster than a team of SDRs. He takes the hard backend problems off our plate and just makes them work.
AdarshCEO · Leadzo

Writing

All writing on Substack →

Stack

LanguagesPython · Go · Java · TypeScript
AI / MLLangGraph · custom RAG · agents · MCP · NLP / NER · embeddings · eval & red-team
BackendFastAPI · Django · Spring Boot · CQRS · async · queues
DataPostgreSQL · MongoDB · Redis · OpenSearch · vector DBs
InfraAWS · Docker · Terraform · Fargate · GitHub Actions

Let's talk

A 30-minute scoping call. No pitch. Tell me where your client's project is stuck, or what you're trying to quote. I'll tell you the exact architecture needed and what it costs to build it quietly behind the scenes. If we're a fit, you'll have a scoping doc the same week.

or email me