Booking engagements for Q3 2026

AI-Native Products,
Shipped at Startup Speed.

An engineering team for ambitious AI-era companies. We've shipped products to 30+ countries and run engagements for fintech, agencies, and consultancies in the UK, EU, and India.

Start a project See our work

Domains we've shipped in

UK fintech· Founders networks· Field-service ops· Edge personalization· Multi-channel ecom· Connected vehicles· EdTech & UPSC· 3D K-12 learning· IoT & edge ML· AI newsroom· UK fintech· Founders networks· Field-service ops· Edge personalization· Multi-channel ecom· Connected vehicles· EdTech & UPSC· 3D K-12 learning· IoT & edge ML· AI newsroom·

Client names withheld under NDA · references on request

Grounded AI & RAG

Citation-enforced retrieval, multi-model routing, and continuous evals on Langfuse — so the model stops being a guess.

Real-time backends

Kafka Streams, Postgres + pgvector, Cloudflare Workers, AWS SST. Architectures that survive the IPL match-day spike, not just the demo.

Multimodal frontends

Three.js explainers, voice-first UIs in Hindi/English/Hinglish, Next.js 15 with strict TypeScript and design that doesn't look generated.

Prototype to production

Idea to revenue.
End-to-end.

We don't hand off a half-finished prototype. Every engagement ends in production code, observability, and a runbook your team can operate. Founder-tier engineers from day one — no juniors hidden behind the brand.

Audit & Architecture

Two-week paid audit ending in a written diagnosis with a sequenced fix list. You own the document.

Build & Ship

Two-week sprints with a Friday demo. Production-grade tests and observability shipped from sprint one.

Operate or Hand-off

We can run it for you, or hand the keys to your team with full documentation when ready.

No churn theatre

Boring tech where boring works, new tech where new genuinely wins. We don't bill for slide decks.

01 · audit

02 · design

03 · ship

Engineering depth

RAG that refuses to hallucinate.

Our pattern: cite, verify, refuse. Every claim ships with a chunk-ID. Answers without retrieval support are rejected at the streaming boundary, not surfaced to the user.

# RAG with citation enforcement
async def answer(query: str) -> Answer:
    chunks = await retrieve(query, k=12, rerank="cohere-v3")
    if len(chunks) < 3:
        return Answer.refuse("insufficient grounding")

    model = router.choose(query, budget="basic")  # claude / gpt-4o / gemini
    stream = await model.stream(prompt(query, chunks))

    async for token in stream:
        if citation_violated(token, chunks):
            return Answer.abort("hallucination detected")
        yield token

live evals (last 1k queries)

langfuse

Citation accuracy 98.7%

Refusal precision 94.2%

Cost vs. single-model −67%

p95 streaming latency 1.2s

Built & running

Three flagship products.

Sunja's own products — the proof that we build and operate AI-native software at scale, not just consult on it.

Deep Mentor

deepmentor.co

live 10K+ users

AI tutor for India's competitive exams. Citation-grounded answers in Hindi, English, and Hinglish — used by 10,000+ aspirants.

UPSCVoice-first

Sunja Learning

sunja.ai

live 30+ countries

K-12 textbook chapters reimagined as cinematic 3D worlds, with an embedded AI tutor that grades open-ended answers. Used by learners across 30+ countries.

K-12Interactive 3D

briefly · news live feed

auto-published conf 92

editor review conf 64

simhash dedup · 27 sources

Briefly

News platform

live v2 in build

An AI-powered news platform that ingests, deduplicates, and editorially grades news with a confidence-scored publishing gate.

NewsAI editorial

Capabilities

The stack we know cold.

Every line below is something we've shipped to production users — not a list copied from a job board.

AI / RAG systems

Citation-enforced RAG. Multi-model routing across Claude, GPT-4, and Gemini. Semantic caching over Postgres + pgvector. Continuous evals on Langfuse that gate every prompt change in CI.

Refusal-first generation: no chunk, no answer
Multi-model routing — pick the cheapest model that answers
Eval datasets in CI; no model swap without a regression check
Voice-first dictation: Hindi · English · Hinglish

stack retrieval-augmented generation

Claude GPT-4 / 4o Gemini pgvector Cohere Rerank Langfuse LlamaParse SimHash dedup

How we engage

Three shapes.
No theatre.

We're a small senior team. Pick the shape that fits the work. Each option has a clear scope, deliverables, and exit criteria — no hourly billing surprises, no slide decks billed for.

Talk to us

Strike team

2–4 senior engineers, full ownership of a product surface, daily standups with your team.

Fixed-scope project

Clearly bounded deliverable — RAG pipeline, mobile app, infra rebuild — for a fixed fee and timeline.

Fractional CTO

Founder-level architecture, hiring loops, and unblocking — without the full-time hire.

Two-week audit

A paid diagnostic with a sequenced fix list. You own the document either way.

Selected work

Six engagements.
Real users, real outcomes.

We don't ship demos. Every project below went to production with paying users, regulated workloads, or both. Client names withheld under NDA.

UK fintech senior contract · ongoing

UK Fintech Cash-Flow Platform

Backend architect and database owner for an SMB cash-flow platform — bank aggregation via Open Banking, transaction reconciliation, and forecasting with audit trails. Currently shipping a new accounting-software integration and analytics enhancements, at startup speed via an AI-assisted coding workflow.

Node + TSMySQLMongoDBJWT · OWASPMoneyhubSentry

via creative agency pair

HNW Members Portal

Migrated a founders-network members directory into a hardened, audit-trailed portal with EU data residency for GDPR compliance.

SupabaseEU residency

via creative agency pair

Three-System Sync Engine

A three-system integration for a UK field-services operator — keeps Salesforce, JobLogic, and a workforce-locations system in sync without duplication or loops.

SalesforceJobLogic

via ecom consultancy featured

fractional CTO

Edge Personalization Platform

An A/B testing and personalization platform built on Cloudflare's edge — sub-50ms responses globally, capacity-planned for India banking-scale traffic during major event campaigns.

CloudflareEdgeChrome Extn

<50ms

p95 latency

10K

RPS load-tested

300+

edge POPs

17 KB

SDK gzip

via ecom consultancy fractional CTO

Multi-Channel Profit-Ops

Real-time profit dashboards for multi-channel sellers — unifies Shopify, Google Ads, and Meta Ads into one view that updates in seconds, not hours.

GCPBigQuery

hiring tooling CTF

IoT Sensor Hiring CTF

A 2-hour capture-the-flag assessment for junior full-stack engineers — replaced an IoT startup's generic LeetCode loop with something that actually predicts on-the-job ability.

Hiring tooling2 hr · 5 flags

What you get

Senior team.
Sane rates.

Most of what's billed in this market is junior labour and slide decks. We charge for senior engineers shipping production code. No padding, no bench, no farmed-out subcontractors.

starting at

2 weeks

for a paid audit + sequenced fix list

no commitment

Two co-founders engaged personally on every project
Weekly Friday demos and a written status note
You own the IP, code, infra, and runbooks at exit
NDA-bound — your client list and code stay yours
Tests, observability, and CI from sprint one — never bolted on later

Book the audit

Founders

Two co-founders.
Senior from day one.

BIT Mesra computer scientists with a combined 15+ years across IoT, mobile, ML, real-time data, and AI platforms.

Shashank Mohan

Co-Founder & Head of Product · 8+ yrs

Mobile and IoT engineer with eight years of taking systems from prototype to production scale — most recently as AI Engineer at TVS Motor, earlier at TagBox (acquired by TVS, National Startup Award 2021). At Sunja, leads product across iOS and Android — from concept and brand through native engineering.

iOS · SceneKitAndroid · KotlinKafka StreamsML / FLIR

in/shashankm82 shashank@sunja.ai

Shivam Srivastava

Co-Founder & Engineering Director · 7+ yrs

Platform engineer with seven years of cloud-native and AI work across TVS Motor and TagBox (acquired by TVS, National Startup Award 2021). At Sunja, leads engineering on Deep Mentor's RAG, multi-model AI routing, and the platforms behind it. Also serves as fractional CTO for an e-commerce optimization consultancy.

RAG / multi-modelVert.x · FastAPINext.js 15Kubernetes

in/shivam31093 shivam@sunja.ai

Modern engineering.
Built for the AI era.

We reply to every inbound within one business day. If we're not the right fit, we'll point you to a team that is.

Email us Founder direct

Currently booking Q3 2026 · Bengaluru, working globally

AI-Native Products,Shipped at Startup Speed.