Booking engagements for Q3 2026

AI-Native Products,
Shipped at Startup Speed.

An engineering team for ambitious AI-era companies. We've shipped products to 30+ countries and run engagements for fintech, agencies, and consultancies in the UK, EU, and India.

Domains we've shipped in

UK fintech· Founders networks· Field-service ops· Edge personalization· Multi-channel ecom· Connected vehicles· EdTech & UPSC· 3D K-12 learning· IoT & edge ML· AI newsroom· UK fintech· Founders networks· Field-service ops· Edge personalization· Multi-channel ecom· Connected vehicles· EdTech & UPSC· 3D K-12 learning· IoT & edge ML· AI newsroom·

Client names withheld under NDA · references on request

Grounded AI and RAG

Grounded AI & RAG

Citation-enforced retrieval, multi-model routing, and continuous evals on Langfuse — so the model stops being a guess.

Real-time backends

Real-time backends

Kafka Streams, Postgres + pgvector, Cloudflare Workers, AWS SST. Architectures that survive the IPL match-day spike, not just the demo.

Multimodal frontends

Multimodal frontends

Three.js explainers, voice-first UIs in Hindi/English/Hinglish, Next.js 15 with strict TypeScript and design that doesn't look generated.

Prototype to production

Idea to revenue.
End-to-end.

We don't hand off a half-finished prototype. Every engagement ends in production code, observability, and a runbook your team can operate. Founder-tier engineers from day one — no juniors hidden behind the brand.

Audit & Architecture

Two-week paid audit ending in a written diagnosis with a sequenced fix list. You own the document.

Build & Ship

Two-week sprints with a Friday demo. Production-grade tests and observability shipped from sprint one.

Operate or Hand-off

We can run it for you, or hand the keys to your team with full documentation when ready.

No churn theatre

Boring tech where boring works, new tech where new genuinely wins. We don't bill for slide decks.

01 · audit
02 · design
03 · ship

Engineering depth

RAG that refuses to hallucinate.

Our pattern: cite, verify, refuse. Every claim ships with a chunk-ID. Answers without retrieval support are rejected at the streaming boundary, not surfaced to the user.

# RAG with citation enforcement
async def answer(query: str) -> Answer:
    chunks = await retrieve(query, k=12, rerank="cohere-v3")
    if len(chunks) < 3:
        return Answer.refuse("insufficient grounding")

    model = router.choose(query, budget="basic")  # claude / gpt-4o / gemini
    stream = await model.stream(prompt(query, chunks))

    async for token in stream:
        if citation_violated(token, chunks):
            return Answer.abort("hallucination detected")
        yield token
live evals (last 1k queries)
langfuse
Citation accuracy 98.7%
Refusal precision 94.2%
Cost vs. single-model −67%
p95 streaming latency 1.2s

Built & running

Three flagship products.

Sunja's own products — the proof that we build and operate AI-native software at scale, not just consult on it.

Deep Mentor — AI tutor for UPSC prep

Deep Mentor

deepmentor.co
live 10K+ users

Citation-enforced RAG over 10K+ exam questions. Multi-model routing across Claude / GPT-4 / Gemini cuts AI cost 67%. Voice-first TutorPortal in Hindi, English, and Hinglish.

Next.js 15FastAPIpgvector
Sunja Learning — interactive 3D K-12 platform

Sunja Learning

sunja.ai
live 30+ countries

K-12 textbook chapters as cinematic Three.js worlds — electromagnetism, mitosis, planetary motion — with an embedded AI tutor that grades open-ended answers and resolves doubts mid-lesson.

Three.jsSceneKit (iOS)Asset stream
briefly · news live feed
auto-published conf 92
editor review conf 64
simhash dedup · 27 sources

Briefly

News platform
live v2 in build

Six-stage LLM pipeline ingests, deduplicates, and editorially grades news. Three-engine adaptive crawler (Static → Playwright → Stealth) with per-domain learned profiles. Confidence-scored auto-publish gate.

Kotlin · Vert.xKoogSimHash

Capabilities

The stack we know cold.

Every line below is something we've shipped to production users — not a list copied from a job board.

AI / RAG systems

Citation-enforced RAG. Multi-model routing across Claude, GPT-4, and Gemini. Semantic caching over Postgres + pgvector. Continuous evals on Langfuse that gate every prompt change in CI.

  • Refusal-first generation: no chunk, no answer
  • Multi-model routing — pick the cheapest model that answers
  • Eval datasets in CI; no model swap without a regression check
  • Voice-first dictation: Hindi · English · Hinglish
stack retrieval-augmented generation
Claude GPT-4 / 4o Gemini pgvector Cohere Rerank Langfuse LlamaParse SimHash dedup

How we engage

Three shapes.
No theatre.

We're a small senior team. Pick the shape that fits the work. Each option has a clear scope, deliverables, and exit criteria — no hourly billing surprises, no slide decks billed for.

Talk to us
01

Strike team

2–4 senior engineers, full ownership of a product surface, daily standups with your team.

02

Fixed-scope project

Clearly bounded deliverable — RAG pipeline, mobile app, infra rebuild — for a fixed fee and timeline.

03

Fractional CTO

Founder-level architecture, hiring loops, and unblocking — without the full-time hire.

04

Two-week audit

A paid diagnostic with a sequenced fix list. You own the document either way.

Selected work

Six engagements.
Real users, real outcomes.

We don't ship demos. Every project below went to production with paying users, regulated workloads, or both. Client names withheld under NDA.

UK fintech featured deep-dive
senior contract

UK Fintech
Cash-Flow Platform

An SMB cash-flow forecasting platform — bank aggregation, transaction analytics, runway calculation, variance analysis. We led the architecture through three iterations: JavaScript Express, TypeScript port, then a polyglot v2 splitting hot paths into SST-on-AWS serverless functions and a Python analytics engine triggered by SQS after every bank sync.

UK Open Banking via Moneyhub. MySQL schema entirely under Liquibase with branch-environment alignment. LLM transaction-name cleaning evaluated on Langfuse. Five test suites with a "critical" pre-commit gate.

Node + TSPython (Poetry)SST on AWSMySQL · LiquibaseDrizzleMoneyhubLangfuse
backend rewrites
5
service repos
30
Liquibase tables
OB
UK Open Banking
via creative agency pair

HNW Members Portal

Migrated a Sheets-backed founders-network portal to a hardened Supabase stack hosted in EU Frankfurt. 19 SQL migrations covering OWASP hardening, performance indexes, audit sessions; PII gated behind a server-side Edge Function.

SupabaseRLSResend
via creative agency pair

Three-System Sync Engine

Salesforce ↔ JobLogic ↔ workforce-locations integration on Hono + Deno Edge Functions. Content-hash idempotency, source-destination loop detection, partial-unique indexes for one-primary-per-case.

Hono · DenoDrizzleFly.io
via ecom consultancy featured
fractional CTO

Edge Personalization Platform

A self-hosted GrowthBook alternative on four Cloudflare Workers across 300+ edge POPs. Three-tier cache (edge / KV / R2) keeps p95 under 50ms. Capacity-planned for IPL match-day push campaigns delivering 8–10K peak RPS.

CF WorkersKV + R2 + D1Cloud RunReact 18Chrome Extn
<50ms
p95 latency
10K
RPS load-tested
300+
edge POPs
17 KB
SDK gzip
via ecom consultancy fractional CTO

Multi-Channel Profit-Ops

Real-time profit ops for multi-channel sellers. 44+ Cloud Functions feeding ETL into BigQuery, Terraform IaC, Cloud-Armor DDoS protection. Profit dashboards that update in seconds.

GCPBigQueryTerraform
hiring tooling CTF

IoT Sensor Hiring CTF

Capture-the-flag assessment for junior full-stack engineers — IoT sensor dashboard with planted bugs, a stubbed AI route, 5 flags. Comes with rubric and confidential evaluator guide. Replaced their generic LeetCode loop entirely.

ReactNodePostgreSQL

What you get

Senior team.
Sane rates.

Most of what's billed in this market is junior labour and slide decks. We charge for senior engineers shipping production code. No padding, no bench, no farmed-out subcontractors.

starting at
2 weeks
for a paid audit + sequenced fix list
no commitment
  • Two co-founders engaged personally on every project
  • Weekly Friday demos and a written status note
  • You own the IP, code, infra, and runbooks at exit
  • NDA-bound — your client list and code stay yours
  • Tests, observability, and CI from sprint one — never bolted on later
Book the audit

Founders

Two co-founders.
Senior from day one.

BIT Mesra computer scientists with a combined 15+ years across IoT, mobile, ML, real-time data, and AI platforms.

Shashank Mohan

Shashank Mohan

Co-Founder & Head of Product · 8+ yrs

Mobile and IoT engineer with eight years of taking systems from prototype to production scale — most recently as AI Engineer at TVS Motor, earlier at TagBox (acquired by TVS, National Startup Award 2021). At Sunja, leads product across iOS and Android — from concept and brand through native engineering.

iOS · SceneKitAndroid · KotlinKafka StreamsML / FLIR
Shivam Srivastava

Shivam Srivastava

Co-Founder & Engineering Director · 7+ yrs

Platform engineer with seven years of cloud-native and AI work across TVS Motor and TagBox (acquired by TVS, National Startup Award 2021). At Sunja, leads engineering on Deep Mentor's RAG, multi-model AI routing, and the platforms behind it. Also serves as fractional CTO for an e-commerce optimization consultancy.

RAG / multi-modelVert.x · FastAPINext.js 15Kubernetes

Modern engineering.
Built for the AI era.

We reply to every inbound within one business day. If we're not the right fit, we'll point you to a team that is.

Currently booking Q3 2026 · Bengaluru, working globally