PORTFOLIO — 2026 · KITCHENER, ONTARIO, CA

Lead Data Scientist & Applied AI Architect — building production AI where hallucinations carry legal consequences.

2,000+

USERS

30K+

DAILY QUERIES

$10M+

IMPACT

7+ YRS

DEPTH

      SCROLL
      ↓
    

I think in systems,
not notebooks.

Three things that separate a production AI architect from someone who can run a notebook.

01

Zero-to-One GenAI Architecture

I don't just prompt-engineer; I build deterministic, multi-agent microservices from scratch. When off-the-shelf tools like LangChain fail at scale, I design custom orchestration layers (PydanticAI, FastMCP) that actually work in production.

02

High-Performance System Optimization

I bridge the gap between Data Science and Data Engineering. I optimize vector indices (Azure Cosmos DB IVF) to slash query latencies from minutes to seconds, circumventing hundreds of thousands in cloud scale-out costs.

03

Measurable Enterprise Impact

My architectures don't just live in notebooks. I've directed teams of 10–14 engineers to deploy systems that drive $10M+ in operational efficiency and process tens of thousands of queries daily across 4 global regions.

TECHNICAL PHILOSOPHY

Moving a GenAI prototype to a regulated production environment exposes the limits of wrapper libraries. Within 30 days of standard RAG on 100+ page insurance documents, I diagnosed catastrophic context collapse and citation failure. The fix wasn't a bigger context window — it was a custom runtime schema transpilation layer and a hierarchical node retrieval engine, built from scratch.

“The best GenAI architecture is the one that can provably tell you exactly why it gave the answer it did, every single time.”

FEATURED WORK

What I've built.

Production systems. Published research. Shipped products.

        PRODUCTION · NDA
      

CoverAI — Zero-Hallucination Retrieval Engine

Lead Data Scientist · 2024–Present · Confidential Employer

Problem. Off-the-shelf LangChain RAG failed catastrophically on 700+ page insurance policies — hallucinating citations with legal consequences.

Build. Custom hierarchical JSON-tree retrieval with runtime schema transpilation. Deterministic citation validation streams output character-by-character. New carriers onboard via config — zero code changes.

        PydanticAI
        FastMCP
        Hierarchical Node Retrieval
        Cosmos DB (IVF)
        Real-Time Citation Validation
        Azure OpenAI / GPT-4o
      

10×

Latency

<30s

E2E Response

$500K+

Cloud Saved

100%

Citations Valid

PRODUCTION

Autonomous Multi-Line Claims Routing

XGBoost pipeline for claim triage. SHAP for 100% regulatory audit trail. 45% cost reduction · $200K annual savings · 35% faster settlements.

XGBoost · SHAP · Django · RabbitMQ

INTERNAL · PRE-LLM

PriML — Natural Language → SQL

Team lead of 10. Fine-tuned Rat-SQL transformer. NL query → SQL → Plotly dashboard, self-serve. 87% accuracy on complex multi-table JOINs — before LLMs existed.

Rat-SQL · Fine-tuning · Plotly · Postgres

PRODUCTION · NDA

FNOL Classification Agent System

3-service microarchitecture (FastAPI + FastMCP + Azure Service Bus). PydanticAI agent generates type-safe models from per-tenant schemas at runtime. 95% alignment · 2% hallucination.

PydanticAI · FastMCP · Multi-Tenant

PUBLISHED · IEEE

Selective EEG Anonymization

Multi-objective autoencoders for Brain-Computer Interfaces. Selective anonymization preserves clinical signal while eliminating re-identification. PST 2023, Copenhagen.

View on IEEE Xplore ↗

LIVE · OPEN SOURCE · DOCUMENTED

Work you can click.

My production work is under NDA. So I ship the same architecture in the open — running systems, readable source, and the build notes behind each one.

        LIVE
        OPEN SOURCE
      

HireOS — Agentic Job Application OS

hireos.girijesh.ca · Multi-LLM pipeline · Deployed on Fly.io

HireOS dashboard — pipeline stats, action items and AI recommendations

Problem. A serious application is 90 minutes of work — assess, tailor, beat the ATS, prep stories, follow up. Trackers give you a Kanban board and let you do all the thinking.

Build. Three-pass resume pipeline where the critic runs on a different model than the writer — self-critique is theatre. Per-task LLM routing. ChromaDB memory that finds the gap recurring across every evaluation.

        FastAPI
        Multi-LLM Router
        ChromaDB
        Playwright
        React
        Fly.io
      

OPEN THE APP ↗ SOURCE ↗ BUILD NOTES →

        LIVE ENDPOINT
        MCP SERVER
      

hireos-mcp — Making an App Agent-Native

20+ MCP tools · Multi-tenant OAuth 2.0 · Dynamic client registration

I didn't build a chat box into HireOS — I exposed it as tools and let Claude be the interface. It runs as a claude.ai custom connector: any HireOS user connects their own account via OAuth, no credentials in a config file.

The interesting problem: resume generation takes 60s, and MCP is request/response. There's no native async job pattern — so the tool returns a hint the model can act on.

Claude invoking the hireos_list_jobs tool

        Model Context Protocol
        TypeScript
        OAuth 2.0 + DCR
        Fly.io
      

SOURCE ↗ BUILD NOTES →

OPEN SOURCE

YOU ARE HERE

This Portfolio

24,000 particles, one draw call, zero build step. Every chapter is the same buffer morphing to a new target. The text is rasterized on a 2D canvas and sampled into a point cloud.

three.js · WebGL · Canvas 2D · One HTML file

          GitHub ↗
          Build notes →
        

BROWSER EXTENSION

AI Conversation Exporter

Export ChatGPT, Claude and Gemini chats as TXT, Markdown, JSON or HTML. Two permissions, zero network requests. Privacy you can verify from the manifest, not the policy.

Manifest V3 · Chrome + Firefox · Local-first

          GitHub ↗
          Build notes →
        

DEVELOPER TOOL

Mermaid Diagram Creator

Render Mermaid diagrams locally and export sharp PNGs at any resolution — SVG rasterized to a canvas at an arbitrary scale factor. 7KB, one file, no dependency graph.

Mermaid.js · Canvas API · Single file

          GitHub ↗
          Build notes →
        

EXPERIMENT

DirectorAI

Browser-native video editor. Chat-first interface over FFmpeg.wasm — edits compile and execute entirely client-side. No uploads, no backend, no upload limits.

React · FFmpeg.wasm · Vite

          GitHub ↗
        

WRITING

Five build notes — one per system. Not tutorials: the decision I'd actually defend in an interview. Why the resume critic must be a different model. Why a tool's error string is really a prompt. Why 24,000 particles cost the same as one.

READ THE BUILD NOTES →

BY THE NUMBERS

Real impact at scale.

Every number is earned, not estimated from a demo.

2,000+

Active Users

US · UK · AU · EU

30K+

Daily AI Queries

Multi-carrier · Multi-tenant

10×

Latency Reduction

Hours → under 30 seconds

$10M+

Annual Savings

In adjuster time

Global Regions

US · UK · AU · EU

6×

Throughput Gain

Same hardware

TECHNICAL ARSENAL

AI Specializations

Machine Learning · Deep Learning · Generative AI · Agentic AI · Large Language Models · Computer Vision · Transformers · Explainable AI (SHAP)

LLM Orchestration

FastMCP · PydanticAI · Hierarchical Node Retrieval · Citation Validation · Azure OpenAI / GPT-4o · Cohere Reranking · Embedding Models

Core ML / AI

PyTorch · Transformers · XGBoost · SHAP · Vision OCR · NLP / Fine-tuning · Prophet · Scikit-learn · Plotly · Matplotlib · Seaborn

Languages

Python · SQL

Data & Cloud

PySpark · Microsoft Azure (Cosmos DB IVF · Service Bus) · Google Cloud Platform · AWS (Lambda · SageMaker · S3) · RabbitMQ · PostgreSQL · MongoDB · Docker

Delivery

FastAPI · OpenTelemetry · Arize Phoenix · Django · Flask · Adobe PDF Services · Team Leadership (10–14)

SIX YEARS · US · UK · AU · EU

Four regions.
One architecture.

CAREER TIMELINE

Jan 2024 — Present

Lead Data Scientist

Primus Software Corporation · Waterloo, ON

Led cross-functional team of 10–14. Scaled enterprise AI to 2,000+ users globally. Resolved two production crises. Built zero-code carrier onboarding. Promoted Senior → Lead in 12 months.

Jan 2023 — Jan 2024

Senior Data Scientist

Primus Software Corporation · Waterloo, ON

Diagnosed LangChain's fundamental limits on multi-document policies. Designed hierarchical RAG architecture solo in 3 months. Latency crisis: 2.5 min → 40 sec.

Sep 2021 — Apr 2023

M.Sc. Computer Science

Lakehead University · Thunder Bay, ON

Project-based Masters, supervised by Dr. Garima Bajwa. Published privacy-preserving ML research at PST 2023, Copenhagen. Continued AI development at Primus concurrently.

May — Aug 2022

Data Science Intern

Ciena · Ottawa, ON

PySpark pipelines, divisive clustering, and manufacturing batch anomaly detection.

Jun 2018 — Dec 2022

ML Engineer → Senior ML Engineer

Primus · Noida, India → Canada (2021)

Built FNOL classification for Crawford & Company solo. Led PriML NL-to-SQL project (team of 10). CTO recognition + bonus. Six years of insurance domain expertise starts here.

PUBLICATIONS

PEER-REVIEWED · IEEE

Selective EEG Signal Anonymization using Multi-Objective Autoencoders

PST 2023 · Copenhagen, Denmark

Autoencoder architectures for securing biological telemetry — preserving clinical signal while eliminating re-identification vectors. Supervised by Dr. Garima Bajwa.

View on IEEE Xplore ↗

PEER-REVIEWED · SPRINGER

In-Memory Computation for Real-time Face Recognition

ICICT 2019 · Springer

Optimized edge-compute inference for computer vision on resource-constrained hardware. In-memory strategies significantly reduce latency for real-time face recognition.

View on Springer ↗

LEADERSHIP & COMMUNICATION

Data scientists who can communicate build better systems. The evidence:

Toastmasters International

Competitive public speaking that directly informs how I present technical findings to non-technical stakeholders — executives, clients, and insurance carriers.

          7× Best Impromptu
          4× Best Evaluator
          3× Best Prepared Speech
        

Cross-Functional Team Lead

Ran day-to-day technical and delivery decisions for a team of 10–14. Direct stakeholder requirement gathering, refinement, and brainstorming. Second-most senior on the team.

          10–14 person team
          Multi-region delivery
          Client-facing ownership
        

OPEN TO LEAD & STAFF DS ROLES · KITCHENER, ONTARIO OR REMOTE

Big Tech, AI-native, or Enterprise AI. If your engineering bar is high and you need an architect who thinks in systems, let's talk.

LINKEDIN ↗ GITHUB ↗

GIRIJESH SINGH · LEAD DATA SCIENTIST · KITCHENER, ON · 2026

I think in systems,not notebooks.