Available for Architecture & Advisory roles

I'm Navneet Singh I architect |

From building web crawlers serving 80 newspapers, to designing Big Data architectures at American Express, to leading Deep Learning teams deploying on edge devices — to now shaping enterprise GenAI platforms at Cigna. Two decades of evolution, one constant: making machines understand.

0+ Years Building AI
0 Companies Transformed
0 Technology Eras Mastered
Scroll to discover

Four Eras of Evolution

Each era didn't replace the last — it built on it. The foundation engineer became the data architect, who became the deep learning lead, who became the GenAI architect.

2006 — 2012

The Foundation

Senior Software Engineer @ Eterno Infotech

Built DailyHunt's backend — web crawlers aggregating news from 80 publishers across rich, XML, HTML, JSON feeds. Learned to think at scale when millions of mobile users depended on your code.

Java · Servlets · Web Crawlers
2012 — 2016

The Data Era

Module Lead @ Impetus (American Express)

Architected SERT — the Speed Engagement and Relevance Tool for Amex open card merchants. Processed terabytes of transactional data. Discovered that data, not code, is the real product.

Hadoop · Hive · HBase · Elasticsearch
2016 — 2022

The Deep Learning Era

DS Lead @ Inkers → Practice Lead @ Trigyn

Led CV teams building face recognition, video analytics across 1000s of streams, edge deployment on Jetson Nano. Made machines see.

75% fewer false alarms 35% FP improvement 20% cost reduction
PyTorch · OpenCV · Edge AI · CUDA
2022 — Present

The GenAI Era

NLP Architect @ Hexad (VW) → ML Advisor @ Cigna

Designing enterprise GenAI platforms — RAG pipelines, LLM guardrails, document intelligence. Making machines understand and reason.

2x throughput +37% accuracy Enterprise scale
LLMs · RAG · LoRA · vLLM · RLHF
Deep Dive into My Journey

Innovation Through Research

Published research and production innovations that push the boundaries

Featured Research

XMRetriever: Token-Efficient Vision-Language Inference

Cross-modal retrieval with dual-head projections, triplet loss, and FAISS indexing. Achieves 75% token reduction while reducing hallucination rates from 8.6% to 1.2% in production document intelligence pipelines.

PDF Page Input Text Embed 1536-d Vision Embed 768-d Projection 512-d Projection 512-d Concat 1024-d FAISS Index
Cross-Modal Retrieval Metric Learning FAISS Production ML
75%
Token Reduction
53%
Latency Reduction
79.4%
Exact Match
1.2%
Hallucination Rate
Explore All Research

Impact That Matters

0
%
False alarm reduction across 1000s of video streams
0
x
Throughput in document processing
0
%
Extraction accuracy improvement via semantic merging
0
%
Deployment cost reduction through edge optimization

Expertise

Generative AI & LLMs

Enterprise GenAI platforms, RAG with VectorDB/FAISS, prompt engineering (CoT, ToT, few-shot), LLM guardrails, PII safety, evaluation gates, vLLM serving

GPT-4oLlamaFLAN-T5BLOOMBERTLangChainROUGE/BLEU

Deep Learning & Vision

Object detection, face recognition, medical imaging (MRI/X-Ray), video analytics, model pruning, quantization, edge deployment on Jetson

PyTorchOpenCVResNetMobileNetEfficientNetU-NetGrad-CAM

Model Optimization & Serving

LoRA/QLoRA fine-tuning, RLHF alignment, knowledge distillation, DDP/FSDP distributed training, ZeRO, FlashAttention, GPU performance at scale

PEFTCUDAvLLMONNXTensorRTwandbDDP/FSDP

Leadership & Architecture

Team building, architecture standards, cost/latency SLOs, security-by-design, experiment tracking, release discipline, stakeholder management

DockerK8sAWSGCPCI/CDJIRAGitLab

Education & Certifications

Education

2023 — 2025

M.Tech — AI & Machine Learning

BITS Pilani, India

Deep Neural Networks, Deep Reinforcement Learning, NLP, Statistics, Information Retrieval

2001 — 2005

B.E. — Computer Science & Engineering

VTU, India — First Class

Operating Systems, Computer Networks, Parallel Programming, Algorithms & Data Structures

Certifications

Udacity

Nano Degree — Self Driving Car

Vehicle Detection & Tracking, Traffic Sign Classification

Udacity

Nano Degree — NLP

PoS Tagging (HMM), Machine Translation (DNN), Speech Recognition

Coursera

14+ Verified Certificates

Generative AI with LLM, Neural Networks & DL, CNNs, AI for Medical Prognosis, Hyperparameter Tuning, and more

Sun/Oracle

SCJP & SCWCD

Sun Certified Java Programmer, Sun Certified Web Component Developer

Latest from the Blog

All Articles

Get in Touch

Open to architecture consulting, advisory roles, and research collaborations

Send a Message

Powered by Formspree — your message goes directly to my inbox