Open to Opportunities

Building Scalable AI Systems
from First Principles

Data Scientist (Consultant) @ UberIIT Palakkad Alum

Background

About Me

The story behind the code.

I’m a Data Scientist at Indium Software, currently working as an external consultant with Uber.

I build and deploy production-grade AI systems, with a focus on transformer models, large language models, and inference efficiency. My work sits at the intersection of research and real-world constraints,turning ideas from papers into systems that are reliable, scalable, and cost-aware.

I graduated from IIT Palakkad, where I learned to approach problems from first principles,understanding why something works before deciding how to build it. I stay closely aligned with current developments in AI, regularly experimenting with new architectures, tooling, and workflows.

Alongside building systems, I write extensively about machine learning and create small tools to better visualize and understand how models learn and make decisions.

PyTorchTransformersLLMsMLOpsModel ServingProduction ML SystemsInference Optimization

3+Years Experience

350k+Models Downloads

25+Technical Articles Published

Career Journey

Professional Experience

Building impactful ML systems at scale, from startup velocity to enterprise reliability.

Indium (Client: Uber)

Current

Data Scientist

Feb 2025 - PresentBengaluru, India

Working as an external data scientist for Uber, building production-grade AI systems for global earner document processing and onboarding.

Built a Transformer-based document auto-transcription system for driver and vehicle documents.
Fine-tuned and deployed LLMs for low-latency, real-time inference in production.
Automated manual document review workflows, reducing operational effort and cost.

PyTorchTransformersPythonOCRDeep LearningDocker

LTIMindtree

Sr. Software Engineer

Jul 2022 - Jan 2025Bengaluru, India

Worked on applied AI and Generative AI systems for enterprise automation and analytics use cases.

Built multimodal AI systems to analyze technical videos and generate summaries.
Developed speech-based natural language interfaces for database querying.
Implemented RAG-based GenAI chatbots and improved ML model accuracy in production systems.
Applied transfer learning to achieve high accuracy with limited training data.

PythonGenerative AIRAGMultimodal LLMsTransfer LearningSQL

Selected Research

Building Systems That Scale

From research implementations to production-grade ML systems serving millions.

Production350k+ Downloads

Language Identification Model

83.5M-parameter model optimized for source-code classification. Fine-tuned across 25+ programming languages to deliver high-accuracy language detection at scale. Adopted by 350k+ developers worldwide.

PyTorchTransformersHugging FaceONNX

Read Case Study

Research

Vision Transformer (ViT) From Scratch

A from-scratch PyTorch implementation of Vision Transformer with configurable hyperparameters and intuitive explanations of self-attention, patch embeddings, and training logic, focused on learning fundamentals and clean engineering. Documented end-to-end on Medium.

PyTorchResearch PaperhydraMLflow

Read Case Study

Research

Transformer From Scratch

A clean PyTorch implementation of Transformer encoder and decoder from first principles, deeply exploring attention mechanics and causal masking. Guided code and narrative help understand how Transformer blocks actually work.

PyTorchPaper ReproductionSelf-AttentionTransformer

Systems

Distributed Training with PyTorch DDP

A practical deep dive into multi-GPU training using PyTorch DistributedDataParallel (DDP). Explains how gradient synchronization works under the hood, how to scale batch sizes correctly, and how to avoid common pitfalls when moving from single-GPU to distributed training.

PyTorch DDPCUDANCCLDistributed Systems

Read Case Study

Education

Self-Attention Explained (From Intuition to Math)

A deep yet intuitive breakdown of self-attention, explaining how tokens interact, why attention works, and how queries, keys, and values emerge, using simple language, diagrams, and minimal math to build real understanding.

Self-AttentionTransformersMathPaper Understanding

Engineering

Knowledge Graph + LLM Chatbot

An RAG-powered conversational system that combines Neo4j knowledge graphs with large language models to answer natural language queries with structured and unstructured data. Includes Cypher integration and vector search strategies.

LLMsNeo4jVector SearchRAG

Writing

Building Scalable AI Systems
from First Principles

About Me

Professional Experience

Indium (Client: Uber)

LTIMindtree

Building Systems That Scale

Language Identification Model

Vision Transformer (ViT) From Scratch

Transformer From Scratch

Distributed Training with PyTorch DDP

Self-Attention Explained (From Intuition to Math)

Knowledge Graph + LLM Chatbot

Recent Articles

Building Vision Transformers (ViT) from Scratch

Hands-On with Transformers: Recreating ‘Attention Is All You Need’ in PyTorch, Step by Step

The Detailed Explanation of Self-Attention in Simple Words

Universal Approximation Theorem (UAT): The Simple Idea That Powers AI

Build Robust Chatbots with Neo4j, Knowledge Graphs, and LLMs

Building Scalable AI Systemsfrom First Principles

About Me

Professional Experience

Indium (Client: Uber)

LTIMindtree

Building Systems That Scale

Language Identification Model

Vision Transformer (ViT) From Scratch

Transformer From Scratch

Distributed Training with PyTorch DDP

Self-Attention Explained (From Intuition to Math)

Knowledge Graph + LLM Chatbot

Recent Articles

Building Vision Transformers (ViT) from Scratch

Hands-On with Transformers: Recreating ‘Attention Is All You Need’ in PyTorch, Step by Step

The Detailed Explanation of Self-Attention in Simple Words

Universal Approximation Theorem (UAT): The Simple Idea That Powers AI

Build Robust Chatbots with Neo4j, Knowledge Graphs, and LLMs

Building Scalable AI Systems
from First Principles