Shubhankar
Tiwari

Building intelligent ML systems that observe, detect, and adapt in real time.

I focus on:

  • [01]Runtime Intelligence — controlling and stabilizing LLM behavior during generation
  • [02]Model Lifecycle Intelligence — detecting drift and governing ML models in production
Software Engineer (Backend Systems)

Project Spotlight

Built a full-stack system to monitor and control LLM generation at the token level, with adaptive instability detection and real-time intervention

LLM Control System — Runtime Reliability for Generative Models

LLM Control System — Runtime Reliability for Generative Models

LLM Control System — Runtime Reliability for Generative Models

  • Token-level observability via manual decoding loop with per-token probability and entropy extraction
  • Three-category stability detector system: entropy_collapse, repetition_loop, low_entropy_lock
  • Formal policy controller with explicit intervention rules mapped to instability signals
  • Adaptive regeneration and temperature adjustment — mid-generation intervention without restart

Core Insight

Detect and correct unstable token generation in real time using entropy-based control.

82%
Instability Drop
0.0%
False Positives
+0.07
Conf. Uplift
~35%
Intervention

How I Think About ML Systems

Most ML systems fail silently.

I design systems that:

  • measure behavior (entropy, drift, performance)
  • detect failure modes (instability, degradation)
  • take corrective action automatically

This results in ML systems that are not just predictive — but self-aware and controllable.

My Journey

I went from curious undergrad to backend engineer at Bank of America. The path wasn't linear — four years of production systems, a Kaggle Expert rank, and now building evaluation infrastructure for LLMs. Each of those things informed the others more than I expected.

B.Tech CSE

SRM Institute of Science & Technology

2018-2022

Graduated with 94.4% GPA in Computer Science. Built foundation in algorithms, data structures, and software engineering.

Software Engineer

Bank of America

2022-Present

Building and maintaining Java backend systems for corporate banking. I own the full lifecycle — design, CI/CD, deployments, and being on-call when something breaks. Four years of that teaches you things about software that writing code in isolation doesn't.

Kaggle Expert

Top 4.1% Globally

2022-Present

Notebooks Expert ranked #2,441 / 59,663 — personal best #707. 34 notebooks and 10 bronze medals across ML, DL, NLP, Computer Vision, and regression.

Backend Excellence

Currently at Bank of America, I focus on building microservices that power critical banking workflows. From API design to deployment validation, I own the full lifecycle.

Competitive ML

Outside of work, I sharpen my skills through Kaggle — ranked top 4.1% globally, personal best #707. Currently researching LLM behavioral reliability, cultural alignment, and building production safeguard systems that measure deployment-readiness, not just benchmark accuracy.

Engineering at Scale

At Bank of America I work on backend systems that process real financial transactions. My job isn't just writing code — it's release validation, production triage, and being accountable when a deployment goes wrong at an inconvenient hour.

I've been doing this for four years. The thing that changes is your relationship with failure. You stop being surprised by it and start building systems that tell you clearly when and why they're failing.

4+ Years
Production Experience
Kaggle
Notebooks Expert

Engineering Principles

/01 Reliability over hype
/02 Deterministic systems over black-box magic
/03 Production readiness over prototype excitement
/04 Root-cause analysis over surface patching

Projects & Experiments

Selected projects and experiments

Drift-Aware Fraud Detection — ML Lifecycle & Model Governance

Built an end-to-end ML system that detects data drift, evaluates model degradation, and governs retraining decisions.

Core Insight

Link distributional drift to model degradation and automatically recover performance via retraining.

284K
Transactions
0.17%
Fraud rate
< 0.2
Drift (PSI)
Live
Shadow deploy
FastAPIscikit-learnNext.js+3
View details
Conservative Auto-Regeneration Policy for AI-Generated Financial Narratives

Production validation system for AI-generated outputs in financial services — threshold optimisation, conservative AND escalation policy, priority-scored human review queue, and audit-grade provenance.

AND
Policy
0.0%
False Pos.
< 5%
Auto-regen
Live
Audit
PythonNLPFinTech+2
View details
Indian Desi Multilingual LLM — Training Pipeline

End-to-end multilingual LLM training pipeline targeting Hindi/English code-switching. Dataset curation, LoRA fine-tuning, inference evaluation, and deployment packaging across 6 Kaggle notebooks.

PythonLLMNLP+3
View details
Song Recommender System

ML-based workout song recommender using BPM and VADER sentiment analysis. Co-authored research with K-Means clustering on Billboard Top 100 to match songs to exercise intensity.

PythonMLNLP+3
View details
Kaggle Portfolio

Notebooks Expert rank #2,441 / 59,663 — personal best #707. 34 notebooks, 11 datasets, 3 models, 1 competition entry. 10 bronze medals across ML, DL, NLP, Computer Vision, and regression.

Data ScienceMLDeep Learning+2
View details

System-Level Focus

These projects represent three pillars of a coherent ML systems narrative, with decision-aware control and auditability:

  • [01]Inference: LLM Control System → Runtime behavioral guardrails
  • [02]Evaluation: Fraud System → Drift detection and automated lifecycle management
  • [03]Governance: LLM Output Validation System → Post-deployment validation layer for LLM outputs

Together, they reflect a focus on building ML systems that are: observable, controllable, and production-ready.

Technical Focus

ML Systems

Evaluation pipelinesControl policiesDrift detectionFailure-mode metricsShadow deployment

Modeling

XGBoostTransformersProbabilistic metricsStatistical modelingEntropy analysis

Infrastructure

FastAPICI gatesVersioned registriesObservabilityModel serving

Credentials

Kaggle Notebooks Expert

Rank 2,441 / 59,663 · 10 Bronze Medals

B.Tech CSE — SRM IST

94.4% GPA · 2018–2022

ML, Deep Learning, CV

Kaggle Certified · 2025

Interactive Terminal

shubhankar.sh
Welcome to shubhankar.sh v1.0.0 Type 'help' for available commands.
visitor@portfolio $

Try: help, about, ls projects/, cat skills, neofetch

Get in touch

Open to roles in backend systems, platform engineering, and applied AI. If something here resonates — I'd like to hear from you.

tiwarishubhankar@gmail.com