Shubhankar
Tiwari

I build backend systems that hold in production — and evaluate whether AI ones will too.

Backend Engineering

Production Java microservices for enterprise fintech — designed, deployed, and kept alive under real load.

JavaSpring BootOpenShiftREST APIs

Production Reliability

Incident triage, CI/CD ownership, and release validation in high-availability distributed systems.

CI/CDIncident ResponseRoot Cause Analysis

LLM Evaluation

Behavioral consistency research — measuring whether a model is trustworthy enough to deploy, not just whether it can get the right answer once.

Monte Carlo SamplingHuggingFaceQwen 2.5-7B

ML Research

Kaggle Notebooks Expert, top 4.1% globally. Applied ML across NLP, CV, and regression — now focused on evaluation infrastructure.

PythonScikit-learnKaggle ExpertNLP

My Journey

I went from curious undergrad to backend engineer at Bank of America. The path wasn't linear — four years of production systems, a Kaggle Expert rank, and now building evaluation infrastructure for LLMs. Each of those things informed the others more than I expected.

B.Tech CSE

SRM Institute of Science & Technology

2018-2022

Graduated with 94.4% GPA in Computer Science. Built foundation in algorithms, data structures, and software engineering.

Software Engineer

Bank of America

2022-Present

Building and maintaining Java backend systems for corporate banking. I own the full lifecycle — design, CI/CD, deployments, and being on-call when something breaks. Four years of that teaches you things about software that writing code in isolation doesn't.

Kaggle Expert

Top 4.1% Globally

2022-Present

Notebooks Expert ranked #2,441 / 59,663 — personal best #707. 34 notebooks and 10 bronze medals across ML, DL, NLP, Computer Vision, and regression.

Backend Excellence

Currently at Bank of America, I focus on building microservices that power critical banking workflows. From API design to deployment validation, I own the full lifecycle.

Competitive ML

Outside of work, I sharpen my skills through Kaggle — ranked top 4.1% globally, personal best #707. Currently researching LLM behavioral reliability and building evaluation infrastructure that measures production-readiness, not just benchmark accuracy.

Engineering at Scale

At Bank of America I work on backend systems that process real financial transactions. My job isn't just writing code — it's release validation, production triage, and being accountable when a deployment goes wrong at an inconvenient hour.

I've been doing this for four years. The thing that changes is your relationship with failure. You stop being surprised by it and start building systems that tell you clearly when and why they're failing.

4+ Years
Production Experience
Kaggle
Notebooks Expert

Engineering Principles

/01 Reliability over hype
/02 Deterministic systems over black-box magic
/03 Production readiness over prototype excitement
/04 Root-cause analysis over surface patching

Featured Work

Selected projects and experiments

LLM Reliability Evaluation Platform

Production-grade system for measuring LLM behavioral consistency via Monte Carlo sampling. Benchmarked Qwen 2.5-7B across 240 inference calls — v2 grid sweep found temperature IS a lever, but top-p dominates 3.1×.

3.1×
top-p vs temp
0.0%
Null gen. rate
0.163
Coding instability
86.5%
Escalation rate
PythonFastAPINext.js+3
View details
Indian Desi Multilingual LLM — Training Pipeline

End-to-end multilingual LLM training pipeline targeting Hindi/English code-switching. Dataset curation, LoRA fine-tuning, inference evaluation, and deployment packaging across 6 Kaggle notebooks.

PythonLLMNLP+3
View details
Song Recommender System

ML-based workout song recommender using BPM and VADER sentiment analysis. Co-authored research with K-Means clustering on Billboard Top 100 to match songs to exercise intensity.

PythonMLNLP+3
View details
Kaggle Portfolio

Notebooks Expert rank #2,441 / 59,663 — personal best #707. 34 notebooks, 11 datasets, 3 models, 1 competition entry. 10 bronze medals across ML, DL, NLP, Computer Vision, and regression.

Data ScienceMLDeep Learning+2
View details
Conservative Auto-Regeneration Policy for AI-Generated Financial Narratives

Production quality-gating system for AI-generated outputs in financial services — threshold optimisation, conservative AND escalation policy, priority-scored human review queue, and audit-grade provenance.

PythonNLPFinTech+2
View details

Technical Arsenal

Backend Systems

JavaSpring BootMicroservicesREST APIsSystem Design

Data & Persistence

Oracle SQLTOADData ModelingQuery Optimization

Platform & DevOps

OpenShiftJenkinsCI/CDContainerizationEnv Config

ML & AI

PythonMachine LearningDeep LearningComputer VisionNLPLLMs

Credentials

Kaggle Notebooks Expert

Rank 2,441 / 59,663 · 10 Bronze Medals

B.Tech CSE — SRM IST

94.4% GPA · 2018–2022

ML, Deep Learning, CV

Kaggle Certified · 2025

Interactive Terminal

shubhankar.sh
Welcome to shubhankar.sh v1.0.0 Type 'help' for available commands.
visitor@portfolio $

Try: help, about, ls projects/, cat skills, neofetch

Get in touch

Open to roles in backend systems, platform engineering, and applied AI. If something here resonates — I'd like to hear from you.

tiwarishubhankar@gmail.com