Senior Data Engineer with 4+ years architecting production data platforms across Azure, Microsoft Fabric, and Databricks. Delivered 3-month to 14-day deployment cycles, 67% ETL cost reduction, and 98% uptime SLA on enterprise payroll infrastructure at scale.
I started in analytics, building search engines at Zomato and forecasting supply at Udaan, then moved fully into engineering the infrastructure that makes analytics possible at scale. Today I design streaming data platforms, AI automation agents, and self-healing pipelines that run without hand-holding.
At ExponentHR I reengineered CDC-based ETL to cut batch runtimes from 30 min to under 8 min, compressed deployment cycles from 3 months to 14 days with CI/CD automation, and shipped AI agents that eliminated 15+ hours of manual sprint work, all while maintaining 98% uptime on production payroll systems.
Outside work I build production systems that scratch my own itch: JobScout scrapes 109 companies every 5 minutes and auto-matches jobs to 95+ tailored resumes, and AutoApply AI chains 5 LLM providers behind a Chrome extension so I never hit a rate limit. I hold a 4.0 GPA Master's from Missouri S&T and a peer-reviewed publication in Taylor & Francis 2025.
Currently at ExponentHR · Open to Senior Data Engineer, ML Engineer, and AI Platform roles · Dallas TX or Remote
Missouri University of Science and Technology
Gayatri Vidya Parishad College of Engineering
ExponentHR
Addison, TX
Missouri University of Science and Technology
Rolla, MO
C2FO
Leawood, KS
udaan.com
India
Zomato
Hyderabad, India
Production-grade systems demonstrating enterprise-level engineering
Data Engineering Foundation: Built end-to-end streaming data pipelines with Apache Kafka (47.8 TPS) and Spark Structured Streaming for 5-second windowed aggregations. ML Integration: VaR calculations at 95% and 99% confidence levels with historical simulation methodology. Production-Ready: FastAPI REST API, Streamlit dashboard, and containerized infrastructure.
ML Engineering Showcase: Production-grade ML platform with Apache Kafka event streaming (100+ TPS), LightGBM classifier, and MLflow experiment tracking. Data Pipeline: End-to-end pipeline from ingestion to prediction with exactly-once processing. DevOps: Prometheus + Grafana monitoring, containerized infrastructure, and Airflow orchestration.
NLP Research: Comparative analysis of sentiment classification methods including VADER lexicon and RoBERTa transformers on Yelp/TripAdvisor reviews. ML Pipeline: Full preprocessing pipeline with text cleaning, feature extraction, and model evaluation. Research Publication: Published findings on ensemble approach combining rule-based and deep learning methods.
Statistical Modeling: Time series forecasting of mobile game downloads using R, comparing ARIMA, exponential smoothing, and regression models. Data Pipeline: Automated data collection and preprocessing pipeline. Business Impact: Actionable insights for marketing campaign timing and inventory planning.
Zero-cost production platform scraping 109 company career pages across 6 ATS platforms (Workday, Greenhouse, Lever, iCIMS, Taleo, SmartRecruiters) every 5 minutes. Multi-signal relevance engine scores jobs on skills, location, experience level, and H1B sponsorship. 95+ tailored resumes indexed with TF-IDF cosine similarity — best-match auto-selected per job. Instant Discord + Telegram alerts for dream roles. Full application lifecycle tracker.
AI job application assistant combining a Chrome MV3 extension with a FastAPI backend on Fly.io + Supabase + Upstash Redis. 5-provider LLM chain: Anthropic Claude → OpenAI → Kimi → Ollama → keyword fallback — zero-downtime AI regardless of provider outages. TF-IDF cosine similarity engine matches job descriptions to the best resume from a 95+ PDF vault. Full auth via Clerk RS256 JWT. CI/CD via GitHub Actions.
Microsoft
2026
Microsoft
2024
Databricks
2026
Microsoft Applied Skills
HackerRank
Atlassian
Scrum Alliance
Thoughts on data engineering, ML systems, and career growth
With expertise spanning streaming pipelines, cloud-native data platforms, and ML infrastructure, I focus on systems that deliver measurable outcomes at enterprise scale. If you're building something meaningful in Data Engineering, ML Engineering, or AI Platform development, I'm open to the conversation. Response time is under 24 hours.
Prefer a document?
Download My Resume