Data Engineer building scalable analytics platforms and AI automation solutions. I transform complex data challenges into production-grade systems that deliver intelligence directly to users.
Data Engineer with expertise in designing and deploying scalable data platforms. Passionate about leveraging AI to tackle complex challenges. Specialized in transforming fragmented data ecosystems into unified analytics infrastructure that enables cross-functional teams to drive data-driven decision making.
With a Master's degree in Information Science & Technology from Missouri University of Science and Technology (4.0 GPA), I combine engineering depth with business acumen. My experience spans from Zomato in India to ExponentHR in the US, giving me a unique perspective on data challenges at scale.
Recently, I've focused on MLOps and AI automation, building end-to-end ML pipelines with Spark, Kafka, and Airflow, and designing AI agents that democratize data access for technical and non-technical teams alike.
Reduced SDLC from 3 months to 14 days (21× faster) through architectural design, Adopting Agile Project management and DevOps automation.
Missouri University of Science and Technology
Gayatri Vidya Parishad College of Engineering
ExponentHR
Addison, TX
Missouri University of Science and Technology
Rolla, MO
C2FO
Leawood, KS
udaan.com
India
Zomato
Hyderabad, India
Production-grade systems demonstrating enterprise-level engineering
Data Engineering Foundation: Built end-to-end streaming data pipelines with Apache Kafka (47.8 TPS) and Spark Structured Streaming for 5-second windowed aggregations. ML Integration: VaR calculations at 95% and 99% confidence levels with historical simulation methodology. Production-Ready: FastAPI REST API, Streamlit dashboard, and containerized infrastructure.
ML Engineering Showcase: Production-grade ML platform with Apache Kafka event streaming (100+ TPS), LightGBM classifier, and MLflow experiment tracking. Data Pipeline: End-to-end pipeline from ingestion to prediction with exactly-once processing. DevOps: Prometheus + Grafana monitoring, containerized infrastructure, and Airflow orchestration.
NLP Research: Comparative analysis of sentiment classification methods including VADER lexicon and RoBERTa transformers on Yelp/TripAdvisor reviews. ML Pipeline: Full preprocessing pipeline with text cleaning, feature extraction, and model evaluation. Research Publication: Published findings on ensemble approach combining rule-based and deep learning methods.
Enterprise Data Engineering: Architected semantic layer over complex multi-schema SQL Server data warehouse enabling natural language querying. AI Integration: Built AI agents that translate business questions into optimized SQL, reducing report development overhead by 40%. Production Impact: Accelerated SDLC from 3 months to 14 days (21× improvement).
Statistical Modeling: Time series forecasting of mobile game downloads using R, comparing ARIMA, exponential smoothing, and regression models. Data Pipeline: Automated data collection and preprocessing pipeline. Business Impact: Actionable insights for marketing campaign timing and inventory planning.
Data Engineering Fundamentals: Automated data collection pipelines using Python, BeautifulSoup, and Selenium for large-scale web scraping. ETL Pipeline: Data cleaning, transformation, and storage workflows. Foundation: Core skills enabling data-driven ML projects through reliable data acquisition.
Microsoft
2026
Microsoft
2024
Databricks
2026
Microsoft Applied Skills
HackerRank
Atlassian
Scrum Alliance
Thoughts on data engineering, ML systems, and career growth
I'm actively exploring Data Engineer and ML Engineer opportunities where I can build next-generation analytics and AI platforms. If you share a passion for building systems that matter, I'd love to connect!
Prefer a document?
Download My Resume