AI Data Engineer
United States
AI Data Engineer with 6 years of experience in production-grade data infrastructure and AI systems. Specialized in real-time ETL/ELT pipelines, Big Data systems, Medallion Architecture, and Semantic Layers. Experience in Fintech, Manufacturing, Banking, and AI Startups. Proven track record at Batlabs AI, Visa, and Infosys Limited.
Batlabs AI
Dec 2025 โ Present
Powering Manufacturing Operations - standardizing instrument data to increase productivity in QA and R&D decision-making. Restored XGBoost defect classifier from 62% to 93% accuracy using GCP Data Lakehouse and Medallion Architecture. Reduced downstream AI agent data prep time by 10x. Built a production LangGraph SQL Agent on BigQuery with Vertex AI. Eliminated deployment failures across 20+ ETL/ELT pipelines using Docker, Kubernetes, and CI/CD. Mentored 4 junior engineers.
VCloud
Feb 2025 โ Nov 2025
Data platform modernization engagement replacing third-party BI infrastructure with in-house analytics pipelines. Built a single source of truth by designing dimensional models and 100 dbt/Spark ETL pipelines on Snowflake, enabling self-serve ad-hoc analytics for DAU/WAU stakeholder reporting.
Visa
Jul 2023 โ Oct 2024
Global Payments platform engineering reliability and observability for 600M+ daily financial transactions. Implemented LSTM and time-series forecasting on historical load data. Improved pipeline reliability from 94% to 99.8% across 60+ Kafka and Spark pipelines supporting real-time OLTP and OLAP workloads.
Infosys Limited
Jan 2020 โ Dec 2021
UK Banking client - cloud migration and streaming modernization for real-time fraud detection at enterprise scale. Migrated 8B+ banking event records to GCP (BigQuery, Cloud Storage). Decreased data latency from hours to minutes by leading architecture transition to event-driven distributed streaming using Kafka, Spark, and CDC.
Masters of Professional Studies ยท Data Sciences and Applications
2022 โ 2023
Bachelors ยท Computer Science Engineering
2015 โ 2019