Data Engineer | Data Scientist | Data Analyst
Passionate about building end-to-end ML systems, scalable data pipelines, and AI-driven solutions that transform data into impactful business decisions.
Hello! I'm Hyma Roshini Gompa, a Data Scientist with 4+ years of experience delivering end-to-end predictive analytics and machine learning solutions for high-volume consumer platforms.
I specialize in building scalable data pipelines, developing and deploying ML models (Random Forest, XGBoost, LightGBM), and translating complex data into actionable business insights through customer segmentation, retention strategies, and KPI-driven dashboards.
My expertise spans across cloud data engineering, MLOps, and generative AI, with a proven track record of driving measurable impact on churn reduction and revenue optimization. I'm passionate about leveraging data and AI to solve complex business problems and deliver value at scale.
May 2025 - Present
Aug 2024 - May 2025
Apr 2021 - Jul 2023
Aug 2023 - May 2025
Baltimore, MD
Aug 2018 - May 2022
Punjab, India
Designed and implemented a real-time voting platform processing 500K+ events per minute using Kafka, Spark Streaming, and PostgreSQL with optimized throughput and live analytics visualization.
Built scalable big data processing pipeline handling massive datasets using distributed computing frameworks, optimizing data processing workflows for improved performance and efficiency.
Built a transformer-based legal document summarization platform using Legal-BERT, LED-16384, and RAG, automating multi-format document processing and reducing review time by 70%.
Developed a comprehensive end-to-end machine learning pipeline with data preprocessing, feature engineering, model training, hyperparameter tuning, and deployment using MLOps best practices.
Implemented advanced anomaly detection system using Robust Graphical Lasso for learning sparse precision matrices, identifying outliers in high-dimensional data with improved accuracy.
Conducted comprehensive statistical analysis of T20 cricket player performance using data analytics and visualization techniques to derive insights on player efficiency and match outcomes.
I'm always open to new opportunities and interesting conversations. Feel free to reach out!