Greetings! I'm a meticulous, purposeful, and driven individual with a profound passion for the transformative power of data. My strengths include a keen eye for detail, a strategic mindset, and a natural knack for effective problem-solving.
Through participating in AI, LLM, and Alteryx-centered hackathons, I’ve discovered my stride, where strategic thinking, technical skills, and creativity intersect. Looking ahead to full-time opportunities starting immediately, my focus remains firmly anchored in the fields of AI, machine learning, and data science, where I can contribute from day one and keep learning at the frontier of innovation.
Certified in Applied Machine Learning, Azure, Google Data Analytics, and Alteryx Core, I bring proficiency across a wide range of tools and platforms. Intrigued by intelligent systems, I continuously explore how data and AI can be applied to solve real-world challenges, drive strategic decisions, and create meaningful impact.
Beyond deciphering intricate data puzzles, I bring a sense of equilibrium to the data landscape as a dedicated former member of the university's tennis team. In my leisure, I savor the exhilaration of motor vehicles, adding a dynamic layer to my interests.
Remembri: Symbolic Memory Pipeline Without Embeddings
Remembri is a lightweight, logic driven memory system that enables LLMs to remember user specific facts—like name and location, without embeddings or chat history. Built with FastAPI and SQLite, it reduced repeated input by 73%, achieved 95% tool-calling accuracy, and boosted context relevance by injecting symbolic memory directly into prompts.
FitFusion AI - Gen AI Hackathon
Built a real-time AI wellness assistant using Hugging Face and Gradio to generate personalized meal/workout plans and detect user emotions through NLP. Integrated Tavily API for smart-search research bot, enabling real-time wellness recommendations with visual and contextual outputs.
AI-Powered Teaching Assistant (Build with AI Google Event)
Built a multi-agent AI teaching assistant using LangGraph, Vertex AI, and Gemini LLMs to automate lesson planning, quizzes, and personalized learning. Implemented an event-driven architecture with Pub/Sub and Eventarc to boost agent collaboration and real-time content generation. Integrated RESTful APIs via Cloud Run to deliver scalable, multimodal educational content, developed through an interactive course and direct guidance from Googlers.
Electric Vehicles Data Analysis - Tableau Project
This project leverages Tableau to analyze and visualize key aspects of electric vehicle (EV) data, with a focus on Battery Electric Vehicles (BEVs) and Plug-in Hybrid Electric Vehicles (PHEVs). The interactive dashboard uncovers trends in the EV market, highlights geographical distribution, and evaluates manufacturer performance.
Website Analytics Dashboard - Power BI Project
Developed an interactive and visually intuitive Power BI dashboard to monitor website performance metrics. The dashboard was tailored for stakeholders with non-technical backgrounds to make data-driven decisions effectively.
Time Series Analysis: Retail & Racing 🏪🏎️
This project applies time series analysis to two datasets—Walmart sales (seasonal) and Formula 1 results (non-seasonal)—using the Box-Jenkins methodology to uncover patterns, forecast trends, and enhance decision-making.
Chat With Multiple PDFs - AI Bot
This project is a sophisticated tool that allows users to interact with multiple PDF documents through an AI-powered chat interface. Leveraging advanced technologies like Gemini AI and LangChain, the system extracts, processes, and stores text for intelligent and context-aware conversations.
NBA Analysis: Predicting Player Performance with Deep Learning
This project employs advanced deep learning techniques to forecast NBA player scoring output. By analyzing player statistics from the 2023-2024 season, we developed and compared six neural network architectures, ultimately identifying the BiLSTM model as the most effective predictor.
Credit Default Prediction for American Express using XGBoost and Neural Networks (Capstone Project)
Built a scalable model on 458K+ customer records to predict credit card defaults. Applied SHAP for feature reduction (190 → 14), tuned hyperparameters for generalization, and optimized thresholds for risk–reward tradeoffs. Delivered an interpretable, retrainable pipeline aligned with financial compliance standards.
Credit Card Eligibility Prediction using Random Forest
Built a machine learning pipeline to automate Global Bank’s credit card approval process using 2.4K+ historical applications. Cleaned and encoded 18+ features, handled missing edge cases, and deployed a Random Forest model that reduced manual reviews by 80% and accelerated decision time 5x.
Cardiovascular Risk Prediction
This project aims to predict the 10-year risk of future coronary heart disease (CHD) in patients using machine learning techniques. By analyzing various health indicators and lifestyle factors, we develop a model that can assist healthcare professionals in identifying high-risk individuals and implementing preventive measures.
Sentiment Analysis using NLP
This project implements a robust sentiment analysis model using Natural Language Processing (NLP) and machine learning techniques. The model classifies text into positive or negative sentiments with high accuracy, providing valuable insights for various business applications.
Anime Face Generation with Gen-AI GANs
Developed a Generative Adversarial Network (GAN) model to generate anime-style faces using Python libraries, including TensorFlow, NumPy, and Matplotlib. This project involved designing and training the GAN to produce high-quality, realistic anime faces by implementing advanced techniques and fine-tuning methods to enhance performance and output quality
SQL-Based HR Analytics Accelerator for IBM (Capstone Project)
Designed and built a fully normalized SQL database (1NF–3NF) from a flat HR dataset, boosting complex query performance by 50%. Developed ER diagrams, applied indexing and partitioning, and ensured referential integrity across tables. Authored advanced SQL queries using CTEs and window functions to analyze attrition trends, pay equity gaps, and education-performance relationships. Optimized the schema for seamless BI tool integration and future scalability.
Real Time Accident Detection
This project focuses on developing and advancing a system for real-time monitoring and alerting of potential vehicle accidents.
The repository is divided into two major phases:
1. Initial Project: The foundational work done during the internship at Electronics Corporation of India Limited (ECIL), which involved real-time vehicle data collection and basic alert mechanisms.
2. Advanced Project: An enhanced version with advanced data processing, real-time alert integration, and further optimizations.
PLC-CNC Data Analysis
The project is focused on analyzing data collected from Programmable Logic Controllers (PLC) in CNC (Computer Numerical Control) machines. The goal of this project is not only to monitor the operational performance of these machines and detect anomalies but also to provide actionable insights and suggestions for improving machine efficiency and reliability.
Skills
Certifications
Applied Machine Learning, Microsoft Azure Fundamentals, Google Data Analytics, Alteryx Designer Core & Foundational, Google Analytics
Programming Languages
Python, R, SQL, C, JavaScript, HTML, CSS
Machine Learning & Data Science
Supervised & Unsupervised Learning: Linear & Logistic Regression, Ridge/Lasso, Decision Trees, Random Forest, AdaBoost, Gradient Boosting (XGBoost, LightGBM, CatBoost), k-NN, Clustering (k-means, HDBSCAN, DBSCAN), Bayesian Belief Networks, Markov Chains, Reinforcement Learning
Deep Learning
Neural Networks, CNNs, RNNs, LSTMs, Transformers, Computer Vision, TensorFlow, PyTorch, Keras
Time-Series
ARIMA, SARIMA, ARMA, Box-Jenkins, LSTMs for Sequence Modeling
NLP & LLM Frameworks
Retrieval-Augmented Generation (RAG), Advanced RAG, RAG Evaluation, Embedding Models, HuggingFace Transformers, LangChain, LlamaIndex, NLTK, SpaCy, Pretraining, Finetuning, Deepspeed, Knowledge Graphs, Explainable AI (SHAP, LIME), Vector Databases (FAISS, Weaviate, Pinecone), NER, Topic Modeling, Sentiment Analysis, LLMOps, Attention Mechanism, Diffusion Models, GANs, Prompt Engineering
LLM & Foundation Models
GPT series, LLaMA, T5, GPT-J, wav2vec, data2vec, Whisper, Mixture of Experts (MoE)
Libraries & Frameworks
TensorFlow, PyTorch, Pandas, NumPy, OpenCV, Flask, Scikit-learn, Matplotlib, WandB, CUDA
Data Analysis & Experimentation
A/B Testing, Multivariate Testing, Hypothesis Testing (z-test, t-test, ANOVA, F-test, Chi-Square), Predictive Modeling, Feature Selection (PCA, Lasso, Ridge), Experiment Design & Causal Inference
Data Engineering & Big Data
ETL Pipelines, Data Warehousing, Snowflake, Hadoop, Spark, Hive, MapReduce, Delta Lake, API Integrations, Real-Time Data Processing
Cloud Platforms & MLOps
AWS, Azure, GCP (Vertex AI, Compute Engine, Cloud Functions), Databricks, CI/CD, MLOps
Business Intelligence & Visualization
Tableau, Power BI, Qlik, Looker, Plotly
Other Tools & Technologies
MongoDB, Git, Web Scraping, Flask, FastAPI, Adobe Analytics, Streamlit, Ollama, Excel, Data Structures and Algorithms (DSA)
Mathematics & Foundations
Linear Algebra, Probability, Statistics, Calculus, Optimization, Discrete Mathematics, Graph Theory, Numerical Methods
Research Papers
Conference
Year: 2021
Awards
- Scholar with Distinction
- UT Dallas Women's Tennis Team - NCAA Division III ASC Champions for two consecutive years
- 2022-23 & 2023-24 Academic All-ASC - Student-Athlete Recognition
Relevant Coursework
- Advanced Large Language Model Agents
- Advance Statistics for Data Science
- Algorithm Design and Analysis
- Applied Deep Learning
- Applied Econometrics and Time Series Analysis
- Applied Machine Learning
- Big Data
- Business Analytics with R
- C Programming
- Calculus
- Cloud Computing Fundamentals (AWS)
- Computer Organisation and Operating Systems
- Database Foundations for Business Analytics
- Database Management Systems
- Foundations of Data Structures and Algorithms Specialization
- Introduction to Internet of Things
- Introduction to Robotics
- JAVA Programming
- Large Language Model Agents
- Mathematics
- Predictive Analytics for Data Science
- Prescriptive Analytics
- Programming for Data Science
- Web Analytics