Shaping the Future, One at a Time.

Kriti C Parikh

I learn at high velocity, often understanding and applying new concepts within 24 hours. One of my core strengths is my relentless curiosity and my ability to operate as a self-driven learning machine, constantly pushing to expand my knowledge.

A data storyteller passionate about transforming raw information into meaningful insights that fuel strategic decisions and business growth. With hands-on experience in Python, Machine Learning, GenAI, LLMs, Hugging Face, Transformers, LangChain, R, SQL, and data visualization tools, I navigate the evolving landscape of AI, Data Science, and Analytics to uncover hidden patterns and craft impactful narratives.

Looking for Full-Time Data Roles Actively

Greetings! I'm a meticulous, purposeful, and driven individual with a profound passion for the transformative power of data. My strengths include a keen eye for detail, a strategic mindset, and a natural knack for effective problem-solving.

Through participating in AI, LLM, and Alteryx-centered hackathons, I’ve discovered my stride, where strategic thinking, technical skills, and creativity intersect. Looking ahead to full-time opportunities starting immediately, my focus remains firmly anchored in the fields of AI, machine learning, and data science, where I can contribute from day one and keep learning at the frontier of innovation.

Certified in Applied Machine Learning, Azure, Google Data Analytics, and Alteryx Core, I bring proficiency across a wide range of tools and platforms. Intrigued by intelligent systems, I continuously explore how data and AI can be applied to solve real-world challenges, drive strategic decisions, and create meaningful impact.
Beyond deciphering intricate data puzzles, I bring a sense of equilibrium to the data landscape as a dedicated former member of the university's tennis team. In my leisure, I savor the exhilaration of motor vehicles, adding a dynamic layer to my interests.

Remembri: Symbolic Memory Pipeline Without Embeddings

Remembri is a lightweight, logic driven memory system that enables LLMs to remember user specific facts—like name and location, without embeddings or chat history. Built with FastAPI and SQLite, it reduced repeated input by 73%, achieved 95% tool-calling accuracy, and boosted context relevance by injecting symbolic memory directly into prompts.

FitFusion AI - Gen AI Hackathon

Built a real-time AI wellness assistant using Hugging Face and Gradio to generate personalized meal/workout plans and detect user emotions through NLP. Integrated Tavily API for smart-search research bot, enabling real-time wellness recommendations with visual and contextual outputs.

AI-Powered Teaching Assistant (Build with AI Google Event)

Built a multi-agent AI teaching assistant using LangGraph, Vertex AI, and Gemini LLMs to automate lesson planning, quizzes, and personalized learning. Implemented an event-driven architecture with Pub/Sub and Eventarc to boost agent collaboration and real-time content generation. Integrated RESTful APIs via Cloud Run to deliver scalable, multimodal educational content, developed through an interactive course and direct guidance from Googlers.

Electric Vehicles Data Analysis - Tableau Project

This project leverages Tableau to analyze and visualize key aspects of electric vehicle (EV) data, with a focus on Battery Electric Vehicles (BEVs) and Plug-in Hybrid Electric Vehicles (PHEVs). The interactive dashboard uncovers trends in the EV market, highlights geographical distribution, and evaluates manufacturer performance.

Chat With Multiple PDFs - AI Bot

This project is a sophisticated tool that allows users to interact with multiple PDF documents through an AI-powered chat interface. Leveraging advanced technologies like Gemini AI and LangChain, the system extracts, processes, and stores text for intelligent and context-aware conversations.

Credit Card Eligibility Prediction using Random Forest

Built a machine learning pipeline to automate Global Bank’s credit card approval process using 2.4K+ historical applications. Cleaned and encoded 18+ features, handled missing edge cases, and deployed a Random Forest model that reduced manual reviews by 80% and accelerated decision time 5x.

Cardiovascular Risk Prediction

This project aims to predict the 10-year risk of future coronary heart disease (CHD) in patients using machine learning techniques. By analyzing various health indicators and lifestyle factors, we develop a model that can assist healthcare professionals in identifying high-risk individuals and implementing preventive measures.

Sentiment Analysis using NLP

This project implements a robust sentiment analysis model using Natural Language Processing (NLP) and machine learning techniques. The model classifies text into positive or negative sentiments with high accuracy, providing valuable insights for various business applications.

Anime Face Generation with Gen-AI GANs

Developed a Generative Adversarial Network (GAN) model to generate anime-style faces using Python libraries, including TensorFlow, NumPy, and Matplotlib. This project involved designing and training the GAN to produce high-quality, realistic anime faces by implementing advanced techniques and fine-tuning methods to enhance performance and output quality

SQL-Based HR Analytics Accelerator for IBM (Capstone Project)

Designed and built a fully normalized SQL database (1NF–3NF) from a flat HR dataset, boosting complex query performance by 50%. Developed ER diagrams, applied indexing and partitioning, and ensured referential integrity across tables. Authored advanced SQL queries using CTEs and window functions to analyze attrition trends, pay equity gaps, and education-performance relationships. Optimized the schema for seamless BI tool integration and future scalability.

Real Time Accident Detection

This project focuses on developing and advancing a system for real-time monitoring and alerting of potential vehicle accidents.
The repository is divided into two major phases:
1. Initial Project: The foundational work done during the internship at Electronics Corporation of India Limited (ECIL), which involved real-time vehicle data collection and basic alert mechanisms.
2. Advanced Project: An enhanced version with advanced data processing, real-time alert integration, and further optimizations.

PLC-CNC Data Analysis

The project is focused on analyzing data collected from Programmable Logic Controllers (PLC) in CNC (Computer Numerical Control) machines. The goal of this project is not only to monitor the operational performance of these machines and detect anomalies but also to provide actionable insights and suggestions for improving machine efficiency and reliability.

Skills

Certifications

Applied Machine Learning, Microsoft Azure Fundamentals, Google Data Analytics, Alteryx Designer Core & Foundational, Google Analytics

Programming Languages

Python, R, SQL, C, JavaScript, HTML, CSS

Machine Learning & Data Science

Supervised & Unsupervised Learning: Linear & Logistic Regression, Ridge/Lasso, Decision Trees, Random Forest, AdaBoost, Gradient Boosting (XGBoost, LightGBM, CatBoost), k-NN, Clustering (k-means, HDBSCAN, DBSCAN), Bayesian Belief Networks, Markov Chains, Reinforcement Learning

Deep Learning

Neural Networks, CNNs, RNNs, LSTMs, Transformers, Computer Vision, TensorFlow, PyTorch, Keras

Time-Series

ARIMA, SARIMA, ARMA, Box-Jenkins, LSTMs for Sequence Modeling

NLP & LLM Frameworks

Retrieval-Augmented Generation (RAG), Advanced RAG, RAG Evaluation, Embedding Models, HuggingFace Transformers, LangChain, LlamaIndex, NLTK, SpaCy, Pretraining, Finetuning, Deepspeed, Knowledge Graphs, Explainable AI (SHAP, LIME), Vector Databases (FAISS, Weaviate, Pinecone), NER, Topic Modeling, Sentiment Analysis, LLMOps, Attention Mechanism, Diffusion Models, GANs, Prompt Engineering

LLM & Foundation Models

GPT series, LLaMA, T5, GPT-J, wav2vec, data2vec, Whisper, Mixture of Experts (MoE)

Libraries & Frameworks

TensorFlow, PyTorch, Pandas, NumPy, OpenCV, Flask, Scikit-learn, Matplotlib, WandB, CUDA

Data Analysis & Experimentation

A/B Testing, Multivariate Testing, Hypothesis Testing (z-test, t-test, ANOVA, F-test, Chi-Square), Predictive Modeling, Feature Selection (PCA, Lasso, Ridge), Experiment Design & Causal Inference

Data Engineering & Big Data

ETL Pipelines, Data Warehousing, Snowflake, Hadoop, Spark, Hive, MapReduce, Delta Lake, API Integrations, Real-Time Data Processing

Cloud Platforms & MLOps

AWS, Azure, GCP (Vertex AI, Compute Engine, Cloud Functions), Databricks, CI/CD, MLOps

Business Intelligence & Visualization

Tableau, Power BI, Qlik, Looker, Plotly

Other Tools & Technologies

MongoDB, Git, Web Scraping, Flask, FastAPI, Adobe Analytics, Streamlit, Ollama, Excel, Data Structures and Algorithms (DSA)

Mathematics & Foundations

Linear Algebra, Probability, Statistics, Calculus, Optimization, Discrete Mathematics, Graph Theory, Numerical Methods

Research Papers

Abnormal Activity Detection using Deep Learning
Published in: Springer LNNS - Link
Year: 2022

Conference

Digital Participation
Presented the Abnormal Activity Detection using Deep Learning paper at the World Conference on Smart Trends in Systems, Security & Sustainability, UK
Year: 2021

Awards

  • Scholar with Distinction
  • UT Dallas Women's Tennis Team - NCAA Division III ASC Champions for two consecutive years
  • 2022-23 & 2023-24 Academic All-ASC - Student-Athlete Recognition

Relevant Coursework

  • Advanced Large Language Model Agents
  • Advance Statistics for Data Science
  • Algorithm Design and Analysis
  • Applied Deep Learning
  • Applied Econometrics and Time Series Analysis
  • Applied Machine Learning
  • Big Data
  • Business Analytics with R
  • C Programming
  • Calculus
  • Cloud Computing Fundamentals (AWS)
  • Computer Organisation and Operating Systems
  • Database Foundations for Business Analytics
  • Database Management Systems
  • Foundations of Data Structures and Algorithms Specialization
  • Introduction to Internet of Things
  • Introduction to Robotics
  • JAVA Programming
  • Large Language Model Agents
  • Mathematics
  • Predictive Analytics for Data Science
  • Prescriptive Analytics
  • Programming for Data Science
  • Web Analytics