Projects — Jatin Mehra

Competition Projects

MAP Competition: Predicting Student Math Misconceptions

🥈 Silver Medal · Top 2%

Developed an advanced NLP system to automatically identify and classify mathematical misconceptions from students' written explanations. Built a sophisticated ensemble of 6 large language models (7B-14B parameters) combining LoRA, QLoRA, and full fine-tuning strategies. Achieved 0.947 MAP@3 score, ranking 45th out of 2,500+ participants.

NLP LLM Fine-tuning LoRA/QLoRA Ensemble Learning PyTorch HuggingFace Multi-GPU Training

Code Kaggle Writeup

Nexar Dashcam Collision Prediction

🏆 11th Place · Top 1%

Advanced computer vision project for dashcam collision prediction using state-of-the-art VideoMAE-2 architecture. Implemented sophisticated video understanding models to predict potential collisions from dashcam footage, leveraging transformer-based video analysis for real-time safety applications in autonomous driving systems.

Computer Vision VideoMAE-2 Transformers PyTorch Video Analysis

Code Model Demo

Automated Essay Scoring System

Top 10%

Developed a state-of-the-art AI model for automated essay evaluation as part of the Kaggle AES competition. Achieved 0.79 QWK (Quadratic Weighted Kappa) score. The project aimed to reduce manual grading effort and enhance the feedback process for students and educators.

PyTorch HuggingFace LLM Fine-tuning NLP

Code Model Demo

Fake Scene Detector

Top 10% · 0.93 AUC

Advanced computer vision project to detect and classify fake or manipulated scenes in images. Utilizes deep neural networks and advanced image processing techniques to identify digitally altered content for media authenticity verification.

Computer Vision Deep Learning Image Processing

Code

BrisT1D Blood Glucose Prediction

Top 10%

Machine learning project for predicting blood glucose levels in Type 1 Diabetes patients using the BrisT1D dataset. Implemented time series forecasting models to help patients and healthcare providers better manage diabetes through predictive analytics.

Machine Learning Time Series Healthcare AI Predictive Analytics

Code

AI Applications & RAG Systems

CrawlGPT

AI-powered web content crawler with advanced LLM-powered RAG (Retrieval Augmented Generation) capabilities. Intelligently extracts content from URLs, processes it through sophisticated summarization algorithms, and enables natural language interactions using cutting-edge LLM technology for enhanced information retrieval.

Generative AI Web Crawling RAG Vector DB CI/CD Docker

Code Demo

PDF Insight Pro: Agentic RAG App

Agentic RAG using FastAPI, FAISS, LangChain & Groq with real-time web validation via Tavily to answer PDF-based queries intelligently. Achieved Semantic Similarity (Mean) 0.852 with ~86% evaluation accuracy. Includes Android app built with Java.

Agentic RAG LangChain FAISS FastAPI Docker Android

Code Demo

AI-Agent-Based Deep Research System

Advanced multi-agent AI system for conducting comprehensive research on complex topics. Uses collaborative AI agents with specialized roles to gather, analyze, and synthesize information from multiple sources for in-depth research reports and insights.

Multi-Agent AI Research Automation LangChain LangGraph Knowledge Synthesis

Code Demo

NLP & Fine-tuned Models

Plagiarism Detector using Fine-tuned SmolLM2

1000+ Downloads/Month

The smolLM2 135M Ins. model was fine-tuned on the MIT Plagiarism Detection Dataset for improved performance in identifying textual similarities. Achieved 0.96 F1 score, 0.96 recall, and 0.96 precision scores. Provides binary classification outputs.

Generative AI NLP LLM Fine-tuning PyTorch HuggingFace

Code Model Demo

AI-Powered Podcast to Blog Generator

Convert podcasts into engaging blogs using AI. Generate FAQs, social media posts, newsletters, and SEO elements. Uses LangChain, Llama 4, OpenAI Whisper-large-v3-turbo, Pydantic, and FastAPI.

Generative AI NLP Tavily Search OpenAI Whisper FastAPI Docker

Code Demo

AI-Powered YouTube Video Summarizer & Fact-Checker

Web app that extracts captions from YouTube videos, generates summaries and text embeddings, and allows users to search within podcast transcripts. Also refines context and fact-checks claims using AI models and web crawlers.

Generative AI NLP Web Scraping FastAPI Pandas

Code Demo Video

Smart Resume Generator using AI

AI-powered application that automatically generates professional resumes tailored to specific job descriptions. Uses natural language processing to analyze job requirements and optimize resume content for better job matching and ATS compatibility.

Generative AI NLP LangChain Document Generation

Code Demo

Deep Learning & Time Series

Gas Turbine Electricity Prediction with LSTM

Developed a deep learning solution for predicting gas turbine electricity output using LSTM neural networks. The model processes time-series data to forecast power generation with high accuracy (RMSE < 370), outperforming traditional prediction methods.

TensorFlow LSTM Time Series Deep Learning

Code

15+

Projects

Top 2%

Best Kaggle Rank

1000+

Monthly Downloads

Live Demos