Computer Vision & Machine Learning Developer looking for Summer 2026 internship opportunities
Introduction
I'm a Computer Vision graduate student at Carnegie Mellon University (Robotics Institute, SCS), passionate about AR/VR, 3D Vision, Vision-Language Models (VLMs), multimodal understanding, and generative visual intelligence. My journey spans both research and industry—shaping ideas into impactful systems.
Previously, I've worked as a research intern at CMU and VJTI, and as a Software Developer Intern at Myntra. My projects explore diverse CV problems—from fine-tuning Stable Diffusion for zero-shot 3D segmentation in cryo-ET, to modeling cell morphology via unsupervised learning, to scaling virtual try-on systems with garment segmentation and LLM-powered recommendation. My work has been published in Springer LNNS and bioRxiv.
Beyond research, I've built production-ready systems: phishing detection pipelines as Chrome extensions, LLM-augmented search at Myntra, and OCR for low-resource scripts using Kolmogorov–Arnold Networks. I enjoy blending classical CV, deep learning, and prompting to solve real-world challenges.
I'm particularly excited about advancing semantically aligned perception, OCR for underrepresented scripts, and applied VLMs in image-based search, AR-powered experiences, and privacy-conscious, real-time vision systems—especially in e-commerce, accessibility, and consumer tech.

What I have done so far
Carnegie Mellon University
Myntra Designs Private Limited
Veermata Jijabai Technological Institute
Dwarkadas J Sanghvi College of Engineering
Nextgen Techno Ventures Private Ltd
My work
This section showcases my academic contributions, highlighting my research efforts and findings. Each paper has either been published/accepted or under review, links to their respective PDFs are provided, illustrating my ability to conduct in-depth research and present findings effectively.


Published in Journal of Electrical Systems 20-10 s (2024): 4772-4787, SCOPUS Indexed
#VGG19
#Mel-Spectograms
#ANN


To be published in Lecture Notes in Networks and Systems - Springer Book series, index by SCOPUS. Accepted and presented in 6th International Conference on Data & Information Sciences(ICDIS-2024)
#BlendingEnsemble
#ML
#DL


Accepted and presented in AISD 2024. Awarded 2nd Best Paper at the workshop.
#LSTM
#ZeroInflated
#PowerTransform


Presented in 3rd International conference on Machine Learning and Data Engineering (ICMLDE 2024). Under Review in Procedia Computer Science Journal
#LLM
#NLP
#DL


Published in International Journal of Engineering Research in Computer Science and Engineering.
#BERT
#RoBERTA
#GraphNetworkAnalysis

Under Review in a Internation Journal of Information Technology
#MaskR-CNN
#FeatureExtraction
#DL


Preprint: BioRxiv
#Stable Diffusion
#Unsupervised Learning
#LoRA
Following projects showcase my skills and experience through real-world examples of my work. Each project is briefly described with links to code repositories and live demos in it. It reflects my ability to solve complex problems, work with different technologies, and manage projects effectively.


It enhances online shopping by enabling users to visualize clothing on themselves and receive personalized outfit recommendations using trend analysis and human-in-the-loop reinforcement learning. Developed for Myntra's HackerRamp WeForShe Hackathon, this project won pre-finalist status and I recieved a six-months internship opportunity.
#SegMind-Diffusion
#ChromaDB
#React JS


Client-Attorney Matchmaking Platform is an MLOps-driven solution designed to connect clients with legal experts efficiently. This platform utilizes advanced machine learning operations to create precise matches based on clients' specific legal needs and attorneys' specialized skills. We won MLOps Track in DataHack 2.0 Datathon.
#FalconAI
#Yake
#BERT


AI-driven carpooling platform that connects commuters to reduce traffic congestion and carbon footprints. Designed to enhance urban travel, it features user matching, fare calculations, route optimization, and safety measures, incentivizing eco-friendly commuting with carbon credit rewards. We came second runner-up in Technovate 2023.
#DJango
#ARIMA
#TravellingSalesman


The main role of HungerZero is connect donner and NGO's, and also give a assistance to those who really need food, reduce wastage of food as well as hunger problem, donner can easily make a donation without any time wasting food shift will guide you throughout your process. We won Code Odyssey 2.0, 2023 with this project
#ReactJS
#DJango
#LogMealAPI


It is designed for suicide detection and mental health monitoring, enabling timely intervention and support for at-risk individuals. The platform features a 24-hour chatbot for mental health assistance, stress relief games, mood-based music playlists, and guided meditation exercises, all aimed at enhancing user well-being.
#ReactJS
#uAgents
#Flask


Shopping Chatbot with Web Scraping is an innovative solution that simplifies the online shopping experience by utilizing web scraping techniques to gather product information from various e-commerce platforms. Powered by Natural Language Processing, this helps users find the best deals through natural language interactions.
#Selenium
#Flask
#NLP


Event Tracker System tackles the challenge of managing information overload by developing a real-time news extraction platform. This system utilizes NLP techniques, large language models, and network analysis to categorize significant events and create an interactive network graph to visualize connections.
#NextJS
#Supabase
#VectorDB