Area | Title | Authors | Mentor TA |
AI for science |
Highlight: Accelerating Two-Photon Calcium Imaging Segmentation with Convolutional Neural Networks |
Anthony Joseph Riley, Emanuel Lars Ingvar Herberthson, Frank Charles DeGuire |
Lucas |
AI for science |
Highlight: Atmospheric Point-spread Function Estimation of Galactic Survey Images |
Ishaan K Singh, Riley Dayne Carlson, Tirth Dharmesh Surti |
Samir |
AI for science |
Highlight: Learning CO2 Plume Migration in Storage Reservoir using Neural Operators |
Alaa Alahmed, Isaac Ju |
Ishikaa |
Bio-medical |
Highlight: Enhancing 3D Lesion Segmentation in PET-CT Scans Using MIP-Segmentation-Reconstruction Techniques |
Jirayu Burapacheep, Sandy Chen, Thodsawit Tiyarattanachai |
Samir |
Graphics |
Highlight: Improving the Efficiency of 3D Pose Estimation Model |
Quan Minh Ho |
Ishikaa |
Graphics |
Highlight: ShapeCraft: Body-Aware and Semantics-Aware 3D Object Design |
Hannah Cha |
Ishikaa |
Graphics |
Highlight: StyleScape: Stylized and Depth-Consistent 3D Scene Generation |
Chetan Rajagopal Nair, Emily Sihan Zhang |
Cem |
Multi-modality |
Highlight: HilAIt: Automatic Video Highlighting System Leveraging Audio, Text, Facial, and Semantic AI |
Danica Xiong |
Chengshu |
Others |
Highlight: Using Pose Estimation to Analyze Rock Climbing Technique |
Jerry R Qu |
Tiange |
Robotics |
Highlight: Manipulation of Soft Cloths using DenseTact Optical Tactile Sensors |
Ankush Kundan Dhawan, Sunny Singh |
Sanjana |
Theory |
Highlight: A Mean-Field Theory of Training Deep Neural Networks |
Iris Yi-Xian Zhou, Raj V Pabari |
Saumya |
Vision |
Highlight: Automating Powerlifting Judging through Keypoint Detection |
Rahul Krishna Thomas, Rishi Alluri, Tarun Kumar Martheswaran |
Chaitanya |
Vision |
Highlight: Chessboard Understanding with Convolutional Learning for Object Recognition and Detection |
Alex Zhang Shan, Brent Ju |
Cem |
Vision |
Highlight: Dynamic Billboard Replacement in Videos |
Shweta Agrawal, Yawen Guo |
Nikil |
Vision |
Highlight: Img2SumGlyphs: Transformer-based OCR of Sumerian Cuneiform |
Cole Simmons |
Sanjana |
Vision |
Highlight: Latent Diffusion-based Art Style Transfer Model |
Jenny Xu, Xianchen Yang |
Sanjana |
Vision |
Highlight: Movement Hacks in Video Games using Visual Input Only |
William Song Liu, Yvette Yinyin Lin |
Cem |
Vision |
Highlight: Novel Knowledge Distillation Techniques for Visual Skin Disease Detection |
Rachel Park, Rohan Reddy Davidi |
Chengshu |
Vision |
Highlight: Poker Game State Detection |
Jack Jin Hung, Luke McLeod Moberly, Michael Samouel Ghatas Souliman |
Cem |
Vision |
Highlight: SliceViT |
Andriy Sergiyenko |
Nikil |
Vision |
Highlight: Transferring Vision: Teaching CNNs to See with ViT Wisdom |
Kris Jeong, Pauline Arnoud |
Anwesha |
Vision |
Highlight: Zero-shot Prompt-based Partial 3D Point Cloud Creation of the Specified Object from an Unlabeled 2D Image |
Sagar Manglani |
Ishikaa |
AI for science |
Basketball Detection: From Images to Videos |
Justin Chang, Pin-Hsuan Tseng |
Sanjana |
AI for science |
Computer Vision Approaches to Burned Area Image Segmentation |
Iris Xia, Matthew Jordan Villescas, Serena Zhang |
Lucas |
AI for science |
Counting Convolutional Neural Networks for Classification for Wildlife Conservation |
Anna Catherine Edmonds, Cheyenne Ali Sadeghi, Vivek Brahmatewari |
Jenny |
AI for science |
EcoEye: Classification on Plant Pathology Image Dataset Using Deep Learning |
Adam Lida Zhao, Nomin-Erdene Bayarsaikhan |
Lucas |
AI for science |
Estimating Aboveground Carbon Density in Sabah, Malaysia using Deep Learning and Sentinel-2 Satellite Imagery |
Alice Zhaoyi Chen, Amy Shilin Guan, Ryne Zen-Zhi Reger |
Lucas |
AI for science |
Predicting Winning Captions for Weekly New Yorker Comics |
Sonny Sano Young, Stanley Cao |
Nikil |
AI for science |
Rooftop HVAC equipment detection from aerial imagery |
Neil Wang Chen |
Raghav Garg |
AI for science |
Segmentation of Dune Crestlines Using Convolutional Neural Networks |
Timmy Chee Cheng Lui |
Abhijit |
AI for science |
Vision Transformer-Based Routability Prediction with DINOv2 Transfer Learning Using the CircuiteNet2.0 Dataset |
Hangfei Lin, Shundan Xiao |
Bohan |
AI for science |
γGAMMAS : Improving Mathematical Reasoning in Vision Language Models Through Synthetic Data Generation |
Ramgopal Venkateswaran, Shubhra Mishra |
Nikil |
Bio-medical |
A Survey of Synthetic Medical Image Generation For Improving Disease Classification |
Bradley Hu, Brendan Lee Adams McLaughlin, Michael Maffezzoli |
Jenny |
Bio-medical |
Automated Polyp Segmentation in Gastrointestinal Tract Images: A |
Nicolas Friley |
Abhijit |
Bio-medical |
Brain Imaging Foundation Model with DINOv2 for Image Registration |
Kevin Chen |
Lucas |
Bio-medical |
Cell Segmentation of Bright field Microscope Images |
Potchara Boonrat |
Raghav Garg |
Bio-medical |
Chest X-ray synthetic data for better testing and evaluation of ML models |
Alexis Geslin, Elsa Bismuth |
Bohan |
Bio-medical |
Classification of Uncertainty in Nuclei Segmentation From H&E Images |
Conor Messer, Rohit Khurana, Suhana Bedi |
Raghav Garg |
Bio-medical |
Comparative Performance Analysis of CNN-Based Feature Extraction and Classification Techniques for Histopathological Lung Cancer Image Classification |
Carlo Amio Dino, Nicole Garcia, Yu Han Daisy Wang |
Lucas |
Bio-medical |
Cortex-Level Brain MRI Generation Using Diffusion Models |
Isabel Michel, Parnian Azizian, Sidra Nadeem |
Samir |
Bio-medical |
Diagnosis of Alzheimer’s Disease Using 3D and 2D Convolutional Neural Networks |
Nikhil Sharma, Siya Goel, Zhi Zheng |
Raghav Ganesh |
Bio-medical |
End-To-End Pediatric Bone Fracture Localization with Data Augmentation |
Jingwen Wu |
Tiange |
Bio-medical |
Enhanced Motion Correction in Magnetic Resonance Fingerprinting via a Deep Learning Approach |
Mengze Gao, Xuetong Zhou |
Saumya |
Bio-medical |
Exploring Deep Learning Methods for Head CT Triaging |
Grant Sheen, Jonathan Nathaniel Coronado |
Jenny |
Bio-medical |
Generating 3D Brain MRIs by Parameter Efficient Finetuning of Diffusion Transformer (DiT) |
Yingbo Li |
Tiange |
Bio-medical |
Generating Synthetic Chest X-Rays with generative modeling |
Abhishek Kumar, Juan Pablo Triana Martinez |
Saumya |
Bio-medical |
IGSR: Iterative Gaussian Splatting Refinement for Limited-View Scenarios |
Yash Taneja |
Samir |
Bio-medical |
Image Colorization using GAN and Inception-ResNet-v2 |
Mia Penfold, Olivia Beyer Bruvik |
Abhijit |
Bio-medical |
Partial Eclipse of the Heart: Left Ventricle Segmentation on the EchoNet Dataset |
Emma Olivia Cruz, Michael Jonathan Yu |
Raghav Garg |
Bio-medical |
ProstateZone Classifier: Enhancing Transition Zone Lesion Classification with Deep Learning |
Abhi Kumar, Hannah Gail Prausnitz-Weinbaum, Samrat Thapa |
Nikil |
Bio-medical |
Skin Synthesis: A Comparative Analysis of AI and Traditional Methods in Skin Type Classification |
Gabe Gaw, Sherine M Ismail |
Abhijit |
Bio-medical |
Spot On: Unraveling Patterns in Colorectal Cancer Tumors with Spatial Transcriptomics and Image-based Deep Learning |
Avash Shrestha, Priyanka Shrestha, Viraj Mehta |
Abhijit |
Bio-medical |
StenosisSeg: Automatic Stenosis Segmentation for Coronary Artery Disease |
Andy Tianqi Wang, Emily Ruoyu Liu, Tim Jing |
Bohan |
Bio-medical |
Text-Enhanced Medical Visual Question Answering |
Chih-Ying Liu, Fan Diao |
Lucas |
Bio-medical |
The Singapore Cycling Path Dataset |
Wai Lun Suen |
Abhijit |
Bio-medical |
Using SSRL Models to Classify T-cell Autofluorescent Images |
Gabe Eduardo Seir, Jennifer Jing Xu, Trevor William Carrell |
Abhijit |
Graphics |
A Conditional Generative Image Model |
He Nan Li, Lucas Lu |
Bohan |
Graphics |
An Artistic Art-ificial Intelligence: A More Intelligent Way to AI Art Style Transfer |
Helen April He, Regina T.H. Ta, William Isaac Shabecoff |
Samir |
Graphics |
An exploration on Wasserstein GANs and a f-GANs |
Chaoqun Jia |
Ishikaa |
Graphics |
Automatic Prediction of Affordance-Preserving 3D Meshes to Improve Interactive Robotics Simulation |
Emily Broadhurst, Sean Bai, Tommy Anthony Bruzzese |
Cem |
Graphics |
GIMME: 3D Gaussian Inverse Rendering for Mobile Mesh Extraction |
Hannah Norman, Neil Nie |
Tiange |
Graphics |
Generating Spatial Images using Monocular Depth Estimation |
Karan Singh Soin, Nick John Riedman |
Raghav Ganesh |
Graphics |
GlamTry: Advancing Virtual Try-On for High-End Accessories |
Khabane Khabane Lekena, Mothana Alsoofi, Ting-Yu Chang |
Ishikaa |
Graphics |
Hybrid Neural Network-Monte Carlo Approach for Efficient PDE Solvers |
Ethan Hsu, Hong Meng Yam, Ivan Ge |
Bohan |
Graphics |
On the Detection of GAN-Generated Facial Imagery |
Colin Patrick Sullivan, Harshal Rajesh Agrawal, Ricky Anthony Parada |
Raghav Ganesh |
Graphics |
SaveFace: Controlling Face in Image Diffusion Models |
Ashna Khetan, Isabel Paz Reyes Sieh, Laya Balaji Iyer |
Raghav Ganesh |
Graphics |
Semantic Segmentation of Agricultural Anomalies in Aerial Imagery: Analysis of Segmentation Models |
Malvyn Lai |
Raghav Garg |
Multi-modality |
3D SDFusion Animal Shape Generation |
Grace Yang, Siqi Ma, Yunong Liu |
Chengshu |
Multi-modality |
Automated Product Description Generation for E-commerce via Vision-Language Model Fine-tuning |
Ericka Liu, Wei Zhao, Xinyan He |
Saumya |
Multi-modality |
CuratorAI: Enhancing Art Appreciation through AI-Powered Insights |
Aryan Chaudhary, James William Stevens, Kevin K Yang |
Chaitanya |
Multi-modality |
Dress-UP: A Deep Unique, Personalized Fashion Recommender |
Ananya Siri Vasireddy, Evelyn Hejin Choi, Poonam Sahoo |
Anwesha |
Multi-modality |
Emotion Recognition in Videos Through Deep Neural Network Models |
Senyang Jiang, Suxi Li, Yichen Jiang |
Anwesha |
Multi-modality |
Exploring Enhancements to Text-to-drawing Methods |
Ching Hsuan Ho |
Wenlong |
Multi-modality |
Exploring Richer Feature Embeddings for Efficient Video Moment Localization |
Jay Martin, Ranajit Gangopadhyay, Rao Prathik |
Chaitanya |
Multi-modality |
Figure2Code: Enhancing Vision-Language Models with Synthetic Figure-Code Pairs for Improved Figure Understanding |
Abhinav Lalwani, Johnny Chang |
Nikil |
Multi-modality |
Generating High Quality Anime Videos with Diffusion |
Jonathan Lee, Justin Peter Lim, Winston Shum |
Anwesha |
Multi-modality |
Investigating a Shared Embedding Space for Image to Audio Object-Centric Data |
Suzannah Dalton Wistreich |
Nikil |
Multi-modality |
Med-Idefics: A Two-Stage Fine-Tuning Approach for Enhanced Medical Visual Question Answering |
Brendan Murphy |
Raghav Ganesh |
Multi-modality |
Multimodal Retrieval Augmented Generation for Instruction Manual Understanding via Contrastive Learning |
Ali Hindy, William Toby Denton |
Nikil |
Multi-modality |
NEUROCIFE - Implementation of Generative Models on Brain Signals in the context of Civil Engineering |
Guilherme Simioni Bonfim |
Chengshu |
Multi-modality |
Query based Image Synthesizer and multi document summarizer |
Prescilla Pragasam |
Jenny |
Multi-modality |
Representation Fine-Tuning on Vision Tasks |
Zheng Wang |
Tiange |
Multi-modality |
Sweden or Switzerland – GeoLocation on Noisy Datasets |
Bjorn Engdahl, Matthias Heubi |
Wenlong |
Multi-modality |
Titan-ification using Vision LLMs on Custom Attack on Titan Dataset |
Gaurav Kiran Rane, Kavin Anand |
Chengshu |
Multi-modality |
Towards Optimal Convolutional Transfer Learning Architectures for Breast Lesion Classification and ACL Tear Detection |
Aditri Bhagirath, Daniel Michael Frees, Moritz Alexander Bolling |
Nikil |
Multi-modality |
Understanding How Vision-Language Models Reason when Solving Visual Math Problems |
Joseph Tey |
Wenlong |
Multi-modality |
Vision is Language: Visual Understanding via LLM |
Xiangyu Liu |
Tiange |
Multi-modality |
Visual Question and Answering Preference Alignment with ORPO and DPO |
Gerardus de Bruijn, Manasven Grover |
Chaitanya |
Multi-modality |
What Was That?: Lip-Reading Silent Videos |
Linyin Lyu, Miguel Gerena Rivera, akayla hackson |
Anwesha |
Others |
Discrete Diffusion for Image Generation |
Anthony Zhan |
Saumya |
Others |
Estimating Water Quality from Satellite Imagery Using the SustainBench Dataset |
Diego Zancaneli, Ernesto Sung Woo Nam Song, Rikhil Paresh Vagadia |
Anwesha |
Others |
Leveraging 3D CNNs and YOLO for Tennis Stroke Classification |
Cameron Allen Camp, Dominic Egidio Borg |
Raghav Ganesh |
Others |
PNTING: Detecting AI in the pAInting world |
Mhar Eisen Santos Tenorio |
Saumya |
Others |
Parallel U-Net: Improving Image Colorization Using Bounding Boxes with a Modified U-Net Architecture |
Shrey Verma |
Raghav Garg |
Others |
Stalking Farms: Predicting Agricultural Yield with Satellite Imagery |
Peter Albert John Ming, Suhas Pradyumna Chundi |
Lucas |
Robotics |
A Study of Visuomotor Behavior Cloning: Performance Considering Real-Time Requirements |
Jared S Weissberg, Vignesh Anand |
Sanjana |
Robotics |
Analyzing the Performance of 3D Pose Estimators to Optimize Humanoid Robot Control |
Ethan Robert Whitmer |
Wenlong |
Robotics |
Editing Neuron Hidden Activations for Specific Visual Properties in ViT |
Bi Tian Yuan, Sunny Sun |
Sanjana |
Robotics |
Making 3D Scenes Interactive |
Addison Reese Jadwin, Mu-sheng Lin, Tatiana Veremeenko |
Cem |
Robotics |
Semantic Segmentation for Robot Vision with Synthetic Training Data |
Alberto Guiggiani |
Sanjana |
Robotics |
Synthetic Dataset Generation Toolbox and Enhanced 6D Pose Estimation and Object Detection on YCB Objects |
Gabriel M SantaCruz, Hsinhua Lu, Pin-Hua Huang |
Sanjana |
Robotics |
Task-Driven Reasoning from and for Fine-Grained Visual Representations |
Yingke Wang |
Wenlong |
Theory |
Do Experts Specialize? A Mechanistic Exploration of Mixture of Experts Models |
James Poetzscher |
Saumya |
Theory |
What Can Activation Patching Tell Us About Adversarial Examples? |
Katherine Yang Yu, Neil Pagarkar Rathi |
Saumya |
Vision |
3D Gaussian Splatting for Intelligence, Surveillance, and Reconnaissance |
James K Park, Jean Rodmond Junior Laguerre |
Kyle |
Vision |
3D Reconstruction in the wild |
Niranjan Thanikachalam |
Chaitanya |
Vision |
A Comparative Study of CNN Models in Alzheimer’s Detection |
Jiahui Chen, Olivia Hannah Weiner, Yvonne Hong |
Lucas |
Vision |
A Mix-and-Mask Approach to Self-Supervised Image Pretraining |
Christos Polzak, Joy Yun, Samuel Shuo Xing |
Chengshu |
Vision |
A Novel Sign Language Translation Model: Gloss-Free Video-to-Sequence Translation Using Transformer Encoder and LLM Decoder |
Evy Zhu Shen, Jeff Liu, Mac Ya |
Samir |
Vision |
AI-Enhanced Lighting Activation for Awakening Passengers |
Dayoung Kim, Wanbin Song |
Chengshu |
Vision |
ASTRO-G: Advancing Style Transfer with Robust Object Detection and Generative Models |
Miguel Angel Fuentes Hernandez, Youssef Faragalla |
Jenny |
Vision |
Adaptation of OCR Models for L[A]TEX Vision |
Nikash Ankur Chhadia, Sambhav Gupta |
Samir |
Vision |
Aerial Wildfire Detection using Image Classification |
Firat Taxpulat |
Cem |
Vision |
AirBlender: Enhancing Interactive 3D Environment Generation with Text-driven Modifications |
Agam Mohan Singh Bhatia |
Cem |
Vision |
AirWall: Malicious Drones Detection using YOLO |
Alexey Alexandrovich Tuzikov |
Raghav Garg |
Vision |
Album Covers Deserve Some Attention |
Natalie Kam Greenfield, Ngorli Fiifi Paintsil |
Ishikaa |
Vision |
An Evaluation of the Application of Various Models in Autonomous Vehicles |
Bruno de Moraes Dumont, Emily Chengxi Xia, Eric Chen |
Jenny |
Vision |
Automatic Soccer Game Highlight Detection |
Fang Shu, Mike Yang |
Nikil |
Vision |
Blackjack Card Counting |
Andrew Wooyong Chung, Pranav Sai Ravella |
Cem |
Vision |
Building a General Purpose Fruit-Detector with Faster R-CNN |
Kasen Stephensen |
Kyle |
Vision |
CNN and Transformer-Based Segmentation to Classify Deforestation Drivers |
Ayesha Khawaja, Claire Macnamara Morton, Yasmine Fatima Mabene |
Anwesha |
Vision |
Catching Fire: Predicting Wildfire Progress with Computer Vision |
Jack Francis Michaels |
Chaitanya |
Vision |
Dancing in Style: Classifying Dance Videos By Style |
Esteban Nathan Guzman Jimenez, Han Vu Trieu Dao, Sophie Wu |
Saumya |
Vision |
De-raining Natural Scenes using Diffusion Models |
Emil Biju, Sidharth Tadeparti, Sneha Jayaganthan |
Kyle |
Vision |
Deep Learning Deepfake Detection |
Hlumelo Notshe, Shannon Xiao, Tycho Augustus Svoboda |
Jenny |
Vision |
Development of WaldoNet: A Novel Approach to Solving ”Where’s Waldo” |
Andrew Carter Lesh, Barney Haoyun Miao |
Tiange |
Vision |
EduCartoonizer: Leveraging InST, IPT, and DCT-Net for High-Quality Cartoon Style Transfer in Educational Images and Videos |
Liuxin Yang |
Ishikaa |
Vision |
Efficient Multi-Stream Fusion for Violence Detection: Integrating RGB, Optical Flow, and Joint Features on Low-Memory Devices |
Jermaine Zhao, Nathan Le Tran, Robin Li |
Chengshu |
Vision |
Enhancing Chart-to-Text Conversion |
Theodore Yu |
Chengshu |
Vision |
Enhancing Privacy: Automated Detection and Blurring of Sensitive Information in Images and Video Feeds |
Paul Woringer, Yanis Najy Miraoui |
Saumya |
Vision |
Enhancing Traffic Sign Detection with ToDayGAN and YOLOv5: A Two-Step Data Augmentation Approach for Small Datasets |
Benson Zu |
Bohan |
Vision |
Enhancing Word-Level Translation of American Sign Language Using Modified 3D Convolutional Networks |
Jadon Geathers, Nikhil Suresh |
Anwesha |
Vision |
Ensemble Transformer Architecture for Plant Traits Prediction |
Annabelle Aurelia Jayadinata, Lisa Fung |
Anwesha |
Vision |
Evaluating CNN* Architectures and Training Paradigms for Visual Commonsense Reasoning Using the SVRT Dataset |
Jasmine Selin Bilir, Megan Dass, Riya Dulepet |
Nikil |
Vision |
Exploring Color Modification in Multi-view Feature Fusion Networks |
Kaden Tien Nguyen, Nathan J Zhao, Patrick Ruibin Li |
Bohan |
Vision |
Fast R-CNN and Multimodal Attention Architecture for Image Captioning |
Armeen Ahmed, Kyle Ian Schmoyer |
Anwesha |
Vision |
FractuVision: Enhancing Bone Fracture Diagnostics |
Aryan Siddiqui, Jun Wang |
Jenny |
Vision |
GAS - A Visual Navigation Framework for Producing 2D and 3D Semantic Maps from Video |
Alexander Kuznetsov, Ghanshyam Bhutra, Swaroop Pal |
Nikil |
Vision |
Hand Segmentation and Depth Estimation |
Marina Qian, Yuxuan Wu |
Ishikaa |
Vision |
Image Augmentations For Satellite Imagery Fire Risk Assessment |
Adrian Antonio Saldana, Mohamed A Owda |
Anwesha |
Vision |
Improved Recoloring of SAR Satellite Images |
Da Sun, Sukrut Oak, Young Chol Song |
Chaitanya |
Vision |
Improving Camouflage Object Detection |
Max Serge Meyberg, Vinay Kumar Awasthi, Yin-Li Liu |
Wenlong |
Vision |
Interchange Interventions on Vision Models |
Emily Bunnapradist |
Bohan |
Vision |
JASMUR: Estimating the Absolute Pose of Vehicles in Real-World Traffic Images |
Haldun Umur Darbaz, Jasmine Saerom Park |
Jenny |
Vision |
Late Fusion Multivariate Stock Market Prediction |
Markus Armbruster |
Raghav Ganesh |
Vision |
Learn To Climb By Seeing: Climbing Grade Classification with Computer Vision |
Julian Rodriguez Cardenas, Kathryn Velez Garcia |
Kyle |
Vision |
Leveraging Lightweight AI for Video Querying in a RAG Framework |
Adam Hyungsuk Chun, Emily Angel Hsu |
Nikil |
Vision |
Live Non-isolated Sign Language Recognition Using Transformers |
Daniel Li Yang, Ethan Tamer Farah |
Chengshu |
Vision |
Low-Data Deep Learning for License Plate Blurring |
Parth Sarin, Patricia Wei |
Raghav Ganesh |
Vision |
MatchPoint: Your Computer Vision Tennis Coach |
Evan Cheng, Isaac I. Gorelik, Rishi Dange |
Abhijit |
Vision |
Mix and Match: ByteTrack with DETR |
Alice Ku, Simba Xu, Sureen Heer |
Nikil |
Vision |
Multi-Domain Transfer Learning for Image Classification |
Yijia Wang |
Ishikaa |
Vision |
Neural Aesthetic Portrait Image Assessment |
Cici Hou, Xiyuan Wu |
Tiange |
Vision |
Object Detection and Classification for Waste Disposal |
Ethan Yi Ko, Peng Hao Lu |
Samir |
Vision |
On Fairness of Low-Rank Adaptation of Vision Models |
Qianzhong Chen, Zhoujie Ding |
Sanjana |
Vision |
Optimizing NeRFs for Dynamic Scenes with Image Inpainting and Motion Segmentation |
Lilian Naing Chen |
Tiange |
Vision |
Person Re-Identification in a Video Sequence |
Jiayang Wang, Zhiyuan Li |
Tiange |
Vision |
Real-Time Fire Detection in Video Stream Using Deep Learning |
Dan Pilewski, Omer Doron, Sravan Patchala |
Raghav Ganesh |
Vision |
Real-Time Pokémon Card Detection from Tournament Footage |
Edwin Antonio Pua |
Ishikaa |
Vision |
Reducing Bias in a Facial Gender and Age Predictor |
Dhruv Tandon, Jack Irish |
Raghav Ganesh |
Vision |
Refining Residual Mappings Using Regions of High Attention |
Charles Wang |
Cem |
Vision |
Related Task Self-Supervised Learning for Rock Climbing Route Rating |
Ethan Hans Harianto, Jack Hlavka |
Chengshu |
Vision |
Remote Sensing Multi-class Object Counting: A YOLO Approach |
Zach Peter Rotzal |
Raghav Ganesh |
Vision |
Scoring with Few Shots: Applying Few-Shot Learning to Basketball Analytics |
Joshua Christopher Francis |
Chengshu |
Vision |
Semantic Segmentation of Cropland with Satellite Imagery |
Fred Addy, James Van Kirk |
Chengshu |
Vision |
Sign Language Recognition with Convolutional Neural Networks |
Anusha Aditi Kuppahally, Arnav Gangal, Malavi Ravindran |
Raghav Ganesh |
Vision |
Solar Panel Detection on Satellite Images: From Faster R-CNN to YOLOv10 |
Camila Nicollier Sanchez, Stefan Elbl Droguett |
Lucas |
Vision |
Stable Image Colorization via Reference Image Cross-Attention |
Anshu Bansal |
Jenny |
Vision |
StrawberrAI: Strawberry Classification using a Convolutional Neural Network |
Erik Luna, Ivan Miranda Liongson |
Chengshu |
Vision |
Transfer Learning for Fish Detection in Underwater Images |
Humishka Zope, Ishvi Mathai |
Wenlong |
Vision |
Transfer Learning for Identifying Land Use and Land Cover from Satellite Imagery |
Chenchen Gu |
Lucas |
Vision |
Uncertainty Quantification of Neural Radiance Fields for Enhanced Safety Validation |
Jacob Alexander Frausto |
Bohan |
Vision |
Using Geoembeddings to Predict Image Geolocations |
Kenneth Ruigeng Ma, Parker Joseph Stewart, Wesley Tjangnaka |
Ishikaa |
Vision |
Vision Transformers for Optical Music Recognition of Monophonic Scores |
Christo Dimitrov Hristov, Maddox de Bretteville |
Sanjana |
Vision |
Vision Transformers for Robust Analysis of Satellite Imagery |
Adrian L Gamarra Lafuente, Mai Khanh Hoang, Nathan Gyoohyun Kim |
Jenny |
Vision |
Weather Forecasting UNET |
Jesus E Meza Rosales, Noah J Anderson, Tenzin Tsultrim |
Samir |