Published
2026 (13 Papers)
CVPR: 4, ICLR: 2, ECCV: 1, AAAI: 1, IROS: 1, SCIE(2%): 1, SCIE(8%): 1, SCIE: 2

Path-level Hindsight Instructions for Semantic Exploration in Vision-Language Navigation
Sung June Kim, Sangpil Kim*, Honglak Lee*
Collaboration : University of Michigan
ECCV 2026
# Physical AI
# Vision-Language Navigation

BiGraspAfford: Collision-Aware Bimanual Grasp Affordance Learning for Dual-Arm Manipulation
SungJun Kim, MinJi Seo, Yisoo Lee, Sangpil Kim*, KangGeon Kim*
Collaboration : KIST
IROS 2026
# Physical AI
# Bimanual Manipulation

LVMark: Robust Watermark for latent video diffusion models
Youngdong Jang†, MinHyuk Jang†, JaeHyeok Lee, Feng Yang, Gyeongrok Oh , Jongheon Jeong, Sangpil Kim*
Collaboration : Google DeepMind
IEEE Transactions on Information Forensics & Security
2026
JCR IF Top 8%
[Paper]
[Code]
# AI Safety
# Video Watermarking

Decoupled Generative Modeling for Human-Object Interaction Synthesis
Hwanhee Jung, Seunggwan Lee, Jeongyoon Yoon, SeungHyeon Kim, Giljoo Nam, Qixing Huang, Sangpil Kim*
Collaboration : META Realty Lab, University of Texas At Austin
CVPR 2026
[Paper]
[Project]
[Code]
# Physical AI
# Human-Object Interaction



M3KG-RAG: Multi-hop Multimodal Knowledge Graph-enhanced Retrieval-Augmented Generation
Hyeongcheol Park, Jiyoung Seo, Jaewon Mun, Hogun Park, Wonmin Byeon, Sung June Kim, Hyeonsoo Im, JeungSub Lee, Sangpil Kim*
Collaboration : Hanwha, NVIDIA Research, Sungkyunkwan University
CVPR 2026
[Paper]
[Project]
# Agentic AI
# Multimodal RAG



Single Image-based Gaussian Splatting for 3D Reconstruction of Movable Articulated Objects
Hwanhee Jung, Seunggwan Lee, Jeongyoon Yoon, Qixing Huang, Sangpil Kim*
Collaboration : University of Texas At Austin
Advanced Engineering Informatics 2026, JCR IF Top 2%
[Paper]
[Code]
# 3D Vision
# 3D Reconstruction

Egocentric Hand Activity Video Dataset and Bidirectional Motion-Priors for Hand Action Recognition
Jiyoung Seo, Dong In Lee, Pilhyeon Lee, Jiwoo Lee, YounHee Gil, Karthik Ramani, Sangpil Kim*
Collaboration : Purdue University, Inha University
IEEE Acess 2026
[Paper]
[Code]
# Physical AI
# Action Recognition

Reconstruction Using the Invisible: Intuition from NIR and Metadata for Enhanced 3D Gaussian Splatting
Gyusam Chang, Tuan-Anh Vu, Vivek Alumootil, Harris Song, Deanna Pham, Sangpil Kim*, M. Khalid Jawed*
Collaboration : UCLA
AAAI 2026
[Paper]
# 3D Vision
# Gaussian Splatting

High-Precision 6DOF Pose Estimation via Global Phase Retrieval in Fringe Projection Profilometry for 3D Mapping
Sehoon Tak, Keunhee Cho, Sangpil Kim*, Jae-Sang Hyun*
Collaboration : Yonsei University
IEEE Transactions on Instrumentation and Measurement 2026
[Paper]
# 3D Vision
# 6DOF Pose Estimation
2025 (18 Papers)
CVPR: 6, ICCV: 3, NeurIPS: 2, ICML: 1, MM: 1, EG: 1, SCIE(3%): 2


BlurGuard: A Simple Approach for Robustifying Image Protection Against AI-Powered Editing
Jinsu Kim, Yunhun Nam, Minseon Kim, Sangpil Kim
, Jongheon Jeong*
Collaboration : KAIST, Microsoft Research Montréal
[Paper]
NeurIPS 2025
# AI Safety
# Deepfake Defense



CATSplat: Context-Aware Transformer with Spatial Guidance for Generalizable 3D Gaussian Splatting from A Single-View Image
Wonseok Roh, Hwanhee Jung, Jong Wook Kim, Seunggwan Lee , Innfarn Yoo, Andreas Lugmayr, Seunggeun Chi, Karthik Ramani, Sangpil Kim*
Collaboration : Google Research, Purdue University
ICCV 2025
[Paper]
[Project]
[Code]
# 3D Vision
# Gaussian Splatting

Test-Time Adaptation for Online Vision-Language Navigation with Feedback-based Reinforcement Learning
Sungjune Kim†, Gyeongrok Oh†, Heeju Ko, Daehyun Ji, Dongwook Lee, Byung-Jun Lee, Sujin Jang*, Sangpil Kim*
Collaboration : Samsung Advanced Institute of Technology
ICML 2025
[Paper]
# Physical AI
# Vision-Language Navigation


EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting
Dong In Lee, Hyeongcheol Park, Jiyoung Seo, Eunbyung Park, Hyunje Park, Ha Dam Baek, Sangheon Shin, Sangmin Kim, Sangpil Kim*
Collaboration : Hanhwa, Yonsei University
CVPR 2025
[Paper]
[Project]
[Code]
# 3D Vision
# 3D Scene Editing


3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
Gyeongrok Oh†, Sungjune Kim†, Heeju Ko,
Hyung-gun Chi, Jinkyu Kim, Dongwook Lee, Daehyun Ji, Sungjoon Choi, Sujin Jang*,
Sangpil Kim*
Collaboration : Samsung Advanced Institute of Technology, Purdue University
CVPR 2025
[Paper]
[Project]
# Autonomous Driving
# Occupancy Prediction

Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Seung Hyun Lee, Jijun jiang, Yiran Xu, Zhuofang Li, Junjie Ke, Yinxiao Li, Junfeng He, Steven Hickson, Katie Datsenko, Sangpil Kim, Ming-Hsuan Yang, Irfan Essa, Feng Yang*
Collaboration : Google Research, Georgia Tech University
CVPR 2025
[Paper]
# Multimodal Learning
# Visual In-Context Learning


Semantically Complex Audio to Video Generation with Audio Source Separation
Sieun Kim, Jaehwan Jeong, Sumin In, Seung Hyun Lee, ,
Seungryong Kim, Saerom Kim, Wooyeol Baek, Sang Ho Yoon, Eugenio Culurciello, Sangpil Kim*
Collaboration : KAIST, Purdue University
Engineering Applications of Artificial Intelligence
(Elsevier,
JCR IF Top 2.7%) 2025
[Paper]
[Code]
# Generative AI
# Audio-guided Video Generation

High-quality three-dimensional cartoon avatar reconstruction with Gaussian splatting
MinHyuk Jang, Jong Wook Kim, Youngdong Jang,
Donghyun Kim, Wonseok Roh , InYong Hwang, Guang Lin,
Sangpil Kim*
Collaboration : Hanhwa, Purdue University
Engineering Applications of Artificial Intelligence
(Elsevier,
JCR IF Top 2.7%) 2025
[Paper]
[Code]
# 3D Vision
# Gaussian Splatting

Occlusion-Aware and Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction
Hyungjun Doh†, Dong In Lee†, Seunggeun Chi, Pin-Hao Huang, Kwonjoon Lee, Sangpil Kim, Karthik Ramani*
Collaboration : Purdue University
ACM MultiMedia(MM) 2025
[Paper]
[Project]
[Code]
# 3D Vision
# Human-Object Interaction

Prosthetic Keypoint Estimation for Alternative Exercise Recommendation in Lower Limb Disabilities
Taebeom Lee, Dongyoon Seo, Chiho Park, Sunghee Hong, and Sangpil Kim*
Human-centric Computing and Information Sciences(Springer, Q2) 2025
[Paper]
# Physical AI
# Pose Estimation

Lightweight Test-time Adaptation for Robust Out-of-Distribution Face Recognition in Web Services
Dongyoon Seo, Taebeom Lee, Jeongyoon Yoon, Chiho Park , Sangpil Kim, Miyoung Kim, Byoungsoo Koh*
Journal of Web Engineering 2025
[Paper]
# Trustworthy AI
# Robust Face Recognition

Perspective Crop Based Egocentric Hand Pose Estimation via Fisheye Stereo Vision
Hyejin Hur, Seongmin Beak, Younhee Gil, and Sangpil Kim*
Collaboration : ETRI
EuroGraphics(Poster) 2025
[Paper]
# Physical AI
# Pose Estimation
2024 (17 Papers)
NeurIPS: 2, CVPR: 3, ECCV: 4, AAAI: 1, SCIE(1%):1, SCIE(10%): 3, SAC: 1, ICPR: 1, ICPRAI: 1

Unified Domain Generalization and Adaptation for
Multi-View 3D Object Detection
Gyusam Chang,
Jiwon Lee, Donghyun Kim, Jinkyu Kim, Dongwook Lee, Daehyun Ji, Sujin Jang*,
Sangpil Kim*
Collaboration : Samsung Advanced Institute of Technology
NeurIPS 2024
[Paper]
[Code]
# Autonomous Driving
# Domain Adaptation



Enhanced Motion Forecasting with Visual Relation Reasoning
Sungjune Kim, Ha Dam Baek, Seunggwan Lee,
Hyung-gun Chi, Hyerin Lim, Jinkyu Kim*,
Sangpil Kim*
Collaboration : Hyndai, Purdue University
ECCV 2024
[Paper]
# Autonomous Driving
# Motion Forecasting

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Seung Hyun Lee,
Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li,
Sangpil Kim,
Irfan Essa, Feng Yang*
Collaboration : Google Resarch, Georgia Tech University
ECCV 2024, Oral
[Paper]
# Multimodal Learning
# Text-to-Image Generation

VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual Descriptions
Seokha Moon, Hyun Woo, Hongbeen Park, Haeji Jung, Hyung-gun Chi, Hyerin Lim,
Sangpil Kim*,
Jinkyu Kim*
Collaboration : Hyndai, Purdue University
ECCV 2024
[Paper]
# Autonomous Driving
# Trajectory Prediction



Higher-order Relational Reasoning for Pedestrian Trajectory Prediction
Sungjune Kim,
Hyung-gun Chi, Hyerin Lim, Karthik Ramani,
Jinkyu Kim*, Sangpil Kim*
Collaboration : Hyndai, Purdue University
CVPR 2024
[Paper]
# Autonomous Driving
# Trajectory Prediction


CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-based 3D Object Detection
Gyusam Chang†, Wonseok Roh†,
Sujin Jang, Dongwook Lee, Daehyun Ji,
Gyeongrok Oh,
Jinsun Park, Jinkyu Kim*, Sangpil Kim*
Collaboration : Samsung Advanced Institute of Technology, Busan University
AAAI 2024
[Paper]

Robust Sound-Guided Image Manipulation
Seunghyun Lee†, Hyung-gun Chi†, Wonmin Byeon, Gyeongrok Oh, Sang Ho Yoon,
Hyunje Park, Wonjun Cho, Jinkyu Kim*, Sangpil Kim*
Collaboration : NVIDIA Research, Hanhwa, KAIST
Neural Networks(Elsevier,
JCR IF Top 10%) 2024
[Paper]
[Project]
[Code]
# Multimodal Learning
# Audio-guided Image Manipulation

Audio-Guided Implicit Neural Representation for Local Image Stylization
Seung Hyun Lee†, Sieun Kim†,
Wonmin Byeon, Gyeongrok Oh, Sumin In, Hyeongcheol Park, Sang Ho Yoon, Sung-hee Hong, Jinkyu Kim*, and
Sangpil Kim*
Collaboration : NVIDIA Research, KAIST
Computational Visual Media(Springer,
JCR IF Top 1%) 2024
[Paper]
[Code]
# Multimodal Learning
# Audio-guided Image Stylization

Self-Supervised Multimodal Graph Convolutional Network for Collaborative Filtering
Sungjune Kim, Seongjun Yun, Jongwuk Lee, Gyusam Chang, Wonseok Roh, Dae-Neung Sohn, Jung-Tae Lee, Hogun Park*, Sangpil Kim*
Collaboration : NAVER, Sungkyunkwan University
Information Sciences(Elsevier,
JCR IF Top 10%) 2024
[Paper]
# Agentic AI
# Recommender Systems

Egocentric View Hand Action Recognition by Leveraging Hand Surface and Hand Grasp Type
Dong Yoon Seo, Hyunggun Chi, Sunghee Hong, Byoung Soo Koh, Karthik Ramani, Sangpil Kim*
Collaboration : Purdue University
ICPRAI 2024(Oral)
[Paper]
# Physical AI
# Hand Action Recognition


Bridging the Domain Gap by Clustering-based Image-Text Graph Matching
Daewon Chae
Nokyung Park, Daewon Chae, Jeong Yong Shim,
Sangpil Kim,
Eun-Sol Kim*,
Jinkyu Kim*
Collaboration : HanYang University
International Conference on Pattern Recognition (2024)
[Paper]
# Multimodal Learning
# Vision-Language
2023 (5 Papers)
ICCV: 1, WWW: 1, JCR 10%: 1, BMVC: 1, Q1 Journal: 1

HapMotion: Motion-to-Tactile Framework with Wearable Haptic Devices for Immersive VR Performance Experience
Kyungeun Jung, Sangpil Kim, Seungjae Oh*, Sang Ho Yoon*
Collaboration : KAIST
Virtual Reality(Springer, Q1) 2023
[Paper]
# Physical AI
# Haptic Interaction

Functional Hand Type Prior for 3D Hand Pose Estimation and Action Recognition from Egocentric View Monocular Videos
Wonseok Roh, Seung Hyun Lee, Wonjeong Ryoo, Gyeongrok Oh, Soo Yeon Hwang, Hyung-gun Chi, Sangpil Kim*
Collaboration : Purdue University
BMVC 2023 (Oral)
[Paper]
[Code]
# Physical AI
# Hand Pose Estimation



Dual Policy Learning for Aggregation Optimization in Recommender Systems
Heesoo Jung, Sangpil Kim, Hogun Park*
Collaboration : Sungkyunkwan University
WWW 2023
[Paper]
# Agentic AI
# Recommender Systems
2022 (3 Papers)
CVPR: 1, ECCV: 1, BMVC: 1



2021

Sound-Guided Semantic Image Manipulation
Seunghyun Lee, SangHo Yoon, Jinkyu Kim*, Sangpil Kim*
Collaboration : KAIST
NeurIPSW 2021
[Paper]
# Generative AI
# Image Manipulation

Audio-Guided Image Manipulation for Artistic Paintings
Seunghyun Lee, Nahyuk Lee, Chanyoung Kim, Wonjeong Ryoo, Jinkyu Kim, SangHo Yoon*, Sangpil Kim*
Collaboration : KAIST
NeurIPSW 2021
# Generative AI
# Image Manipulation
2020

A Large-scale Annotated Mechanical Components Benchmark for Classification and Retrieval Tasks with Deep Neural Networks
Sangpil Kim†, Hyung-gun Chi†, Xiao Hu, Qixing Huang, Karthik Ramani
ECCV 2020
[Paper]
[Dataset A]
[Dataset B]

First-Person View Hand Segmentation of Multi-Modal Hand Activity Video Dataset
Sangpil Kim, Hyung-gun Chi, Xiao Hu, Anirudh Vegesana, Karthik Ramani
BMVC 2020
[Paper]

Object Synthesis by Learning Part Geometry with Surface and Volumetric Representations
Sangpil Kim, Hyung-gun Chi, Karthik Ramani
Computer-Aided Design 2020
[Paper]

Enet: A deep neural network architecture for real-time semantic segmentation
Adam Paszke, Abhishek Chaurasia, Sangpil Kim, Eugenio Culurciello
arxiv 2016
[Paper]

Learning hand articulations by hallucinating heat distribution
Chiho Choi, Sangpil Kim, Karthik Ramani
ICCV 2017
[Paper]

Latent transformations neural network for object view synthesis
Sangpil Kim, Winovich Nick, Hyung-gun Chi, Guang Lin, Karthik Ramani
Visual Computer 36 1663-1677 2020
[Paper]

Computer Vision Lab
Department of Artificial Intelligence, Korea University
603, Woojung Hall of Informatics, Korea University, 145 Anam-ro, Seongbuk-gu, Seoul, 02841
