Pre-print

MTVG : Multi-text Video Generation with Text-to-Video Models
Gyeongrok Oh, Jaehwan Jeong, Sieun Kim, Wonmin Byeon, Jinkyu Kim, Hyeokmin Kwon, Sungwoong Kim, Sangpil Kim
Collaboration : NVIDIA Research USA
Under reivew (2024) – #8,#C5

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Seung Hyun Lee, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang
Collaboration : Google Resarch USA, Georgia Tech Univ.
Under reivew (2024) – #7,#C4

Enhanced Motion Forecasting with Visual Relation Reasoning
Sungjune Kim, Ha Dam Baek, Seunggwan Lee, Hyung-gun Chi, Hyerin Lim, Jinkyu Kim*, Sangpil Kim*
Collaboration : Hyndai, Purdue Univ.
Under reivew (2024) – #6,#C3

Cross-Modal Domain Generalization for Multi-view 3D Object Detection
Gyusam Chang, Wonjeong Ryoo, Donghyun Kim, Dongwook Lee, Daehyun Ji, Jinkyu Kim, Sujin Jang*, Sangpil Kim*
Collaboration : Samsung Advanced Institute of Technology
Under reivew (2024) – #5,#J3

V-Trap: Vision-augmented Trajectory Prediction with Text Supervision
Seokha Moon, Hyun Woo, Hongbeen Park, Haeji Jung, Hyung-gun Chi, Hyerin Lim, Sangpil Kim*, Jinkyu Kim*
Collaboration : Hyndai, Purdue Univ.
Under reivew (2024) – #4,#C2

Bridging the Domain Gap by Clustering-based Image-Text Graph Matching
Daewon Chae Nokyung Park, Daewon Chae, Jeong Yong Shim, Sangpil Kim, Eun-Sol Kim*, Jinkyu Kim*
Collaboration : HanYang Univ.
Under reivew (2024) – #3,#C1

Soundini: Sound-Guided Diffusion for Natural Video Editing
Seung Hyun Lee, Sieun Kim, Innfarn Yoo, Feng Yang, Donghyeon Cho, Youngseo Kim, Huiwen Chang, Jinkyu Kim*, Sangpil Kim*
Collaboration : Google Resarch USA
Under reivew (2023) – #2,#J2
[Paper]

FPANet: Frequency-based Video Demoireing using Frame-level Post Alignment
Gyeongrok Oh, Heon Gu, Jinkyu Kim*, Sangpil Kim*
Collaboration : LG Display
Under reivew (2023) – #1,#J1
[Paper]