I'm interested in computer vision and computer graphics.
My research focuses on physics-informed models that can
(1) reconstruct the digital twins of our Multimodal World from multimodal data,
and (2) leverage shared representations across modalities to enable cross-modal learning.
FreeFix: Boosting 3D Gaussian Splatting via Fine-Tuning-Free Diffusion Models Hongyu Zhou,
Zisen Shao,
Sheng Miao, Pan Wang, Dongfeng Bai, Bingbing Liu,
Yiyi Liao3DV, 2026
project page /
arXiv /
code
A fine-tuning-free approach designed to eliminate artifacts and boost the rendering quality of 3D Gaussian Splatting (3DGS) in extrapolated views
WeRef: An Open-source and Extensible Dataset for Referee Gesture Recognition in RoboCup Zisen Shao,
Josiah P. Hanna,
RoboCup-2025: Robot Soccer World Cup XXVIII, 2025   (Oral Presentation) code
WeRef is an open-source synthetic data generation pipeline for RoboCup Standard Platform League (SPL) referee gestures recognition.
A novel architecture integrating RL within a classical robotics stack,
while employing a multi-fidelity sim2real approach and decomposing behavior into learned sub-behaviors with heuristic selection.