I'm interested in computer vision and computer graphics.
My research focuses on physics-informed models that can
(1) reconstruct the digital twins of our Multimodal World from multimodal data,
and (2) leverage shared representations across modalities to enable cross-modal learning.
A novel architecture integrating RL within a classical robotics stack,
while employing a multi-fidelity sim2real approach and decomposing behavior into learned sub-behaviors with heuristic selection.