Daoyi Gao

Daoyi Gao

I am a PhD student at the 3D AI Lab at Technical University of Munich.

Previously, I graduated from Robotics, Cognition, Intelligence at TUM.

My research interests are in the field of 3D computer vision, CAD-driven scene understanding, and 3D generative modeling.

Email / Google Scholar / Github / LinkedIn

Publications

	MeshART: Generating Articulated Meshes with Structure-guided Transformers Daoyi Gao, Yawar Siddiqui, Lei Li, Angela Dai CVPR, 2025 project page / paper We proposed a hierarchical transformer-based approach to generate articulated 3D meshes with clean, compact geometry.
	DiffCAD: Weakly-Supervised Probabilistic CAD Model Retrieval and Alignment from an RGB Image Daoyi Gao, Dávid Rozenberszki, Stefan Leutenegger, Angela Dai SIGGRAPH, 2024 project page / paper / code We proposed a weakly-supervised approach for CAD model retrieval and alignment from an RGB image. Our approach utilzes diffusion models to tackle the ambiguities in the monocular perception, and achives robuts cross-domain performance while only trained on synthetic dataset
	Polarimetric Pose Prediction Daoyi Gao, Yitong Li, Patrick Ruhkamp, Iuliia Skobleva, Magdalena Wysock, HyunJun Jung, Pengyuan Wang, Arturo Guridi, Benjamin Busam ECCV, 2022 project page / paper / code We proposed a hybrid model that utilizes polarizaiotn information with physical priors in a data-driven learning strategy to improve the accuracy of pose predictions for photometric challenging objects. Equal contribution. Alphabetical order.
	Attention meets Geometry: Geometry Guided Spatial-Temporal Attention for Consistent Self-Supervised Monocular Depth Estimation Patrick Ruhkamp, Daoyi Gao, Hanzhi Chen, Nassir Navab, Benjamin Busam 3DV, 2021 project page / paper / code We proposed a self-supervised monocular depth estimation pipeline that aims to improve consistency while preserving accuracy. We also proposed a new Temporal Consistency Metric (TCM) to quantify depth consistency across frames. Equal contribution. Order determined randomly.

Workshops

1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics

CVPR, 2024

Teaching

Teaching Assistant, Machine Learning for 3D Geometry (IN2392) WS 2023/2024, SS 2024, WS 2024/2025

Teaching Assistant, Techniques in Artificial Intelligence (IN2062) WS 2021/2022

Teaching Assistant, Introduction to Deep Learning (IN2346) SS2020, WS 2020/2021

Teaching Assistant, Block Seminar Hands-on Deep Learning SS2020

Credicts: Jon Barron