Daoyi Gao

I am a PhD student at the 3D AI Lab at Technical University of Munich.

Previously, I graduated from Robotics, Cognition, Intelligence at TUM. Prior to this, I obtained my B.E. from Beihang University (BUAA) majored in mechatronics.

My research interests are in the field of 3D computer vision, retrieval-based scene understanding.

Email  /  Google Scholar /  Github /  LinkedIn

profile photo
Publications
DiffCAD: Weakly-Supervised Probabilistic CAD Model Retrieval and Alignment from an RGB Image
Daoyi Gao, Dávid Rozenberszki, Stefan Leutenegger, Angela Dai
SIGGRAPH, 2024
project page / bibtex / code

We proposed a weakly-supervised approach for CAD model retrieval and alignment from an RGB image. Our approach utilzes diffusion models to tackle the ambiguities in the monocular perception, and achives robuts cross-domain performance while only trained on synthetic dataset

Polarimetric Pose Prediction
Daoyi Gao*, Yitong Li*, Patrick Ruhkamp*, Iuliia Skobleva*, Magdalena Wysock*, HyunJun Jung, Pengyuan Wang, Arturo Guridi, Benjamin Busam
ECCV, 2022
project page / paper / bibtex / code

We proposed a hybrid model that utilizes polarizaiotn information with physical priors in a data-driven learning strategy to improve the accuracy of pose predictions for photometric challenging objects.

* Equal contribution. Alphabetical order.

Attention meets Geometry: Geometry Guided Spatial-Temporal Attention for Consistent Self-Supervised Monocular Depth Estimation
Patrick Ruhkamp*, Daoyi Gao*, Hanzhi Chen*, Nassir Navab, Benjamin Busam
3DV, 2021
project page / paper / bibtex / code

We proposed a self-supervised monocular depth estimation pipeline that aims to improve consistency while preserving accuracy. We also proposed a new Temporal Consistency Metric (TCM) to quantify depth consistency across frames.

* Equal contribution. Order determined randomly.

Spatial-Temporal Attention through Self-Supervised Geometric Guidance
Patrick Ruhkamp*, Daoyi Gao*, Hanzhi Chen*, Nassir Navab, Benjamin Busam
ICCV Workshop: Self-supervised Learning for Next-Generation Industry-level Autonomous Driving, 2021
project page / paper / code

We proposed a spatial-temporal attention mechanism guided by geometric constraints to aggregate geometrically meaningful attention and improve temporal depth stability and accuracy compared to previous methods.

* Equal contribution. Order determined randomly.

Teaching
Teaching Assistant, Techniques in Artificial Intelligence (IN2062) WS 2021/2022


Teaching Assistant, Introduction to Deep Learning (IN2346) SS2020, WS 2020/2021


Teaching Assistant, Block Seminar Hands-on Deep Learning SS2020

Credicts: Jon Barron