UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding
2022
D3Net: A Speaker-Listener Architecture for Semi-supervised Dense Captioning and Visual Grounding in RGB-D Scans
2021
Deep convolutional priors for indoor scene synthesis
ACM Transactions on Graphics
2018
37
4
1-14
PartNet: A Large-scale Benchmark for Fine-grained and Hierarchical Part-level 3D Object Understanding
2018
Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
2020
ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
2020
PlanIT
ACM Transactions on Graphics
2019
38
4
1-15
Scan2CAD: Learning CAD Model Alignment in RGB-D Scans
2018