Hestia: Voxel-Face-Aware Hierarchical Next-Best-View Acquisition for Efficient 3D Reconstruction

Lu, Cheng-You; Zhuang, Zhuoli; Le, Nguyen Thanh Trung; Xiao, Da; Chang, Yu-Cheng; Do, Thomas; Sridhar, Srinath; Lin, Chin-teng

Computer Science > Robotics

arXiv:2508.01014 (cs)

[Submitted on 1 Aug 2025 (v1), last revised 25 Nov 2025 (this version, v3)]

Title:Hestia: Voxel-Face-Aware Hierarchical Next-Best-View Acquisition for Efficient 3D Reconstruction

Authors:Cheng-You Lu, Zhuoli Zhuang, Nguyen Thanh Trung Le, Da Xiao, Yu-Cheng Chang, Thomas Do, Srinath Sridhar, Chin-teng Lin

View PDF HTML (experimental)

Abstract:Advances in 3D reconstruction and novel view synthesis have enabled efficient and photorealistic rendering. However, images for reconstruction are still either largely manual or constrained by simple preplanned trajectories. To address this issue, recent works propose generalizable next-best-view planners that do not require online learning. Nevertheless, robustness and performance remain limited across various shapes. Hence, this study introduces Voxel-Face-Aware Hierarchical Next-Best-View Acquisition for Efficient 3D Reconstruction (Hestia), which addresses the shortcomings of the reinforcement learning-based generalizable approaches for five-degree-of-freedom viewpoint prediction. Hestia systematically improves the planners through four components: a more diverse dataset to promote robustness, a hierarchical structure to manage the high-dimensional continuous action search space, a close-greedy strategy to mitigate spurious correlations, and a face-aware design to avoid overlooking geometry. Experimental results show that Hestia achieves non-marginal improvements, with at least a 4% gain in coverage ratio, while reducing Chamfer Distance by 50% and maintaining real-time inference. In addition, Hestia outperforms prior methods by at least 12% in coverage ratio with a 5-image budget and remains robust to object placement variations. Finally, we demonstrate that Hestia, as a next-best-view planner, is feasible for the real-world application. Our project page is this https URL web.

Comments:	Accepted to the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026
Subjects:	Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2508.01014 [cs.RO]
	(or arXiv:2508.01014v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2508.01014

Submission history

From: Cheng-You Lu [view email]
[v1] Fri, 1 Aug 2025 18:27:23 UTC (32,960 KB)
[v2] Tue, 11 Nov 2025 09:17:20 UTC (33,301 KB)
[v3] Tue, 25 Nov 2025 06:20:26 UTC (33,302 KB)

Computer Science > Robotics

Title:Hestia: Voxel-Face-Aware Hierarchical Next-Best-View Acquisition for Efficient 3D Reconstruction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Hestia: Voxel-Face-Aware Hierarchical Next-Best-View Acquisition for Efficient 3D Reconstruction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators