作者(外文):Wang, Fu-En
論文名稱(外文):360 Perception for Indoor Depth and Layout Estimation
指導教授(外文):Sun, Min
口試委員(外文):Lin, Chia-Wen
Lee, Chi-Chun
Chen, Yi-Ting
Chiu, Wei-Chen
近年來,隨著消費者級別的全景相機越來越普及,深度學習運用在全景相機上的相關演算法在計算機視覺領域開始得到許多重視。此外,因為全景像機能夠同時拍到周圍360度資訊的關係,室內自主系統也開始在使用全景像機進行室內定位與導航,然而,全景影像的場景理解技術至今都沒有成熟的演算法能夠讓大家有效率的去運用,因此,本論文將針對室內自主系統中至關重要的兩個項目 (1) 室內環境深度預測,與 (2) 室內格局預測來進行討論並提出新穎高效率的演算法來提高未來室內自主系統相關應用的可行性。首先,針對深度預測的部分,我們透過結合不同投影資訊的方式來提高既有方法在預測的深度圖上容易產生模糊的問題,同時,我們提出了兩個全新的網路架構BiFuse和BiFuse++來大幅改善全景影像深度預測的精確度;針對格局預測的部分,我們則結合了BiFuse++與LED2-Net來同時運用不同投影的資訊與一維表示法並精確預測出室內格局的資訊。
In recent years, as consumer-level 360 cameras become popular and affordable by most people, algorithms that utilize deep learning and panoramas become important topics in computer vision. Moreover, since 360 cameras are caplable of capturing all surrounding information around the camera, indoor autonomous systems start to adopt these useful sensors for indoor localization and navigation tasks. However, efficient approaches for dealing with these tasks haven't been studied well in computer vision field. Hence, in this paper, we focus on the two important tasks in indoor autonomous systems: 1) Indoor Depth Estimation, and 2) Indoor Layout Estimation. For Indoor Depth Estimation, we utilize the information from different projections of panoramas and propose two novel framework, BiFuse and BiFuse++, to significantly improve the problems existing in previouse works that the predicted depth maps from networks are usually blurred. For Indoor Layout Estimation, we utilize BiFuse++ and LED$^2$-Net to simultaneously use the information from different projections and 1D representation to precisely estimate layouts from panoramas.
