作者(外文):Lee, Yi-An
論文名稱(外文):Simultaneous Video Generation and Remote Photoplethysmography Estimation with Spatial Attention Mechanism
指導教授(外文):Hsu, Chiou-Ting
口試委員(外文):Wang, Sheng-Jyh
Shao, Hao-Chiang
外文關鍵詞:Video generationRemote photoplethysmographyComputer visionDeep learningArtificial intelligenceHeart rate estimationBiomedical imagingData augmentationAttention mechanism
遠程光體積變化描記圖法(rPPG)是一種測量生物醫學訊息的技術。藉由分析人體皮膚的光學影像,rPPG可以擷取人體脈搏資訊,且相較傳統心律偵測工具,rPPG具有非侵入、非接觸的優勢。隨著深度學習(Deep learning)與卷積神經網絡(CNN)近年快速的發展,這些技術已被運用在rPPG偵測。儘管如此,精準測量rPPG仍然非常困難。我們認為數據增強(data augmentation)具有改善測量rPPG精確度的潛力。在本文中,我們提出了兩個深度學習的網路:rPPG偵測網絡(rPPG esimation network)和rPPG合成網絡(rPPG synthesizing network),分別從面部影像偵測出rPPG,以及生成訓練用的合成影像。此外,我們在rPPG偵測網絡內設計的注意力模組(Attention module),除了增進rPPG偵測網絡的訓練過程,還能降低合成影片的失真程度。根據實驗結果,我們的方法成功生成了與真實影像幾乎相同、無失真的合成影像,從而改進了rPPG運算網路。此論文的實驗數據也超越了所有現存方法。
Remote photoplethysmography (rPPG) is a non-invasive method for estimating biomedical signals from optically-obtained videos of human skin. With the vast progress of deep learning and convolutional neural networks, the CNN-based learning frameworks have been proven effective for estimating such information. Because accurate estimation of rPPG requires large amount of training data, we believe data augmentation is very potential for improving the performance. In this thesis, we propose two serial, joint-learning frameworks: rPPG estimation network and rPPG synthesizing network, to estimate rPPG signals from face videos and generate synthetic data for augmentation, respectively. Additionally, we implement novel attention modules within the rPPG estimation network to boost the training process and alleviate the artifact of synthetic videos. According to the experimental results on benchmark datasets, our method is capable of generating realistic videos to improve the proposed model and reaches state-of-the-art performance.
摘要 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .i
Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1
2 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .5
2.1 Remote Photoplethysmography Estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .5
2.2 Data Augmentation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .6
2.3 Attention Mechanism . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .6
3 Proposed Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .8
3.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .8
3.2 rPPG Estimation Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .9
3.3 rPPG Attention Module . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .10
3.4 rPPG Synthesizing Network . . . . . . . . . . . . . . . . . . . . . . . . . . .12
3.5 Overall Framework . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16
4 Experiment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .17
4.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .17
4.2 Datasets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .17
4.3 Evaluation Metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .18
4.4 Implementation Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19
4.5 Network Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19
4.6 Ablation Study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .20
4.7 Result and Comparison . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .24
4.8 Visualization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .29
5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .33
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .34
