|
A. Toshev, C. Szegedy. DeepPose: human pose estimation via deep neural networks. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014 [2] J. Tompson, A. Jain, Y. LeCun, and C. Bregler. Joint training of a convolutional network and a graphical model for human pose estimation. In Conference on Neural Information Processing Systems (NIPS), 2014. [3] A. Newell, K. Yang, and J. Deng. Stacked hourglass networks for human pose estimation. In European Conference on Computer Vision (ECCV), 2016. [4] K. He, G. Gkioxari, P. Dollár, and R. Girshick. Mask R-CNN. In IEEE International Conference on Computer Vision (ICCV), 2017. [5] S. Wei, V. Ramakrishna, T. Kanade, and Y. Sheikh. Convolutional pose machines. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. [6] F. Mueller, F. Bernard, O. Sotnychenko, D. Mehta, S. Sridhar, D. Casas, and C. Theobalt. GANerated hands for real-time 3D hand tracking from monocular RGB. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018. [7] G. Garcia-Hernando, S. Yuan, S. Baek, T. Kim. First-Person hand action benchmark with RGB-D videos and 3D hand pose annotations. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018. [8] C. Zimmermann and T. Brox. Learning to estimate 3D hand pose from single RGB images. In IEEE International Conference on Computer Vision (ICCV), 2017. [9] K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. [10] F. Mueller, D. Mehta, O. Sotnychenko, S. Sridhar, D. Casas, and C. Theobalt. Real-time hand tracking under occlusion from an egocentric RGB-D sensor. In IEEE International Conference on Computer Vision (ICCV), 2017. [11] J. Zhang, J. Jiao, M. Chen, L. Qu, X. Xu, Q. Yang. A hand pose tracking benchmark from stereo matching. In IEEE International Conference on Image Processing (ICIP), 2017 [12] T. Simon, H. Joo, I. Matthews, and Y. Sheikh. Hand keypoint detection in single images using multiview bootstrapping. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017. [13] G. Moon, J. Chang, and K. Lee. V2V-PoseNet: Voxel-to-Voxel prediction network for accurate 3D hand and human pose estimation from a single depth map. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018. [14] L. Ge, H. Liang, J. Yuan, and D. Thalmann. Robust 3D hand pose estimation in single depth images: from single-view CNN to multi-view CNNs. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. [15] X. Sun, B. Xiao, F. Wei, S. Liang, and Y. Wei. Integral human pose regression. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017. [16] J. Deng, W. Dong, R. Socher, L. Li, K. Li and F. Li. ImageNet: A large-scale hierarchical image database. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009. [17] C. Ionescu, D. Papava, V. Olaru, and C. Sminchisescu. Human 3.6m: Large scale datasets and predictive methods for 3d human sensing in natural environments. In Proceeding of TPAMI, 2014. [18] J. Martinez, R. Hossain, J. Romero, and J. J. Little. A simple yet effective baseline for 3d human pose estimation. In IEEE International Conference on Computer Vision (ICCV), 2017. [19] Unreal Engine 4. [Online]. Available: https://www.unrealengine.com [20] P. Martinez-Gonzalez, S. Oprea, A. Garcia-Garcia, A. Jover-Alvarez, S. Orts-Escolano, J. Garcia-Rodriguez. UnrealROX: An eXtremely photorealistic virtual reality environment for robotics simulations and synthetic data generation. arXiv preprint arXiv:1810.06936 [21] B. Xiao, H. Wu, and Y. Wei. Simple baselines for human pose estimation and tracking. In European Conference on Computer Vision (ECCV), 2018. [22] Y. Zhou, J. Lu, K. Du, X. Lin, Y. Sun, and X. Ma. HBE: Hand branch ensemble network for real-time 3d hand pose estimation. In European Conference on Computer Vision (ECCV), 2018. [23] Keras. [Online]. Available: https://www.tensorflow.org/guide/keras [24] K. He, X. Zhang, S. Ren, and J. Sun. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. arXiv preprint arXiv:1502.01852v1 [25] Y. Chen, Z. Wang, Y. Peng, Z. Zhang, G. Yu, and J. Sun. Cascaded pyramid network for multi-person pose estimation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018. [26] D. Pavllo, C. Feichtenhofer, D. Grangier, and M. Auli. 3D human pose estimation in video with temporal convolutions and semi-supervised training. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. [27] L. Pishchulin, E. Insafutdinov, S. Tang, B. Andres, M. Andriluka, P. Gehler, and B. Schiele. DeepCut: Joint subset partition and labeling for multi person pose estimation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. [28] Y. Cai, L. Ge, J. Cai, and Junsong Yuan. Weakly-supervised 3d hand pose estimation from monocular RGB Images. In European Conference on Computer Vision (ECCV), 2018. [29] X. Zhou, X. Sun, W. Zhang, S. Liang, and Y. Wei. Deep kinematic pose regression. In European Conference on Computer Vision (ECCV) Workshop, 2016. [30] T. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. Zitnick. Microsoft COCO: Common objects in context. In European Conference on Computer Vision (ECCV), 2014 [31] T. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan, and S. Belongie. Feature pyramid networks for object detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017. [32] Procrustes analysis. [Online]. Available: https://en.wikipedia.org/wiki/Procrustes_analysis [33] J. Tobin, R. Fong, A. Ray, J. Schneider, W. Zaremba, and P. Abbeel. Domain randomization for transferring deep neural networks from simulation to the real world. In IEEE International Conference on Intelligent Robots and Systems (IROS), 2017 [34] J. Yosinski, J. Clune, Y. Bengio, and H. Lipson. How transferable are features in deep neural networks? In Advances in Neural Information Processing Systems (NIPS), 2014. [35] H. Fang, G. Lu, X. Fang, J. Xie, Y. Tai, and C. Lu. Weakly and semi supervised human body part parsing via pose-guided knowledge transfer. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
|