|
[1] Yifu Zhang, Chunyu Wang, Xinggang Wang, Wenjun Zeng, and Wenyu Liu. Fairmot: On the fairness of detection and re-identification in multiple object tracking. International Journal of Computer Vision, pages 1–19, 2021. [2] Oliver Styles, Victor Sanchez, and Tanaya Guha. Multiple object forecasting: Predicting future object locations in diverse environments. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 690–699, 2020. [3] Agrim Gupta, Justin Johnson, Li Fei-Fei, Silvio Savarese, and Alexandre Alahi. Social gan: Socially acceptable trajectories with generative adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2255–2264, 2018. [4] Junaid Ahmed Ansari and Brojeshwar Bhowmick. Simple means faster: Real- time human motion forecasting in monocular first person videos on cpu. In 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 10319–10326. IEEE, 2020. [5] Hai-Yan Yao, Wang-Gen Wan, and Xiang Li. End-to-end pedestrian trajec- tory forecasting with transformer network. ISPRS International Journal of Geo-Information, 11(1):44, 2022. [6] Boris Ivanovic and Marco Pavone. The trajectron: Probabilistic multi-agent trajectory modeling with dynamic spatiotemporal graphs. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 2375– 2384, 2019. [7] Osama Makansi, Ozgun Cicek, Kevin Buchicchio, and Thomas Brox. Multi- modal future localization and emergence prediction for objects in egocentric view with a reachability prior. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4354–4363, 2020. [8] Zhongdao Wang, Liang Zheng, Yixuan Liu, Yali Li, and Shengjin Wang. Towards real-time multi-object tracking. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XI 16, pages 107–122. Springer, 2020. [9] Xingyi Zhou, Vladlen Koltun, and Philipp Kr ̈ahenbu ̈hl. Tracking objects as points. In European Conference on Computer Vision, pages 474–490. Springer, 2020. [10] Pavel Tokmakov, Jie Li, Wolfram Burgard, and Adrien Gaidon. Learning to track with object permanence. arXiv preprint arXiv:2103.14258, 2021. [11] Bing Shuai, Andrew G Berneshawi, Davide Modolo, and Joseph Tighe. Multi- object tracking with siamese track-rcnn. arXiv preprint arXiv:2004.07786, 2020. [12] Yichao Yan, Jinpeng Li, Jie Qin, Song Bai, Shengcai Liao, Li Liu, Fan Zhu, and Ling Shao. Anchor-free person search. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7690–7699, 2021. [13] Alexandre Alahi, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Li Fei-Fei, and Silvio Savarese. Social lstm: Human trajectory prediction in crowded spaces. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 961–971, 2016. [14] Oily Styles, Arun Ross, and Victor Sanchez. Forecasting pedestrian trajectory with machine-annotated training data. In 2019 IEEE Intelligent Vehicles Symposium (IV), pages 716–721. IEEE, 2019. [15] Takuma Yagi, Karttikeya Mangalam, Ryo Yonetani, and Yoichi Sato. Fu- ture person localization in first-person videos. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 7593–7602, 2018. [16] Amir Rasouli, Iuliia Kotseruba, and John K Tsotsos. Are they going to cross? a benchmark dataset and baseline for pedestrian crosswalk behavior. In Proceedings of the IEEE International Conference on Computer Vision Workshops, pages 206–213, 2017. [17] Alon Lerner, Yiorgos Chrysanthou, and Dani Lischinski. Crowds by example. In Computer graphics forum, volume 26, pages 655–664. Wiley Online Library, 2007. [18] Alexandre Robicquet, Amir Sadeghian, Alexandre Alahi, and Silvio Savarese. Learning social etiquette: Human trajectory understanding in crowded scenes. In European conference on computer vision, pages 549–565. Springer, 2016. [19] A. Ess, B. Leibe, K. Schindler, , and L. van Gool. A mobile vision system for robust multi-person tracking. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’08). IEEE Press, June 2008. [20] Yuan Liu, Ruoteng Li, Yu Cheng, Robby T Tan, and Xiubao Sui. Object tracking using spatio-temporal networks for future prediction location. In European Conference on Computer Vision, pages 1–17. Springer, 2020. 27 [21] Lukas Neumann and Andrea Vedaldi. Pedestrian and ego-vehicle trajectory prediction from monocular camera. In Proceedings of the IEEE/CVF Confer- ence on Computer Vision and Pattern Recognition, pages 10204–10212, 2021. [22] SHI Xingjian, Zhourong Chen, Hao Wang, Dit-Yan Yeung, Wai-Kin Wong, and Wang-chun Woo. Convolutional lstm network: A machine learning ap- proach for precipitation nowcasting. In Advances in neural information pro- cessing systems, pages 802–810, 2015. [23] Rico Jonschkowski, Austin Stone, Jonathan T Barron, Ariel Gordon, Kurt Konolige, and Anelia Angelova. What matters in unsupervised optical flow. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part II 16, pages 557–572. Springer, 2020. [24] Zhichao Yin and Jianping Shi. Geonet: Unsupervised learning of dense depth, optical flow and camera pose. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1983–1992, 2018. [25] Anurag Ranjan, Varun Jampani, Lukas Balles, Kihwan Kim, Deqing Sun, Jonas Wulff, and Michael J Black. Competitive collaboration: Joint unsuper- vised learning of depth, camera motion, optical flow and motion segmenta- tion. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12240–12249, 2019. [26] Yang Wang, Peng Wang, Zhenheng Yang, Chenxu Luo, Yi Yang, and Wei Xu. Unos: Unified unsupervised optical-flow and stereo-depth estimation by watching videos. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8071–8081, 2019. [27] Yifu Zhang, Peize Sun, Yi Jiang, Dongdong Yu, Zehuan Yuan, Ping Luo,Wenyu Liu, and Xinggang Wang. Bytetrack: Multi-object tracking by asso- ciating every detection box. arXiv preprint arXiv:2110.06864, 2021. [28] Alex Kendall, Yarin Gal, and Roberto Cipolla. Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 7482–7491, 2018. [29] A. Milan, L. Leal-Taix ́e, I. Reid, S. Roth, and K. Schindler. MOT16: A benchmark for multi-object tracking. arXiv:1603.00831 [cs], March 2016. arXiv: 1603.00831. [30] P. Dendorfer, H. Rezatofighi, A. Milan, J. Shi, D. Cremers, I. Reid, S. Roth, K. Schindler, and L. Leal-Taix ́e. Mot20: A benchmark for multi object track- ing in crowded scenes. arXiv:2003.09003[cs], March 2020. arXiv: 2003.09003. [31] Rudolph Emil Kalman. A new approach to linear filtering and prediction problems. 1960. |