|
[1] Agarwala A., Dontcheva M., Agrawala M., Drucker S., Colburn A., Curless B., Salesin D., Cohen M. (2004) Interactive digital photomontage. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 23(3):294–302 [2] Ambardekar A., Nicolescu M., Dascalu S. (2009) Ground truth verification tool (gtvt) for video surveillance systems. In: International Conferences on Advances in Computer-Human Interactions [3] Autodesk (2009) 123D Catch. URL http://www.123dapp.com/catch [4] Bai X., Wang J., Simons D., Sapiro G. (2009) Video snapcut: Robust video object cutout using localized classifiers. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 28(3):70:1–70:11 [5] Boykov Y., Veksler O., Zabih R. (2001) Fast approximate energy minimization via graph cuts. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 23(11):1222–1239 [6] Chan F.-H., Chen Y.-T., Xiang Y., Sun M. (2016) Anticipating accidents in dash- cam videos. In: Asian Conference on Computer Vision (ACCV), Springer, pp 136– 153 [7] Chang C.-S., Chu H.-K., Mitra N. J. (2016) Interactive videos: Plausible video editing using sparse structure points. Computer Graphics Forum (Proceedings of EUROGRAPHICS) 35 [8] Chang C.-S., Sun M., Chu H.-K. (2018) An interactive system for robust and ef- ficient 2d/3d annotation of dashcam videos. submitted to International Journal of Computer Vision (IJCV) [9] Chaurasia G., Duchene S., Sorkine-Hornung O., Drettakis G. (2013) Depth syn- thesis and local warps for plausible image-based navigation. ACM Transactions on Graphics (ToG) 32(3):30:1–30:12 [10] Chen X., Kundu K., Zhang Z., Ma H., Fidler S., Urtasun R. (2016) Monocular 3d object detection for autonomous driving. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 2147–2156, doi: 10.1109/CVPR.2016.236 [11] Chopra A. (2012) Introduction to google sketchup [12] Chu H.-K., Hsu W.-H., Mitra N. J., Cohen-Or D., Wong T.-T., Lee T.-Y. (2010) Camouflage images. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 29:51:1–51:8 [13] Chuang Y.-Y., Agarwala A., Curless B., Salesin D. H., Szeliski R. (2002) Video matting of complex scenes. ACM Transactions on Graphics (Proceedings of SIG- GRAPH) 21(3):243–248 [14] Comaniciu D., Meer P. (2002) Mean shift: A robust approach toward feature space analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 24(5):603–619 [15] Comaschi F., Stuijk S., Basten T., Corporaal H. (2014) A tool for fast ground truth generation for object detection and tracking from video. In: IEEE International Conference on Image Processing (ICIP), pp 368–372 [16] Criminisi A., Reid I. D., Zisserman A. (2000) Single view metrology. vol 40, pp 123–148 [17] DavisA.,LevoyM.,DurandF.(2012)Unstructuredlightfields.ComputerGraph- ics Forum (Proceedings of EUROGRAPHICS) 31(2pt1):305–314 [18] Doennann D., Mihalcik D. (2000) Tools and techniques for video performance evaluation. In: Proceedings of International Conference on Pattern Recognition, vol 4 [19] Dosovitskiy A., Ros G., Codevilla F., Lopez A., Koltun V. (2017) CARLA: An open urban driving simulator. In: Proceedings of the 1st Annual Conference on Robot Learning, pp 1–16 [20] Engel J., Schöps T., Cremers D. (2014) Lsd-slam: Large-scale direct monocular slam. In: European Conference on Computer Vision (ECCV) [21] Ester M., Kriegel H.-P., Sander J., Xu X. (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of In- ternational Conference on Knowledge Discovery and Data Mining (KDD ’96), pp 226–231 [22] Fan Q., Zhong F., Lischinski D., Cohen-Or D., Chen B. (2015) Jumpcut: Non- successive mask transfer and interpolation for video cutout. ACM Transactions on Graphics (Proceedings of SIGGRAPH Asia) 34(6) [23] Farbman Z., Lischinski D. (2011) Tonal stabilization of video. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 30(4):89:1–89:10 [24] Gaidon A., Wang Q., Cabon Y., Vig E. (2016) Virtualworlds as proxy for multi- object tracking analysis. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 4340–4349 [25] Geiger A., Lenz P., Urtasun R. (2012) Are we ready for autonomous driving? the kitti vision benchmark suite. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) [26] Girshick R. (2015) Fast R-CNN. In: Proceedings of the International Conference on Computer Vision (ICCV) [27] Goldman D. B., Gonterman C., Curless B., Salesin D., Seitz S. M. (2008) Video object annotation, navigation, and composition. In: Proceedings of the 21st Annual ACM Symposium on User Interface Software and Technology (UIST), pp 3–12 [28] HartleyR.,ZissermanA.(2003)MultipleViewGeometryinComputerVision,2nd edn. Cambridge University Press [29] He K., Rhemann C., Rother C., Tang X., Sun J. (2011) A global sampling method for alpha matting. In: IEEE Conference on Computer Vision and Pattern Recogni- tion (CVPR), pp 2049–2056 [30] van den Hengel A., Dick A., Thormählen T., Ward B., Torr P. H. S. (2007) Video- trace: Rapid interactive scene modelling from video. ACM Transactions on Graph- ics (Proceedings of SIGGRAPH) 26(3) [31] Hennessey J. W., Mitra N. J. (2015) An image degradation model for depth- augmented image editing. Computer Graphics Forum (Proceedings of SGP) [32] IgarashiT.,MoscovichT.,HughesJ.F.(2005)As-rigid-as-possibleshapemanipu- lation. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 24(3):1134– 1141 [33] Jiang N., Tan P., Cheong L.-F. (2009) Symmetric architecture modeling with a single image. ACM Transactions on Graphics (Proceedings of SIGGRAPH Asia) pp 113:1–113:8 [34] KarschK.,SunkavalliK.,HadapS.,CarrN.,JinH.,FonteR.,SittigM.,ForsythD. (2014) Automatic scene inference for 3d object compositing. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 33(3) [35] Kavasidis I., Palazzo S., Salvo R. D., Giordano D., Spampinato C. (2012) A semi- automatic tool for detection and tracking ground truth generation in videos. In: Proceedings of International Workshop on Visual Interfaces for Ground Truth Col- lection in Computer Vision Applications [36] Klein G., Murray D. (2007) Parallel tracking and mapping for small AR workspaces. In: Proceedings of IEEE and ACM International Symposium on Mixed and Augmented Reality (ISMAR), Nara, Japan [37] KloseF.,WangO.,BazinJ.-C.,MagnorM.,Sorkine-HornungA.(2015)Sampling based scene-space video processing. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 34(4):67:1–67:11 [38] Kopf J., Cohen M. F., Szeliski R. (2014) First-person hyper-lapse videos. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 33(4):78:1–78:10 [39] Lee S. C., Nevatia R. (2011) Robust camera calibration tool for video surveillance camera in urban environment. In: CVPR Workshops [40] Lepetit V., Moreno-Noguer F., Fua P. (2009) Epnp: An accurate o(n) solution to the pnp problem. International Journal Computer Vision (IJCV) 81(2):155–166 [41] Li C., Zia Z., Tran Q.-H., Yu X., Hager G. D., Chandraker M. (2017) Deep su- pervision with shape concepts for occlusion-aware 3d object parsing. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) [42] Li G., Liu L., Zheng H., Mitra N. J. (2010) Analysis, reconstruction and manip- ulation using arterial snakes. ACM Transactions on Graphics (ToG) 29(6):152:1– 152:10 [43] Li Y., Zheng Q., Sharf A., Cohen-Or D., Chen B., Mitra N. J. (2011) 2d-3d fusion for layer decomposition of urban facades. In: Proceedings of the International Conference on Computer Vision (ICCV) [44] Liu F., Gleicher M., Jin H., Agarwala A. (2009) Content-preserving warps for 3d video stabilization. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 28(3):44:1–44:9 [45] Liu S., Wang J., Cho S., Tan P. (2014) Trackcam: 3d-aware tracking shots from consumer video. ACM Transactions on Graphics (Proceedings of SIGGRAPH Asia) 33(6):198:1–198:11 [46] Lourakis M. I. (Jul. 2004) levmar: Levenberg-marquardt nonlinear least squares algorithms in C/C++. URL http://www.ics.forth.gr/~lourakis/ levmar/ [47] Milan A., Leal-Taixe L., Reid I., Roth S., Schindler K. (2016) MOT16: A bench- mark for multi-object tracking. arXiv:1603.00831 [cs] [48] MousavianA.,AnguelovD.,FlynnJ.,KoseckaJ.(2017)3dboundingboxestima- tion using deep learning and geometry. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) [49] Mur-Artal R., Montiel J. M. M., Tardós J. D. (2015) Orb-slam: a versatile and accurate monocular slam system. IEEE Transactions on Robotics (TRO) 31(5):1147–1163, doi: 10.1109/TRO.2015.2463671 [50] Newcombe R., Fox D., Seitz S. (2015) Dynamicfusion: Reconstruction and track- ing of non-rigid scenes in real-time. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) [51] Pollefeys M., Van Gool L., Vergauwen M., Verbiest F., Cornelis K., Tops J., Koch R. (2004) Visual modeling with a hand-held camera. International Journal Com- puter Vision (IJCV) 59(3):207–232 [52] Ren S., He K., Girshick R., Sun J. (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 39:1137–1149 [53] Ros G., Sellart L., Materzynska J., Vazquez D., Lopez A. M. (2016) The synthia dataset: A large collection of synthetic images for semantic segmentation of urban scenes. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) [54] Rüegg J., Wang O., Smolic A., Gross M. (2013) Ducttake: Spatiotemporal video compositing. Computer Graphics Forum (Proceedings of EUROGRAPH- ICS) 32(2pt1):51–61 [55] Schaefer S., McPhail T., Warren J. (2006) Image deformation using moving least squares. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 25(3):533– 540 [56] SchödlA.,EssaI.A.(2002)Controlledanimationofvideosprites.In:Proceedings of ACM SIGGRAPH/Eurographics Symposium on Computer Animation, SCA ’02, pp 121–127 [57] Snavely N., Seitz S. M., Szeliski R. (2006) Photo tourism: Exploring photo col- lections in 3d. ACM Transactions on Graphics (Proceedings of SIGGRAPH) pp 835–846, doi: 10.1145/1179352.1141964 [58] Sunkavalli K., Johnson M. K., Matusik W., Pfister H. (2010) Multi-scale im- age harmonization. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 29(4):125:1–125:10 [59] Sweeney C. (2013) Theia multiview geometry library: Tutorial & reference. URL http://theia-sfm.org [60] Thormählen T., Broszio H. (2007) Voodoo camera tracker [61] Vi3Dim (2011) Vi3dimv2. URL http://www.vi3dim.com [62] Vondrick C., Patterson D., Ramanan D. (2013) Efficiently scaling up crowd- sourced video annotation. International Journal Computer Vision (IJCV) 101 [63] Wang J., Bhat P., Colburn R. A., Agrawala M., Cohen M. F. (2005) Interac- tive video cutout. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 24(3):585–594 [64] Wang T. Y., Kohli P., Mitra N. J. (2015) Dynamic sfm: Detecting scene changes from image pairs. Computer Graphics Forum (Proceedings of SGP) [65] Wong Y.-S., Chu H.-K., Mitra N. J. (2015) Smartannotator: An interactive tool for annotating indoor rgbd images. Computer Graphics Forum (Proceedings of EUROGRAPHICS) 34 [66] Xiang Y., Mottaghi R., Savarese S. (2014) Beyond pascal: A benchmark for 3d object detection in the wild. In: IEEE Winter Conference on Applications of Com- puter Vision (WACV) [67] Xiang Y., Alahi A., Savarese S. (2015) Learning to track: Online multi-object tracking by decision making. In: Proceedings of the International Conference on Computer Vision (ICCV), pp 4705–4713, doi: 10.1109/ICCV.2015.534 [68] Xiang Y., Choi W., Lin Y., Savarese S. (2017) Subcategory-aware convolutional neural networks for object proposals and detection. In: IEEE Winter Conference on Applications of Computer Vision (WACV) [69] Xiao J., Cao X., Foroosh H. (2006) 3d object transfer between non-overlapping videos. In: Proceedings of IEEE Virtual Reality Conference, pp 127–134 [70] Xu F., Liu Y., Stoll C., Tompkin J., Bharaj G., Dai Q., Seidel H.-P., Kautz J., Theobalt C. (2011) Video-based characters: Creating new human performances from a multi-view video database. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 30(4):32:1–32:10 [71] Yang C. M., Choo Y., Park S. (2018) Semi-automatic image and video annotation system for generating ground truth information. In: International Conference on Information Networking (ICOIN) [72] YuF.,FoleyS.,ChenH.,BaiH.,XianW.,ChenY.,WangX.,DarrellT.,Gonzalez J., Hays J. (2018) Scalabel. URL http://www.scalabel.ai [73] Zhang G., Dong Z., Jia J., Wan L., Wong T.-T., Bao H. (2009) Refilming with depth-inferred videos. IEEE Transactions on Visualization and Computer Graph- ics (TVCG) 15(5):828–840 [74] Zheng Y., Chen X., Cheng M.-M., Zhou K., Hu S.-M., Mitra N. J. (2012) Interac- tive images: Cuboid proxies for smart image manipulation. ACM Transactions on Graphics (Proceedings of SIGGRAPH) 31(4):99:1–99:11 [75] Zheng Y., Liu H., Dorsey J., Mitra N. J. (2016) Smartcanvas: Context-inferred in- terpretation of sketches for preparatory design studies. Computer Graphics Forum (Proceedings of EUROGRAPHICS) 35(2):37–48, doi: 10.1111/cgf.12809 [76] Zhong F., Yang S., Qin X., Lischinski D., Cohen-Or D., Chen B. (2014) Slippage- free background replacement for hand-held video. ACM Transactions on Graphics (Proceedings of SIGGRAPH Asia) 33(6):199:1–199:11 |