|
[1] Y. S. Shao, B. Reagen, G.-Y. Wei, and D. Brooks,"Aladdin: A Pre-RTL, Power-Performance Accelerator Simulator Enabling Large Design Space Exploration of Customized Architectures," in ACM/IEEE International Symposium on Computer Architecture (ISCA), pp. 97-108, Jun. 2014. [2] W.-C. Chang, "Assisted Design Optimization using High-Level Synthesis Flow", 2016 [3] Y. S. Shao, and D. Brooks, "ISA-Independent Workload Characterization and its Implications for Specialized Architectures," in International Symposium on Performance Analysis of Systems and Software (ISPASS), 2013. [4] J. Cong, W. Jiang, B. Liu, and Y. Zou, "Automatic Memory Partitioning and Scheduling for Throughput and Power Optimization," in \textit{ACM,} 2011. [5] Y. Wang, P. Zhang, X. Cheng, and J. Cong, "An Integrated and Automated Memory Optimization Flow for FPGA Behavioral Synthesis," in ASPDAC, 2012. [6] P. Li, Y. Wang, P. Zhang, G. Luo, T. Wang, and J. Cong, "Memory Partitioning and Scheduling Co-optimization in Behavioral Synthesis," in ICCAD, 2012. [7] Y. Wang, P. Li, P. Zhang, C. Zhang, and J. Cong, "Memory Partitioning for Multidimensional Arrays in High-Level Synthesis," in DAC, 2013. [8] Y. Wang, P. Li, and J. Cong, "Theory and Algorithm for Generalized Memory Partitioning in High-Level Synthesis," in FPGA, 2014. [9] P. Li, P. Zhang, L. -N. Pouchet, and J. Cong, "Resource-Aware Throughput Optimization for High-Level Synthesis," in FPGA, 2015. [10] J. Su, F. Yang, X. Zeng, and D. Zhou, "Efficient Memory Partitioning for Parallel Data Access via Data Reuse," in FPGA, 2016. [11] J. Su, F. Yang, X. Zeng, and D. Zhou, "Interplay of loop unrolling and multidimensional memory partitioning in HLS," in DATE, 2015. [12] B. Reagen, Y. S. Shao, G. -Y. Wei, and D. Brooks, "Quantifying Acceleration: Power/Performance Trade-Offs of Application Kernels in Hardware," in ISLPED, 2013. [13] M. Li, P. Zhang, C. Zhu, H. Jia, X. Xie, J. Cong, and W. Gao, "High Efficiency VLSI Implementation of an Edge-directed Video Up-scaler Using High Level Synthesis," in IEEE International Conference on Consumer Electronics (ICCE), 2015. [14] Brandon Reagen, Robert Adolf, Yakun Sophia Shao, Gu-Yeon Wei, David Brooks, "MachSuite: Benchmarks for Accelerator Design and Customized Architectures," in Workload Characterization (IISWC), IEEE International Symposium on), 2014. [15] B. Carrion Schafer and A. Mahapatra, "S2CBench : Synthesizable SystemC Benchmark Suite for High-Level Synthesis," in IEEE Embedded Systems Letters (Volume:6 , Issue: 3 ), 2014. [16] J. Cong, V. Sarkar, G. Reinman and A. Bui, "Customizable Domain Specific Computing," in IEEE Design & Test of Computers, 2011. [17] C.-T. Huang, H.-T. Tsai, "Performance Optimization of Accelerators using C-bassd High-Level Synthesis Flow", 2016 [18] Avalible:http://accelerator.eecs.harvard.edu/isca14tutorial/isca2014-tutorial-cad-benchmarks.pdf
|