|
[1] Rafael Ubal, Julio Sahuquillo, Salvador Petit, and Pedro Lopez, \Multi2sim: A simu- lation framework to evaluate multicore-multithreaded processors", in Computer Archi- tecture and High Performance Computing, 2007. SBAC-PAD 2007. 19th International Symposium on, 2007, pp. 62{68. [2] Niket Agarwal, Tushar Krishna, Li-Shiuan Peh, and Niraj K Jha, \Garnet: A detailed on- chip network model inside a full-system simulator", in Performance Analysis of Systems and Software, 2009. ISPASS 2009. IEEE International Symposium on. IEEE, 2009, pp. 33{42. [3] Joel Hestness, Boris Grot, and Stephen W Keckler, \Netrace: dependency-driven trace- based network-on-chip simulation", in Proceedings of the Third International Workshop on Network on Chip Architectures. ACM, 2010, pp. 31{36. [4] \http://developer.amd.com/tools-and-sdks/opencl-zone/amd-accelerated-parallel- processing-app-sdk/". 33 [5] Nan Jiang, Daniel U Becker, George Michelogiannakis, James Balfour, Brian Towles, David E Shaw, Jung-Ho Kim, and William J Dally, \A detailed and exible cycle- accurate network-on-chip simulator", in Performance Analysis of Systems and Software (ISPASS), 2013 IEEE International Symposium on. IEEE, 2013, pp. 86{96. [6] Ali Bakhoda, George L Yuan, Wilson WL Fung, Henry Wong, and Tor M Aamodt, \Analyzing cuda workloads using a detailed gpu simulator", in Performance Analysis of Systems and Software, 2009. ISPASS 2009. IEEE International Symposium on. IEEE, 2009, pp. 163{174. [7] Karen MacDonald, Christopher Nitta, Matthew Farrens, and Venkatesh Akella, \Pdg gen: A methodology for fast and accurate simulation of on-chip networks", Com- puters, IEEE Transactions on, vol. 63, no. 3, pp. 650{663, 2014. |