|
[1] Jaleel, Aamer. "Memory characterization of workloads using instrumentation-driven simulation." Web Copy: http://www. glue. umd. edu/ajaleel/workload (2010). [2] Alipour, Mehdi, Kamran Moshari, and Mohammad Reza Bagheri. "Performance per power optimum cache architecture for embedded applications, a design space exploration." Networked Embedded Systems for Enterprise Applications (NESEA), 2011. [3] Alipour, Mehdi, and Mostafa E. Salehi. "Design Space Exploration to Find the Optimum Cache and Register File Size for Embedded Applications." Internatinal Conference on Embedded System and Applications , 2012. [4] Nawinne, Isuru, and Sri Parameswaran. "A survey on exact cache design space exploration methodologies for application specific SoC memory hierarchies." Industrial and Information Systems (ICIIS), 2013. [5] Mattson, Richard L., et al. "Evaluation techniques for storage hierarchies." IBM Systems journal 9.2 (1970): 78-117. [6] Berg, Erik, and Erik Hagersten. "StatCache: a probabilistic approach to efficient and accurate data locality analysis." Performance Analysis of Systems and Software, 2004 IEEE International Symposium on-ISPASS. IEEE, 2004. [7] Niu, Qingpeng, et al. "PARDA: A fast parallel reuse distance analysis algorithm." Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International. IEEE, 2012. [8] Qureshi, Moinuddin K., et al. "Adaptive insertion policies for high performance caching." ACM SIGARCH Computer Architecture News. Vol. 35. No. 2. ACM, 2007. [9] Jaleel, Aamer, et al. "High performance cache replacement using re-reference interval prediction (RRIP)." ACM SIGARCH Computer Architecture News. Vol. 38. No. 3. ACM, 2010. [10] Qureshi, Moinuddin K., and Yale N. Patt. "Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches." Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 2006. [11] Sanchez, Daniel, and Christos Kozyrakis. "Vantage: scalable and efficient fine-grain cache partitioning." ACM SIGARCH Computer Architecture News. Vol. 39. No. 3. ACM, 2011. [12] Sundararajan, Karthik T., et al. "Cooperative partitioning: Energy-efficient cache partitioning for high-performance CMPs." High Performance Computer Architecture (HPCA), 2012 IEEE 18th International Symposium on. IEEE, 2012. [13] Jouppi, Norman P., and Steven JE Wilton. "Tradeoffs in two-level on-chip caching." Computer Architecture, 1994., Proceedings the 21st Annual International Symposium on. IEEE, 1994. [14] Hamerly, Greg, et al. "Simpoint 3.0: Faster and more flexible program phase analysis." Journal of Instruction Level Parallelism 7.4 (2005): 1-28. [15] Luk, Chi-Keung, et al. "Pin: building customized program analysis tools with dynamic instrumentation." ACM Sigplan Notices. Vol. 40. No. 6. ACM, 2005. [16] Burger, Doug, and Todd M. Austin. "The SimpleScalar tool set, version 2.0." ACM SIGARCH Computer Architecture News 25.3 (1997): 13-25. [17] Henning, John L. "SPEC CPU2006 benchmark descriptions." ACM SIGARCH Computer Architecture News 34.4 (2006): 1-17. [18] Guan, Nan, et al. "WCET analysis with MRU cache: Challenging LRU for predictability." ACM Transactions on Embedded Computing Systems (TECS) 13.4s (2014): 123. [19] Wu, C. J., Jaleel, A., Martonosi, M., Steely Jr, S. C., & Emer, J. (2011, December). PACMan: prefetch-aware cache management for high performance caching. In Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture (pp. 442-453). ACM. [20] [1] Zhang, Chuanjun, Frank Vahid, and Walid Najjar. "A highly configurable cache architecture for embedded systems." Computer Architecture, 2003. Proceedings. 30th Annual International Symposium on. IEEE, 2003. [21] [2] Sharifi, Akbar, et al. "PEPON: performance-aware hierarchical power budgeting for NoC based multicores." Proceedings of the 21st international conference on Parallel architectures and compilation techniques. ACM, 2012. [22] Wilton, Steven JE, and Norman P. Jouppi. "CACTI: An enhanced cache access and cycle time model." Solid-State Circuits, IEEE Journal of 31.5 (1996): 677-688. [23] http://www.intel.com/content/www/us/en/processors/core/core-i7-processor.html
|