|
[1] Garcia Lopez, P., Montresor, A., Epema, D., Datta, A., Higashino, T., Iamnitchi, A., Adriana, I., Marinho, B., Pascal, F. & Riviere, E, “Edge-centric Computing: Vision and Challenges,” ACM SIGCOMM Computer Communication Review, vol. 45, no. 5, pp. 37–42, 2015. [2] Lockerman, E., Feldmann, A., Bakhshalipour, M., Stanescu, A., Gupta, S., Sanchez, D., & Beckmann, N., “Livia: Data-Centric Computing Throughout the Memory Hierarchy,” in Proceedings of the Twenty-Fifth International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 417–433, March 2020. [3] Chen, C. P., & Zhang, C. Y., “Data-intensive applications, challenges, techniques and technologies: A survey on Big Data,” Information sciences, vol. 275, pp. 314–347, August 2014. [4] Caulfield, A. M., Grupp, L. M., & Swanson, S., “Gordon: Using Flash Memory to Build Fast, Power-efficient Clusters for Data-intensive Applications,” ACM SIGPLAN Notices, vol. 44, no. 3, pp. 217–228, 2019. [5] Fevgas, A., Akritidis, L., Bozanis, P., & Manolopoulos, Y., “Indexing in flash storage devices: a survey on challenges, current approaches, and future trends,” The VLDB Journal, vol. 29, no. 1, pp. 273–311, 2020. [6] Lee, W., Kang, M., Hong, S., & Kim, S., “Interpage-Based Endurance-Enhancing Lower State Encoding for MLC and TLC Flash Memory Storages,” IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 27, no. 9, pp. 2033–2045, September 2019. [7] Cai, Y., Luo, Y., Haratsch, E. F., Mai, K., & Mutlu, O., “Data Retention in MLC NAND Flash Memory: Characterization, Optimization, and Recovery,” in 2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA), pp. 551–563, February 2015. [8] Liang, Y. P., Chen, T. Y., Chang, Y. H., Chen, S. H., Wei, H. W., & Shih, W. K., “B^*-Sort: Enabling Write-once Sorting for Nonvolatile Memory,” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 39, no. 12, pp. 4549–4562, 2020. [9] Yu, S., Sun, X., Peng, X., & Huang, S., “Compute-in-Memory with Emerging Nonvolatile-Memories: Challenges and Prospects,” in 2020 IEEE Custom Integrated Circuits Conference (CICC), March 2020, pp. 1–4. [10] Venkataraman, S., Tolia, N., Ranganathan, P., & Campbell, R. H., “Consistent and Durable Data Structures for Non-Volatile Byte-Addressable Memory,” in 9th USENIX Conference on File and Storage Technologies (FAST 11), 2011. [11] Le Gallo, M., & Sebastian, A., “An overview of phase-change memory device physics,” Journal of Physics D: Applied Physics, vol. 53, no. 21, pp. 213002, 2020. [12] Zhou, W., Feng, D., Hua, Y., Liu, J., Huang, F., & Chen, Y., “An Efficient Parallel Scheduling Scheme on Multi-partition PCM Architecture,” in Proceedings of the 2016 International Symposium on Low Power Electronics and Design, August 2016, pp. 344–349. [13] Wang, R., Jiang, L., Zhang, Y., Wang, L., & Yang, J., “Exploit Imbalanced Cell Writes to Mitigate Write Disturbance in Dense Phase Change Memory,” in 52nd ACM/EDAC/IEEE Design Automation Conference (DAC), June 2015, pp. 1–6. [14] Chen, Y., Wong, W. F., Li, H., & Koh, C. K., “Processor Caches Built Using Multi-Level Spin-Transfer Torque RAM Cells,” in IEEE/ACM International Symposium on Low Power Electronics and Design, August 2011, pp. 73–78. [15] Mao, M., Li, H., Jones, A. K., & Chen, Y., “Coordinating Prefetching and STT-RAM based Last-level Cache Management for Multicore Systems,” in Proceedings of the 23rd ACM International Conference on Great Lakes Symposium on VLSI (GLSVLSI), May 2013, pp. 55–60. [16] Kültürsay, E., Kandemir, M., Sivasubramaniam, A., & Mutlu, O., “Evaluating STT-RAM as an Energy-Efficient Main Memory Alternative,” in 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), April 2013, pp. 256–267. [17] Xu, C., Niu, D., Zheng, Y., Yu, S., & Xie, Y., “Reliability-Aware Cross-Point Resistive Memory Design,” in Proceedings of the 24th Edition of the Great Lakes Symposium on VLSI (GLSVLSI), May 2014, pp. 145–150. [18] Wu, H., Wang, X. H., Gao, B., Deng, N., Lu, Z., Haukness, B., Bronner, G., & Qian, H., “Resistive Random Access Memory for Future Information Processing System,” in Proceedings of the IEEE, vol. 105, no. 9, pp. 1770–1789, 2017. [19] Fong, S. W., Neumann, C. M., & Wong, H. S. P., “Phase-Change Memory—Towards a Storage-Class Memory,” in IEEE Transactions on Electron Devices, vol. 64, no. 11, pp. 4374–4385, November 2017. [20] Oukid, I., & Lersch, L., “On the Diversity of Memory and Storage Technologies,” Datenbank-Spektrum, vol. 18, no. 2, pp. 121–127, June 2018. [21]Parkin, S. S., Hayashi, M., & Thomas, L., “Magnetic Domain-Wall Racetrack Memory,” Science, vol. 320, no. 5873, pp. 190–194, 2008. [22] Parkin, S., & Yang, S. H., “Memory on the racetrack,” Nature nanotechnology, vol. 10, no. 3, pp. 195–198, 2015. [23] Parkin, S. S., “Data in the Fast Lanes of RACETRACK MEMORY,” Scientific American, vol. 300, no. 6, pp. 76–81, 2009. [24] Thomas, L., Yang, S. H., Ryu, K. S., Hughes, B., Rettner, C., Wang, D. S., Tsai, C. H., Shen, K. H., & Parkin, S. S., “Racetrack Memory: a high-performance, low-cost, non-volatile memory based on magnetic domain walls,” in 2011 International Electron Devices Meeting, December 2011, pp. 24.2.1–24.2.4. [25] Hu, Q., Sun, G., Shu, J., & Zhang, C., “Exploring Main Memory Design Based on Racetrack Memory Technology,” in Proceedings of the 26th edition on Great Lakes Symposium on VLSI (GLSVLSI), May 2016, pp. 397–402. [26] Kang, W., Zheng, C., Huang, Y., Zhang, X., Zhou, Y., Lv, W., & Zhao, W., “Complementary Skyrmion Racetrack Memory With Voltage Manipulation,” in IEEE Electron Device Letters, vol. 37, no. 7, pp. 924–927, 2016. [27] Kang, W., Huang, Y., Zheng, C., Lv, W., Lei, N., Zhang, Y., Zhang, X., Zhou, Y., & Zhao, W., “Voltage Controlled Magnetic Skyrmion Motion for Racetrack Memory,” Scientific reports, vol. 6, no. 23164, pp. 1–11, 2016. [28] Xu, R., Sha, E. H. M., Zhuge, Q., Gu, S., & Shi, L., “Optimizing Data Placement for Hybrid SPM with SRAM and Racetrack Memory,” in 2020 IEEE 38th International Conference on Computer Design (ICCD), October 2020, pp. 409–416. [29] Sun, Z., Bi, X., Jones, A. K., & Li, H., “Design Exploration of Racetrack Lower-level Caches,” in 2014 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED), August 2014, pp. 263–266. [30] Sun, Z., Wu, W., & Li, H., “Cross-Layer Racetrack Memory Design for Ultra High Density and Low Power Consumption,” in 2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC), May 2013, pp. 1–6. [31] Chang, C. H., & Chang, C. W., “Adaptive Memory and Storage Fusion on Non-Volatile One-Memory System,” in 2019 IEEE Non-Volatile Memory Systems and Applications Symposium (NVMSA), August 2019, pp. 1–6. [32] Gu, S., Sha, E. H. M., Zhuge, Q., Chen, Y., & Hu, J., “A Time, Energy, and Area Efficient Domain Wall Memory-Based SPM for Embedded Systems,” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 35, no. 12, pp. 2008–2017, December 2016. [33] Zhu, D., Kang, W., Li, S., Huang, Y., Zhang, X., Zhou, Y., & Zhao, W., “Skyrmion Racetrack Memory With Random Information Update/Deletion/Insertion,” in IEEE Transactions on Electron Devices, vol. 65, no. 1, pp. 87–95, January 2018. [34] Kang, W., Chen, X., Zhu, D., Zhang, X., Zhou, Y., Qiu, K., Zhang, Y., & Zhao, W., “A Comparative Study on Racetrack Memories: Domain Wall vs Skyrmion,” in 2018 IEEE 7th Non-Volatile Memory Systems and Applications Symposium (NVMSA), August 2018, pp. 7–12. [35] Kang, W., Wu, B., Chen, X., Zhu, D., Wang, Z., Zhang, X., Zhou, Y., Zhang, Y., & Zhao, W., “A Comparative Cross-layer Study on Racetrack Memories: Domain Wall vs Skyrmion,” ACM Journal on Emerging Technologies in Computing Systems (JETC), vol. 16, no. 1, pp. 1–17, 2019. [36] Chen, F., Li, Z., Kang, W., Zhao, W., Li, H., & Chen, Y., “Process Variation Aware Data Management for Magnetic Skyrmions Racetrack Memory,” in 2018 23rd Asia and South Pacific Design Automation Conference (ASP-DAC), January 2018, pp. 221–226. [37] Parkin, S., “Racetrack Memory: A storage class memory based on current controlled magnetic domain wall motion,” in 2009 Device Research Conference, June 2009, pp. 3–6. [38] Tomasello, R., Martinez, E., Zivieri, R., Torres, L., Carpentieri, M., & Finocchio, G., “A strategy for the design of skyrmion racetrack memories,” Scientific reports, vol. 4, no. 6784, pp. 1–7, 2014. [39] Bose, R. C., & Nelson, R. J., “A Sorting Problem,” Journal of the ACM (JACM), vol. 9, no. 2, pp. 282–296, 1962. [40] Venkatesan, R., Kozhikkottu, V., Augustine, C., Raychowdhury, A., Roy, K., & Raghunathan, A., “TapeCache: A high Density, Energy Efficient Cache Based on Domain Wall Memory,” in Proceedings of the 2012 ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED), July 2012, pp. 185–190. [41] Nolte, T., Hansson, H., Norström, C., & Punnekkat, S., “Using bit-stuffing distributions in CAN analysis,” in IEEE Real-Time Embedded Systems Workshop at the Real-Time Systems Symposium, December 2001. [42] Alaei, R., Moallem, P., & Bohlooli, A., “Statistical based algorithm for reducing bit stuffing in the Controller Area Networks,” Microelectronics Journal, vol. 101, p. 104794, 2020. [43] Tang, D. T., & Bahl, L. R., “Block Codes for a Class of Constrained Noiseless Channels,” Information and Control, vol. 17, no. 5, pp. 436– 461, 1970. [44] Le, D. D., Nguyen, D. P., Tran, T. H., & Nakashima, Y., “Joint polar and run-length limited decoding scheme for visible light communication systems,” IEICE Communications Express, vol. 7, no. 1, pp. 19–24, 2018. [45] Yang, T. Y., Yang, M. C., Li, J., & Kang, W., “Permutation-Write: Optimizing Write Performance and Energy for Skyrmion Racetrack Memory,” in 2020 57th ACM/IEEE Design Automation Conference (DAC), July 2020, pp. 1–6. [46] Zhang, C., Sun, G., Zhang, X., Zhang, W., Zhao, W., Wang, T., Liang, Y., Liu, Y., Wang, Y., & Shu, J., “Hi-fi Playback: Tolerating Position Errors in Shift Operations of Racetrack Memory,” in Proceedings of the 42nd Annual International Symposium on Computer Architecture (ISCA), vol. 43, no. 3, pp. 694–706, June 2015. [47] Mao, H., Zhang, C., Sun, G., & Shu, J., “Exploring Data Placement in Racetrack Memory based Scratchpad Memory,” in 2015 IEEE Non-Volatile Memory System and Applications Symposium (NVMSA), August 2015, pp. 1–5. [48] Gu, S., Sha, E. H. M., Zhuge, Q., Chen, Y., & Hu, J., “Area and Performance Co-optimization for Domain Wall Memory in Application-specific Embedded Systems,” in Proceedings of the 52nd Annual Design Automation Conference, June 2015, pp. 1–6. [49] Mühlbauer, S., Binz, B., Jonietz, F., Pfleiderer, C., Rosch, A., Neubauer, A., Georgii, R., & Böni, P., “Skyrmion Lattice in a Chiral Magnet,” Science, vol. 323, no. 5916, pp. 915–919, 2009. [50] Liang, Z., Sun, G., Kang, W., Chen, X., & Zhao, W., “ZUMA: Enabling Direct Insertion/Deletion Operations with Emerging Skyrmion Racetrack Memory,” in 2019 56th ACM/IEEE Design Automation Conference (DAC), June 2019, pp. 1–6. [51] Cormen, T. H., Leiserson, C. E., Rivest, R. L., & Stein, C., Introduction to Algorithms, MIT Press, 2022. [52] Jadoon, S., Solehria, S. F., & Qayum, M., “Optimized Selection Sort Algorithm is faster than Insertion Sort Algorithm: a Comparative Study,” International Journal of Electrical & Computer Sciences IJECS-IJENS, vol. 11, no. 2, pp. 19–24, 2011. [53] Zhang, C., Sun, G., Zhang, W., Mi, F., Li, H., & Zhao, W., “Quantitative Modeling of Racetrack Memory, A Tradeoff among Area, Performance, and Power,” in 20th Asia and South Pacific Design Automation Conference (ASP-DAC), January 2015, pp. 100–105. [54] Khan, A. A., Hameed, F., Bläsing, R., Parkin, S., & Castrillon, J., “RTSim: A Cycle-Accurate Simulator for Racetrack Memories,” in IEEE Computer Architecture Letters, vol. 18, no. 1, pp. 43–46, 2019. [55] Hsieh, Y. S., Huang, P. C., Chen, P. X., Chang, Y. H., Kang, W., Yang, M. C., & Shih, W. K., “Shift-limited Sort: Optimizing Sorting Performance on Skyrmion Memory-Based Systems,” in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 39, no. 11, pp. 4115–4128, 2020. [56] Chen, T. Y., Chang, Y. H., Kuan, Y. H., Yang, M. C., Chang, Y. M., & Hsiu, P. C., “Enhancing Flash Memory Reliability by Jointly Considering Write-back Pattern and Block Endurance,” ACM Transactions on Design Automation of Electronic Systems (TODAES), vol. 23, no. 5, pp. 1–24, 2018. [57] Chen, S. H., Yang, M. C., & Chang, Y. H., “Optimizing Lifetime Capacity and Read Performance of Bit-Alterable 3-D NAND Flash,” in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 40, no. 2, pp. 218–231, 2020. [58] Chen, S. H., Tsao, C. W., & Chang, Y. H., “Beyond address mapping: A user-oriented multiregional space management design for 3-D NAND flash memory,” in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 39, no. 6, pp. 1286–1299, June 2020. [59] Jin, H., Cheng, P., & Zhang, J., “Buffer System for Optical Storage System,” in 1997 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, PACRIM, 10 Years Networking the Pacific Rim, 1987-1997, August 1997, pp. 134–137 vol. 1. [60] Archer, S., Mappouras, G., Calderbank, R., & Sorin, D., “Foosball Coding: Correcting Shift Errors and Bit Flip Errors in 3D Racetrack Memory,” in 2020 50th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), June 2020, pp. 331–342. [61] Cho, S., & Lee, H., “Flip-N-Write: A Simple Deterministic Technique to Improve PRAM Write Performance, Energy and Endurance,” in Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture, December 2009, pp. 347–357. [62] Alsuwaiyan, A., & Mohanram, K., “MFNW: An MLC/TLC Flip-N-Write Architecture,” ACM Journal on Emerging Technologies in Computing Systems (JETC), vol. 14, no. 2, pp. 1–32, 2018. [63] Palangappa, P. M., & Mohanram, K., “Flip-Mirror-Rotate: An Architecture for Bit-write Reduction and Wear Leveling in Non-volatile Memories,” in Proceedings of the 25th Edition on Great Lakes Symposium on VLSI (GLSVLSI), May 2015, pp. 221–224. [64] Von Puttkamer, E., “A Simple Hardware Buddy System Memory Allocator,” in IEEE Transactions on Computers, vol. C-24, no. 10, pp. 953– 957, 1975. [65] Bonwick, J., “The Slab Allocator: An Object-Caching Kernel Memory Allocator,” in USENIX Summer 1994 Technical Conference, vol. 16, June 1994. [66] Cooper, B. F., Silberstein, A., Tam, E., Ramakrishnan, R., & Sears, R., “Benchmarking Cloud Serving Systems with YCSB,” in Proceedings of the 1st ACM Symposium on Cloud Computing, June 2010, pp. 143–154. [67] Zhang, X., Ezawa, M., & Zhou, Y., “Magnetic skyrmion logic gates: conversion, duplication and merging of skyrmions,” Scientific reports, vol. 5, no. 1, pp. 1–8, 2015. [68] Chauwin, M., Hu, X., Garcia-Sanchez, F., Betrabet, N., Paler, A., Moutafis, C., & Friedman, J. S., “Skyrmion Logic System for Large-Scale Reversible Computation,” Physical Review Applied, vol. 12, p. 064053, 2019. [69] Zhang, H., Zhu, D., Kang, W., Zhang, Y., & Zhao, W., “Stochastic Computing Implemented by Skyrmionic Logic Devices,” Physical Review Applied, vol. 13, p. 054049, 2020. [70] Zokaee, F., Chen, F., Sun, G., & Jiang, L., “Sky-Sorter: A Processing-in-Memory Architecture for Large-Scale Sorting,” in IEEE Transactions on Computers, 2022. [71] Choong, B. C. M., Luo, T., Liu, C., He, B., Zhang, W., & Zhou, J. T., “Hardware-software co-exploration with racetrack memory based in-memory computing for CNN inference in embedded systems,” Journal of Systems Architecture, vol. 128, p. 102507, 2022. [72] Ollivier, S., Zhang, X., Tang, Y., Choudhuri, C., Hu, J., & Jones, A. K., “FPIRM: Floating-point Processing in Racetrack Memories,” arXiv preprint arXiv:2204.13788, 2022. [73] Lo, T. S., Wu, C. F., Chang, Y. H., Kuo, T. W., & Wang, W. C., “Space-efficient Graph Data Placement to Save Energy of ReRAM Crossbar,” in 2021 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED), July 2021, pp. 1–6. [74] Kang, Y. W., Wu, C. F., Chang, Y. H., Kuo, T. W., & Ho, S. Y., “On Minimizing Analog Variation Errors to Resolve the Scalability Issue of ReRAM-based Crossbar Accelerators,” in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 39, no. 11, pp. 3856–3867, 2020. [75] Shafiee, A., Nag, A., Muralimanohar, N., Balasubramonian, R., Strachan, J. P., Hu, M., Williams, R. S. & Srikumar, V., “ISAAC: A Convolutional Neural Network Accelerator with In-Situ Analog Arithmetic in Crossbars,” ACM SIGARCH Computer Architecture News, vol. 44, no. 3, pp. 14–26, June 2016. [76] Chi, P., Li, S., Xu, C., Zhang, T., Zhao, J., Liu, Y., Wang, Y., & Xie, Y., “PRIME: A Novel Processing-in-memory Architecture for Neural Network Computation in ReRAM-based Main Memory,” ACM SIGARCH Computer Architecture News, vol. 44, no.3, pp. 27–39, June 2016. [77] Ahn, J., Yoo, S., Mutlu, O., & Choi, K., “PIM-Enabled Instructions: A Low-Overhead, Locality-Aware Processing-in-memory Architecture,” in 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA), pp. 336–348, June 2015. [78] Wang, S., Cao, J., & Yu, P., “Deep Learning for Spatio-Temporal Data Mining: A Survey,” in IEEE Transactions on Knowledge and Data Engineering, 2020. [79] Najafabadi, M. M., Villanustre, F., Khoshgoftaar, T. M., Seliya, N., Wald, R., & Muharemagic, E., “Deep learning applications and challenges in big data analytics,” Journal of Big Data, vol. 2, no. 1, pp. 1–21, 2015. [80] Nisbet, R., Elder, J., & Miner, G., Handbook of Statistical Analysis and Data Mining Applications, Academic press, 2009. [81] Burr, G. W., Shelby, R. M., Sebastian, A., Kim, S., Kim, S., Sidler, S., Virwani, K., Ishii, M., Narayanan, P., Fumarola, A., Sanches, L. L., Boybat, I., Gallo, M. L., Moon, K., Woo, J., Hwang, H. & Leblebici, Y., “Neuromorphic computing using non-volatile memory,” Advances in Physics: X, vol. 2, no. 1, pp. 89–124, 2017. [82] Jang, J. W., Park, S., Jeong, Y. H., & Hwang, H., “ReRAM-based Synaptic Device for Neuromorphic Computing,” in 2014 IEEE International Symposium on Circuits and Systems (ISCAS), June 2014, pp. 1054–1057. [83] Schrauwen, B., Verstraeten, D., & Van Campenhout, J., “An overview of reservoir computing: theory, applications and implementations,” in Proceedings of the 15th European Symposium on Artificial Neural Networks (ESANN), 2007, pp. 471–482. [84] Li, S., Kang, W., Zhang, X., Nie, T., Zhou, Y., Wang, K. L., & Zhao, W. “Magnetic Skyrmions for Unconventional Computing,” Materials Horizons, vol. 8, no. 3, pp. 854–868, 2021. [85] Zhang, Y., Qu, P., Ji, Y., Zhang, W., Gao, G., Wang, G., Song, S., Li, G., Chen, W., Zheng, W., Chen, F., Pei, J., Zhao, R., Zhao, M., & Shi, L., “A system hierarchy for brain-inspired computing,” Nature, vol. 586, no. 7829, pp. 378–384, 2020. [86] Khan, A. A., Ollivier, S., Longofono, S., Hempel, G., Castrillon, J., & Jones, A. K., “Brain-inspired Cognition in Next Generation Racetrack Memories,” arXiv preprint arXiv:2111.02246, 2021.
|