帳號:guest(3.17.77.42)          離開系統
字體大小: 字級放大   字級縮小   預設字形  

詳目顯示

以作者查詢圖書館館藏以作者查詢臺灣博碩士論文系統以作者查詢全國書目
作者(中文):徐浩哲
作者(外文):Hsu, Hao Che
論文名稱(中文):HSA 模擬平台支援記憶體共享與多個運算單元
論文名稱(外文):An HSA Emulation Platform with SVM and Multiple Computing Devices Support
指導教授(中文):鍾葉青
金仲達
指導教授(外文):Chung, Yeh Ching
King, Chung Ta
口試委員(中文):徐慰中
洪士灝
口試委員(外文):Hsu, Wei Chung
Hung, Shih Hao
學位類別:碩士
校院名稱:國立清華大學
系所名稱:資訊工程學系
學號:103062701
出版年(民國):105
畢業學年度:104
語文別:英文
論文頁數:38
中文關鍵詞:HSA異質架構記憶體共享模擬器平行化運算
外文關鍵詞:HSASVMMultiple Computing DevicesEmulatorParallel Computing
相關次數:
  • 推薦推薦:0
  • 點閱點閱:310
  • 評分評分:*****
  • 下載下載:0
  • 收藏收藏:0
隨著計算的複雜度提高以及資料量的增加,我們需要進一步的研發下一個世代的 計算機架構以提升計算機的運算能力,異質運算是近年常被研究及提出來的解決方 案之一,特別是在行動式裝置興起後,因為 SoC 的大量應用與發展,異質架構的整 合變為重要的議題。
HSA (Heterogeneous System Architecture) 是一系列的開放標準,定義了軟體 層與硬體架構,其致力於提供一個異質架構整合的平台,使得異質運算能充份展它 的優勢,也讓應用程式開發者能夠以相對於以往更簡潔的方式運用異質運算加速。
HSA 提供了許多特性,例如:訊號的傳遞、全域共享記憶體的存取、工作分派 的機制,並定義了硬體必須支援的層面,其中有兩個特點:一是共享記憶體的機 制,希望透過記憶體的共享減少設備間的溝通負擔,其二是多個運算設備的平行運 算以達到多工的優勢。
根據 HSA 和 OpenCL 2.0 的標準我們開發了 HSAemu 2.0 ,模擬 HSA 架構並提 供給前期的應用程式開發,這篇論文使 HSAemu 2.0 更進一步模擬 HSA 記憶體管 理單元,透過此單元探討記憶體共享的機制,除此之外也探討了多個運算設備在 多工處理及共享記憶體上使用的優勢。
To solve the increasing computing complexity and proliferation of data, there is much research about the next-generation architectures. Heterogeneous computing is one of the solutions to achieve the goals of high performance and efficient power consumption.
Heterogeneous System Architecture (HSA) is a series of standards including the software stack and hardware architecture for heterogeneous computing. HSA has goals to build a friendly platform for multiple devices by providing Shared Virtual Memory (SVM) to reduce the overhead of the data transfer between devices. Not only does the platform improve performance on the heterogeneous architecture but it reduces complexity of programming on heterogeneous computing.
To simulate the new architecture in detail and provide a platform for developing applicationsintheearlystage,webuiltHSAemu2.0,anHSAemulationplatform. In this thesis, we enhance HSAemu 2.0 by implementing the HSA soft-MMU (Software Memory Management Unit) to achieve the goal of shared virtual memory (SVM). In addition, we make HSAemu 2.0 to support multiple computing devices including multi-core CPU and three GPU emulators. The emulation platform proves that utilizing the HSA MMU gets better performance than without HSA MMU and provides a friendly interface for multiple computing devices.
摘要 ...... 2
Abstract ...... 3
Content ...... 4
List of Figures ...... 5
List of Tables ...... 6
Chapter 1 Introduction ...... 7
Chapter 2 Related Works ...... 10
Chapter 3 Background ...... 13
Chapter 4 HSA MMU and Multiple Computing Devices ...... 20
Chapter 5 Experimental Results ...... 28
Chapter 6 Conclusion and Future Works ...... 36
Reference ...... 37
[1] P. Ross, "Why CPU Frequency Stalled," in IEEE Spectrum, 2008.
[2] John Nickolls and William J. Dally, "The GPU Computing Era," in IEEE Micro,
2010.
[3] W.Tang, B.Duan and C. Zhang, "Accelerating Millions of Short Reads Mapping
on a Heterogeneous Architecture with FPGA Accelerator," in FCCM, 2012.
[4] G. Teodoro et al., "Coordinating the use of GPU and CPU for improving performance of compute intensive applications," in Cluster Computing and Workshops, 2009.
[5] Enhua Wu and Youquan Liu, "Emerging technology about GPGPU," in APCCAS, 2008.
[6] HSA Foundation, "HSA System Architecture Specification 1.0," 2015. [Online]. Available: http://www.hsafoundation.com/standards/.
[7] HSA Foundation, "HSA Programmer Reference Manual Specification 1.01," 2015. [Online]. Available: http://ww.hsafoundation.com/standards/.
[8] HSA Foundation , "HSA Runtime Specification 1.0. 2015," [Online]. Available: http://www.hsafoundation.com/standards/.
[9] B.-C. Jeng, "HSAemu Framework," 2014.
[10] W.-C. Sun, "An OpenCL 2.0 Runtime Based on HSA Runtime," 2015.
37
[11] Jui Hsiao, "An OpenCL 2.0 Compilation Framework for HSA," 2015.
[12] F. Peng, "OpenCL 2.0 Enabled HSA Hardware Emulation," 2015.
[13] Fabrice Bellard, "QEMU, a fast and portable dynamic translator," in USENIX, 2005.
[14] J.H. Ding et al, "PQEMU: a parallel system emulator based on QEMU," in ICPADS, 276-283, 2004.
[15] Z. Wang et al., "COREMU: a scalable and portable parallel full-system emulator.," in PPoPP, 2011.
[16] Nathan Binkert et al, "The gem5 simulator," in ACM SIGRACH Computer Architecture News, 1-7, 2011.
[17] Ali Bakhoda et al, "Analyzing CUDA workloads using a detailed GPU simulator.," pp. 163-174, 2009.
[18] R. Ubal et al, "Multi2Sim: a simulation framework to evaluate multicore- multithread processors.," in HPCA, 2007.
[19] Hestness J. et al., "gem5-gpu: A Heterogeneous CPU-GPU Simulator," in IEEE Computer Architecture Letters 14, 2014.
[20] C.-Y. Kuo, "Implementation of Image Feature Supports in HSAemu Framework," 2015.
[21] Intel, "OpenCL 2.0 Shared Virtual Memory Overview," 2014. [Online]. Available: https://software.intel.com/en-us/articles/opencl-20-shared-virtual-memory- overview.
(此全文限內部瀏覽)
電子全文
摘要檔
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
* *