【摘 要】
:
E?cient resource utilization requires that emerging datacenter interconnects support both high performance communication and e?cient remote resource sharing. Th
【机 构】
:
State Key Laboratory of Computer Architecture
【基金项目】
:
This work was supported by the Strategic Priority Research Program of the Chinese Academy of Sciences;the National Natural Science Foundation of China
论文部分内容阅读
E?cient resource utilization requires that emerging datacenter interconnects support both high performance communication and e?cient remote resource sharing. These goals require that the network be more tightly coupled with the CPU chips. Designing a new interconnection technology thus requires considering not only the interconnection itself, but also the design of the processors that will rely on it. In this paper, we study memory hierarchy implications for the design of high-speed datacenter interconnects—particularly as they affect remote memory access—and we use PCIe as the vehicle for our investigations. To that end, we build three complementary platforms: a PCIe-interconnected prototype server with which we measure and analyze current bottlenecks; a software simulator that lets us model microarchitectural and cache hierarchy changes;and an FPGA prototype system with a streamlined switchless customized protocol Thunder with which we study hardware optimizations outside the processor. We highlight several architectural modifications to better support remote memory access and communication, and quantify their impact and limitations.
其他文献
Many machine learning and data mining (MLDM) problems like recommendation, topic modeling, and medical diagnosis can be modeled as computing on bipartite graphs
Determinism is very useful to multithreaded programs in debugging, testing, etc. Many deterministic ap-proaches have been proposed, such as deterministic multit
随着经济的快速发展,我国的电力企业得到了快速的发展,而用户对供电质量和供电服务的要求也越来越高。为提高电力企业的市场竞争力,电力企业必须加强电力营销的精细化管理,确保电
Pipeline parallelism is a popular parallel programming pattern for emerging applications. However, program-ming pipelines directly on conventional multithreaded
随着我国电力市场需求的快速发展,供电单位引进了自动抄表技术,不仅提高了抄表的准确率,而且还实现了远程抄表。本文针对电力企业电费回收工作所遇到的问题,有针对性地提出改进措
Parallel programs consist of series of code sections with different thread-level parallelism (TLP). As a result, it is rather common that a thread in a parallel
On-chip interconnection has posed significant challenges in multiprocessor system on chip (MPSoC) design paradigm, especially in big data era. With respect to t
Big data processing is becoming a standout part of data center computation. However, latest research has indicated that big data workloads cannot make full use
随着我国经济的不断发展以及人们生活水平的不断提高,人们对电的需求也越来越大,因而对电力行业的要求也越来越高。为了保证电力行业改革的有序推进,必须提高我国电力营销的信息
Phase change memory (PCM) is a promising technology for future memory thanks to its better scalability and lower leakage power than DRAM (dynamic random-access