Farewell my Shared LLC! A Case for Private Die-Stacked DRAM Caches for Servers
Venue: MICRO 2018 Authors: Amna Shahab, Boris Grot, et. al. Introduction. Orthogonal to the conventional wisdom that the last level cache (LLC) in large-scale datacenters need to be shared among cores, this paper argues a case for private-LLC. They conduct experiments to analyze the effect of size, access latency, and inter-thread data sharing on LLC and conclude the following: 1. At a fixed latency, increasing LLC size from 8MB to 64MB results in less than 6% performance improvement, whereas increasing the size to 256MB results in 10%-20% improvement. This improvement is because the secondary working set stats to fit in the LLC. 2. Compared to an 8MB LLC at baseline latency, increasing LLC size to 512MB at 100% increase in latency results in a slowdown compared to the baseline. From this analysis they conclude that higher access latency is detrimental even if the LLC size is increased. 3. To analyze the sensitivity of data shared across threads, they analyze the different type...