System level cache prefetching algorithms for complex GPU workloads
Prefetching is a well known concept for CPUs but for GPUs it is fairly unexplored. The memory management of a GPU plays a crucial role in its performance, and cache prefetching has the potential to lower the overall latency. This thesis compares different types of prefetching methods for GPUs and remaking some CPU prefetchers to fit the GPU architecture. All these prefetchers were then put inside