Memory Request Priority Based Warp Scheduling for GPUs

ZHANG Jun; HE Yanxiang; SHEN Fanfan; LI Qing'an; TAN Hai

doi:10.1049/cje.2018.05.003

ZHANG Jun, HE Yanxiang, SHEN Fanfan, LI Qing'an, TAN Hai. Memory Request Priority Based Warp Scheduling for GPUs[J]. Chinese Journal of Electronics, 2018, 27(5): 985-994. DOI: 10.1049/cje.2018.05.003

Citation:

ZHANG Jun, HE Yanxiang, SHEN Fanfan, LI Qing'an, TAN Hai. Memory Request Priority Based Warp Scheduling for GPUs[J]. Chinese Journal of Electronics, 2018, 27(5): 985-994. DOI: 10.1049/cje.2018.05.003

Citation:

ZHANG Jun, HE Yanxiang, SHEN Fanfan, LI Qing'an, TAN Hai. Memory Request Priority Based Warp Scheduling for GPUs[J]. Chinese Journal of Electronics, 2018, 27(5): 985-994. DOI: 10.1049/cje.2018.05.003

Memory Request Priority Based Warp Scheduling for GPUs

Graphical Abstract

Graphical Abstract

Abstract

Abstract

High performance of GPGPU comes from its super massive multithreading, which makes it more and more widely used especially in the field of throughputoriented. Data locality is one of the important factors affecting the performance of GPGPU. Although GPGPU can exploit intra/inter-warp locality by itself in part, there is still large improvement space for that. In our work, we analyze the characteristics of different applications and propose memory request based warp scheduling to better exploit inter-warp spatial locality. This method can make some warps with good inter-warp locality run faster, which is beneficial to improve the whole performance. Our experimental results show that our proposed method can achieve 24.7% and 11.9% average performance improvement over LRR and MRPB respectively.