Volgen
Xu Liu
Titel
Geciteerd door
Geciteerd door
Jaar
Evaluating modern gpu interconnect: Pcie, nvlink, nv-sli, nvswitch and gpudirect
A Li, SL Song, J Chen, J Li, X Liu, NR Tallent, KJ Barker
IEEE Transactions on Parallel and Distributed Systems 31 (1), 94-110, 2019
2222019
OMPT: An OpenMP tools application programming interface for performance analysis
AE Eichenberger, J Mellor-Crummey, M Schulz, M Wong, N Copty, ...
OpenMP in the Era of Low Power Devices and Accelerators: 9th International …, 2013
1292013
A tool to analyze the performance of multithreaded programs on NUMA architectures
X Liu, J Mellor-Crummey
ACM Sigplan Notices 49 (8), 259-272, 2014
882014
Locality-aware CTA clustering for modern GPUs
A Li, SL Song, W Liu, X Liu, A Kumar, H Corporaal
ACM SIGARCH Computer Architecture News 45 (1), 297-311, 2017
852017
Flep: Enabling flexible and efficient preemption on gpus
B Wu, X Liu, X Zhou, C Jiang
ACM SIGPLAN Notices 52 (4), 483-496, 2017
842017
Cvr: Efficient vectorization of spmv on x86 processors
B Xie, J Zhan, X Liu, W Gao, Z Jia, X He, L Zhang
Proceedings of the 2018 International Symposium on Code Generation and …, 2018
772018
memif Towards Programming Heterogeneous Memory Asynchronously
FX Lin, X Liu
ACM SIGPLAN Notices 51 (4), 369-383, 2016
732016
A data-centric profiler for parallel programs
X Liu, J Mellor-Crummey
Proceedings of the International Conference on High Performance Computing …, 2013
702013
Tartan: evaluating modern GPU interconnect via a multi-GPU benchmark suite
A Li, SL Song, J Chen, X Liu, N Tallent, K Barker
2018 IEEE International Symposium on Workload Characterization (IISWC), 191-202, 2018
622018
Scaanalyzer: A tool to identify memory scalability bottlenecks in parallel programs
X Liu, B Wu
Proceedings of the International Conference for High Performance Computing …, 2015
572015
Pinpointing data locality problems using data-centric analysis
X Liu, J Mellor-Crummey
International Symposium on Code Generation and Optimization (CGO 2011), 171-180, 2011
562011
Towards efficient spmv on sunway manycore architectures
C Liu, B Xie, X Liu, W Xue, H Yang, X Liu
Proceedings of the 2018 International Conference on Supercomputing, 363-373, 2018
532018
Cudaadvisor: Llvm-based runtime profiling for modern gpus
D Shen, SL Song, A Li, X Liu
Proceedings of the 2018 International Symposium on Code Generation and …, 2018
502018
Redspy: Exploring value locality in software
S Wen, M Chabbi, X Liu
Proceedings of the Twenty-Second International Conference on Architectural …, 2017
432017
Watching for software inefficiencies with witch
S Wen, X Liu, J Byrne, M Chabbi
Proceedings of the Twenty-Third International Conference on Architectural …, 2018
392018
Pinpointing data locality bottlenecks with low overhead
X Liu, J Mellor-Crummey
2013 IEEE International Symposium on Performance Analysis of Systems and …, 2013
382013
Call paths for pin tools
M Chabbi, X Liu, J Mellor-Crummey
Proceedings of Annual IEEE/ACM International Symposium on Code Generation …, 2014
372014
DR-BW: identifying bandwidth contention in NUMA architectures with supervised learning
H Xu, S Wen, A Gimenez, T Gamblin, X Liu
2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2017
352017
Redundant loads: A software inefficiency indicator
P Su, S Wen, H Yang, M Chabbi, X Liu
2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE …, 2019
342019
Characterizing emerging heterogeneous memory
D Shen, X Liu, FX Lin
ACM SIGPLAN Notices 51 (11), 13-23, 2016
342016
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20