Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning X Zhang, G Tan, S Xue, J Li, K Zhou, M Chen PPoPP, 2017 | 67 | 2017 |
A performance analysis framework for exploiting GPU microarchitectural capability K Zhou, G Tan, X Zhang, C Wang, N Sun Proceedings of the International Conference on Supercomputing, 1-10, 2017 | 21 | 2017 |
A Reliable Distributed Convolutional Neural Network for Biology Image Segmentation X Zhang, G Tan, M Chen CCGRID, 2015 | 8 | 2015 |
Vlock: Lock virtualization mechanism for exploiting fine-grained parallelism in graph traversal algorithms J Yan, G Tan, X Zhang, E Yao, N Sun Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation …, 2013 | 6 | 2013 |
选择稀疏矩阵乘法最优存储格式的研究 李佳佳, 张秀霞, 谭光明, 陈明宇 计算机研究与发展 51 (4), 882-894, 2014 | 3 | 2014 |
A Case Study of 3D RTM-TTI Algorithm on Multicore and Many-core Platforms X Zhang, G Tan, M Chen High Technology Letters, 2017 | | 2017 |