Author

Bo Wu

Colorado School of Mines - Cited by 1,138 - Compiler Optimization - Heterogeneous System - Profiling

Biography

Dr. Bo Wu is an Professor works at The Key Laboratory of Ministry of Education for Microbial and Plant Genetic Engineering, and College of Life Science and Technology, Guangxi University; 100 Daxue East Road, Nanning, Guangxi 530004, China.
Title
Cited by
Year
Complexity Analysis and Algorithm Design for Reorganizing Data to Minimize Non-Coalesced GPU Memory Accesses
B Wu, Z Zhao, E Zhang, Y Jiang, X ShenACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2013127201
127
2013
Enabling and Exploiting Flexible Task Assignment on GPU through SM-Centric Program Transformations
B Wu, G Chen, D Li, X Shen, J VetterThe 29th International Conference on Supercomputing, 2015201
97
2015
PORPLE: An Extensible Optimizer for Portable Data Placement on GPU
G Chen, B Wu, D Li, X ShenThe 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014201
81
2014
Can PCM Benefit GPU? Reconciling Hybrid Memory Design with GPU Massive Parallelism for Energy Efficiency
B Wang, B Wu, D Li, X Shen, W Yu, Y Jiao, J VetterThe 22nd International Conference on Parallel Architectures and Compilation …, 201380201
80
2013
Flep: Enabling flexible and efficient preemption on gpus
B Wu, X Liu, X Zhou, C JiangACM SIGPLAN Notices 52 (4), 483-496, 2017201
76
2017
Automine: harmonizing high-level abstraction and high performance for graph mining
D Mawhirter, B WuProceedings of the 27th ACM Symposium on Operating Systems Principles, 509-523, 2019201
62
2019
Graphie: Large-scale asynchronous graph traversals on just a GPU
W Han, D Mawhirter, B Wu, M Buland2017 26th International Conference on Parallel Architectures and Compilation …, 2017201
61
2017
FinePar: Irregularity-aware fine-grained workload partitioning on integrated architectures
F Zhang, B Wu, J Zhai, B He, W Chen2017 IEEE/ACM International Symposium on Code Generation and Optimization …, 2017201
55
2017
ScaAnalyzer: A Tool to Identify Memory Scalability Bottlenecks in Parallel Programs
X Liu, B WuThe International Conference for High Performance Computing, Networking …, 2015201
53
2015
Grnn: Low-latency and scalable rnn inference on gpus
C Holmes, D Mawhirter, Y He, F Yan, B WuProceedings of the Fourteenth EuroSys Conference 2019, 1-16, 2019201
49
2019
Challenging the" embarrassingly sequential" parallelizing finite state machine-based computations through principled speculation
Z Zhao, B Wu, X ShenACM SIGARCH Computer Architecture News 42 (1), 543-558, 2014201
48
2014
Laius: Towards latency awareness and improved utilization of spatial multitasking accelerators in datacenters
W Zhang, W Cui, K Fu, Q Chen, DE Mawhirter, B Wu, C Li, M GuoProceedings of the ACM international conference on supercomputing, 58-68, 2019201
36
2019
Co-run scheduling with power cap on integrated cpu-gpu systems
Q Zhu, B Wu, X Shen, L Shen, Z Wang2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2017201
30
2017
Enhancing Data Locality for Dynamic Simulations through Asynchronous Data Transformations and Adaptive Control
B Wu, EZ Zhang, X ShenThe Twentieth International Conference on Parallel Architectures and …, 2011201
25
2011
Automatic irregularity-aware fine-grained workload partitioning on integrated architectures
F Zhang, J Zhai, B Wu, B He, W Chen, X DuIEEE Transactions on Knowledge and Data Engineering 33 (3), 867-881, 2019201
24
2019
Graphzero: Breaking symmetry for efficient graph mining
D Mawhirter, S Reinehr, C Holmes, T Liu, B WuarXiv preprint arXiv:1911.12877, 2019201
24
2019
Graphphi: efficient parallel graph processing on emerging throughput-oriented architectures
Z Peng, A Powell, B Wu, T Bicer, B RenProceedings of the 27th International Conference on Parallel Architectures …, 2018201
23
2018
Enabling scalability-sensitive speculative parallelization for fsm computations
J Qiu, Z Zhao, B Wu, A Vishnu, SL SongProceedings of the International Conference on Supercomputing, 1-10, 2017201
18
2017
Simple Profile Rectifications Go a Long Way: Statistically Exploring and Alleviating the Effects of Sampling Errors for Program Optimizations
B Wu, M Zhou, X Shen, Y Gao, R Silvera, G YiuECOOP 2013–Object-Oriented Programming: 27th European Conference …, 2013201
18
2013
Optimizing data placement on GPU memory: A portable approach
G Chen, X Shen, B Wu, D LiIEEE Transactions on Computers 66 (3), 473-487, 2016201
17
2016