Compiler-assisted workload consolidation for efficient dynamic parallelism on GPU H Wu, D Li, M Becchi 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2016 | 26 | 2016 |
Nested parallelism on GPU: Exploring parallelization templates for irregular loops and recursive computations D Li, H Wu, M Becchi 2015 44th International Conference on Parallel Processing, 979-988, 2015 | 23 | 2015 |
A Compiler Framework for Fixed-Topology Non-Deterministic Finite Automata on SIMD Platforms M Nourian, H Wu, M Becchi 2018 IEEE 24th International Conference on Parallel and Distributed Systems …, 2018 | 8 | 2018 |
Exploiting dynamic parallelism to efficiently support irregular nested loops on GPUs D Li, H Wu, M Becchi Proceedings of the 2015 International Workshop on Code Optimisation for …, 2015 | 8 | 2015 |
Compiling SIMT Programs on Multi-and Many-Core Processors with Wide Vector Units: A Case Study with CUDA H Wu, J Ravi, M Becchi 2018 IEEE 25th International Conference on High Performance Computing (HiPC …, 2018 | 3 | 2018 |
Evaluating thread coarsening and low-cost synchronization on intel xeon phi H Wu, M Becchi 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2020 | 1 | 2020 |
An Analytical Study of Recursive Tree Traversal Patterns on Multi-and Many-Core Platforms H Wu, M Becchi 2017 IEEE 23rd International Conference on Parallel and Distributed Systems …, 2017 | 1 | 2017 |
Facilitating the Deployment of Irregular Applications on Parallel Manycore Architecture by Identifying Irregular Patterns H Wu North Carolina State University, 2021 | | 2021 |
Compiler-assisted workload consolidation to efficiently exploit dynamic parallelism for recursive applications H Wu University of Missouri-Columbia, 2015 | | 2015 |