Follow
Xulong Tang
Title
Cited by
Cited by
Year
Scheduling techniques for GPU architectures with processing-in-memory capabilities
A Pattnaik, X Tang, A Jog, O Kayiran, AK Mishra, MT Kandemir, O Mutlu, ...
Proceedings of the 2016 International Conference on Parallel Architectures …, 2016
2422016
Yolobile: Real-time object detection on mobile devices via compression-compilation co-design
Y Cai, H Li, G Yuan, W Niu, Y Li, X Tang, B Ren, Y Wang
Proceedings of the AAAI conference on artificial intelligence 35 (2), 955-963, 2021
1132021
Controlled kernel launch for dynamic parallelism in GPUs
X Tang, A Pattnaik, H Jiang, O Kayiran, A Jog, S Pai, M Ibrahim, ...
2017 IEEE International Symposium on High Performance Computer Architecture …, 2017
622017
Data movement aware computation partitioning
X Tang, O Kislal, M Kandemir, M Karakoy
Proceedings of the 50th Annual IEEE/ACM International Symposium on …, 2017
582017
Opportunistic computing in gpu architectures
A Pattnaik, X Tang, O Kayiran, A Jog, A Mishra, MT Kandemir, ...
Proceedings of the 46th international symposium on computer architecture …, 2019
562019
Algorithm-hardware co-design of attention mechanism on FPGA devices
X Zhang, Y Wu, P Zhou, X Tang, J Hu
ACM Transactions on Embedded Computing Systems (TECS) 20 (5s), 1-24, 2021
512021
Improving bank-level parallelism for irregular applications
X Tang, M Kandemir, P Yedlapalli, J Kotra
2016 49th Annual IEEE/ACM International Symposium on Microarchitecture …, 2016
512016
μC-States: Fine-grained GPU datapath power management
O Kayiran, A Jog, A Pattnaik, R Ausavarungnirun, X Tang, MT Kandemir, ...
Proceedings of the 2016 International Conference on Parallel Architectures …, 2016
512016
Automated runtime-aware scheduling for multi-tenant dnn inference on gpu
F Yu, S Bray, D Wang, L Shangguan, X Tang, C Liu, X Chen
2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD), 1-9, 2021
482021
Memory row reuse distance and its role in optimizing application performance
M Kandemir, H Zhao, X Tang, M Karakoy
Proceedings of the 2015 ACM SIGMETRICS International Conference on …, 2015
332015
Oversubscribed command queues in GPUs
S Puthoor, X Tang, J Gross, BM Beckmann
Proceedings of the 11th Workshop on General Purpose GPUs, 50-60, 2018
302018
Optimizing off-chip accesses in multicores
W Ding, X Tang, M Kandemir, Y Zhang, E Kultursay
Proceedings of the 36th ACM SIGPLAN Conference on Programming Language …, 2015
272015
Parallelizing DNN training on GPUs: Challenges and opportunities
W Xu, Y Zhang, X Tang
Companion Proceedings of the Web Conference 2021, 174-178, 2021
212021
Enhancing computation-to-core assignment with physical location information
O Kislal, J Kotra, X Tang, MT Kandemir, M Jung
ACM SIGPLAN Notices 53 (4), 312-327, 2018
202018
Improving address translation in multi-gpus via sharing and spilling aware tlb design
B Li, J Yin, Y Zhang, X Tang
MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021
192021
Enabling latency-aware data initialization for integrated CPU/GPU heterogeneous platform
Z Wang, Z Jiang, Z Wang, X Tang, C Liu, S Yin, Y Hu
IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2020
192020
FlexBFS: a parallelism-aware implementation of breadth-first search on GPU
G Liu, H An, W Han, X Li, T Sun, W Zhou, X Wei, X Tang
Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of …, 2012
182012
Q-gpu: A recipe of optimizations for quantum circuit simulation using gpus
Y Zhao, Y Guo, Y Yao, A Dumi, DM Mulvey, S Upadhyay, Y Zhang, ...
2022 IEEE International Symposium on High-Performance Computer Architecture …, 2022
172022
Enhancing address translations in throughput processors via compression
X Tang, Z Zhang, W Xu, MT Kandemir, R Melhem, J Yang
Proceedings of the ACM International Conference on Parallel Architectures …, 2020
152020
DEMM: a Dynamic Energy-saving mechanism for Multicore Memories
A Sharifi, W Ding, D Guttman, H Zhao, X Tang, M Kandemir, C Das
2017 IEEE 25th International Symposium on Modeling, Analysis, and Simulation …, 2017
152017
The system can't perform the operation now. Try again later.
Articles 1–20