Tensor comprehensions: Framework-agnostic high-performance machine learning abstractions N Vasilache, O Zinenko, T Theodoridis, P Goyal, Z DeVito, WS Moses, ... arXiv preprint arXiv:1802.04730, 2018 | 530 | 2018 |
The next 700 accelerated layers: From mathematical expressions of network computation graphs to accelerated GPU kernels, automatically N Vasilache, O Zinenko, T Theodoridis, P Goyal, Z Devito, WS Moses, ... ACM Transactions on Architecture and Code Optimization (TACO) 16 (4), 1-26, 2019 | 70 | 2019 |
Finding missed optimizations through the lens of dead code elimination T Theodoridis, M Rigger, Z Su Proceedings of the 27th ACM International Conference on Architectural …, 2022 | 29 | 2022 |
Understanding and exploiting optimal function inlining T Theodoridis, T Grosser, Z Su Proceedings of the 27th ACM International Conference on Architectural …, 2022 | 22 | 2022 |
Inlining-benefit prediction with interprocedural partial escape analysis ME Weingarten, T Theodoridis, A Prokopec Proceedings of the 14th ACM SIGPLAN International Workshop on Virtual …, 2022 | 4 | 2022 |
Fast linear programming through transprecision computing on small and sparse data T Grosser, T Theodoridis, M Falkenstein, A Pitchanathan, M Kruse, ... Proceedings of the ACM on Programming Languages 4 (OOPSLA), 1-28, 2020 | 3 | 2020 |
Refined Input, Degraded Output: The Counterintuitive World of Compiler Behavior T Theodoridis, Z Su Proceedings of the ACM on Programming Languages 8 (PLDI), 671-691, 2024 | 1 | 2024 |
Boosting Compiler Testing by Injecting Real-World Code S Li, T Theodoridis, Z Su Proceedings of the ACM on Programming Languages 8 (PLDI), 223-245, 2024 | 1 | 2024 |