Effective compiler support for predicated execution using the hyperblock SA Mahlke, DC Lin, WY Chen, RE Hank, RA Bringmann ACM SIGMICRO Newsletter 23 (1-2), 45-54, 1992 | 888 | 1992 |
The superblock: An effective technique for VLIW and superscalar compilation WMW Hwu, SA Mahlke, WY Chen, PP Chang, NJ Warter, RA Bringmann, ... Instruction-Level Parallelism: A Special Issue of The Journal of …, 2011 | 855 | 2011 |
{COMET}: Code offload by migrating execution transparently MS Gordon, DA Jamshidi, S Mahlke, ZM Mao, X Chen Presented as part of the 10th {USENIX} Symposium on Operating Systems Design …, 2012 | 529 | 2012 |
IMPACT: An architectural framework for multiple-instruction-issue processors PP Chang, SA Mahlke, WY Chen, NJ Warter, WW Hwu ACM SIGARCH Computer Architecture News 19 (3), 266-275, 1991 | 505 | 1991 |
Scalpel: Customizing dnn pruning to the underlying hardware parallelism J Yu, A Lukefahr, D Palframan, G Dasika, R Das, S Mahlke ACM SIGARCH Computer Architecture News 45 (2), 548-560, 2017 | 409 | 2017 |
Using profile information to assist classic code optimizations PP Chang, SA Mahlke, WMW Hwu Software: Practice and Experience 21 (12), 1301-1321, 1991 | 344 | 1991 |
Sage: Self-tuning approximation for graphics engines M Samadi, J Lee, DA Jamshidi, A Hormati, S Mahlke Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013 | 340 | 2013 |
Shoestring: probabilistic soft error reliability on the cheap S Feng, S Gupta, A Ansari, S Mahlke ACM SIGARCH Computer Architecture News 38 (1), 385-396, 2010 | 336 | 2010 |
Soda: A low-power architecture for software radio Y Lin, H Lee, M Woh, Y Harel, S Mahlke, T Mudge, C Chakrabarti, ... ACM SIGARCH Computer Architecture News 34 (2), 89-101, 2006 | 324 | 2006 |
Processor acceleration through automated instruction set customization N Clark, H Zhong, S Mahlke Proceedings. 36th Annual IEEE/ACM International Symposium on …, 2003 | 289 | 2003 |
Reliable systems on unreliable fabrics T Austin, V Bertacco, S Mahlke, Y Cao IEEE Design & Test of Computers 25 (4), 322-332, 2008 | 278 | 2008 |
Paraprox: Pattern-based approximation for data parallel applications M Samadi, DA Jamshidi, J Lee, S Mahlke Proceedings of the 19th international conference on Architectural support …, 2014 | 271 | 2014 |
Orchestrating the execution of stream programs on multicore platforms M Kudlur, S Mahlke ACM SIGPLAN Notices 43 (6), 114-124, 2008 | 269 | 2008 |
A comparison of full and partial predicated execution support for ILP processors SA Mahlke, RE Hank, JE McCormick, DI August, WMW Hwu Proceedings of the 22nd annual international symposium on Computer …, 1995 | 263 | 1995 |
BulletProof: A defect-tolerant CMP switch architecture K Constantinides, S Plaza, J Blome, B Zhang, V Bertacco, S Mahlke, ... The Twelfth International Symposium on High-Performance Computer …, 2006 | 252 | 2006 |
Edge-centric modulo scheduling for coarse-grained reconfigurable architectures H Park, K Fan, SA Mahlke, T Oh, H Kim, H Kim Proceedings of the 17th international conference on Parallel architectures …, 2008 | 249 | 2008 |
Application-specific processing on a general-purpose core via transparent instruction set customization N Clark, M Kudlur, H Park, S Mahlke, K Flautner 37th international symposium on microarchitecture (MICRO-37'04), 30-40, 2004 | 240 | 2004 |
Profile‐guided automatic inline expansion for C programs PP Chang, SA Mahlke, WY Chen, WMW Hwu Software: Practice and Experience 22 (5), 349-369, 1992 | 238 | 1992 |
An architecture framework for transparent instruction set customization in embedded processors N Clark, J Blome, M Chu, S Mahlke, S Biles, K Flautner 32nd International Symposium on Computer Architecture (ISCA'05), 272-283, 2005 | 212 | 2005 |
Composite cores: Pushing heterogeneity into a core A Lukefahr, S Padmanabha, R Das, FM Sleiman, R Dreslinski, ... 2012 45th Annual IEEE/ACM international symposium on microarchitecture, 317-328, 2012 | 211 | 2012 |