Chao Wang
Chao Wang
Scientist, National Center for Computational Sciences, Oak Ridge National Laboratory
在 ornl.gov 的电子邮件经过验证
标题引用次数年份
Proactive process-level live migration in HPC environments
C Wang, F Mueller, C Engelmann, SL Scott
Proceedings of the 2008 ACM/IEEE conference on Supercomputing, 43, 2008
1902008
A job pause service under LAM/MPI+ BLCR for transparent fault tolerance
C Wang, F Mueller, C Engelmann, SL Scott
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE …, 2007
982007
NVMalloc: Exposing an aggregate SSD store as a memory partition in extreme-scale machines
C Wang, SS Vazhkudai, X Ma, F Meng, Y Kim, C Engelmann
2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012
622012
Hybrid checkpointing for MPI jobs in HPC environments
C Wang, F Mueller, C Engelmann, SL Scott
Parallel and Distributed Systems (ICPADS), 2010 IEEE 16th International …, 2010
492010
Proactive process-level live migration and back migration in HPC environments
C Wang, F Mueller, C Engelmann, SL Scott
Journal of Parallel and Distributed Computing 72 (2), 254-267, 2012
402012
Proactive Process-Level Live Migration and Back Migration in HPC Environments
C Wanga, F Muellera, C Engelmannb, SL Scottb
40*
Optimizing center performance through coordinated data staging, scheduling and recovery
Z Zhang, C Wang, SS Vazhkudai, X Ma, GG Pike, JW Cobb, F Mueller
Proceedings of the 2007 ACM/IEEE conference on Supercomputing, 55, 2007
352007
Scalable, fault tolerant membership for MPI tasks on HPC systems
J Varma, C Wang, F Mueller, C Engelmann, SL Scott
Proceedings of the 20th annual international conference on Supercomputing …, 2006
332006
Hybrid full/incremental checkpoint/restart for MPI jobs in HPC environments
C Wang, F Mueller, C Engelmann, SL Scott
Dept. of Computer Science, North Carolina State University, 2009
182009
Improving the availability of supercomputer job input data using temporal replication
C Wang, Z Zhang, X Ma, SS Vazhkudai, F Mueller
Computer Science-Research and Development 23 (3-4), 149-157, 2009
162009
MOLAR: Adaptive runtime support for high-end computing operating and runtime systems
C Engelmann, SL Scott, DE Bernholdt, NR Gottumukkala, C Leangsuksun, ...
ACM SIGOPS Operating Systems Review 40 (2), 63-72, 2006
152006
A tunable holistic resiliency approach for high-performance computing systems
SL Scott, C Engelmann, GR Vallée, T Naughton, A Tikotekar, ...
ACM Sigplan Notices 44 (4), 305-306, 2009
122009
On-the-fly recovery of job input data in supercomputers
C Wang, Z Zhang, SS Vazhkudai, X Ma, F Mueller
Parallel Processing, 2008. ICPP'08. 37th International Conference on, 620-627, 2008
52008
Transparent fault tolerance for job healing in hpc environments
C Wang
North Carolina State University, 2009
42009
Understanding object-level memory access patterns across the spectrum
X Ji, C Wang, N El-Sayed, X Ma, Y Kim, SS Vazhkudai, W Xue, ...
Proceedings of the International Conference for High Performance Computing …, 2017
22017
Transparent Fault Tolerance for Job Input Data in HPC Environments
C Wang, SS Vazhkudai, X Ma, F Mueller
22014
Hybrid Full/Incremental Checkpoint/Restart for MPI Jobs in HPC Environments
W Chao, F Mueller, C Engelmann
Proc. of the 16th International Conference on Parallel and Distributed …, 2011
22011
GPFS Evaluation Report
C Wang
Technical Report, National Center for Computational Sciences, Oak Ridge …, 2016
2016
A Study on Application Heap Object-level Memory Access Patterns
X Ji, C Wang, X Ma, S Vazhkudai, Y Kim
Technical Report, National Center for Computational Sciences, Oak Ridge …, 2016
2016
Back-Migration for MPI Jobs in HPC Environments
C Wang, F Mueller, C Engelmann, SL Scott
Forum to Address Scalable Technology for Runtime and Operating Systems (FastOS), 2009
2009
系统目前无法执行此操作,请稍后再试。
文章 1–20