关注
Alexander Heinecke
Alexander Heinecke
Senior Principal Engineer at Intel Labs
在 intel.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
The ELPA library: scalable parallel eigenvalue solutions for electronic structure theory and computational science
A Marek, V Blum, R Johanni, V Havu, B Lang, T Auckenthaler, A Heinecke, ...
Journal of Physics: Condensed Matter 26 (21), 213201, 2014
2592014
Design and implementation of the linpack benchmark for single and multi-node systems based on intel® xeon phi coprocessor
A Heinecke, K Vaidyanathan, M Smelyanskiy, A Kobotov, R Dubtsov, ...
2013 IEEE 27th International Symposium on Parallel and Distributed …, 2013
2122013
A study of BFLOAT16 for deep learning training
D Kalamkar, D Mudigere, N Mellempudi, D Das, K Banerjee, S Avancha, ...
arXiv preprint arXiv:1905.12322, 2019
1982019
Mixed precision training of convolutional neural networks using integer operations
D Das, N Mellempudi, D Mudigere, D Kalamkar, S Avancha, K Banerjee, ...
arXiv preprint arXiv:1802.00930, 2018
1632018
LIBXSMM: accelerating small matrix multiplications by runtime code generation
A Heinecke, G Henry, M Hutchinson, H Pabst
SC'16: Proceedings of the International Conference for High Performance …, 2016
1622016
Petascale high order dynamic rupture earthquake simulations on heterogeneous supercomputers
A Heinecke, A Breuer, S Rettenberger, M Bader, AA Gabriel, C Pelties, ...
SC'14: Proceedings of the International Conference for High Performance …, 2014
1412014
ls1 mardyn: The Massively Parallel Molecular Dynamics Code for Large Systems
C Niethammer, S Becker, M Bernreuther, M Buchholz, W Eckhardt, ...
Journal of chemical theory and computation 10 (10), 4455-4464, 2014
1342014
591 TFLOPS multi-trillion particles simulation on SuperMUC
W Eckhardt, A Heinecke, R Bader, M Brehm, N Hammer, H Huber, ...
Supercomputing: 28th International Supercomputing Conference, ISC 2013 …, 2013
982013
Anatomy of high-performance deep learning convolutions on simd architectures
E Georganas, S Avancha, K Banerjee, D Kalamkar, G Henry, H Pabst, ...
SC18: International Conference for High Performance Computing, Networking …, 2018
932018
From gpgpu to many-core: Nvidia fermi and intel many integrated core architecture
A Heinecke, M Klemm, HJ Bungartz
Computing in Science & Engineering 14 (2), 78-83, 2012
872012
Sustained petascale performance of seismic simulations with SeisSol on SuperMUC
A Breuer, A Heinecke, S Rettenberger, M Bader, AA Gabriel, C Pelties
Supercomputing: 29th International Conference, ISC 2014, Leipzig, Germany …, 2014
762014
Efficient shared-memory implementation of high-performance conjugate gradient benchmark and its application to unstructured matrices
J Park, M Smelyanskiy, K Vaidyanathan, A Heinecke, DD Kalamkar, X Liu, ...
SC'14: Proceedings of the International Conference for High Performance …, 2014
622014
Performance optimizations for scalable implicit RANS calculations with SU2
TD Economon, D Mudigere, G Bansal, A Heinecke, F Palacios, J Park, ...
Computers & Fluids 129, 146-158, 2016
502016
Distgnn: Scalable distributed training for large-scale graph neural networks
V Md, S Misra, G Ma, R Mohanty, E Georganas, A Heinecke, D Kalamkar, ...
Proceedings of the International Conference for High Performance Computing …, 2021
492021
Leveraging the bfloat16 artificial intelligence datatype for higher-precision computations
G Henry, PTP Tang, A Heinecke
2019 IEEE 26th Symposium on Computer Arithmetic (ARITH), 69-76, 2019
482019
Petascale local time stepping for the ADER-DG finite element method
A Breuer, A Heinecke, M Bader
2016 IEEE international parallel and distributed processing symposium (IPDPS …, 2016
452016
Parallel matrix multiplication based on space-filling curves on shared memory multicore platforms
A Heinecke, M Bader
Proceedings of the 2008 workshop on Memory access on future processors: a …, 2008
452008
High order seismic simulations on the Intel Xeon Phi processor (Knights Landing)
A Heinecke, A Breuer, M Bader, P Dubey
High Performance Computing: 31st International Conference, ISC High …, 2016
402016
Option pricing with a direct adaptive sparse grid approach
HJ Bungartz, A Heinecke, D Pflüger, S Schraufstetter
Journal of Computational and Applied Mathematics 236 (15), 3741-3750, 2012
382012
Extending a Highly Parallel Data Mining Algorithm to the Intel ® Many Integrated Core Architecture
A Heinecke, M Klemm, D Pflüger, A Bode, HJ Bungartz
Euro-Par 2011: Parallel Processing Workshops: CCPI, CGWS, HeteroPar, HiBB …, 2012
382012
系统目前无法执行此操作,请稍后再试。
文章 1–20