The company Numascale is one of the few companies offering shared memory systems with thousands of cores. To produce machines of such a size, a proprietary cache-coherent interconnect is used to couple standard servers into a large single system. In this work we investigate the ability of such a huge system to run OpenMP applications in an efficient way. Therefore, we use kernel benchmarks to investigate basic performance characteristics of the machine and we present a real world application from the Institute of Combustion Technology at RWTH Aachen University, called TrajSearch, which has been optimized to run efficient on such a large shared memory machine.
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
Tel.: +1 703 830 6300
Fax: +1 703 830 2300 firstname.lastname@example.org
(Corporate matters and books only) IOS Press c/o Accucoms US, Inc.
For North America Sales and Customer Service
West Point Commons
Lansdale PA 19446
Tel.: +1 866 855 8967
Fax: +1 215 660 5042 email@example.com