Task support was introduced into OpenMP to address irregular parallelism in shared memory architectures. Creating tasks that are extremely fine granular in applications, however, impedes performance. In this paper, a methodology for analyzing the performance of task-based OpenMP programs and its implementation in Periscope is presented. The paper unveils and concentrates on the newly formulated high-level performance properties that formalize typical performance bottlenecks of task-based programs. In addition, the paper reports on the experimental results which were accomplished for the codes of the Barcelona OpenMP Tasks Suite (BOTS) using Periscope in the SuperMUC supercomputing machine at Garching, Germany.
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
Tel.: +1 703 830 6300
Fax: +1 703 830 2300 firstname.lastname@example.org
(Corporate matters and books only) IOS Press c/o Accucoms US, Inc.
For North America Sales and Customer Service
West Point Commons
Lansdale PA 19446
Tel.: +1 866 855 8967
Fax: +1 215 660 5042 email@example.com