メッシュ操作の機能と性能の分離を可能にするタスク処理フレームワークを提案する。このフレームワークにより、スレッド管理とスケジューリングの決定を一般的で再利用可能なタスク処理フレームワークに引き上げることができる。
The presented tasking framework enables separation of concerns between functionality and performance aspects of parallel mesh generation, leading to improved performance and portability.
GPU parallel computing can significantly accelerate the computationally intensive iterative reconstruction process in photoacoustic imaging, enabling faster image processing and broader adoption of this promising medical imaging technology.
INSPIRIT, an efficient and lightweight scheduling framework with adaptive priority, is proposed to optimize task scheduling in task-based runtime systems on heterogeneous hardware. INSPIRIT introduces two novel task attributes - inspiring ability and inspiring efficiency - to determine task priorities, eliminating the need for application domain knowledge. INSPIRIT also jointly considers runtime information such as the number of ready tasks in worker queues to guide task scheduling, exposing more performance opportunities in heterogeneous hardware while reducing overhead.
The article presents optimally fast O(log p) time algorithms for computing round-optimal broadcast schedules for message-passing parallel computing systems, where n indivisible blocks of data need to be broadcast from a root processor to all other processors in a fully connected network of p processors.
Stackless coroutines in C++20 enable fully-portable continuation stealing, leading to optimal time/memory scaling in parallel computing.
Libfork enables fully-portable continuation stealing using stackless coroutines, achieving optimal time/memory scaling in parallel computing.