Simulation of multicore vector dataflow processor with shared memory

Автор: Dikarev Nikolay Ivanovich, Shabanov Boris Mikhaylovich, Shmelev Aleksandr Sergeyevich

Журнал: Программные системы: теория и приложения @programmnye-sistemy

Рубрика: Программное и аппаратное обеспечение для супер ЭВМ

Статья в выпуске: 1 (36) т.9, 2018 года.

Бесплатный доступ

A Dataflow processor can execute up to 16 instructions per cycle compared to 4 to 6 instructions of the best von Neumann processors. Simulation of the vector dataflow processor (VDP) showed that it is possible to raise its core vector performance up to 256 flops per clock, and using modern manufacturing process to implement up to 4 such cores on a single die. Simulation results of the matrix multiplication program and 2D Stencil on double core VDP with shared memory are given in this paper. It is shown that the matrix multiplication program scales well on VDP, while the performance of 2D Stencil is limited by the shared memory bandwidth. (In Russian). (in Russian).

Еще

2d stancil, 2d stencil, dataflow architecture, matrix multiplication, performance evaluation, supercomputer, vector processor

Короткий адрес: https://sciup.org/143164297

IDR: 143164297   |   DOI: 10.25209/2079-3316-2017-9-1-37-52

Статья научная