Low Overhead Remote Procedure Call System for Saturn DSP
Christiaan Banister
EECS Department, University of California, Berkeley
Technical Report No. UCB/EECS-2022-144
May 19, 2022
http://www2.eecs.berkeley.edu/Pubs/TechRpts/2022/EECS-2022-144.pdf
System-on-a-Chip designs with specialized processors and domain-specific accelerators have grown in popularity over the last decade to meet ever-increasing compute demands. This is due to the superior performance-per-watt that can be obtained from such designs as well as the inherent limitations in scaling up traditional out-of-order superscalar CPUs. However, a heavy reliance on microarchitectural details and a lack of coordination between software writers and hardware designers makes targeting these specialized IPs highly non-trivial. In addition, offloading tasks from the CPU to an accelerator introduces overheads that may nullify the advantage of utilizing it in the first place. This paper investigates a transparent method for allowing programmers to run user code on digital signal processing units (DSPs) and quantifies the impact of such an offload.
We propose a lightweight framework at the system software level in the style of message passing Asymmetric Multiprocessing kernels. By pairing a large symmetric multiprocessing kernel with a lightweight real-time operating system, we find that we can achieve low-latency accelerator communication while maintaining a relatively straightforward programming model.
Advisors: Krste Asanović
BibTeX citation:
@mastersthesis{Banister:EECS-2022-144, Author= {Banister, Christiaan}, Title= {Low Overhead Remote Procedure Call System for Saturn DSP}, School= {EECS Department, University of California, Berkeley}, Year= {2022}, Month= {May}, Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2022/EECS-2022-144.html}, Number= {UCB/EECS-2022-144}, Abstract= {System-on-a-Chip designs with specialized processors and domain-specific accelerators have grown in popularity over the last decade to meet ever-increasing compute demands. This is due to the superior performance-per-watt that can be obtained from such designs as well as the inherent limitations in scaling up traditional out-of-order superscalar CPUs. However, a heavy reliance on microarchitectural details and a lack of coordination between software writers and hardware designers makes targeting these specialized IPs highly non-trivial. In addition, offloading tasks from the CPU to an accelerator introduces overheads that may nullify the advantage of utilizing it in the first place. This paper investigates a transparent method for allowing programmers to run user code on digital signal processing units (DSPs) and quantifies the impact of such an offload. We propose a lightweight framework at the system software level in the style of message passing Asymmetric Multiprocessing kernels. By pairing a large symmetric multiprocessing kernel with a lightweight real-time operating system, we find that we can achieve low-latency accelerator communication while maintaining a relatively straightforward programming model.}, }
EndNote citation:
%0 Thesis %A Banister, Christiaan %T Low Overhead Remote Procedure Call System for Saturn DSP %I EECS Department, University of California, Berkeley %D 2022 %8 May 19 %@ UCB/EECS-2022-144 %U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2022/EECS-2022-144.html %F Banister:EECS-2022-144