Latency-tolerant virtual cluster architecture for VLIW DSP

Pi Chen Hsiao*, Tay Jyi Lin, Chih-Wei Liu, Chein Wei Jen

*此作品的通信作者

    研究成果: Conference article同行評審

    摘要

    This paper proposes a virtual cluster architecture, which executes multi-cluster VLIW programs with a reduced number of clusters in a time-sharing fashion. The interleaved sub-VLIWs help to hide instruction latencies significantly, and thus the proposed virtual cluster will have advantages of (1) reduced forwarding complexity in the processor datapath, (2) improved programming model for further code optimizations, and (3) supporting composite instructions without any extra functional unit. In our experiments with a 4-cluster VLIW DSP, the 28 forwarding paths inside a cluster are completely eliminated, which contributes to savings of 21.71% delay and 17.56% silicon area. Moreover, the virtual cluster has been verified to have better efficiency on its code sizes and execution times for its improved programming model for various DSP kernels.

    原文English
    文章編號4253436
    頁(從 - 到)3506-3509
    頁數4
    期刊Proceedings - IEEE International Symposium on Circuits and Systems
    DOIs
    出版狀態Published - 2007
    事件2007 IEEE International Symposium on Circuits and Systems, ISCAS 2007 - New Orleans, LA, United States
    持續時間: 27 5月 200730 5月 2007

    指紋

    深入研究「Latency-tolerant virtual cluster architecture for VLIW DSP」主題。共同形成了獨特的指紋。

    引用此