Optimizing dual-core execution for power efficiency and transient-fault recovery

Authors

    Authors

    Y. Ma; H. L. Gao; M. Dimitrov;H. Y. Zhou

    Comments

    Authors: contact us about adding a copy of your work at STARS@ucf.edu

    Abbreviated Journal Title

    IEEE Trans. Parallel Distrib. Syst.

    Keywords

    multiple data stream architectures; fault tolerance; low-power design; Computer Science, Theory & Methods; Engineering, Electrical & Electronic

    Abstract

    Dual-core execution (DCE) is an execution paradigm proposed to utilize chip multiprocessors to improve the performance of single-threaded applications. Previous research has shown that DCE provides a complexity-effective approach to building a highly scalable instruction window and achieves significant latency-hiding capabilities. In this paper, we propose to optimize DCE for power efficiency and/or transient-fault recovery. In DCE, a program is first processed (speculatively) in the front processor and then reexecuted by the back processor. Such reexecution is the key to eliminating the centralized structures that are normally associated with very large instruction windows. In this paper, we exploit the computational redundancy in DCE to improve its reliability and its power efficiency. The main contributions include: 1) DCE-based redundancy checking for transient-fault tolerance and a complexity-effective approach to achieving full redundancy coverage and 2) novel techniques to improve the power/energy efficiency of DCE-based execution paradigms. Our experimental results demonstrate that, with the proposed simple techniques, the optimized DCE can effectively achieve transient-fault tolerance or significant performance enhancement in a power/energy-efficient way. Compared to the original DCE, the optimized DCE has similar speedups (34 percent on average) over single-core processors while reducing the energy overhead from 93 percent to 31 percent.

    Journal Title

    Ieee Transactions on Parallel and Distributed Systems

    Volume

    18

    Issue/Number

    8

    Publication Date

    1-1-2007

    Document Type

    Article

    Language

    English

    First Page

    1080

    Last Page

    1093

    WOS Identifier

    WOS:000247541500006

    ISSN

    1045-9219

    Share

    COinS