Error Correction Code protected Data Processing Units

Nicoleta Cucu Laurenciu, Tushar Gupta, Valentin Savin, Sorin Cotofana

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

5 Citations (Scopus)

Abstract

The significant uncertainty associated with current nanodevices fabrication and operation, calls for a circuit design paradigm change, which ought to actively embrace the inherently nanodevice unreliability to generate overall circuit architectures able to perform reliable computation. While for data storage units viable solutions exist, Data Processing Units (DPUs) are not amenable to a similar line of reasoning. The typical approach undertaken for fault-tolerant DPUs relies on modular redundancy (e.g., spatial, temporal), which while being effective from an error tolerance perspective, generally involves high area and/or performance impairments. This paper proposes a generic methodology to obtain reliable DPU implementations built with unreliable components by intimately intertwining Error Correcting Codes (ECCs) codecs with the DPU functionality. The ECC protected DPU architecture is derived cluster-wise with area and reliability constraints, by exploiting dependence relations (logical and w.r.t. shared area) between internal signals pertaining to the DPU and the ECC codec. To evaluate the error rate and performance implications, a multitude of test corners were considered (e.g., gate criticality, ECC type and structure, faulty and low complexity decoder, time-space redundancy) for an ECC protected 6-bit adder architecture. Simulation results reveal that the ECC embedding approach can be effective from both error rate and area perspective, for the Pareto designs with performance figures of merit situated in-between consecutive modular redundancy based design corresponding curves. The proposed approach is generic from the coding point of view, scalable, and enables a fine grained control of the DPU desired reliability degree and area overhead.

Original languageEnglish
Title of host publication2016 IEEE/ACM International Symposium on Nanoscale Architectures (NANOARCH)
EditorsW. Zhao, C.A. Moritz
Place of PublicationNew York
PublisherAssociation for Computing Machinery (ACM)
Pages37-42
Number of pages6
ISBN (Electronic)978-1-4503-4330-5
ISBN (Print)978-1-4673-8927-3
Publication statusPublished - 2016
Event2016 IEEE/ACM International Symposium on Nanoscale Architectures (NANOARCH) - Beijing, China
Duration: 18 Jul 201620 Jul 2016

Conference

Conference2016 IEEE/ACM International Symposium on Nanoscale Architectures (NANOARCH)
Abbreviated titleNANOARCH 2016
Country/TerritoryChina
CityBeijing
Period18/07/1620/07/16

Keywords

  • reliable computing
  • unreliable devices
  • ECC
  • reliable functional units

Fingerprint

Dive into the research topics of 'Error Correction Code protected Data Processing Units'. Together they form a unique fingerprint.

Cite this