Impact of power management on temperature and reliability evolution for an embedded many-core architecture

Konferenz: ARCS 2011 - 24th International Conference on Architecture of Computing Systems
22.02.2011 - 23.02.2011 in Como, Italy

Tagungsband: ARCS 2011

Seiten: 4Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Autoren:
Carbon, M. Alexandre; Chehida, Karim Ben; David, Raphaël (CEA, LIST, Embedded Computing Lab, Gif-sur-Yvette, 91191, France)
Héron, Olivier (CEA, LIST, Embedded System Reliability Lab, Gif-sur-Yvette, 91191, France)

Inhalt:
Technology scaling and the increasing demand for performance have led to the development of many-core architectures in order to exploit parallelism and to avoid physical limits which started to appear in mono-processors (especially in terms of performance and power consumption). Although the design of these architectures is less aggressive (timing closure), the non-ideal technology scaling leads to an increase of power densities which cause chip temperature (hotspots, gradients) and reliability problems (faster aging and frequent intermittent errors). There is a strong relationship between reliability, temperature, power and performance. The presence of thermal hotspots and gradients (spatial and temporal) affects significantly the chip performance, power consumption, cost and reliability. The transistor switching and interconnect delays increase with temperature. Moreover, a temperature increase causes an increase in leakage power that also causes a temperature increase. This positive feedback loop can lead to chip destruction if the power dissipation is not controlled.