A Decoder For Short BCH Codes With High Decoding Efficiency and Low Power For Emerging Memories
A Decoder For Short BCH Codes With High Decoding Efficiency and Low Power For Emerging Memories
Abstract— In this paper, a double-error-correcting and triple- high-density, and low-latency characteristics [1]. In addition to
error-detecting (DEC-TED) Bose–Chaudhuri–Hocquenghem SCMs, some emerging memories, such as STT-MRAM, are
(BCH) code decoder with high decoding efficiency and low also considered promising candidate embedded memories due
power for error correction in emerging memories is presented.
To increase the decoding efficiency, we propose an adaptive to their fast read and write latencies, low leakage power, and
error correction technique for the DEC-TED BCH code that logic-friendly compatibility [2], [3].
detects the number of errors in a codeword immediately after As technology scales down, these emerging memories are
syndrome generation and applies a different error correction also struggling with reduced reliability, and as a solution,
algorithm depending on the error conditions. With the adaptive error-correcting code (ECC) and its encoder/decoder circuits
error correction technique, the average decoding latency and
power consumption are significantly reduced owing to the have been applied. While NAND flash requires a powerful ECC
increased decoding efficiency. To further reduce the power capable of correcting up to 100 errors, most of the emerging
consumption, an invalid-transition-inhibition technique is memories can reach the required chip yield using an ECC
proposed to remove the invalid transitions caused by glitches of capable of correcting two or three errors because of new
syndrome vectors in the error-finding block. Synthesis results developments in storage physics [2]–[8]. In addition to simply
with an industry-compatible 65-nm technology library show
that the proposed decoders for the (79, 64, 6) BCH code take increasing the memory yield, ECC can be used to optimize
only 37%–48% average decoding latency and achieve more memory performance regarding density [9], [10] and energy
than 70% power reduction compared to the conventional consumption [11], [12]. In this manner, ECC has become an
fully parallel decoder under the 10−4 –10−2 raw bit-error essential part of emerging memories.
rate. To correct two or three errors, the Bose–Chaudhuri–
Index Terms— Adaptive error correction, Bose–Chaudhuri– Hocquenghem (BCH) code is widely adopted for emerging
Hocquenghem (BCH) code, double-error-correcting and memories [2]–[8]. However, the standard iterative and sequen-
triple-error-detecting (DEC-TED), emerging memories, error- tial decoding processes, which require multiple cycles, are
correcting code (ECC), invalid transition inhibition.
not compatible with emerging memories. This is because the
latency of the BCH code decoder should be a few nanosec-
onds, considering the short read or write access time in emerg-
I. I NTRODUCTION
ing memories. To achieve a double-error-correcting (DEC)
CHOI et al.: DECODER FOR SHORT BCH CODES WITH HIGH DECODING EFFICIENCY AND LOW POWER FOR EMERGING MEMORIES 3
CHOI et al.: DECODER FOR SHORT BCH CODES WITH HIGH DECODING EFFICIENCY AND LOW POWER FOR EMERGING MEMORIES 5
CHOI et al.: DECODER FOR SHORT BCH CODES WITH HIGH DECODING EFFICIENCY AND LOW POWER FOR EMERGING MEMORIES 7
Fig. 7. Block diagram of the proposed high-decoding-efficiency and low-power DEC-TED decoder.
TABLE V
C OMPARISON OF THE C ONVENTIONAL PA-BASED AND LUT-BASED
D ECODERS AND THE P ROPOSED PA-BASED
AND LUT-B ASED R EALIZATIONS
CHOI et al.: DECODER FOR SHORT BCH CODES WITH HIGH DECODING EFFICIENCY AND LOW POWER FOR EMERGING MEMORIES 9
Fig. 12. Power consumption comparison of the conventional decoders and the proposed decoders. (a) PA-based decoder. (b) LUT-based decoder.
by the elimination of invalid transitions. Note that the proposed [9] B. Del Bel, J. Kim, C. H. Kim, and S. S. Sapatnekar, “Improving
LUT-based decoder consumes slightly less power than the STT-MRAM density through multibit error correction,” in Proc.
IEEE/ACM Conf. Design, Autom. Test Eur. (DATE), Mar. 2014, pp. 1–6.
proposed PA-based decoder because fewer changes occur in [10] Z. Pajouhi, X. Fong, and K. Roy, “Device/circuit/architecture co-design
the internal nodes by only one activated path corresponding of reliable STT-MRAM,” in Proc. IEEE/ACM Conf. Design, Autom. Test
to the input syndrome vector in the LUT. Based on the area Eur. (DATE), Mar. 2015, pp. 1437–1442.
[11] Y. Alkabani, Z. Koopmans, H. Xu, A. K. Jones, and R. Melhem, “Write
and power results of the proposed decoders, the proposed pulse scaling for energy efficient STT-MRAM,” in Proc. IEEE Comput.
PA-based decoder can be implemented with three times less Soc. Annu. Symp. VLSI (ISVLSI), Jul. 2016, pp. 248–253.
area and similar power consumption in comparison to the pro- [12] X. Wang, M. Mao, E. Eken, W. Wen, H. Li, and Y. Chen, “Sliding
posed LUT-based decoder. Therefore, the proposed PA-based basket: An adaptive ECC scheme for runtime write failure suppression
of STT-RAM cache,” in Proc. IEEE/ACM Conf. Design, Autom. Test
decoder is a more favorable option in any applications than Eur. (DATE), Mar. 2016, pp. 762–767.
the LUT-based decoder, unlike the conventional decoders. [13] X. Wang, D. Wu, C. Hu, L. Pan, and R. Zhou, “Embedded high-speed
BCH decoder for new-generation NOR flash memories,” in Proc. IEEE
Custom Integr. Circuits Conf. (CICC), Sep. 2009, pp. 195–198.
VI. C ONCLUSION [14] W. Xueqiang, P. Liyang, W. Dong, H. Chaohong, and Z. Runde,
“A high-speed two-cell BCH decoder for error correcting in MLC nor
This paper presented a (79, 64, 6) BCH DEC-TED decoder flash memories,” IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 56,
with high decoding efficiency and low power for emerging no. 11, pp. 865–869, Nov. 2009.
[15] Y. Yoo and I.-C. Park, “A search-less DEC BCH decoder for low-
memories. We proposed an adaptive error correction technique complexity fault-tolerant systems,” in Proc. IEEE Workshop Signal
that chooses a different decoding algorithm depending on Process. Syst. (SiPS), Oct. 2014, pp. 1–6.
the error conditions in a codeword, to improve the decoding [16] C.-C. Chu, Y.-M. Lin, C.-H. Yang, and H.-C. Chang, “A fully par-
allel BCH codec with double error correcting capability for NOR
efficiency regarding delay and power consumption. Also, flash applications,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal
to avoid the high dynamic power consumption problem in Process. (ICASSP), Mar. 2012, pp. 1605–1608.
conventional fully parallel BCH decoders, the invalid transition [17] R. Naseer and J. Draper, “Parallel double error correcting code design to
mitigate multi-bit upsets in SRAMs,” in Proc. Eur. Solid-State Circuits
inhibition technique was adopted by using FFs and a specific Conf. (ESSCIRC), Sep. 2008, pp. 222–225.
ECC clock. In comparison to the conventional PA- and LUT- [18] S. Lin and D. J. Costello, “BCH codes,” in Error Control Coding:
based DEC-TED decoders, the average decoding latency of Fundamentals and Applications, 2nd ed. Englewood Cliffs, NJ, USA:
the proposed decoders is less than half that of the conventional Prentice-Hall, 2004, pp. 141–177.
[19] D. H. Yoon, J. Chang, R. S. Schreiber, and N. P. Jouppi, “Practical
decoders with an RBER of 100 ppm. Due to the added blocks, nonvolatile multilevel-cell phase change memory,” in Proc. Int. Conf.
the area increases by 18% and 1.4% in the PA- and LUT- High Perform. Comput., Netw., Storage Anal., Nov. 2013, pp. 1–12.
based decoders, respectively. However, this overhead can be [20] B. L. Ji et al., “In-line-test of variability and bit-error-rate of HfOx -
based resistive memory,” in Proc. IEEE Int. Memory Workshop (IMW),
decreased with an increased latency constraint of the decoder. May 2015, pp. 1–4.
The proposed PA- and LUT-based decoders achieve 75% [21] S. Sills et al., “Challenges for high-density 16 Gb ReRAM with 27 nm
and 70% power reduction on average in comparison to the technology,” in IEEE Symp. VLSI Circuits Dig. Tech. Papers, Jun. 2015,
conventional decoders, respectively. This paper provides a pp. 106–107.
[22] J. V. D. Horst and T. Berger, “Complete decoding of triple-error-
promising ECC decoder solution to achieve the target yield correcting binary BCH codes,” IEEE Trans. Inf. Theory, vol. 22, no. 2,
even with a high RBER, especially for high-performance and pp. 138–147, Mar. 1976.
low-power applications using emerging memories. [23] F. Gulliang, “An algebraic complete decoding for double-error-correcting
binary BCH codes,” J. Electron., China, vol. 1, no. 1, pp. 12–17, 1984.
[24] S. B. Wicker, Error Control Systems for Digital Communication and
Storage. Englewood Cliffs, NJ, USA: Prentice-Hall, 1995.
R EFERENCES
[25] X. Zhang and K. K. Parhi, “High-speed architectures for parallel long
[1] P. Amato, S. Bellini, M. Ferrari, C. Laurent, M. Sforzin, and BCH encoders,” IEEE Trans. Very Large Scale Integr. (VLSI) Syst.,
A. Tomasoni, “Fast decoding ECC for future memories,” IEEE J. Sel. vol. 13, no. 7, pp. 872–877, Jul. 2005.
Areas Commun., vol. 34, no. 9, pp. 2486–2497, Sep. 2016. [26] W. Liu, J. Rho, and W. Sung, “Low-power high-throughput BCH error
[2] S. H. Kang, “Embedded STT-MRAM for energy-efficient and cost- correction VLSI design for multi-level cell NAND flash memories,”
effective mobile systems,” in IEEE Symp. VLSI Circuits Dig. Tech. in Proc. IEEE Workshop Signal Process. Syst. Design Implement.,
Papers, Jun. 2014, pp. 36–37. Oct. 2006, pp. 303–308.
[3] H. Noguchi, K. Ikegami, N. Shimomura, T. Tetsufumi, J. Ito, and [27] J. Freudenberger and J. Spinner, “A configurable Bose–Chaudhuri–
S. Fujita, “Highly reliable and low-power nonvolatile cache memory Hocquenghem codec architecture for flash controller applications,”
with advanced perpendicular STT-MRAM for high-performance CPU,” J. Circuits, Syst. Comput., vol. 23, no. 2, p. 1450019, 2014.
in IEEE Symp. VLSI Circuits Dig. Tech. Papers, Jun. 2014, pp. 1–2. [28] A. Fahrner, H. Griesser, R. Klarer, and V. V. Zyablov, “Low-complexity
[4] P. Amato, C. Laurent, M. Sforzin, S. Bellini, M. Ferrari, and GEL codes for digital magnetic storage systems,” IEEE Trans. Magn.,
A. Tomasoni, “Ultra fast, two-bit ECC for emerging memories,” in Proc. vol. 40, no. 4, pp. 3093–3095, Jul. 2004.
6th IEEE Int. Memory Workshop (IMW), May 2014, pp. 79–82. [29] J. Spinner, M. Rajab, and J. Freudenberger, “Construction of high-rate
[5] Y. Emre, C. Yang, K. Sutaria, Y. Cao, and C. Chakrabarti, “Enhancing generalized concatenated codes for applications in non-volatile flash
the reliability of STT-RAM through circuit and system level techniques,” memories,” in Proc. IEEE 8th Int. Memory Workshop (IMW), May 2016,
in Proc. IEEE Workshop Signal Process. Syst., Oct. 2012, pp. 125–130. pp. 1–4.
[6] D. Niu, Y. Xiao, and Y. Xie, “Low power memristor-based ReRAM [30] I. Zhilin and A. Kreschuk, “Generalized concatenated code construc-
design with error correcting code,” in Proc. 17th Asia South Pacific tions with low overhead for optical channels and NAND-flash mem-
Design Autom. Conf., Jan./Feb. 2012, pp. 79–84. ory,” in Proc. 15th Int. Symp. Problems Redundancy Inf. Control
[7] M. Mao, Y. Cao, S. Yu, and C. Chakrabarti, “Optimizing latency, energy, Syst. (REDUNDANCY), Sep. 2016, pp. 177–180.
and reliability of 1T1R ReRAM through cross-layer techniques,” IEEE J. [31] D. Strukov, “The area and latency tradeoffs of binary bit-parallel BCH
Emerg. Sel. Topics Circuits Syst., vol. 6, no. 3, pp. 352–363, Sep. 2016. decoders for prospective nanoelectronic memories,” in Proc. Asilomar
[8] C. Yang, M. Mao, Y. Cao, and C. Chakrabarti, “Cost-effective design Conf. Signals, Syst. Comput., Oct./Nov. 2006, pp. 1183–1187.
solutions for enhancing pram reliability and performance,” IEEE Trans. [32] W. W. Peterson, Error-Correcting Codes. Cambridge, MA, USA:
Multi-Scale Comput. Syst., vol. 3, no. 1, pp. 1–11, Jan./Mar. 2017. MIT Press, 1972.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
CHOI et al.: DECODER FOR SHORT BCH CODES WITH HIGH DECODING EFFICIENCY AND LOW POWER FOR EMERGING MEMORIES 11
[33] S. Tanakamaru, Y. Yanagihara, and K. Takeuchi, “Error-prediction LDPC Jung Pill Kim received the B.S. degree in electronic
and error-recovery schemes for highly reliable solid-state drives (SSDs),” engineering from Hanyang University, Seoul, South
IEEE J. Solid-State Circuits, vol. 48, no. 11, pp. 2920–2933, Nov. 2013. Korea, in 1988, and the M.S. and Ph.D. degrees
[34] J. Xiao-Bo, T. Xue-Qing, and H. Wei-Pei, “Novel ECC structure and in computer science and electrical engineering
evaluation method for NAND flash memory,” in Proc. IEEE Int. Syst.- from Harvard University, Cambridge, MA, USA,
Chip Conf. (SOCC), Sep. 2015, pp. 100–104. in 2000 and 2003, respectively.
[35] L. Yuan, H. Liu, P. Jia, and Y. Yang, “Reliability-based ECC system From 1988 to 1998, he was with Hynix Semicon-
for adaptive protection of NAND flash memories,” in Proc. Int. Conf. ductor, Icheon, South Korea, where he was involved
Commun. Syst. Netw. Technol., Apr. 2015, pp. 897–902. in the development and research of many DRAM
[36] L. Yuan, H. Liu, P. Jia, and Y. Yang, “An adaptive ECC scheme for products from 1-MB DRAM to 64-MB EDO and
dynamic protection of NAND Flash memories,” in Proc. IEEE Int. Conf. SDR DRAM. From 2001 to 2008, he was a Design
Acoust., Speech, Signal Process. (ICASSP), Apr. 2015, pp. 1052–1055. Team Leader and a Principle Design Engineer with Qimonda, Research
[37] A. Basak, S. Paul, J. Park, J. Park, and S. Bhunia, “Reconfigurable ECC Triangle Park, NC, USA, focusing on high-density 1-GB SDR DRAM prod-
for adaptive protection of memory,” in Proc. IEEE Int. Midwest Symp. ucts, low-power mobile DRAM and LPDDR1 products, and high-performance
Circuits Syst. (MWSCAS), Aug. 2013, pp. 1085–1088. 1-GB GDDR3 products. Since 2008, he has been with Qualcomm Incorpo-
[38] P. P. Ankolekar, R. Isaac, and J. W. Bredow, “Multibit error-correction rated, San Diego, CA, USA, where he is involved in the development of
methods for latency-constrained flash memory systems,” IEEE Trans. STT-MRAM-related IPs and macros. He has authored or coauthored several
Device Mater. Rel., vol. 10, no. 1, pp. 33–39, Mar. 2010. technical papers and holds over 20 U.S. patents. His current research interests
[39] R. Ramaraju, R. Rock, E. J. Gieske, C. Park, and D. F. Greenberg, include low-power circuit design, high-speed memory design, and advanced
“Hierarchical error correction for large memories,” U.S. Patent 8 677 205 future memory design and technologies.
B2, Mar. 18, 2014.
[40] Z. Wang, “Hierarchical decoding of double error correcting codes for
high speed reliable memories,” in Proc. 50th ACM/EDAV/IEEE Design
Autom. Conf. (DAC), May/Jun. 2013, pp. 1–7.