Exploring Parallel Processing Levels for Convolutional Turbo Decoding

Size: px
Start display at page:

Download "Exploring Parallel Processing Levels for Convolutional Turbo Decoding"

Transcription

1 Exploring Parallel Processing Levels for Convolutional Turbo Decoding Olivier Muller Electronics Department, GET/EST Bretagne Technopôle Brest Iroise, Brest, France Amer Baghdadi Electronics Department, GET/EST Bretagne Technopôle Brest Iroise, Brest, France Michel Jézéquel Electronics Department, GET/EST Bretagne Technopôle Brest Iroise, Brest, France Abstract In forward error correction, convolutional turbo codes were introduced to increase error correction capability approaching the Shannon bound. Decoding of these codes, however, is an iterative process requiring high computation rate and latency. Thus, in order to achieve high throughput and to reduce latency, crucial in emerging digital communication applications, parallel implementations become mandatory. In this paper, we explore the parallelism in convolutional turbo decoding with the BCJR algorithm and propose a multi-level classification of the explored parallelism techniques. We also present promising results on sub-bloc and component-decoder levels of parallelism. Sub-bloc parallelism results show that for sub-bloc initializations, message passing technique outperforms the acquisition approach. Furthermore, sub-bloc parallelism becomes quite inefficient in terms of speed gain for high sub-bloc parallelism degree. Conversely component-decoder parallelism efficiency, which only depends on interleaving rules, increases with sub-bloc parallelism degree. 1. Introduction Digital communication systems, such as fiber-optic communication, wireless communication and storage applications, require very high data-rates as well as powerful error correction capabilities. For the latter, performance approaching the Shannon bound are obtained with iterative decoding algorithms, such as turbo decoding [1] or LDPC decoding [2]. As those algorithms are characterized by their high complexity, achieving high-data rates requires optimal parallelism exploitation. Parallelism in convolutional turbo decoding has been widely investigated over the last few years either at a fine grain level [4] [5] on symbol elementary computations of the decoding algorithm, called BCJR or Forward-bacward algorithm [3], or at a coarse grain level [6-9] mainly based on frame decoding scheme. Especially in the latter, new parallelism techniques continue to appear where the recently introduced shuffled decoding [1] constitutes a typical example. In this paper, we classify existing parallelism possibilities in convolutional turbo decoding with the BCJR algorithm. We also propose a promising performance analysis of parallelism efficiency related to sub-bloc decoding and shuffled decoding. The rest of the paper is organized as follows. The next section presents the convolutional turbo decoding algorithm for better understanding of subsequent sections. Section 3 proposes a multi-level classification of turbo decoding parallelism. In section 4 and section 5, sub-bloc parallelism and component-decoder parallelism (shuffled decoding) are respectively analyzed on the basis of parallelism efficiency criteria. Finally, section 6 summarizes the results obtained and concludes the paper. 2. Convolutional turbo decoding Discovered in 1993, the turbo principle [1] relies on information exchange and iterative processing between the different elementary blocs. The exchanged information is called extrinsic information. For parallel convolutional turbo codes (Figure 1.a), the elementary blocs are the component decoders. The convolutional decoding is performed using the BCJR algorithm [3] which is the optimal algorithm for the maximum a posteriori (MAP) decoding of convolutional codes. In practice, a log-domain derivation of the algorithm (log-map) is used. Log-MAP algorithm can be approximated by max-log-map algorithm [15]. BCJR algorithm is implemented in Soft Input Soft Output (SISO) decoders. Using input symbols and a priori extrinsic information, each SISO decoder computes a posteriori probabilities (APP). These APPs constitue the a prori information for the other decoder and are exchanged via an interleaving (Π) and deinterleaving (Π -1 ) processes. Figure 1.b illustrates the main steps of the BCJR algorithm. Firstly, the branch metric (or γ metric) between two states represents the probability that a transition occurs between these two states.

2 γ red sys SISO Interleaved domain β (b) Π Π Π -1 extrinsic red 1 SISO 1 Desinterleaved domain output Frame β, extrinsic β, extrinsic Frame β extrinsic (a) T time (c) (d) Figure 1. Turbo decoding: (a) turbo decoder, (b) BCJR SISO, (c) forward bacward scheme, (d) butterfly scheme Secondly, forward recursion (or recursion) computes the probability of all the states in the trellis given the past observations (eq. 1). This processing is recursive since a trellis section (i.e. the probability of all states) is computed using the previous trellis section and branch metrics between these two sections. ( ν 2 1 s ) = 1 ( s ) γ ( s, s) (1) s = Thirdly, bacward recursion (or β recursion) computes the probability of all the states in the trellis given the future observations (eq. 2). This computation is similar to the forward recursion, but the frame is processed in the bacward direction. β ( ν 2 1 s ) = β + 1 ( s ) γ + 1( s, s ) (2) s = Finally, the extrinsic information is computed from the forward recursion, the bacward recursion and the extrinsic part of the branch metrics (eq. 3). β () s (') s γ (',) s s Pr( d = i) = (3) () s (') s (',) s s e 1 ( s', s)/ d( s', s) = i e β 1 γ ( s', s) 3.1. BCJR metric level parallelism T/2 The BCJR metric level parallelism concerns the processing of all metrics involved in the decoding of each received symbol inside a BCJR SISO decoder (Figure 1). It exploits the inherent parallelism of the trellis structure, and also the parallelism of BCJR computations [4] [5] Parallelism of trellis transitions Trellis-transition parallelism can easily be extracted from trellis structure as the same operations are repeated for all transition pairs. In log-domain [15], these operations are either ACS operations (Add-Compare- Select) for the max-log-map algorithm or ACSO operations (ACS with a correction offset [15]) for the log-map algorithm. Each BCJR computation (eq. 1,2,3) requires a number of ACS-lie operation equals to half the number of transitions per trellis section. Thus, this number, which depends on the structure of the convolutional code, constitutes the upper bound of the trellis-transition parallelism degree. However this parallelism implies low area overhead as only the ACS units have to be duplicated. In particular, no additional memories are required since all the parallelized operations are executed on the same trellis section, and in consequence on the same data Parallelism of BCJR computations time 3. Parallel processing levels In turbo decoding with BCJR algorithm, parallelism techniques can be classified at three levels: (1) BCJR metric level parallelism, (2) BCJR SISO decoder level parallelism, and (3) Turbo-decoder level parallelism. The first (lowest) parallelism level concerns symbol elementary computations inside a SISO decoder processing the BCJR algorithm. Parallelism between these SISO decoders, inside one turbo decoder, belongs to the second parallelism level. The third (highest) parallelism level duplicates the turbo decoder itself. A second metric parallelism can be orthogonally extracted from BCJR algorithm through a parallel execution of the three BCJR computations. Parallel execution of bacward recursion and APP computations was proposed with the original Forward- Bacward scheme, depicted in Figure 1.c. So, in this scheme, we can notice that BCJR computation parallelism degree is equal to one in the forward part and two in the bacward part. To increase this parallelism degree, several schemes are proposed [8]. Figure 1.d shows the butterfly scheme which doubles the parallelism degree of the original scheme through the parallelism between the forward and

3 bacward recursion computations. This is performed without any memory increase and only BCJR computation resources have to be duplicated. Thus, BCJR computation parallelism is area efficient but still limited in parallelism degree. In conclusion, BCJR metric level parallelism achieves optimal area efficiency as it does not affect memory size which occupies most of the area in a turbo decoder circuit. evertheless the parallelism degree is limited by the decoding algorithm and the code structure. Thus, achieving higher parallelism degree implies exploring higher processing levels BCJR-SISO decoder level parallelism The second level of parallelism concerns the SISO decoder level. It consists of the use of multiple SISO decoders, each executing the BCJR algorithm and processing a sub-bloc of the same frame in one of the two interleaving orders. This level of parallelism can reach a reasonable parallelism degree and preserve memory area [6]. Two inds of parallelism exist in this class: sub-bloc parallelism and component-decoder parallelism Sub-bloc parallelism In sub-bloc parallelism, each frame is divided into M sub-blocs and then each sub-bloc is processed on a BCJR-SISO decoder using adequate initializations [6] [7] [8] [9]. A graphical formalism is proposed in [8] to compare various existing sub-bloc decoding schemes towards parallelism degree and memory efficiency. Besides duplication of BCJR-SISO decoders, this parallelism imposes two other constraints. On the one hand, interleaving has to be parallelized in order to extend proportionally the communication bandwidth [12]. On the second hand, BCJR-SISO decoders have to be initialized adequately as detailed in section Component-decoder Parallelism The component-decoder parallelism is a new ind of parallelism that has become operational with the introduction of the shuffled decoding technique [1]. The basic idea of shuffled decoding is to execute all component decoders in parallel and to exchange APP information as soon as created. With this decoding scheme, decoding time could be theoretically halved in comparison with serial approach with the same iteration number. Section 5 analyzes the performance that can be obtained with this ind of parallelism Turbo-decoder level parallelism The highest level of parallelism duplicates whole turbo decoders to process iterations and/or frames in parallel. Iteration parallelism occurs in a pipelined fashion with a maximum pipeline depth equal to the iteration number, whereas frame parallelism presents no limitation in parallelism degree. Turbo-decoder level parallelism, however, is too areaexpensive (all memories and computation resources are duplicated) and presents no gain in decoding latency. Table 1. Parallel processing levels Level Parallelism BCJR metric Trellis transitions BCJR computations BCJR-SISO decoder Turbo-decoder Sub-blocs Component decoders Iterations Frames 4. Initialization in sub-bloc parallelism As described in section 3, sub-bloc parallelism taes place at frame level and requires initializations. These initializations are mandatory as information on recursion metrics is available at frame ending points, but not at sub-bloc ending points [7]. An estimation of this undetermined information can be obtained either by acquisition or by message passing between neighboring sub-blocs Initialization by acquisition This widely used initialization method consists in estimating recursion metrics thans to an overlapping region called acquisition window or prologue. Starting from a trellis section, where all the states are initialized to a uniform constant, the acquisition window will be processed on its length, denoted AL, to provide reliable state metrics at the beginning of the sub-bloc. This acquisition length is determined at design time in order to mae negligible error rate degradation. It is fixed in function of the number of redundancies in the prologue, typically 6. Another empirical rule recommends from 3 to 5 times the constraint length of the code for this acquisition length [7]. When all the sub-blocs are initialized with acquisition method, the decoding time (t d ), the speed gain (S g ) and additional computation ratio (R c ) can be expressed as: t d + AL it (4) d AL d S g = d 1 + (5) AL ( d 1) R C = (6) where represents the frame length, d the sub-bloc parallelism degree and it the number of iterations. Equation 4 shows clearly that the decoding time tends towards a constant value when parallelism degree

4 increases. Thus sub-bloc parallelism with initialization by acquisition encounters a throughput ceiling value and the maximum speed gain is equal to ( AL +1). The corresponding efficiency, which is defined as speed gain S g divided by parallelism degree d, will decrease to the minimum value 1 ( AL +1). Furthermore the additional computation ratio, which concerns recursion computations and input data memory accesses, increases linearly with the parallelism degree Initialization by message passing The second method initializes dynamically a subbloc with recursion metrics computed during the last iteration in the neighboring sub-blocs [9]. So this technique does not require additional hardware except some communication resources between BCJR SISO units. To evaluate this technique, bit error rate performance degradation has to be evaluated and compensated with additional iterations. In Figure 2, Frame Error Rate (FER) performance is represented for different parallelism degrees in function of iteration number. This figure shows that asymptotic error rate is not affected by message passing approach whatever the parallelism degree. Thus it ensures that initialization by message passing can be used without degradation. 1.e 1.e-1 FER 1.e-2 1.e-3 1.e-4 PROC1 PROC1 PROC25 PROC5 PROC iterations Figure 2. Convergence of message passing technique, DVB-RCS, R=6/7, 188 bytes frame, SR=4.2 db, 5 bit quantification, Log-MAP algorithm However this figure also reveals that additional iterations are mandatory to reach a given FER. The decoding time (t d ) and speed gain (S g ) can be expressed as: td it MP (7) d it S g = d (8) itmp where it MP represents the iteration number with message passing technique. An estimation of it MP can be obtained at fixed FER value. In Figure 3, iteration number with message passing technique, speed gain and efficiency of the technique are represented in function of parallelism degree. Iterations Iterations Speed Gain Parallelism degree Figure 3. Iterations, speed gain and efficiency with message passing technique, DVB-RCS, R=6/7, 188 bytes frame, SR=4.2 db, 5 bit quantification, Log- MAP algorithm It appears that iteration number is constant for small parallelism degree and after a threshold (8 in the presented case) it increases linearly with parallelism degree. So in terms of decoding time, this could be written as: it MP = it if d is less than the threshold. it + C d it MP = if d is greater than the threshold and where C is a constant. Lie sub-bloc parallelism with initialization by acquisition, using initialization by message passing method also encounters a throughput ceiling value. The maximum speed gain is roughly equal to it C, corresponding to an efficiency of it ( C ). Threshold position strongly depends on the sub-bloc size. It can be physically interpreted as the minimum sub-bloc size, which provides reliable recursion values at the end of the first iteration. Under this minimum size, recursion values have to be refined using more iterations. Thus this threshold will change according to the frame size and the code rate, as this latter has an influence on recursion reliability and performance comparison In Figure 4, sub-bloc parallelism efficiency is compared between both initialization methods. Under the presented conditions, efficiency of message passing technique and acquisition technique with 16-symbol acquisition length are quite similar at high sub-bloc parallelism degree. evertheless at low sub-bloc parallelism degree, message passing technique efficiency is constant and equal to maximum efficiency and thus outperforms acquisition technique whatever the acquisition length Speed Gain

5 Parallelism degree Message Passing Acquisition with AL=8 Acquisition with AL=16 Acquisition with AL=32 Figure 4. and initialization methods, DVB- RCS, R=6/7, 188 bytes frame, SR=4.2 db, 5 bit quantification, Log-MAP algorithm Furthermore, the initialization by acquisition degrades error-rate performance, whereas initialization by message passing induces no degradation. Figure 5 illustrates Frame Error Rate (FER) performance considering a DVB-RCS code [14] with a 47-sub-blocparallelism degree..15db-degradation is observed between initialization with 32-symbol acquisition length and message passing initialization or Max-log-MAP without parallelism. 1 created. The following section will analyze the efficiency of this parallelism Shuffled decoding efficiency Lie sub-bloc parallelism efficiency, componentdecoder parallelism efficiency is defined as the speed gain divided by the parallelism degree at equivalent error rate performance. For shuffled decoding, parallelism degree is limited to the number of component decoders (usually 2) and only iteration number could vary in speed gain (eq. 5). Thus shuffled decoding efficiency only depends of the iteration number needed to reach the same error rate performance as serial decoding. Simulation results demonstrate efficiency ranging from.6 to.95. By definition, shuffled decoding efficiency is computed with a set of component decoder, with interleavers of defined size and with a fixed SR. can always be computed along the turbo decoder convergence process as shuffled decoding and serial decoding converge to the same value. Simulations reveal that efficiency is almost invariant along turbo decoder convergence. Then with a defined error rate at various SR, it is also possible to show that efficiency is SR invariant. So shuffled decoding efficiency can only depend on interleaving rules and BCJR-SISO decoder parallelism Shuffled decoding and interleaving FER Acquisition..AL.32 Message.passing MaxLogMAP Acquisition..AL SR Figure 5. FER and initialization methods for high parallelism degree (47), DVB-RCS, R=6/7, 188 bytes frame, 5 bit quantification, Max-log-MAP algorithm Comparison between both techniques tends clearly in favor of message passing technique, which enables better error rate performance without resource overhead but gives similar efficiency. However simulations results (Figure 4) also show that sub-bloc parallelism, whatever the initialization method, becomes quite inefficient at high parallelism degrees. 5. Component-decoder parallelism analysis As described in section 3, component-decoder parallelism taes advantage of the shuffled decoding technique that executes all component decoders in parallel and exchanges APP information as soon as The dependency between shuffled decoding and interleaving has already been studied in [1]. According to interleaving laws Π, symbols belong to three different classes. The first class contains all points processed at the same time in interleaved and desinterleaved order, such as t() = t(π()). The second class contains all points verifying t() < t(π()) and the third all points verifying t() > t(π()). A symbol of the first class is processed concurrently by both component decoders. So decoders do not tae advantage before the next iteration of APPs, that are sent by the another decoder. Using component-decoder and sub-bloc parallelism at the same time, the number of first-class symbols increases with sub-bloc parallelism degree. However shuffled decoding efficiency increases with sub-bloc parallelism degree as represented in Table 2 and Table 3. Table 2. and sub-bloc parallelism degree with 53 bytes DVB-RCS interleaving (R=6/7; max log MAP; SR=4, db; FER =1,6e-3) Sub-bloc parallelism degree without shuffling with shuffling

6 Table 3. and sub-bloc parallelism degree with 188 bytes DVB-RCS interleaver (R=6/7; max log MAP; SR=4, db; FER =1,6e-3) Sub-bloc parallelism degree without shuffling with shuffling This result can be explained by the fact that iteration number increases with sub-bloc parallelism degree. Thus the penalty to next iteration imposed by defined first class will be less significant in the decoding and in this way shuffled decoding efficiency is improved Combining component-decoder and subbloc parallelism From these results, it maes sense to combine component-decoder parallelism and sub-bloc parallelism. Indeed component-decoder parallelism efficiency increases with sub-bloc parallelism degree and at the same time sub-bloc parallelism efficiency decreases. To determine when component-decoder parallelism becomes more efficient than sub-bloc parallelism, subbloc parallelism with parallelism degree d should become the new reference in efficiency computation. Thus the efficiency of doubling sub-bloc parallelism degree to a degree 2d can be compared with efficiency of shuffled decoding at the reference parallelism degree d to select the most efficient parallelism. In our examples, shuffled decoding becomes more efficient for d greater than 4 in Table 2 and 16 in Table 3. Obtained results illustrate that, beyond a certain bound, decoder parallelism becomes more efficient than sub-bloc parallelism. 6. Conclusion In this paper we analyzed and classified the various parallelism techniques that could be used in convolutional turbo decoding with the BCJR algorithm. The proposed three level classification includes: BCJR metric level parallelism, BCJR SISO decoder level parallelism, and Turbo-decoder level parallelism. It has been shown that sub-bloc initialization is more efficient with message passing technique than with acquisition technique and also that sub-bloc parallelism becomes inefficient for high sub-bloc parallelism degrees. On the contrary, component-decoder parallelism, with the newly introduced shuffled decoding technique, becomes more efficient for high sub-bloc parallelism degrees. of this parallelism depends only on interleaving rules. Furthermore a criterion based on analysis of parallelism efficiency is proposed to help the selection and the use of these two parallelism techniques. References [1] C. Berrou, A. Glavieux, and P. Thitimajshima, ear Shannon Limit Error-Correcting Coding and Decoding: Turbo-Codes, in Proc International Conference on Communications (ICC 93), Geneva, Switzerland, [2] D. J. C. MacKay, Good error-correcting codes based on very sparse matrices, IEEE Trans. Inf. Theory, vol. 45, pp , Mar [3] L. Bahl, J. Coce, F. Jeline, and J. Raviv, Optimal decoding of linear codes for minimizing symbol error rate, IEEE Trans. Inf. Theory, vol. IT-2, pp , Mar [4] G. Masera, G. Piccinini, M. R. Roch, and M. Zamboni, VLSI architectures for turbo codes, IEEE Trans. VLSI Syst., vol. 7, pp , Sept [5] E. Boutillon, W. J. Gross, and P. G. Gula, VLSI architectures for the MAP algorithm, IEEE Trans. Commun., vol. 51, pp , Feb. 23. [6] C. Schurgers, F. Catthoor, and M. Engels, Memory optimization of MAP turbo decoder algorithms, IEEE Trans. VLSI Syst., vol. 9, pp , Apr. 21. [7] T. Wolf, "Initialization of Sliding Windows in Turbo decoders", 3rd International Symposium on Turbo Codes and Related Topics, Brest, France, pp , Sept. 23. [8] Y. Zhang and K.K. Parhi, Parallel Turbo decoding, Proceedings of the International Symposium on Circuits and Systems, volume 2, May 24 Page(s):II [9] A. Abbasfar, and K. Yao, An Efficient Architecture for High Speed Turbo Decoders, Proc. of ICASSP 23, pp. IV-521-IV-524, April 23. [1] J. Zhang, and M.P.C. Fossorier, Shuffled iterative decoding, IEEE Transactions on Communications Volume 53, Issue 2, Feb. 25 Page(s): [11] D. Gnaëdig, E. Boutillon, M. Jezequel, V. Gaudet, G. Gula, "On Multiple Slice Turbo Code", 3rd International Symposium on Turbo Codes and Related Topics, Brest, France, pp , Sept. 23. [12] F. Gilbert, M. Thul and. Wehn. Communication Centric Architectures for Turbo-Decoding on Embedded Multiprocessors, Proceedings of DATE 23, Munich. [13] M. J. Thul, F. Gilbert, T. Vogt, G. Kreiselmaier and. Wehn, A Scalable System Architecture for High- Throughput Turbo-Decoders, Journal of VLSI Signal Processing Vol. 39, pages 63-77, etherlands 25. [14] C. Douillard, M. Jezequel, C. Berrou,. Brengarth, J. Tousch,. Pham, The Turbo Code Standard for DVB- RCS, 2nd International Symposium on Turbo Codes & Related Topics, Brest, France, 2. p [15] P. Robertson, P. Hoeher, and E. Villebrun, Optimal and Sub-Optimal Maximum a Posteriori Algorithms Suitable for Turbo Decoding, European Transactions on Telecommunications (ETT), vol. 8, no. 2, 1997, pp

Research Article Parallelism Efficiency in Convolutional Turbo Decoding

Research Article Parallelism Efficiency in Convolutional Turbo Decoding Hindawi Publishing Corporation EURASIP Journal on Advances in Signal Processing Volume 21, Article ID 92792, 11 pages doi:1.1155/21/92792 Research Article Parallelism Efficiency in Convolutional Turbo

More information

IEEE Proof Web Version

IEEE Proof Web Version IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS 1 From Parallelism Levels to a Multi-ASIP Architecture for Turbo Decoding Olivier Muller, Member, IEEE, Amer Baghdadi, and Michel Jézéquel,

More information

ASIP-Based Multiprocessor SoC Design for Simple and Double Binary Turbo Decoding

ASIP-Based Multiprocessor SoC Design for Simple and Double Binary Turbo Decoding ASIP-Based ultiprocessor SoC Design for Simple and Double Binary Turbo Decoding Olivier uller, Amer Baghdadi, ichel Jézéquel Electronics Department, ENST Bretagne, Technopôle Brest Iroise, 29238 Brest,

More information

ISSCC 2003 / SESSION 8 / COMMUNICATIONS SIGNAL PROCESSING / PAPER 8.7

ISSCC 2003 / SESSION 8 / COMMUNICATIONS SIGNAL PROCESSING / PAPER 8.7 ISSCC 2003 / SESSION 8 / COMMUNICATIONS SIGNAL PROCESSING / PAPER 8.7 8.7 A Programmable Turbo Decoder for Multiple 3G Wireless Standards Myoung-Cheol Shin, In-Cheol Park KAIST, Daejeon, Republic of Korea

More information

/$ IEEE

/$ IEEE IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 56, NO. 1, JANUARY 2009 81 Bit-Level Extrinsic Information Exchange Method for Double-Binary Turbo Codes Ji-Hoon Kim, Student Member,

More information

Towards an optimal parallel decoding of turbo codes

Towards an optimal parallel decoding of turbo codes owards an optimal parallel decoding of turbo codes David Gnaedig *, Emmanuel Boutillon +, Jacky ousch *, Michel Jézéquel * urboconcept, 115 rue Claude Chappe, 29280 PLOUZANE, France + LEER Unité CNR FRE

More information

Nearly-optimal associative memories based on distributed constant weight codes

Nearly-optimal associative memories based on distributed constant weight codes Nearly-optimal associative memories based on distributed constant weight codes Vincent Gripon Electronics and Computer Enginering McGill University Montréal, Canada Email: vincent.gripon@ens-cachan.org

More information

OPTIMIZED MAP TURBO DECODER

OPTIMIZED MAP TURBO DECODER OPTIMIZED MAP TURBO DECODER Curt Schurgers Francy Catthoor Marc Engels EE Department IMEC/KUL IMEC UCLA Kapeldreef 75 Kapeldreef 75 Los Angeles, CA 90024 3001 Heverlee 3001 Heverlee USA Belgium Belgium

More information

lambda-min Decoding Algorithm of Regular and Irregular LDPC Codes

lambda-min Decoding Algorithm of Regular and Irregular LDPC Codes lambda-min Decoding Algorithm of Regular and Irregular LDPC Codes Emmanuel Boutillon, Frédéric Guillou, Jean-Luc Danger To cite this version: Emmanuel Boutillon, Frédéric Guillou, Jean-Luc Danger lambda-min

More information

Optimal M-BCJR Turbo Decoding: The Z-MAP Algorithm

Optimal M-BCJR Turbo Decoding: The Z-MAP Algorithm Wireless Engineering and Technology, 2011, 2, 230-234 doi:10.4236/wet.2011.24031 Published Online October 2011 (http://www.scirp.org/journal/wet) Optimal M-BCJR Turbo Decoding: The Z-MAP Algorithm Aissa

More information

VHDL Implementation of different Turbo Encoder using Log-MAP Decoder

VHDL Implementation of different Turbo Encoder using Log-MAP Decoder 49 VHDL Implementation of different Turbo Encoder using Log-MAP Decoder Akash Kumar Gupta and Sanjeet Kumar Abstract Turbo code is a great achievement in the field of communication system. It can be created

More information

Implementation Aspects of Turbo-Decoders for Future Radio Applications

Implementation Aspects of Turbo-Decoders for Future Radio Applications Implementation Aspects of Turbo-Decoders for Future Radio Applications Friedbert Berens STMicroelectronics Advanced System Technology Geneva Applications Laboratory CH-1215 Geneva 15, Switzerland e-mail:

More information

High Speed Downlink Packet Access efficient turbo decoder architecture: 3GPP Advanced Turbo Decoder

High Speed Downlink Packet Access efficient turbo decoder architecture: 3GPP Advanced Turbo Decoder I J C T A, 9(24), 2016, pp. 291-298 International Science Press High Speed Downlink Packet Access efficient turbo decoder architecture: 3GPP Advanced Turbo Decoder Parvathy M.*, Ganesan R.*,** and Tefera

More information

Hard Decision Based Low SNR Early Termination for LTE Turbo Decoding

Hard Decision Based Low SNR Early Termination for LTE Turbo Decoding Hard Decision Based Low SNR Early Termination for LTE Turbo Decoding Jan Geldmacher, Klaus Hueske, and Jürgen Götze Information Processing Lab, TU Dortmund University Otto-Hahn-Strasse 4, 447 Dortmund,

More information

Super Codes: A Flexible Multi Rate Coding System

Super Codes: A Flexible Multi Rate Coding System Super Codes: A Flexible Multi Rate Coding System Steven S. Pietrobon Small World Communications, 6 First Avenue, Payneham South SA 57, Australia. E mail: steven@sworld.com.au Abstract: We define super

More information

High speed low complexity radix-16 Max-Log-MAP SISO decoder

High speed low complexity radix-16 Max-Log-MAP SISO decoder High speed low complexity radix-16 Max-Log-MAP SISO decoder Oscar David Sanchez Gonzalez, Christophe Jego, Michel Jezequel, Yannick Saouter To cite this version: Oscar David Sanchez Gonzalez, Christophe

More information

Comparison of Decoding Algorithms for Concatenated Turbo Codes

Comparison of Decoding Algorithms for Concatenated Turbo Codes Comparison of Decoding Algorithms for Concatenated Turbo Codes Drago Žagar, Nenad Falamić and Snježana Rimac-Drlje University of Osijek Faculty of Electrical Engineering Kneza Trpimira 2b, HR-31000 Osijek,

More information

EFFICIENT PARALLEL MEMORY ORGANIZATION FOR TURBO DECODERS

EFFICIENT PARALLEL MEMORY ORGANIZATION FOR TURBO DECODERS In Proceedings of the European Signal Processing Conference, pages 831-83, Poznan, Poland, September 27. EFFICIENT PARALLEL MEMORY ORGANIZATION FOR TURBO DECODERS Perttu Salmela, Ruirui Gu*, Shuvra S.

More information

Reduced complexity Log-MAP algorithm with Jensen inequality based non-recursive max operator for turbo TCM decoding

Reduced complexity Log-MAP algorithm with Jensen inequality based non-recursive max operator for turbo TCM decoding Sybis and Tyczka EURASIP Journal on Wireless Communications and Networking 2013, 2013:238 RESEARCH Open Access Reduced complexity Log-MAP algorithm with Jensen inequality based non-recursive max operator

More information

422 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 45, NO. 2, FEBRUARY 2010

422 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 45, NO. 2, FEBRUARY 2010 422 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 45, NO. 2, FEBRUARY 2010 Turbo Decoder Using Contention-Free Interleaver and Parallel Architecture Cheng-Chi Wong, Ming-Wei Lai, Chien-Ching Lin, Hsie-Chia

More information

TURBO codes, [1], [2], have attracted much interest due

TURBO codes, [1], [2], have attracted much interest due 800 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 47, NO. 2, FEBRUARY 2001 Zigzag Codes and Concatenated Zigzag Codes Li Ping, Member, IEEE, Xiaoling Huang, and Nam Phamdo, Senior Member, IEEE Abstract

More information

Interlaced Column-Row Message-Passing Schedule for Decoding LDPC Codes

Interlaced Column-Row Message-Passing Schedule for Decoding LDPC Codes Interlaced Column-Row Message-Passing Schedule for Decoding LDPC Codes Saleh Usman, Mohammad M. Mansour, Ali Chehab Department of Electrical and Computer Engineering American University of Beirut Beirut

More information

Iterative Decoding of Concatenated Convolutional Codes: Implementation Issues

Iterative Decoding of Concatenated Convolutional Codes: Implementation Issues E. BOUTILLON, C. DOUILLARD, G. MONTORSI 1 Iterative Decoding of Concatenated Convolutional Codes: Implementation Issues Emmanuel Boutillon, Catherine Douillard, and Guido Montorsi Abstract This tutorial

More information

Low Complexity Architecture for Max* Operator of Log-MAP Turbo Decoder

Low Complexity Architecture for Max* Operator of Log-MAP Turbo Decoder International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347 5161 2015 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Research Article Low

More information

TURBO CODES with performance near the Shannon

TURBO CODES with performance near the Shannon IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 13, NO. 4, APRIL 2005 427 Parallel Interleaver Design and VLSI Architecture for Low-Latency MAP Turbo Decoders Rostislav (Reuven)

More information

Design of Low-Power High-Speed Maximum a Priori Decoder Architectures

Design of Low-Power High-Speed Maximum a Priori Decoder Architectures Design of Low-Power High-Speed Maximum a Priori Decoder Architectures Alexander Worm Λ, Holger Lamm, Norbert Wehn Institute of Microelectronic Systems Department of Electrical Engineering and Information

More information

Programmable Turbo Decoder Supporting Multiple Third-Generation Wireless Standards

Programmable Turbo Decoder Supporting Multiple Third-Generation Wireless Standards Programmable Turbo Decoder Supporting Multiple Third-eneration Wireless Standards Myoung-Cheol Shin and In-Cheol Park Department of Electrical Engineering and Computer Science, KAIST Yuseong-gu Daejeon,

More information

High Speed ACSU Architecture for Viterbi Decoder Using T-Algorithm

High Speed ACSU Architecture for Viterbi Decoder Using T-Algorithm High Speed ACSU Architecture for Viterbi Decoder Using T-Algorithm Atish A. Peshattiwar & Tejaswini G. Panse Department of Electronics Engineering, Yeshwantrao Chavan College of Engineering, E-mail : atishp32@gmail.com,

More information

Hybrid Iteration Control on LDPC Decoders

Hybrid Iteration Control on LDPC Decoders Hybrid Iteration Control on LDPC Decoders Erick Amador and Raymond Knopp EURECOM 694 Sophia Antipolis, France name.surname@eurecom.fr Vincent Rezard Infineon Technologies France 656 Sophia Antipolis, France

More information

Comparison of Various Concatenated Convolutional Code Ensembles under Spatial Coupling

Comparison of Various Concatenated Convolutional Code Ensembles under Spatial Coupling Comparison of Various Concatenated Convolutional Code Ensembles under Spatial Coupling GABRIEL IRO RAJESHWARI KABBINALE MASTER S THESIS DEPARTMENT OF ELECTRICAL AND INFORMATION TECHNOLOGY FACULTY OF ENGINEERING

More information

EFFICIENT RECURSIVE IMPLEMENTATION OF A QUADRATIC PERMUTATION POLYNOMIAL INTERLEAVER FOR LONG TERM EVOLUTION SYSTEMS

EFFICIENT RECURSIVE IMPLEMENTATION OF A QUADRATIC PERMUTATION POLYNOMIAL INTERLEAVER FOR LONG TERM EVOLUTION SYSTEMS Rev. Roum. Sci. Techn. Électrotechn. et Énerg. Vol. 61, 1, pp. 53 57, Bucarest, 016 Électronique et transmission de l information EFFICIENT RECURSIVE IMPLEMENTATION OF A QUADRATIC PERMUTATION POLYNOMIAL

More information

/$ IEEE

/$ IEEE IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS, VOL. 56, NO. 5, MAY 2009 1005 Low-Power Memory-Reduced Traceback MAP Decoding for Double-Binary Convolutional Turbo Decoder Cheng-Hung Lin,

More information

Overlapped Scheduling for Folded LDPC Decoding Based on Matrix Permutation

Overlapped Scheduling for Folded LDPC Decoding Based on Matrix Permutation Overlapped Scheduling for Folded LDPC Decoding Based on Matrix Permutation In-Cheol Park and Se-Hyeon Kang Department of Electrical Engineering and Computer Science, KAIST {icpark, shkang}@ics.kaist.ac.kr

More information

A Novel Area Efficient Folded Modified Convolutional Interleaving Architecture for MAP Decoder

A Novel Area Efficient Folded Modified Convolutional Interleaving Architecture for MAP Decoder A Novel Area Efficient Folded Modified Convolutional Interleaving Architecture for Decoder S.Shiyamala Department of ECE SSCET Palani, India. Dr.V.Rajamani Principal IGCET Trichy,India ABSTRACT This paper

More information

BER Guaranteed Optimization and Implementation of Parallel Turbo Decoding on GPU

BER Guaranteed Optimization and Implementation of Parallel Turbo Decoding on GPU 2013 8th International Conference on Communications and Networking in China (CHINACOM) BER Guaranteed Optimization and Implementation of Parallel Turbo Decoding on GPU Xiang Chen 1,2, Ji Zhu, Ziyu Wen,

More information

Quasi-Cyclic Low-Density Parity-Check (QC-LDPC) Codes for Deep Space and High Data Rate Applications

Quasi-Cyclic Low-Density Parity-Check (QC-LDPC) Codes for Deep Space and High Data Rate Applications Quasi-Cyclic Low-Density Parity-Check (QC-LDPC) Codes for Deep Space and High Data Rate Applications Nikoleta Andreadou, Fotini-Niovi Pavlidou Dept. of Electrical & Computer Engineering Aristotle University

More information

Area-Efficient High-Throughput MAP Decoder Architectures

Area-Efficient High-Throughput MAP Decoder Architectures IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 13, NO. 8, AUGUST 2005 921 Area-Efficient High-Throughput MAP Decoder Architectures Seok-Jun Lee, Member, IEEE, Naresh R. Shanbhag,

More information

Architecture multi-asip pour turbo décodeur multi-standard. Amer Baghdadi.

Architecture multi-asip pour turbo décodeur multi-standard. Amer Baghdadi. Architecture multi-asip pour turbo décodeur multi-standard Amer Baghdadi Amer.Baghdadi@telecom-bretagne.eu Télécom Bretagne Technopôle Brest-Iroise - CS 8388 938 Brest Cedex 3 - FRANCE GDR-ISIS, Paris.

More information

Stopping-free dynamic configuration of a multi-asip turbo decoder

Stopping-free dynamic configuration of a multi-asip turbo decoder 2013 16th Euromicro Conference on Digital System Design Stopping-free dynamic configuration of a multi-asip turbo decoder Vianney Lapotre, Purushotham Murugappa, Guy Gogniat, Amer Baghdadi, Michael Hübner

More information

AS TURBO codes [1], or parallel concatenated convolutional

AS TURBO codes [1], or parallel concatenated convolutional IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 15, NO. 7, JULY 2007 801 SIMD Processor-Based Turbo Decoder Supporting Multiple Third-Generation Wireless Standards Myoung-Cheol Shin,

More information

Finding Small Stopping Sets in the Tanner Graphs of LDPC Codes

Finding Small Stopping Sets in the Tanner Graphs of LDPC Codes Finding Small Stopping Sets in the Tanner Graphs of LDPC Codes Gerd Richter University of Ulm, Department of TAIT Albert-Einstein-Allee 43, D-89081 Ulm, Germany gerd.richter@uni-ulm.de Abstract The performance

More information

LOW-DENSITY PARITY-CHECK (LDPC) codes [1] can

LOW-DENSITY PARITY-CHECK (LDPC) codes [1] can 208 IEEE TRANSACTIONS ON MAGNETICS, VOL 42, NO 2, FEBRUARY 2006 Structured LDPC Codes for High-Density Recording: Large Girth and Low Error Floor J Lu and J M F Moura Department of Electrical and Computer

More information

Non-Binary Turbo Codes Interleavers

Non-Binary Turbo Codes Interleavers Non-Binary Turbo Codes Interleavers Maria KOVACI, Horia BALTA University Polytechnic of Timişoara, Faculty of Electronics and Telecommunications, Postal Address, 30223 Timişoara, ROMANIA, E-Mail: mariakovaci@etcuttro,

More information

A Reconfigurable Multi-Processor Platform for Convolutional and Turbo Decoding

A Reconfigurable Multi-Processor Platform for Convolutional and Turbo Decoding A Reconfigurable Multi-Processor Platform for Convolutional and Turbo Decoding Timo Vogt, Christian Neeb, and Norbert Wehn University of Kaiserslautern, Kaiserslautern, Germany {vogt, neeb, wehn}@eit.uni-kl.de

More information

< Irregular Repeat-Accumulate LDPC Code Proposal Technology Overview

<  Irregular Repeat-Accumulate LDPC Code Proposal Technology Overview Project IEEE 802.20 Working Group on Mobile Broadband Wireless Access Title Irregular Repeat-Accumulate LDPC Code Proposal Technology Overview Date Submitted Source(s):

More information

Implementation of a Turbo Encoder and Turbo Decoder on DSP Processor-TMS320C6713

Implementation of a Turbo Encoder and Turbo Decoder on DSP Processor-TMS320C6713 International Journal of Engineering Research and Development e-issn : 2278-067X, p-issn : 2278-800X,.ijerd.com Volume 2, Issue 5 (July 2012), PP. 37-41 Implementation of a Turbo Encoder and Turbo Decoder

More information

Research Article A Programmable Max-Log-MAP Turbo Decoder Implementation

Research Article A Programmable Max-Log-MAP Turbo Decoder Implementation VLSI Design Volume 28, Article ID 3995, 7 pages doi:.55/28/3995 Research Article A Programmable Max-Log-MAP Turbo Decoder Implementation Perttu Salmela, Harri Sorokin, and Jarmo Takala Department of Computer

More information

Performance of Multihop Communications Using Logical Topologies on Optical Torus Networks

Performance of Multihop Communications Using Logical Topologies on Optical Torus Networks Performance of Multihop Communications Using Logical Topologies on Optical Torus Networks X. Yuan, R. Melhem and R. Gupta Department of Computer Science University of Pittsburgh Pittsburgh, PA 156 fxyuan,

More information

6.962 Graduate Seminar in Communications Turbo-Like Codes: Structure, Design Criteria, and Variations

6.962 Graduate Seminar in Communications Turbo-Like Codes: Structure, Design Criteria, and Variations 6.962 Graduate Seminar in Communications Turbo-Like Codes: Structure, Design Criteria, and Variations Presenter: J. Nicholas Laneman October 18, 2000 1 Summary Broadening our treatment of capacity-approaching

More information

THERE has been great interest in recent years in coding

THERE has been great interest in recent years in coding 186 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 16, NO. 2, FEBRUARY 1998 Concatenated Decoding with a Reduced-Search BCJR Algorithm Volker Franz and John B. Anderson, Fellow, IEEE Abstract We

More information

CONTENT ADAPTIVE SCREEN IMAGE SCALING

CONTENT ADAPTIVE SCREEN IMAGE SCALING CONTENT ADAPTIVE SCREEN IMAGE SCALING Yao Zhai (*), Qifei Wang, Yan Lu, Shipeng Li University of Science and Technology of China, Hefei, Anhui, 37, China Microsoft Research, Beijing, 8, China ABSTRACT

More information

High Throughput Radix-4 SISO Decoding Architecture with Reduced Memory Requirement

High Throughput Radix-4 SISO Decoding Architecture with Reduced Memory Requirement JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, VOL.14, NO.4, AUGUST, 2014 http://dx.doi.org/10.5573/jsts.2014.14.4.407 High Throughput Radix-4 SISO Decoding Architecture with Reduced Memory Requirement

More information

THE turbo code is one of the most attractive forward error

THE turbo code is one of the most attractive forward error IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 63, NO. 2, FEBRUARY 2016 211 Memory-Reduced Turbo Decoding Architecture Using NII Metric Compression Youngjoo Lee, Member, IEEE, Meng

More information

Low complexity FEC Systems for Satellite Communication

Low complexity FEC Systems for Satellite Communication Low complexity FEC Systems for Satellite Communication Ashwani Singh Navtel Systems 2 Rue Muette, 27000,Houville La Branche, France Tel: +33 237 25 71 86 E-mail: ashwani.singh@navtelsystems.com Henry Chandran

More information

Iterative Decoding of Concatenated Convolutional Codes: Implementation Issues

Iterative Decoding of Concatenated Convolutional Codes: Implementation Issues E. BOUTILLON, C. DOUILLARD, G. MONTORSI 1 Iterative Decoding of Concatenated Convolutional Codes: Implementation Issues Emmanuel Boutillon 1, Catherine Douillard 2, and Guido Montorsi 3 Abstract This tutorial

More information

VLSI Architectures for SISO-APP Decoders

VLSI Architectures for SISO-APP Decoders IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 11, NO. 4, AUGUST 2003 627 VLSI Architectures for SISO-APP Decoders Mohammad M. Mansour, Student Member, IEEE, and Naresh R. Shanbhag,

More information

Reduced-State Soft-Input/Soft-Output Algorithms for Complexity Reduction in Iterative and Non-Iterative Data Detection

Reduced-State Soft-Input/Soft-Output Algorithms for Complexity Reduction in Iterative and Non-Iterative Data Detection Reduced-State Soft-Input/Soft-Output Algorithms for Complexity Reduction in Iterative and Non-Iterative Data Detection Xiaopeng Chen and Keith M. Chugg Abstract Soft-input/soft-output (SISO) algorithms

More information

High-Throughput Contention-Free Concurrent Interleaver Architecture for Multi-Standard Turbo Decoder

High-Throughput Contention-Free Concurrent Interleaver Architecture for Multi-Standard Turbo Decoder High-Throughput Contention-Free Concurrent Interleaver Architecture for Multi-Standard Turbo Decoder Guohui Wang, Yang Sun, Joseph R. Cavallaro and Yuanbin Guo Department of Electrical and Computer Engineering,

More information

A Modified Medium Access Control Algorithm for Systems with Iterative Decoding

A Modified Medium Access Control Algorithm for Systems with Iterative Decoding A Modified Medium Access Control Algorithm for Systems with Iterative Decoding Inkyu Lee Carl-Erik W. Sundberg Sunghyun Choi Dept. of Communications Eng. Korea University Seoul, Korea inkyu@korea.ac.kr

More information

A completely safe early-stopping criterion for max-log Turbo code decoding

A completely safe early-stopping criterion for max-log Turbo code decoding A completely safe early-stopping criterion for max-log Turbo code decoding Andrew Hunt, Stewart Crozier, Ken Gracie, and Paul Guinand Communications Research Centre (CRC), Ottawa, Canada, www.crc.ca/fec

More information

Lowering the Error Floors of Irregular High-Rate LDPC Codes by Graph Conditioning

Lowering the Error Floors of Irregular High-Rate LDPC Codes by Graph Conditioning Lowering the Error Floors of Irregular High- LDPC Codes by Graph Conditioning Wen-Yen Weng, Aditya Ramamoorthy and Richard D. Wesel Electrical Engineering Department, UCLA, Los Angeles, CA, 90095-594.

More information

A Review on Analysis on Codes using Different Algorithms

A Review on Analysis on Codes using Different Algorithms A Review on Analysis on Codes using Different Algorithms Devansh Vats Gaurav Kochar Rakesh Joon (ECE/GITAM/MDU) (ECE/GITAM/MDU) (HOD-ECE/GITAM/MDU) Abstract-Turbo codes are a new class of forward error

More information

On the performance of turbo codes with convolutional interleavers

On the performance of turbo codes with convolutional interleavers University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 5 On the performance of turbo codes with convolutional interleavers Sina

More information

An Efficient Decoding Algorithm for Cycle-free Convolutional Codes and Its Applications

An Efficient Decoding Algorithm for Cycle-free Convolutional Codes and Its Applications An Efficient Decoding Algorithm for Cycle-free Convolutional Codes and Its Applications Jing Li Krishna R Narayanan and Costas N Georghiades Department of Electrical Engineering Texas A&M University College

More information

Low-Power Adaptive Viterbi Decoder for TCM Using T-Algorithm

Low-Power Adaptive Viterbi Decoder for TCM Using T-Algorithm International Journal of Scientific and Research Publications, Volume 3, Issue 8, August 2013 1 Low-Power Adaptive Viterbi Decoder for TCM Using T-Algorithm MUCHHUMARRI SANTHI LATHA*, Smt. D.LALITHA KUMARI**

More information

Iterative Decoder Architectures

Iterative Decoder Architectures Iterative Decoder Architectures Submitted to IEEE Communications Magazine Engling Yeo, Borivoje Nikolic, and Venkat Anantharam Department of Electrical Engineering and Computer Sciences University of California,

More information

/$ IEEE

/$ IEEE IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 14, NO. 10, OCTOBER 2006 1147 Transactions Briefs Highly-Parallel Decoding Architectures for Convolutional Turbo Codes Zhiyong He,

More information

Partially-Parallel LDPC Decoder Achieving High-Efficiency Message-Passing Schedule

Partially-Parallel LDPC Decoder Achieving High-Efficiency Message-Passing Schedule IEICE TRANS. FUNDAMENTALS, VOL.E89 A, NO.4 APRIL 2006 969 PAPER Special Section on Selected Papers from the 18th Workshop on Circuits and Systems in Karuizawa Partially-Parallel LDPC Decoder Achieving

More information

New Message-Passing Decoding Algorithm of LDPC Codes by Partitioning Check Nodes 1

New Message-Passing Decoding Algorithm of LDPC Codes by Partitioning Check Nodes 1 New Message-Passing Decoding Algorithm of LDPC Codes by Partitioning Check Nodes 1 Sunghwan Kim* O, Min-Ho Jang*, Jong-Seon No*, Song-Nam Hong, and Dong-Joon Shin *School of Electrical Engineering and

More information

Novel Low-Density Signature Structure for Synchronous DS-CDMA Systems

Novel Low-Density Signature Structure for Synchronous DS-CDMA Systems Novel Low-Density Signature Structure for Synchronous DS-CDMA Systems Reza Hoshyar Email: R.Hoshyar@surrey.ac.uk Ferry P. Wathan Email: F.Wathan@surrey.ac.uk Rahim Tafazolli Email: R.Tafazolli@surrey.ac.uk

More information

Complexity Comparison of Non-Binary LDPC Decoders

Complexity Comparison of Non-Binary LDPC Decoders Complexity Comparison of Non-Binary LDPC Decoders Laura Conde-Canencia, Ali Al-Ghouwayel, Emmanuel Boutillon To cite this version: Laura Conde-Canencia, Ali Al-Ghouwayel, Emmanuel Boutillon. Complexity

More information

Self-adaptability in Secure Embedded Systems: an Energy-Performance Trade-off

Self-adaptability in Secure Embedded Systems: an Energy-Performance Trade-off Self-adaptability in Secure Embedded Systems: an Energy-Performance Trade-off N. Botezatu V. Manta and A. Stan Abstract Securing embedded systems is a challenging and important research topic due to limited

More information

Payload Length and Rate Adaptation for Throughput Optimization in Wireless LANs

Payload Length and Rate Adaptation for Throughput Optimization in Wireless LANs Payload Length and Rate Adaptation for Throughput Optimization in Wireless LANs Sayantan Choudhury and Jerry D. Gibson Department of Electrical and Computer Engineering University of Califonia, Santa Barbara

More information

Improving Min-sum LDPC Decoding Throughput by Exploiting Intra-cell Bit Error Characteristic in MLC NAND Flash Memory

Improving Min-sum LDPC Decoding Throughput by Exploiting Intra-cell Bit Error Characteristic in MLC NAND Flash Memory Improving Min-sum LDPC Decoding Throughput by Exploiting Intra-cell Bit Error Characteristic in MLC NAND Flash Memory Wenzhe Zhao, Hongbin Sun, Minjie Lv, Guiqiang Dong, Nanning Zheng, and Tong Zhang Institute

More information

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose Department of Electrical and Computer Engineering University of California,

More information

Energy Efficiency Maximization for Wireless Sensor Networks

Energy Efficiency Maximization for Wireless Sensor Networks Energy Efficiency Maximization for Wireless Sensor Networks Inwhee Joe College of Information and Communications Hanyang University Seoul, Korea iwjoe@hanyang.ac.kr Abstract. Because of the remote nature

More information

Configuration latency constraint and frame duration /13/$ IEEE. a) Configuration latency constraint

Configuration latency constraint and frame duration /13/$ IEEE. a) Configuration latency constraint An efficient on-chip configuration infrastructure for a flexible multi-asip turbo decoder architecture Vianney Lapotre, Michael Hübner, Guy Gogniat, Purushotham Murugappa, Amer Baghdadi and Jean-Philippe

More information

Memory-Reduced Turbo Decoding Architecture Using NII Metric Compression

Memory-Reduced Turbo Decoding Architecture Using NII Metric Compression Memory-Reduced Turbo Decoding Architecture Using NII Metric Compression Syed kareem saheb, Research scholar, Dept. of ECE, ANU, GUNTUR,A.P, INDIA. E-mail:sd_kareem@yahoo.com A. Srihitha PG student dept.

More information

PARALLEL INTERLEAVING ON PARALLEL DSP ARCHITECTURES. Thomas Richter and Gerhard P. Fettweis

PARALLEL INTERLEAVING ON PARALLEL DSP ARCHITECTURES. Thomas Richter and Gerhard P. Fettweis PARALLEL INTERLEAVING ON PARALLEL DSP ARCHITECTURES Thomas Richter and Gerhard P. Fettweis Chair for Mobile Communications Systems Dresden University of Technology, Dresden {richtert, fettweis}@ifn.et.tu-dresden.de

More information

BER Evaluation of LDPC Decoder with BPSK Scheme in AWGN Fading Channel

BER Evaluation of LDPC Decoder with BPSK Scheme in AWGN Fading Channel I J C T A, 9(40), 2016, pp. 397-404 International Science Press ISSN: 0974-5572 BER Evaluation of LDPC Decoder with BPSK Scheme in AWGN Fading Channel Neha Mahankal*, Sandeep Kakde* and Atish Khobragade**

More information

An Empirical Study of Lazy Multilabel Classification Algorithms

An Empirical Study of Lazy Multilabel Classification Algorithms An Empirical Study of Lazy Multilabel Classification Algorithms E. Spyromitros and G. Tsoumakas and I. Vlahavas Department of Informatics, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece

More information

Context based optimal shape coding

Context based optimal shape coding IEEE Signal Processing Society 1999 Workshop on Multimedia Signal Processing September 13-15, 1999, Copenhagen, Denmark Electronic Proceedings 1999 IEEE Context based optimal shape coding Gerry Melnikov,

More information

A Connection between Network Coding and. Convolutional Codes

A Connection between Network Coding and. Convolutional Codes A Connection between Network Coding and 1 Convolutional Codes Christina Fragouli, Emina Soljanin christina.fragouli@epfl.ch, emina@lucent.com Abstract The min-cut, max-flow theorem states that a source

More information

Application of a design space exploration tool to enhance interleaver generation

Application of a design space exploration tool to enhance interleaver generation Application of a design space exploration tool to enhance interleaver generation Cyrille Chavet, Philippe Coussy, Pascal Urard, Eric Martin To cite this version: Cyrille Chavet, Philippe Coussy, Pascal

More information

An Efficient VLSI Architecture of a Clock-gating Turbo Decoder

An Efficient VLSI Architecture of a Clock-gating Turbo Decoder An Efficient VLSI Architecture of a Clock-gating Turbo Decoder Abstracts P Priyanka, P.G.STUDENT, Kakinada Institute of Engineering and Technology M Devi Satya Priya, Asst.prof, Kakinada Institute of Engineering

More information

Performance Analysis of Gray Code based Structured Regular Column-Weight Two LDPC Codes

Performance Analysis of Gray Code based Structured Regular Column-Weight Two LDPC Codes IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 11, Issue 4, Ver. III (Jul.-Aug.2016), PP 06-10 www.iosrjournals.org Performance Analysis

More information

Non-recursive complexity reduction encoding scheme for performance enhancement of polar codes

Non-recursive complexity reduction encoding scheme for performance enhancement of polar codes Non-recursive complexity reduction encoding scheme for performance enhancement of polar codes 1 Prakash K M, 2 Dr. G S Sunitha 1 Assistant Professor, Dept. of E&C, Bapuji Institute of Engineering and Technology,

More information

Cost efficient FPGA implementations of Min- Sum and Self-Corrected-Min-Sum decoders

Cost efficient FPGA implementations of Min- Sum and Self-Corrected-Min-Sum decoders Cost efficient FPGA implementations of Min- Sum and Self-Corrected-Min-Sum decoders Oana Boncalo (1), Alexandru Amaricai (1), Valentin Savin (2) (1) University Politehnica Timisoara, Romania (2) CEA-LETI,

More information

RECENTLY, researches on gigabit wireless personal area

RECENTLY, researches on gigabit wireless personal area 146 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 55, NO. 2, FEBRUARY 2008 An Indexed-Scaling Pipelined FFT Processor for OFDM-Based WPAN Applications Yuan Chen, Student Member, IEEE,

More information

Quantized Iterative Message Passing Decoders with Low Error Floor for LDPC Codes

Quantized Iterative Message Passing Decoders with Low Error Floor for LDPC Codes Quantized Iterative Message Passing Decoders with Low Error Floor for LDPC Codes Xiaojie Zhang and Paul H. Siegel University of California, San Diego 1. Introduction Low-density parity-check (LDPC) codes

More information

A unified turbo/ldpc code decoder architecture for deep-space communications

A unified turbo/ldpc code decoder architecture for deep-space communications A unified turbo/ldpc code decoder architecture for deep-space communications Carlo Condo, Guido Masera, Senior Member IEEE Dipartimento di Elettronica e Telecomunicazioni, Politecnico di Torino, Italy

More information

On combining chase-2 and sum-product algorithms for LDPC codes

On combining chase-2 and sum-product algorithms for LDPC codes University of Wollongong Research Online Faculty of Engineering and Information Sciences - Papers: Part A Faculty of Engineering and Information Sciences 2012 On combining chase-2 and sum-product algorithms

More information

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC Randa Atta, Rehab F. Abdel-Kader, and Amera Abd-AlRahem Electrical Engineering Department, Faculty of Engineering, Port

More information

Turbo codes, MAP algorithms, serial concatenated codes

Turbo codes, MAP algorithms, serial concatenated codes A partial list of papers: Topics covered: Turbo codes, MAP algorithms, serial concatenated codes, turbo coded modulation, equalization, turbo equalization, trellis coded modulation, block coded modulation,

More information

INTERLEAVERS FOR PUNCTURED TURBO CODES. Institute for Telecommunications Research. University of South Australia. SPRI Building, Warrendi Road

INTERLEAVERS FOR PUNCTURED TURBO CODES. Institute for Telecommunications Research. University of South Australia. SPRI Building, Warrendi Road INTERLEAVERS FOR PUNCTURED TURBO CODES. Mark S. C. Ho 1, Steven S. Pietrobon 2 and Tim Giles 1 1 Mobile Communications Research Centre Institute for Telecommunications Research University of South Australia

More information

A Scalable Multi-Core ASIP Virtual Platform For Standard-Compliant Trellis Decoding

A Scalable Multi-Core ASIP Virtual Platform For Standard-Compliant Trellis Decoding A Scalable Multi-Core ASIP Virtual Platform For Standard-Compliant Trellis Decoding Matthias Jung, Christian Brehm, Norbert Wehn Microelectronic Systems Design Research Group University of Kaiserslautern,

More information

Parallel Computing. Slides credit: M. Quinn book (chapter 3 slides), A Grama book (chapter 3 slides)

Parallel Computing. Slides credit: M. Quinn book (chapter 3 slides), A Grama book (chapter 3 slides) Parallel Computing 2012 Slides credit: M. Quinn book (chapter 3 slides), A Grama book (chapter 3 slides) Parallel Algorithm Design Outline Computational Model Design Methodology Partitioning Communication

More information

Reduced Complexity of Decoding Algorithm for Irregular LDPC Codes Using a Split Row Method

Reduced Complexity of Decoding Algorithm for Irregular LDPC Codes Using a Split Row Method Journal of Wireless Networking and Communications 2012, 2(4): 29-34 DOI: 10.5923/j.jwnc.20120204.01 Reduced Complexity of Decoding Algorithm for Irregular Rachid El Alami *, Mostafa Mrabti, Cheikh Bamba

More information

A New MIMO Detector Architecture Based on A Forward-Backward Trellis Algorithm

A New MIMO Detector Architecture Based on A Forward-Backward Trellis Algorithm A New MIMO etector Architecture Based on A Forward-Backward Trellis Algorithm Yang Sun and Joseph R Cavallaro epartment of Electrical and Computer Engineering Rice University, Houston, TX 775 Email: {ysun,

More information

ERROR CONTROL CODING FOR B3G/4G WIRELESS SYSTEMS

ERROR CONTROL CODING FOR B3G/4G WIRELESS SYSTEMS ERROR CONTROL CODING FOR B3G/4G WIRELESS SYSTEMS PAVING THE WAY TO IMT-ADVANCED STANDARDS Edited by Dr. Thierry Lestable SAGEMCOM, France (formerly with Samsung Electronics) Dr. Moshe Ran H.I.T - Holon

More information