CHANNEL CODING FOR THE RELAY CHANNEL

UN'"l:k::i/fY OF HAWAII U8RARY

CHANNEL CODING FOR THE RELAY CHANNEL

A THESIS SUBMl'I"IED TO THE GRADUATE DIVISION OF THE

UNIVERSITY OF HAW AI'I IN THE PARTIAL FULFILLMENT OF THE

REQ~TSFORTHEDEGREEOF

MASTER OF SCIENCE

IN

ELECTRICAL ENGINEERING

May 2006

By DannyYee

Thesis Committee

Anders Hllst-Madsen, Chair N. Thomas Gaarder

Marc Fossorier

We certify that we have read this thesis and that, in our opinion. it is satisfactory in scope

and quality as a thesis for the degree of Master of Science in Electrical Engineering.

Chair

ii

ffA Qll1 .H3

no. 4075

Acknowledgement

I would like to thank my advisor Dr. Anders H~st-Madsen for always helping me

with whatever questions I had, as well as giving me a subject matter for my thesis. I

would especially like to thank Yige Wang for the LOpe decoder and generator matrices

and helping me with all of the questions I had about the code. I would also like to thank

Dr. Marc Fossorier for help with erasure codes, irregular codes, and serial concatenation.

Finally, I would like to thank Dr. N. Thomas Gaarder for help with some theoretical

issues as well as other miscellaneous questions.

iii

Abstract

The relay channel consists of an input, a relay, and a destination. The source

transmits XII. A relay, which receives that as Y2, will decode it and transmitx2 as a

function of Xu. Finally, the destination receives Xu, XI2, and X2 as)'3, which it decodes. If

done correctly, XI2 and X2 will sum coherently and the resulting)'3 will be superior to

simply sending the message straight from source to destination at 2x the power.

In this thesis, two cases are considered. The cases are full duplex relaying and

half duplex relaying. Full duplex relaying is where the relay can send and receive at the

same time. This is implemented using backward decoding, a form of superposition block

Markov encoding, as well as QPSK signaling.

The half duplex case is where the relay cannot send and receive on the same

channel at the same time. This is implemented using TDO. Code shortening is used to

achieve the relaying objective.

iv

List of Figures

Figure

Figure 1: A diagram of a relay channel.. ............................................................................. 2

Figure 2: The two cuts that are used to prove equation (4) ................................................. 7

Figure 3: Superposition Block Markov Encoding ............................................................ 14

Figure 4: The flow of infonnation through the two paths ................................................. 15

Figure 5: Comparison of BER vs. Ei/No for Relay and Direct transmission .................... 21

Figure 6: BER vs. Ei/No for the channel where both are operating at peak ...................... 26

Figure 7: BER vs. P. Trying to find the best value ........................................................... 30

Figure 8: Another graph trying to find the optimum value of Ii ....................................... 31

Figure 9: Example of serial concatenation ........................................................................ 35

Figure 10: The theoretical decoder that could not be implemented due to time constraints

................................................................................................................................... 36

Figure 11: An example of an irregular code ..................................................................... 40

Figure 12: Taking every other column of a parity check matrix ...................................... .41

Figure 13: Block Error Rate vs. Et/No for the first TOD case ......................................... .47

Figure 14: Plot for second case where K = 0 ..................................................................... 50

v

Table of Contents

Acknowledgement .............................................................................................................. iii

Abstract .............................................................................................................................. iv

List of Figures ..................................................................................................................... v

Table of Contents ............................................................................................................... vi

1. Introduction ................................................................................................................ 1

2. System ModeL ........................................................................................................... 4

2.1 Theoretical Capacity for Full Duplex ........................................................................ 6

2.1.1 Theoretical Calculation of ~ .............................................................................. 7

2.2 Theoretical Capacity for Half Duplex ....................................................................... 9

2.2.1 Theoretical calculation for K ........................................................................... 10

3. Full Duplex Relaying Implementation ..................................................................... 13

3.1 Block Markov Encoding and Backward Decoding ................................................. 1 3

3.2 Implementation Details ........................................................................................... 16

3.3 Results ............................................................................................................... 19

3.3.1 First Simulation ......................................................................................... 21

3.3.2 Second Simulation. .................................................................................... 23

3.3.3 Third Simulation ....................................................................................... 27

3.3.4 Determination of p .................................................................................... 29

3.4 Summary ........................................................................................................... 32

4. Half Duplex Relaying Implementation .................................................................... 34

4.1 Basic Idea .......................................................................................................... 34

vi

4.1.1 Serial concatenation at the encoder ........................................................... 34

4.1.2 Code Shortening ........................................................................................ 36

4.2 hnplementation Details ..................................................................................... 38

4.2.1 The Problem .............................................................................................. 38

4.2.2 An Attempted Solution. ............................................................................. 39

4.2.3 A Solution That Worked ........................................................................... 40

4.2.4 Recalculating ro ......................................................................................... 41

4.3 Results ............................................................................................................... 44

4.3.1 Theoretical Capacity Calculation .............................................................. 45

4.3.2 First Simulation ......................................................................................... 46

4.3.3 The Second Simulation ............................................................................. 49

4.4 Summary ........................................................................................................... 51

5. Conclusions, Possible hnprovements or Future Works ........................................... 54

5.1 Conclusions ....................................................................................................... 54

5.2 Possible hnprovements ..................................................................................... 55

5.3 Future Works ..................................................................................................... 57

5.3.1 Full Duplex ................................................................................................ 57

5.3.2 Half Duplex ............................................................................................... 58

6. References ................................................................................................................ 60

7. Appendix A: Source Code ....................................................................................... 63

vii

1. Introduction

Upon hearing the term relay, one might be tempted to think of the track & field

relay, where a runner runs 100m and passes a baton to the next runner, who runs for

another 100m. In some ways, the relay channel works similar to this. On one end, there

is a source, on the other end there is a destination, and in between there is a relay. The

source sends some information, which the relay picks up. Then the relay helps the source

send some additional information to the destination. This system of communication,

which was first studied by van der Muellen [14] has a wide range of uses, from cellular

networks, to ad-hoc networks, to sensor networks. The advantages of these networks are

increased capacity (compared to direct transmission) and resistance to fading, shadowing,

and path loss [15].

Although there are a lot of papers dealing with the theory and theoretical capacity

of the relay channel, there are only a few papers that deal with the implementation of a

relay channel [16, 17]. This thesis deals with implementations of the relay channel using

LDPC codes. LDPC codes were discovered by Gallager [18] and rediscovered recently

by MacKay [19]. They perform very well, in some cases coming within 1 dB of the

theoretical capacity.

The general idea of the relay channel is as follows: First, assume that the relay is

"closer" to the source than the destination is, and is "closer" to the destination than the

source is. The word "closer" is actually referring to the attenuation, or channel gain

between source, relay, and destination, which is usually, but not exclusively due to the

distance between the three nodes. Now the source transmits a message with enough

1

power that the relay can decode. Note that both the relay and the destination receive this

message. Naturally the relay can decode the message, but the destination, since it is

farther away, generally cannot. The relay then sends resolution information to the

destination coherently with the source, which combines what it received in the first half

with what it receives later to decode the message. The combination of the source and

relay sending the common information coherently makes them equivalent to a two-node

antenna array, so the advantages of an antenna array apply here.

Figure 1: A diagram ofa relay clumnel

The challenge in channel coding comes from the fact that none of the current

schemes truly extract the potential benefits of cooperating nodes in many cases. In

particular, what is called multihop in this thesis does not make use of source-relay

cooperation at all. This thesis considers two possible situations: The first situation is

when the relay is capable of full duplex, meaning it can send and receive at the same time.

The second situation is half-duplex, where it must spend part of the time receiving, and

part of the time sending. Even though they are very similar in what they try to do, each

situation requires very different approaches.

2

The full duplex relay method makes use of superposition block Markov encoding

(SBME) with backward decoding. 1bree different situations are considered. One case

where the direct transmission should clearly be better than the relay channel method. and

two methods where the relay channel method should definitely be better than direct

transmission.

The half duplex relay method will not require any sort of block Markov encoding.

It will make use of what is known as code shortening for transmission. Two cases will be

considered. One case is a typical situation where the relay is situated somewhere

between the source and destination. The other case is also one where the relay is between

the source and destination. except in this case synchronization between source and relay

is not necessary.

3

2. System Model

This thesis considers the Gaussian relay channel. The Gaussian relay channel is

given by Gaussian noise Z2 at the relay and Gaussian noise Z3 at the destination. Each

noise is an independent zero mean Gaussian random variable with variance N. In other

words, it is white Gaussian noise. As a simplification, assume that the noise variance of

Z:! and Z3 are the same. Thus the received signals at the relay node and the destination

node are given by

(1)

Zz and Z3 are set to unit variances (after normalization). In this thesis, constant channel

gains will be used, which means that e21. e31. and e32 are all non-random.

This thesis considers a technique for full-duplex transmission, and one for half

duplex transmission. Full-duplex transmission means that the relay can receive and

transmit at the same time on the same frequency channel. In general, this is difficult to

achieve if the receiving and transmitting powers are very different In the half -duplex

case, either time division duplex (TOO) or frequency division duplex (FDO) may be used

to fulfill the requirements.

For the TOO method, the time is divided up into two periods that mayor may not

be equal. The first period shall be called the relay receive period, and the second period

shall be called the relay transmit period. In the relay receive period, the sources sends the

first pan of the message, and the relay and destination both receive that message. The

relay is not transmitting at this time. The destination simply stores what it receives, while

4

the relay receives the encoded message, decodes it and uses the decoded message to

generate some resolution information.

The next period will be called the relay transmit period. In this period, the relay

and the source may cooperate to transmit the resolution information coherently (in

practice this is extremely difficult). The destination receives the resolution information,

and uses that in combination with what it received in the fIrSt period to resolve the

message. In some cases of eZI. e3I. and e32, the best idea would be for the source not to

broadcast at the same time as the relay. In that case, synchronization is not necessary.

In comparison, the FDD method will have split up the total bandwidth into two

parts. In one part the relay will only receive messages, and in the other part the relay will

only send messages. The source transmits over the whole spectrum, and the destination

will listen over the whole spectrum. The source must also send coherently with the relay

over the part of the spectrum over which they both transmit in common.

In the end, TOD was chosen as the transmission method. The reason TOD was

chosen is given in the paper [2], which states: "Clearly, from an information-theoretic

point of view, the TO mode and the FD mode are equivalent for the fixed channel gain

case. In fading channels, however, the TO mode has an advantage over the FD mode

because a can be adjusted to the instantaneous channel conditions, whereas ais usually

fixed in the FD mode."

Note that the problem of synchronization is not dealt with in this thesis. In

practice, it is extremely difficult to synchronize two separate crystal oscillators [4]. In

considering a model that includes a phase difference, one may use the asynchronous

5

channel model in which there is a random phase offset eli] between the source and the

relay. The resulting signal at the destination is [2]

(2)

Note that e is random and ergodic, and uniformly distributed between [-'It, +1t).

Although phase differences will not be dealt with in the simulation, note that they do

make a large difference in physical implementation.

2.1 Theoretical Capacity for Full Duplex

An upper bound on the theoretical capacity of the relay channel with only one

relay in-between is

(3)

The formal proof of this is given in [1, Theorem 14.10.1], which is a result ofa max flow

min cut algorithm. One can picture a sort of generalization of the proof as follows: The

fIrst term in the braces comes from the "cut" going thru 1 ~ 3 and 2 ~ 3. The second

term in the braces comes from the "cut" going thru 1 ~ 2 and 1 ~ 3. The capacities of

each term results in the equation

(4)

Pis the fraction of power PI sent directly to the destination only, and (1-/1) is the fraction

of PI sent to both relay and destination. Note that the noise variance for each part of the

equation is 1. This is the upper bound on the capacity, and is not a realistically

6

achievable bound, becanse it assumes that the relay and destination cooperate to decode,

and the relay and source cooperate to send all of the message, rather than just part of the

message.

FIgure 2: The two cuts that are used to prove equation (4)

If the relay node decodes and then re-encodes the message, then the capacity

becomes [3, Theorem 1]

In the simulations, the decode - re-encode method is used, so (6) is applicable to the

situation. The proof of this bound is given later. However, it is of the same form as the

physically degraded channel in [3]. A justification for this can be seen in [20].

2.1.1 Theoretical Calculation of P

In order to simulate the full duplex relay channel. a formula to fmd the value of f3

must first be derived. From the formula (6), it is seen that both terms within the braces

7

are convex. Thus, using that formula and the convexity, one could calculate a theoretical

value for p by setting the first term in the braces equal to the second term. However, that

simple formula fails to take into acount the requirement that O!S p!S 1 (& real) and O!S

JP !S 1 (& real) are restrictions on what values p can take. However, this can be taken

care of after doing the calculations. Using simple algebraic manipulation and the

quadratic formula, the theoretical value for JP turns out to be

(7)

This is only valid for the cases where JP is neither negative nor complex.

In order to determine what to do when JP is negative, it is helpful to consider

some numbers when it solves to exactly 0 and when it is slightly negative. When the

constants are declared in (7) so that JP = 0, then the power of the received signal at the

relay is exactly equal to the received power at the destination, as expected. In this case,

when p = 0, all of the power from the source is being broadcast to the relay and the

destination. However, when JP goes slightly negative, then pbecomes a positive

number. Since the formula in (6) assumes that JP is a positive number, this leads to

them no longer being equal. Thus, the method winds up allocating some power for the

source? destination only signal, even though the source? relay path is the limiting

path. The source? destination only signal is basically the part of the message that the

source will be sending to the destination cooperatively with the relay.

8

If the value of .J7i turns out to be negative, there is nothing one can do to make

the received powers at the relay and destination equal to each other. In the case where it

is negative, however, the ftrst term in the braces, the source ~ relay path, is the limiting

path. Thus if it is negative, then the best that can be done is to set .J7i = O. This also

takes care of the complex cases, because when .J7i is complex, it is also negative for all

of the ranges of c xy that are realizable. When .J7i is set to 0 instead of going negative or

complex, then the source ~ relay is still the the term that is limiting capacity, but the

source is allocating all of its power through that path, and that is the best that can be done

under these circumstances.

2.2 Theoretical Capacity for Ha1f Duplex

The capacity of the TDD relay channel is upper bounded by [13, Section 4] [2,

Appendix A]

(8)

with

In these equations ais the fraction of time spent in the relay receive period, and

(I-a) is the fraction of time spent in the relay transmit period. Theoretically, this acould

be changed at any time depending on the conditions of the channel, as mentioned earlier

9

in the thesis. Kis the fraction of power of PI that the source will use in the relay transmit

period, and (l-K) is the fraction of PI used during the relay receive period. During the

relay transmit period, (l-p> is the fraction of KPI'S power that is transmitted directly to

the destination as a different part of the message.

The upper bound contains the same terms and has the same situation as that of the

full duplex case. The cut for CI + uses both the relay and the destination to jointly decode

what the source sent during the relay receive period. This is obviously impossible. By

changing the formula to make use of the decode and forward method, equation (9)

becomes

Cj" = a ~ log(l+c~l (l-K)~)+(1-a) ~ 10g(I+(1-P)C;lK~) (11)

The theory involves transmitting two different signals to the destination, with one

of the signals involving the parameter of (I-P>. Thus, unless some methods are used to

avoid interference, the destination will have interference from the term with (1-P>. To

avoid this problem, the interfering signal will not be sent at all by setting P = 1.

2.2.1 Theoretical calculation for K

Now that Phas been eliminated as a variable, two parameters are left to solve for,

aand K. It is not so simple to optimize over two parameters with only one equation

using both of them. The usual solution to this type of equation is to solve for one

parameter in terms of the other parameter. This is not very useful. Practically, in order to

simplify the simulations, a is set to 'Iz. This has the advantage of making the calculations

10

slightly easier. In addition, as shall be seen later, it simplifies the method used to achieve

TOD transmission.

After the elimination of the other parameters, the only paramter left is" Thus,

equations (11) and (10) become

C.- = ! log (1 + ci. (I-rc)ll) (12)

and

(13)

Now, all that needs to be done is set equation (12) equal to equation (13) and solve for

.JK: in terms of the other variables. Naturally, this does not take in to account the limits

on .JK: , where 0 '.5..JK: '.5.1 and it must be real.

If solved symbolically, .JK: will be found to be the solution to a fourth order

polynomial equation. The solution is much too long to reproduce here, and it would not

make much sense even if it were here, but the equation to solve for it (symbolically) is

reproduced here

( c:1ll + C:2P2 + c:,c:211~ - ci.ll) + ( 2c3.C32~ llP2 + 2c:.c3211 ~ ll~ ).JK: +( cj.1l2 -c:.c:211~ +ci.ll)rc-2c:.c3211~1l~rcX -cj.1l2rc2 = 0

In this case, it is best to solve the equation QX4 +bx' +d +tix+e = O. And set a, b, c, d,

and e equal to the corresponding terms in the above equation.

Now, the limits on .JK: must be taken into account. As in the full-duplex JP

case, .JK: can be negative and/or complex. In addition, it can also be greater than 1.

11

(14)

Unlike in the case of JP ' JK is not necessarily complex when it is negative. Some

rules were laid out to try to account for these cases. First, if it is complex, then only its

magnitude is taken, and its phase is ignored. The negative case is taken care of by setting

any negative value equal to zero, as it was in the full duplex case with JP. In the case

where JK> I, set JK= 1.

12

3. Full Duplex Relaying Implementation

3.1 Block Markov Encoding and Backward Decoding

For full duplex relaying, the backward decoding method suggested by [2, pg. 4],

which comes from [5] based on [6] is used. Here backward decoding is briefly

summarized. Backward decoding is a form of a more general Superposition Block

Markov Encoding (SBME). First, some things must be set up. Let a message stream be

denoted by w[l], w[2], ... , w[N] E {1, ... ,2nR}. The messages are transmitted inN+l

blocks each of n symbols. This gives a rate of -f+r R, but as N 7 DO an asymptotic rate of

R is obtained.

The method of SBME is as follows: During block b. h=1.2 ... . .N. the transmitters

send to the receiver a certain amount of ''fresh information", but with a rate too high to

allow for reliable decoding. Because there is ambiguity left for the receiver, the

transmitters in the next block cooperate to send "resolution information" to the receiver.

Superimposed on this resolution information, the transmitters again send high-rate fresh

information.

Decoding will use the idea of backward decoding. The method is as follows: The

receiver starts decoding only after all N + 1 blocks have been received. In block N + 1

only resolution information is sent, so it is readily decoded. The resolution information is

used to help decode the high rate ''fresh information" in block N. Simultaneously, the

decoder is able to decode the resolution information in block N, which is used for N -1.

This continues until the message has been fmally decoded at block 1.

13

FRESH f-

FRESH I-INFORMATION INFORMATION

~ ~ 4

~ RESOLUTION 4 RESOLUTION 4- RESOLUTION INFORMATION INFORMATION INFORMATION

blockNl blockN block l\FI-l

Figure 3: Superposition Block Markov Encoding

In the proof of this method in [6], they use the idea where n ~ 00, however, since they

also use the fact that N ~ 00, they must condition convergence on n increasing faster than

N.

In the case of the full duplex relay, the SBME method is used as follows: The

encoder uses two codebooks, each with power 1: X2(W) and XI(WIoW2). The source

transmits

The relay receives )'2[ll = C2IXI[ll + Z2[ll, but uses only the

C21~(I-P)P. XI (w[i), w[i -1]) part of the signal to decode the message.

Assuming a successful decoding of w[ i-I], if

(16)

then the relay can decode w[ll from )'2[ll. After decoding the relay transmits,

(17)

14

Note thatx2[i+l] is due to the time delay when going from source ~ relay then relay ~

destination.

At this point, the destination node is receiving Y3 [i] = C31~ [i]+C32X2 [i]+ Z3. This

is equal to

The received power is the magnitude of the two parts. Assuming the destination has

successfully decoded w[/l, then it can decode w[i-l] fromY3[ll if

(19)

Thus the capacity of the channel is the maximum over all P of the smaller of the two rates.

This is a sort of proof of the above equations, specifically equation (6). This encoding

method is what is used in the implementation, as all of it was suggested by Prof. H~st-

Madsen.

Figure 4: The flow of iDformation throngh the two paths.

The left cot represents patb In (16). The right ent represents the patb In (19)

15

3.2 Implementation Details

In the setup. all of the variables were declared beforehand, specifically C2t. C3t. C32.

PI. P2. and the noises. From these variables. the theoretically optimum numerical value

for P is calculated. Using this value of P along with an encoded binary message. the

source sends the following non-circular QPSK signal

~ [i] = ~PIl (_I)c(w[I-I]l + jJ(I- P)1l (_I)c(w[I-I]E&w[I]

This signal is a realization of equation (15). The relay takes the imaginary part of the

signal Y2 [i] = C2lXI [i] + Z, [i]. and decodes it to obtain W. which is an estimate of

w[i -1] EIlI w[i]. It then obtains an estimate ofw[/l as w[i] = wEIll w[i -1]. with the

exception of i = 1. where W[I] = W. The imaginary part of Y2[11 is

Im{Y2 [i]) = Im{C21~ [i] + Z, [in

= czIJ(I- P)1l (_I)c(w[i-I]E&w[i] + Im{ Zz [in

Note that the received power of this signal follows the power portion of the capacity

equation (16).

During block i. the relay transmits Xz [i] = Fa (_I)c(lI{l-l]l. The destination

receives

which is

Y3 [i] = C31

..J7iif (_I)c(w[H]l + C32Fa (-1 ),(lI{H]l

+ jC3IJ(1- P)1l (_I)c(w[H]E&w[I]l + Z3 [i]

16

(20)

(21)

(22)

(23)

When the k-th element is taken and transfonned to polar form, the following equation is

the result

(24)

From this equation, one can see that if }'J[i)k is rotated by the phase angle given by the

exponential, then it becomes a BPSK signal with magnitude given by

detennined by w[/l. Note also that the magnitude of the received signal, when squared,

matches the power component of equation (19), thus the theoretical P calculated in (7) is

expected to be exactly equal, or at least very close, to the experimentally acquired value

ofp.

From equation (24), one can see that the fonnula for the phase difference, which

is called e, is determined solely by the channel constants, e32 and e3h as well as Ph P2,

and p. Therefore e can be calculated beforehand, and at the destination the received

signal will be rotated by ±e, with the ± depending on ekCw[ll), so that it will be a BPSK

signal. After rotating and removing the imaginary part of the noise, the received signal

can be decoded to find w[i-l). Thus, as one can see, w[i-l) depends on w[/l, so this

method uses backward decoding for implementation. The steps involved in transmission

are summarized below:

17

First assume that the transmission powers are at least the minimum necessary for good

transmission.

1. During block 1. source transmits jJif (_I)'(>{I]). The relay is able to decode it

because C21PI > C21(l-{J)Pt. assuming that C21(l-fJJPI is the minimum power for small

p(error) at the relay.

2. During block i = 2 •.. . .N. the source transmits the QPSK signal

~ [i] = ~ fJ~ (_I)c(w[I-I]) + j~(1- fJ)~ ( _I)c(>{I-I]E9w(I]). The relay receives only the

imaginary part of the signal j~(1- fJ) ~ (_I)c(>{I-I]E9w(I]) .

3. The relay decodes the signal w[i-I]E9w[i].and E9 it with the w[i-I] from the

previous block to get w[ll. Then it re-encodes the block c( w[i]).

4. In the actual implementation. however. it does not do the re-encoding. because the

result of the decoding is c( w[i -1] e w[i]) = c( w[i -I])e c( W[i]). so doing the

addition, c( w[i -1] E9 w[i]) E9c( w[i -1]) = c( w[i]). Thus. no re-encoding.

5. During block N+I. the source transmits the signal Jif (_I)'(>{N]) .

6. Also. for i = I •... .N. the relay transmits ~ [i + 1] = ~ (_1)'("[1].

18

7. The destination receives Y3 [i] = C31~ [i]+cnXa [i] + Z3 [i]. AU = N+I, the destination

decodes it like a BPSK signal. At i = I, ... .N, the destination rotates the received

signal by ±e, with e given by (24), and ± determined by the previously decoded

crJw[i+I]). (k subscript refers to the k-th element of the codeword) Then the

destination takes the real part, thus removing the imaginary part of the noise, and

decodes it like a BPSK signal.

It is important to note that in the implementation for the relay, decoding and re

encoding is not actually being done. One of the results of belief propagation decoding is

that the whole codeword is the result, and the message bits are derived from that. Thus,

rather than discard the original parity bits and re-encode the message bits, the

intermediate step may be skipped, and the codeword that is output from the decoder is

simply reused. This has the benefit of speeding up the simulation, since the message

block is not being multiplied by the Generator matrix at the relay.

The same idea is also used at the destination. The destination requires a decoding

and re-encoding of the next block in order to decode the previous block. Using this

method, it is no longer necessary to have either the relay or the destination do any

multiplication by the Generator matrix. Thus, the requirement that both the relay and the

destination need to have the Generator matrix in their memory is relaxed.

3.3 Results

For the full duplex relaying with fixed channel gain, simulation is done in Matiab,

with the computationally intensive parts written in C. For all of the simulations, PI = P2

19

is used to simplify things, because C32 could just as easily be modified as P2, so C32 was

chosen. For all of the simulations, the noise was also set at the relay and the destination

to be Ncn = Noo = 2, so that Noxl2 = 1.

The code being used for this simulation is a rate ~ LOPC code received from

Yige Wang, one of Prof. Fossorier's students. Upon further research, it was found that

this particular code is from Prof. David MacKay's database on his website [8], in

particular the Gallager code of rate ~ labeled {8000.4000.3.483(N=8000, K=4000,

M=4000, R=O.5)}. Also, some C source to do Belief Propagation decoding was received

from Yige. The author of the original version was Jinghu Chen, also one of Fossorier's

students who wrote the source code for his dissertation [9]. This source was modified so

that it could access Matlab variables and a lot of useless code was trimmed. After the

modifications to the code were complete, some trial runs were started, to see how long a

simulation would take. According to measurements taken, a set of 40 000 blocks would

take about 1 day to simulate on a computer with a 1.5 GHz Pentium 4 CPU with 512 MB

of reasonably fast RAM.

To convert a parity check matrix (H matrix) into a generator matrix (G matrix) a

program was written to put an H matrix into reduced row echelon form, then output it in

MacKay's "alist" formaL In addition, a Matlab script by Igor Kosintsev [7] was used to

turn files from alist format to Matlab sparse matrix formaL Thus, the G matrix was

resident in Matlab's variable memory. Unfortunately, in the end, the use of certain

methods meant that it wasn't even necessary to use the G matrix anywhere. However, for

the sake of reference, the use of the source code is included, since the G matrices may be

of importance, and the original source did use a G matrix. In addition, the simulation of

20

the multihop method with full duplex relay (described later) required the use of a \4 rate

G matrix. So, the sources were of use after all.

3.3.1 First Simulation

For the first set of simulations, an "equilateral triangle" of sorts was used, by

setting C21 = C31 = C32 = 0.8. With these channel parameters, the limiting path is not the

source ~ destination, but source ~ relay. Here is a summary of the fixed parameters:

N(1J. = Nm = 2 ~ Nr/2 = 1 E- This will always be true in the simulations.

C21 = C31 = C32 = 0.8, p= O.

FtiI dL!>"'" ... = 0.8. "" = 0.8. c... = 0.8

10· E':E':~:C§:~:§",:"§": §"": :E':~' ~~ ':C§:~:"§":"§": ~!"_: 'E':~:~:C§:~"!!"§":§"": :E':~:"'~cC§:~:F 1- "'--==:ReIa"'-~-Y"'-1i~r~,;;~;s.~;,ISsi~-2on':'-=-:;;:-:;;:->:-;;:'-;:;-;:;-43" :::: ::::::: ~::::: ~::::: ::::::t::L -e- DIrectTransmtssJcnl1/4 rata cpsKlI: _____ 1 _____ .J _____ .1 _____ 1. _____ L. _____ L ____ -1 _____ .J _____ J. ____ _

I , I I I I I I I

-----~----4-----~-----.-----~-----~----4-----~-----.-----I

k.. I I I I I I I I I

• ty I I I I I I I I : ~ I

_ 10 m; ~ ~ ~ ~i~ ~ ~ ~ ~ H ~ ~ ~ ~; ~ ~ ~ ~ ~ H ~ ~ ~ ~ ~ ~ ~ ~ ~ n ~ ~ ~ ~ ~~ ~ ~ ~ ~; ~ ~ ~ ~ ~~ ~ ~ ~ ~ ~ G:I - - - -1- - - - - -1-- - --"f---- - t- -- - - -I" - - -- -1- - - - - -1- - - - - ... - - - - - - - - --1iiI 'I I , I , , I I

215 -2 I I I , I I I I I

f 10 ~ ~ = ~ ~!~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ f ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~~ ~ ~ ~ ~ ~~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ f = ~ ~ ~ ~ I! -- --,-----,--- --'-----r -----r -- ---,- -- ---,- -- --,- ---- T - - - --

~ -----l- ----:-----: -----: -----: -----: ----~-----: -----~ -----! 10" ~~ui~mm~~m~mm~mm~m~~m~m~m~~m ==='

---- ~----'-----l-----r-----r-----~----~-----l-----r-

====~=====~=====1=====f=====~=====f====~=====1=====f= 10~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r

- - - - -.- - - - -., - - - - - , - - - - - r - - - - -; - - - - -r - - - -.,- - - - -, - - - - - T - - - --____ ••• ____ J _____ 1 _____ L _____ L _____ L ____ J _____ J _____ ! ____ _ • • • I • ttl 1

-8 • 1 1. • 1

10_1~--~~~S~--~0~--~0~.5~--~--~1~.5~--~2~--~~~5~--~3~--~a~5~~4· EJN. (cB)

FIgure 5: Comparison of BER YS. ElINo for Relay and DIrect transmission

21

In this case, p = 0 because all of the power will be sent to the relay, since the

source -+ relay section is the limiting part of the channel. Since that is the limiting part,

it is possible to reduce the power coming from the relay, P2, and still have the same

capacity. However, it was decided earlier to set the transmitting powers PI and P2 equal

to each other. so that situation is ignored.

Unfortunately, the theoretical capacity of the channel for the full duplex case is

not calculated and plotted, because this is a QPSK system, and there has not been an

analysis of the capacity of QPSK modulation for the full duplex relay channel. Note that

since p = 0, no power is transmitted in the real part of the signal being broadcast from the

source.

Within the graph, the simulated relay transmission values are compared with the

simulated direct transmission values using QPSK. A detail of importance is that the

direct transmission values that are being compared to in the graph are of a 'A rate code

being transmitted using QPSK modulation. The reason that the full duplex relay method

is compared to the 1.41 rate QPSK code is because that is the fairest comparison that can be

made. The method used in this full duplex case meant that 4000 bits were transmitted

over 8000 transmissions. In addition, it used twice the bandwidth of a BPSK signal. In

order to match that, a 'A rate code was used, but half of the bits were transmitted over the

imaginary part of the signal. Since an analogue caonot be drawn between the original

rate li2 code and the rate 1A code, this comparison is not perfect.

From the graph of direct transmission, it is apparent that the relay transmission is

much worse than the direct transmission for the particular case of an equilateral triangle.

This is unfortunate, but perhaps not unexpected. As was stated earlier, power from the

22

relay, P2, could have been decreased and the capacity would have remained the same.

Thus, it would have been better to compare a direct transmission to full duplex relaying if

PI = P2 was not one of the conditions.

However, the fact that the direct transmission is over 4 dB better than the relay

transmission cannot be explained by the differences in the code. or the fact that the

allocated power was not optimum alone. Thus, this method is not ideal for the situation

where the direct transmission path is comparably close to the relay paths. The next

simulation demonstrates a situation where the relay transmission is much better than

direct transmission.

3.3.2 Second Simulation

In the second simulation, a situation where p > 0 is simulated, so that the source

must transmit part of its power to the destination only. The following parameters were

used when setting up the program: e21 = 0.8, e31 = 0.2, e32 = 0.7. pwas calculated to be

0.0625, meaning that most of PI'S power will be used to broadcast to the relay and

destination, but a little bit of it will be transmitted only to the destination. Also within the

graph, a simulation of multihop transmission is plotted.

3.3.2.1 Multihop

The method of multihop is considered the more "traditional" relaying method. In

multihop. the source sends a message to the relay, then the relay transmits that same

message to the destination. It is similar to a cascade of two direct transmission channels.

Where it is different from the cascade of two channels is in the interference coming from

the source to the destination. During i = 1, the source transmits UI[1] which the relay

23

receives as C2IUI[lj. The destination receives c31UI[lj, which it discards. During i = 2,

the source transmits uI[2j, which the relay receives as C2IUI[2j. At the same time, the

relay is sending al[lj. Thus, destination receives C32 a 1[lj + C3IUI[2j. Evidently, the

uI[i+lj is an interfering signal at the receiver. The ideal case would be for C31 = O. In

that case there would be no interfering signal, and multihop would be performing at its

best.

In the following graph, the plot of multihop does not use simulation values in the

strictest sense, in that the simulation involved sending a signal from the source ~ relay,

and then sending a signal from the relay ~ destination along with an interfering signal

from the source ~ destination. Instead, the simulation makes use of the fact that the

transmission from source ~ relay is identical to a direct transmission. Thus, the results

from the direct transmission simulation can be reused.

As for the second part, the transmission from relay ~ destination, that is not

similar to something that has already been simulated in this thesis. However, it is not so

simple to simulate the second part. First, the power from the relay ~ destination must be

increased to overcome the interference from the source. Therefore, P2 > PI. The exact

c2 N /2+c2 p. formula is P2 = -¥-~ 03 / 31 1 • The f1l'St ratio before PI takes care of the channel

C32 N03 2

gain differences between source ~ relay and relay ~ destination. The second ratio after

PI takes care of the interference caused by source ~ destination. It assumes that the

source interference is a Gaussian random variable when in fact, it is a Bernoulli random

variable. This should not make too much of a difference as long as the interfering term

remains small.

24

Unlike in the second graph, multihop was not compared to the simulated values in

the first graph because that graph used an "equilateral triangle", and the channel gains

were all equal. Thus, the interference from the next transmission would have required

that the relay increase its power to compensate. This will make it perform worse than the

direct transmission, so it was left out. In the second graph. the relay channel is compared

to the multihop channel as well as the direct transmission channel.

In order to make as fair comparison as possible, the encoding method for

multihop used a \4 rate code with QPSK modulation. This is due to the aforementioned

fact that the encoding scheme for the relay channel uses real and imaginary parts to send

signals. Unfortunately, this also means that a different code will have to be used for

comparison. Again, this causes the simulation to not be a completely fair comparison.

25

FUI ~Iex .., = O.B, '31 = 0.2, '30 = 0.7 t~~~='=~~~~~~~~~~~~~~~~~~~~~TI ~ ~ ~ ~ ~:~ _ ~ ~ ~ ~~ ~ ~ ~ ~:! ~ ~ ~ ~ ~! ~ ~ ~ ~ ~ t ~ ~ ---R*y-Tr~on- - - --

·1 to

10"

::: ==:: ::::~:::::~::::: ~:::: :~:: ---*- DIrectTransmlsskn(114 rata (pSl() _____ , _____ " _____ " _____ • _____ c __ -e- Ml.iIIhcp (114 raID CPSK)

- - -q -:- -- - - ~--- - - ~ - - - - - ~-- - - - ~ - - - - -1- - - - - -1- - - - -"1- - - - - t' -- - --

: : : : : : : : :~ ~ ~ ~ ~ ~:~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~:~ ~ ~ ~ ~ ~:~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ = ~ ~ ~ ~ ---- :~~~~~3~~~~~~~~~~~f~~~~~E~~~~~~~~~~3~~~~~3~~~~~1 ----

' _____ 1 _ ____ l ___ __ 1 _____ l _____ , ______ , _____ J _____ 1 , , I I I I I , I 1----- -----4-----+-----~-----~----_1-----~-----t'

I I I I I I I ,

=====-==== '=====~=====i=====~=====~====~=====~=====~=====. ---- ---- ::::=l=::::t:::::C:::::C::::~:::::J:::::I: --------, -----'-----r-----r-----r----'-----'-----T-:::::1 :::::J:::::I:::::[:::::C::::~:::::J:::::I: 1 1 1 1 1 1 1 1

----~ -----'-----r-----r-----r----~-----l-----T-____ -1 _____ ~ _____ ~ _____ ~ _____ ~ ____ ~ _____ ~ _____ ~_

" , 1 1 1 1 1 1 , ,

=====' ====::$ ====:1=====f:=====t:=====':::====:::1:::====:1:::===:::±::: :::== :::::::::::: ='- =::::: :::J :::: = :: :J :: = = =:: 1: :: = = = = c:::::::: ::: :::1::: ::: = = =:::1= = =:: =:J = = = = = % ::: = :: :: :: - - - - -1- - - - - -1 - - - - ... - - - - - .. - - - - - t- - - - - -1- - - - - -1- - - - - ... - - - - - + - - - - ------r----' ----'-----T-----r-----r----'-----'-----T----------~ ---, ----'-----r-----r-----~----~-----l-----T-- -------r ---, ----'-----T-----r-----r----'-----'-----T--_____ 1 ____ J ____ .1 _____ J. _____ L _____ 1 ______ , _____ .1 _____ .I. __

I I 1 I I 1 , 1 I

- - - - -:- ~ - - - ~ ::: = = :: ~ = = = :::: ~:::: = :: :: t:::: =:: ::::: :: :: ::: :: ~= =:::::: ~ :: =::: =::: ± = = =_00_' =::::::::~::::::::~ ----~-----+-----~-----~----~-----~-----+--=====~====~=====~:::====i=::::~:::==C::==~==::::::::~=====i== -----r----'-----'-----T-----r-----r----~-----'-----T-------r----'-----'-----r-----r-----r----'-----'-----T-- __ _ -----r----'-----'-----T-----r-----r----~-----'-----T-----_____ L ____ J _____ J _____ l _____ L _____ L ____ J _____ J _____ l ____ _

1 1 I 1 I I I I , I 1 I I I ..

10 2:---~3-----4~---!5----~e~---=7----~B~---=9----~tO~--~t~t--~t2

EJN. (ell)

Figure 6: BER vs. E!INiJ for the cluumeJ where both are operating at peak

Unlike the first situation of an equilateral triangle, in the second case all of the

power from the source and relay are required to do a proper decoding. Once again, for

this case the theoretical capacity is not calculated or plotted, so that comparison is not

made in this graph. This differs from the half-duplex case in section 4, where

comparisons between the simulations and the theoretical capacity are made,

Note in the above case that using direct transmission is much worse than using the

relay, although it is not a completely fair comparison, since the relay channel uses a

different code than the direct transmission channel. It is obvious that the difference is

primarily due to the path between source and destination, C31, being very bad compared to

26

the other two paths. so the direct transmission route will require more power to overcome

the attenuation. This means that using the relay to transmit a significant portion of the

transmission in cases similar to this can result in gains of over 5 dB.

The multihop capacity is obviously much better than the direct transmission

capacity. due to the quality of the other paths. What is surprising is that the multihop plot

is better than the relay channel plot. Part of the difference may come from the fact that

the 1,4 rate code is slightly closer to its theoretical capacity than the Y2 rate code. but that

hardly accounts for the fact that multihop is 1 dB better than the relay channel. The relay

channel should be better than the multihop channel due to the fact that there is no

interference from the source 7 destination path in the relay method. However. both

methods are still much better than direct transmission.

3.3.3 Third Simulation

In the previous simulations. the relay channel method used in this thesis was not

as good as the other methods used. namely. direct transmission and multihop. This next

simulation is one where the channel gain constants are set so that the source and relay are

close together. and the destination is far away. This situation is somewhat like an

antenna relay. so the benefits of an antenna array can be applied directly to the part of the

signal that source and relay transmit coherently. This method is expected to clearly

demonstrate the advantages of the relay channel.

27

FUI 0..>_ '21 c 0.8. '31 c 0.2, ... c 0.2

t~~~~~~~~~~~~~~~~~~~~~~~~~TI mm~m~~m~~Hm~mHm ==~::;I=-(~~-~~~

.j

to

to"

- - - - - - - - - -"- - - _ - - - - - - - "-- - - - -- - -;>- MLillhq) (114 raIa (pSI<) I ,

----- ----~-----------~-----------~--------- -----------, , , I I I I

====== ===~===========~===========~= ========~ ===============--==;:==========~===========~= ========~--========= ::::::: ::J:::::::::::r:::::::::::c:: :::::::J: ::::::::: ----------~-----------~-----------~-- -------~-- --------_________ J ___________ L ___________ ~ _________ J _________ _

I I I I

----------~-----------+-----------~--- ------~--- ----.--I I I I I t I I

~~~~~~~~~=~~~~~~~~~~~~E§~~~~~~~~~~~§§§§§ §§§§5§§§§: ===== ---------- -----------r-----------r----- ----,----- -----:::::::::: :::::::::::I:::::::::::C:::::: :::J::::::-:::: I , I I ----------, ----------r-----------r------- --,------ ----__________ ~ __________ ~ ___________ ~ _____ ••.. 4 _________ _

I I I I I I I I

==========~= =========t===========~:======= =j:======= =_ ==========~= =========t===========C========_=~====:=== == ----------~- ---------.-----------~------ .. - ~-------- ------------,-----------r-----------r--------- ,--------- -----------,-- --------r-----------r--------- ,--------- -----------,-- --------r-----------r---------- --------- -__________ J ___________ ~ ___________ L ___________________ _

, , , I I I I

----------~--- -------.-----------~----------~ ---------==========;==:-=======;===========~==========~-========== ::::=::==:~::::====:=:;:::::::::::~::::::::::~: ::::::::: ----------~-----------r-----------r----------~- -------------------,-----------r-----------r----------,---------------------~-----------r-----------r----------,-----------__________ J ___________ 1 ___________ L __________ J __________ _

I I I I I I I I ..

t09~~--------~t~0---------1~as~--------~tt~------~t~t.~6--------~t2

EJN. (cB)

As expected, the graph shows that in this particular case, the relay channel as

implemented here is better than direct transmission and multihop methods. According to

the graph, it is about 1.5 dB better. This is naturally due to the fact that the source and

relay are sending the real part of the QPSK signal coherently, so that part of the signal

benefits from the antenna array-like configuration of the relay implementation. As

expected, the direct transmission method is better than the multihop method because

multihop must use additional power at the relay to cancel out the interference coming

from the source 7 destination path.

28

3.3.4 Determination of P

While simulating the above scenario, a discrepancy was noted in the time it was

taking to decode at the relay and destination. It seemed that the destination was taking

somewhat longer to decode its part of the codeword than the relay, meaning it had a

lower SNR. In practice, the discrepancy was not necessary something to be concerned

about, but its investigation did lead to a discovery about fl. The value of P was tested to

see if the optimum p was being used. So a simulation was set up with the following

parameters:

PI =P2= 1.24

e21 = 0.8, e31 = 0.2, e32 = 0.7

Then flwas varied slightly. The results are as follows:

29

Eflec\S of varylnQl s1gh11y: .., = 0.8. "" = 0.2, ... = 0.7 10" ,-------.---,-----,------.---,-----,------, -------i-------1--------r-------r------- I-------~---------------T-------,--------r-------r-------,-------,--------

-------~-------~--------~-------~-------~-------~--------I , , , I I

-------i-------l--------r-------r-------l-------~--------_______ ! _______ J. _______ L _______ L _______ J _______ J _______ _

I I I I I I r , I I I I

-------i-------,--------r-------r-------l-------~--------I I I I I 1 _______ 1___ _ _____ L _______ J ________ I _______ _

, The theoretical : : : , , , value of fJ· - ----- ~ ------ -1- - - - - - - -:- - - - - - - -, , , , , , ,

I I I I I ___ ! _______ J ________ L _______ L. ______ J____ _ _______ _ _ _ ! _______ J _______ .1. _______ L _______ J _ _ _ ___ .'. ______ _

_____ .! _______ J ________ L _______ L _______ J ______ ~ _______ _ , , , , . , ------- ------~--------~-------~-------1 ------4--------_______ 1 __ ____ J ________ , ________ L _______________ , _______ _ I , I I I

------_£_--- -~--------~-------~------ ~-------~--------I I I I _______ l _______ J ________ L_______ ___ __J _______ J _______ _ I I I 1 1 I 1 I I 1 I I _______ ~ _______ J ________ L _______ L _______ J _______ ~ _______ _

I I I I I 1 I 1 I 1 I 1 I I 1 1 _______ ~ _______ J ________ L _______ L _______ J _______ ~ _______ _

I I I I I 1 I I I I , , , , , , ..

to o.oL:-8---:c0-:'.065c:c:-----:O-':.r11=---O::-.076~---0::-.':::08:-----::0-:'.085':::-----:O:-'.09:::---::-:0.a!5

FIgure 7: BER vs. p, TryiDg to find the best value

The first point at the far left of the graph is the theoretical value of p.

The experiment was repeated with a different value for en. The parameters below were

tried:

e21 = 0.8, e31 = 0.2, e32 = 0.6

Then p was varied, simulated, and the BER was found. These are the results:

30

Elfeclll of varyIn!ll sightly: .., = 0.8. '91 = 0.2. '32 = 0.8 10" ,----,--,----,--,------,,----"T'--,----,--,------,

-----~----1-----1-----r-----r-----~----~-----1-----i----------r----'-----'-----r-----r-----r----'-----'-----T----------~----~-----~-----.-----~-----~----~-----~-----+-----

-----~----~-----1-----t-----~-----~----~-----~-----+-----_____ 1. ____ J. ____ J _____ 1.. ____ L ____ .1. _____ , _____ J. ____ L ____ _

I I I , I I I I I I , , , I I , I I

-----l-----'-----l-----T-----r-----~----~-----'-----r-----I I I I I I I I I _____ ~ ____ J _____ J _____ 1 _____ L _____ ~ ____ J _____ J _____ ! ____ _ t I I I I I I I I 1 I I I I I I I I I I 1 I I I I I I I I I I I I

-----~----l-----'-----r-----r-----~----~-----,--- -y-----I I I I I

: The theoretical :: :, , , , valueofp. ' , ____ .'. __ _ ____ L ____ .'. ____ .1. ___ J _____ 1 ___ _

_ _ _ _ _ 1_ _ _ _ ____ L _____ 1 ______ 1 ____ J ____ .1 •. __ _ _ _ _ _ _ 1 ____ J _____ J _____ 1. _____ L. ____ 1 ______ , ____ 2 _____ 1 ____ _

1 I 1 I 1 1 'I

-----~ ---~-----1-----.-----~-----~---- -----1-----~-----_____ 1 ____ J _____ J _____ ! _____ L _____ L ___ J _____ J _____ 1 ____ _ I 1 , I I I I 1 ____ t _____ ~ _____ 4 _____ ~ _____ ~ _____ t ____ ~ _____ ~ _____ ~ ____ _

I 1 I I t I I I 1 ____ L ____ J _____ J _____ ! _____ L _________ J _____ J _____ ! ____ _ I 1 1 I I 1 1 1 I 1 1 I I I I 1 1 I ____ L ____ J _____ .J _____ 1. _____ L ____ 1 _____ J _____ J _____ 1 ____ _

1 1 I 1 I I 1 I I ....,.--- I I 1 1 I I I 1

I 1 1 1 1 I 1 ____ 4 _____ 1 ____ L _____ I _____ J ____ .J _____ 1 ____ _

1 1 I I 1 1 I 1 I I I I I 1

I I I I I 1 I I 1 I 1 I 1 I , "

FIgure 8: Another graph trying to find the optimum value of Ii

Again, at the far left is the theoretical value of p. According to both of these graphs, the

actual p is very close to, but not quite equal to the theoretical value. Since the BER of a

given code is a function of the received SNR, if a small, but constant bit of power was

added to the destination side of equation (6) before solving for p, then the actual value of

pmay be solved for. In other words, a constant, J, will be declared and used in equation

(6) as follows:

Thus, the new equation for pwill be

31

(26)

Using the graphs of pvs. BER, the estimated value of (j - 0.02 to 0.03. There is some

evidence that (j is a function of some of the rest of the variables, and is more than just a

constant. However, fmding an actual formula for the best value of (j will be a very time

consuming task.

The simulations have not been performed to fmd the formula for the exact value

of (j for this case. However, it is not critical for the simulation, due to the fact shown in

the above graphs that the theoretical value of P is very close to the experimentally

determined optimum value of p. So, the non-optimum results obtained in the graphs

above should not change a great deal from the ideal results.

3.4 Summary

In the process of implementing the full duplex relay, the method of Superposition

Block Markov Encoding is used, which has been shown to achieve capacity in various

papers [6]. By using 5MBE and a QPSK signaling scheme, what the capacity theory says

should be the received power at each node is mirrored exactly. While the relay node

decodes the 5MBE in the forward direction, the destination decodes its 5MBE in the

backward direction. In addition, before the destination decodes a block, it should rotate

the received QPSK signal by a phase angle with the direction of the angle determined by

the previous block. Thus, it turns the QPSK signal into a BPSK signal.

Three situations that use the full duplex channel are simulated and graphed. In

the first situation, the relay is the limiting receiver, and so it is possible to reduce the

32

power to the destination from the relay and still get reliable transmission. Within the

graph, there is also a comparison to a 1,4 rate code transmitted using QPSK. This is the

closest comparison that could be simulated that involved direct transmission. In this

graph, the relay channel fared very poorly to the direct transmission case.

Another situation is a typical situation where the path from source to destination

is very poor. This case is also compared with the 1,4 rate direct transmission QPSK. In

addition, it is also compared with what is known as multihop. Multihop is the traditional

form of relaying, where a message is sent from source to relay, and then from the relay to

the destination. It is similar to a cascade of two direct transmission channels. In this case

multihop was actually a bit better than the relay channel implementation. This is

unfortunate, because ideally, the relay channel should be better than multihop.

The third situation is one where the relay channel is better than the other two

methods. This is the situation where the source and relay are close together, and far from

the destination. In this case, the two transmitting nodes act like an antenna array for the

part of the signal that they have in common. The relay implementation is about 1.5 dB

better than the next closest method, direct transmission. Direct transmission is better than

multihop because multihop must deal with interfering signals from the source ~

destination path.

Finally, it was noted that the pthat was being using to simulate blocks was not

necessarily the best performing p. However. in the process of simulation. it was

determined that the actual value was very close to the theoretical value. Thus, the values

that were recorded were allowed to stand, because they were close enough.

33

4. Half Duplex Relaying Implementation

4.1 Basic Idea

For the TOO version of the relay, one must return to the idea behind the relay

channel. The idea was that the source would broadcast a message that both the relay and

the destination would receive. Assuming that the channel gain between source ~ relay is

better than the channel gain between source ~ destination, the message would be sent at

a rate that would be high enough for the relay to decode, but too low for the destination to

decode. The relay and source would then cooperate to send resolution information

coherently to the destination. The destination would receive this resolution information

and combine it with the originally received information to decode the message. For the

TOO implementation, an approach that mirrors this basic idea will be used.

Two possible methods will be described that can achieve the TOO

implementation. Each of them has their positives and negatives, which will be described.

In the end, the one that seemed less likely to have major problems in its implementation

was chosen.

4.1.1 Serial concatenation at the encoder

The first method better elucidates the ideas of the relay, but was ultimately not

implemented because of a lack of knowledge and inability to analytically determine

which method would prove to be better and ease of implementation. This method was a

serial concatenation of block codes. It involved using two codes in series, with the output

34

of one coder put into the input of another coder. An interleaver was placed between the

decoders to improve the performance of the overall decoder [12].

The idea was to broadcast the output of the first encoder from the source to the

relay and the destination. The rate and power of the signal would be such that the relay

would be able to decode it, but the destination wouldn't. Then. since the relay now has

the original message, have the relay and source generate the additional parity checks that

come from the second encoder. Finally, the relay and source cooperate to send the

secondary parity checks coherently. The idea is that the combination of the two block

codes and the interleaver is roughly equivalent to a larger code of lower rate. In other

words. a message that is encoded with a rate 'h code and then another rate 'h code is like

a message encoded with an overa1l code of rate IA.

Message I First Parity Bits Second Parity Bits

~ ~ ,.0 Ii - Outer 1t Inner -

Figure 9: Example of serial concatenation

To d88tlnatlon (parity bits only)

To dastlnatlon and mlay

The idea is simple enough, and has a certain appeal to it. The source sends an

encoded message that the relay would be able to decode. but the destination wouldn't.

Then the source and relay cooperate to send additional resolution information in the form

of parity bits to the destination. The destination would then use the information sent

35

during the relay received period along with the resolution information sent during the

relay transmit period to decode the message.

The problem with this idea came at the decoder. In order to do soft-decision

decoding, a dual soft input soft output decoder with interleavers and feedback would need

to be implemented [12]. A program to do the decoding would have to be written from

scratch. In addition, there was no guarantee that no unforeseen errors with this method

would come up or that this method would even work as planned. Given these facts, and

that the decoder for the other method was already written, it was decided to go with the

other method.

'Mu :'#

5150 ~ ::rc'! ;0) nat used

1.1) Inner Ni;o

1t"1 .:..

1t

'J..(cll;I}

51S0 ").(uo ./

r-- Outer

zero

- 'J..(c ll ;

NUll :

0)

0)

decl ilion

FIgure 10: The theoretical decoder that could not be Implemented dne to time construlnts

4.1.2 Code Shortening

The next method, which is the one that is implemented in this thesis, is called

code shortening. The idea behind code shortening (which is also known as the method of

punctured codes) is to take a code of a given rate and erase a certain number of bits to

make it a new code with a new rate. For instance, in the simulations, a rate \4 (10000,

2500) code is taken and only ~ of the bits are transmitted. In effect, it becomes a rate ~

36

code (5000, 2500). It is important to note, however, that this new shortened rete ~ code

will not perform as well as a code that was genereted from the start to be a rete ~ code.

The method to decode these codewords is to take the transmitted bits and place them in

their respective positions in the original codeword, then stuff 0' s where no signal was

transmitted, and use the original ~ rate parity check. matrix to decode the entire

codeword. How this is used in the relay channel is as follows:

The source produces a codeword from the message. Then, the source takes ~ of

the bits of the codeword and broadcasts them to both the relay and the destination. If the

bits were transmitted with sufficient power, then the relay will be able to decode the

message. but the destination will not be able to. Then the relay and the source cooperate

to coherently send the other ~ of the bits to the destination. Using the bits from the

previous transmission, plus the bits from the current transmission, the destination is able

to treat it as a single codeword of rate ~, and assuming sufficient power was used in the

transmission, should be able to decode it.

This method has the advantage that the decoders written for the full duplex

method may be reused in this case. This is in contrast to the case of serial concatenation,

where the decoder would have to be wrinen from scmtch. From an analytical point of

view, it is difficult to determine which of these methods has an advantage over the other

one. Therefore, it makes sense to choose the one that is easier to implement. Thus, the

code shortening method was chosen.

37

4.2 Implementation Details

4.2.1 The Problem

There are many ways of implementing the idea of code shortening. Within the

scope of this thesis, one of the problems inherent in the relay channel will be solved by

the methods that are explained here. This problem comes up in the situation when the

source ~ destination path is much worse than the source ~ relay and relay ~

destination paths (a common occurrence in rea1life). Using a puncturing method similar

to the serial concatenation case, after encoding a message, the first block. i.e. the first nl

bits, would be sent to the relay and the destination. Then the source and relay both send

the second block, the last n2 bits (n2 = n - nl), to the destination.

n2

For the code given, it was discovered that if the n I bits were sent first, then a very

high SNR would be needed in order for the relay to decode. It seems that the code that

was in use had to be punctured in a very specific manner to retain good performance. At

this point, an attempt was made to send the n2 bits first. It was discovered that this gave

much better performance than the first nl bits at the relay. However, for the case given

above, where the source ~ destination path is very poor, this proved to not work. The

problem is that in this case, the source ~ relay path is sort of a "mirror image" of the

relay + source ~ destination. When the source broadcasts its n. bits to the relay and

destination, the destination receives that particular set of bits on a very poor path, then

when it receives the other ny bits. it receives it on a very good path. This mirrors the

relay, which receives the n. bits on a very good path from the source, but doesn't receive

38

any of the ny bits at all, which is equivalent to transmitting on the worst possible path

with channel gain = O.

4.2.2 An Attempted Solution

One solution tried in order to solve this problem was to use irregular LOpe codes.

Irregular LOpe codes are codes where the columus in the parity check matrix (H matrix)

have very different weights. This means that some of the bits are better protected from

error than other bits. In general, columus in the H matrix that have higher weight will

better protect their corresponding bits. Also, irregular LOPC codes can have better

performance than regular LOpe codes [11]. When an irregular code was tried in place of

the original code while still keeping the nl and n2 block format, it was found that it didn't

make much of a difference. However, in the process of trying irregular codes, a code was

discovered that performed better in direct transmission than the 1,4 rete regular code given

on MacKay's website [8], the decision was made to use this irregular code in the

simulations. This irregular code proved to perform better in the case of relaying than the

regular code as well.

39

• ••

22222222233 ••• 10 10

Figure 11: An example of an irregular code.

The numbers at the bottom represent the colnmn weights.

4.2.3 A Solution That Worked

Another solution thought out for this problem was to divide the code into two

halves in such a way that were approximately equal in performance. Of course, there are

a great many ways to divide the code into two halves. One method tried was a sort of

"sliding window". A ''window'' of Yz of all the bits would be selected, and designated as

the first half, and the rest of the bits would be the second half. The best division of the

bits into two halves would be one where they are approximately equal in terms of error

correcting performance, and out of the entire set of divisions that have equal performance,

the best one would require the least power.

It was realized that it was not necessary for the bits to be clumped together in

blocks, and that perhaps it would be better for them not to be together. So, other methods

to split up the codeword were tried. One novel idea was that if every other column of the

parity check matrix were taken, and combined into a new matrix, then it would appear to

40

be normal parity check matrix. So, the message was divided into two separate bits by

taking all of the odd numbered bits as one half of the codeword, and all the even

numbered bits as the other half of the codeword. After some trial simulations, it was

found that the two halves of the parity check matrix were indeed approximately equal in

their error correcting ability. In addition, each halves' overall performance as a V2 rate

code was fairly good, but not quite as good as a ''true'' 'h rate code. However, they were

good enough to use in simulations.

Figure 12: Taking every other column of a parity check matrix

leads to two matrices of approximately equaJ performance.

4.2.4 Recalculating K

Much like in the full duplex case, the Kthat the equation (14) gave to divide the

power at the source was not optimum. However, unlike in the full duplex case, in this

case the difference in the theoretical Kand the actual KWas quite large. The source of

this difference comes from the point stated earlier, that "It is important to note, however,

that this new shortened rate 'h code will not perform as well as a code that was generated

from the start to be a rate 'h code." In other words, the shortened code will need more

power than a good rate 'h code to decode the same message. In contrast, the destination

does not need any more power to decode what it receives, because it is an original1A rate

41

code with good perfonnance. So, equation (14) needs to be modified to include a

constant power going to the relay called o.

( 6 + C:l~ + C:2~ + C:IC~~~ - C;I~) + (2C31C32.J ~~ + 2C:IC32~.J ~pz)-.JK

+(C:l~2 -C:IC:2~~ +C;I~)K-2c:IC32~.J~P2KM -C:l~2r =0 (27)

Although not proven graphically, through trial and error an approximate value for 0

of 0.1 was estimated for this case. Again, for this case, the value of 0 is very likely in

fact a function of some of the other variables. It would most likely be very difficult to

find a formula for 0 as a function of the other variables.

The steps in transmission are summarized below:

42

First assume that the transmission powers are at least the minimum necessary for good

transmission.

1. Have the source encode a message. Then have it take every other bit (every odd

numbered bit or even numbered bit) and broadcast that to the relay and destination.

2. The relay should be able to decode the message by stuffmg O's where no bit was

transmitted, and using the original parity check matrix to decode. Then it re-encodes

the estimate of the message. The SNR of the received bits from this first part are not

enough for the destination to reliably decode the message.

3. Now the relay and source can coherently send the other bits (the even numbered bits

or odd numbered bits) to the destination. However, if the relay detects that it did not

correctly decode the message, then it will not send any bits at all. Assuming that the

relay was able to decode correctly, it will send the same bits as the source, and they

will constructively interfere.

4. The destination uses both halves of the codeword along with the parity check matrix

to decode the message.

Note that there is also an additional condition when the relay is decoding. If the

relay cannot decode the message, then it is very likely that the relay will at least be able

to detect an error in the decoding, i.e. the syndrome is not all zeros. So, if the relay

detects that the decoder did not correctly decode the message, meaning the relay

calculates a syndrome that is not all zeros, then the relay will not forward the rest of the

information. It does this because the message will be wrong, and it will destructively

interfere with the (correct) source transmission. It is most probable that the destination 43

will still not be able to decode the transmission from the source transmission itself, but

the chances of decoding are much higher than if the relay had transmitted incorrect

(interfering) bits. Another minor advantage of this is that it saves a small (nearly

negligible) amount of power.

As with the full duplex case, re-encoding is not actually being done at the relay.

After the decoding process, the whole codeword is simply taken and reused during the

relay transmit period. Again, this speeds up the simulation slightly. Also, as with the full

duplex case, this means that the Generator matrix is only needed at the source, rather than

at the source and relay.

4.3 Results

In the simulation, exactly the same setup was used as for the full duplex case.

The simulation environment was primarily Matlab, for declaring variables, calculating

the constant!; and doing the matrix multiplication. As in the full duplex case, the only

parts written in C were the decoding parts, which were the most time consuming parts of

the code. Since code shortening was chosen as the implementation method, the source

code that was used for the full duplex case was reused for the half duplex case with a few

modifications. The simulations were using the irregular code of rate 1,4 that was

previously derived as described in section 4.2.2, as that turned out to give better

performance than the rate 14 code that was downloaded from Dr. David MacKay's

website. The program to generate this irregular code was originally written by one of

Prof. Fossorier's students, Juntan Zhang for his dissertation [10]. The simulation of this

14 rate code for the IDD relay took approximately twice as long as that of the liz rate code

44

in the full duplex relay case. The reference computer, the aforementioned 1.5 GHz

Pentium 4, took 1 day to simulate only 20 000 blocks.

Again, as in the full duplex case, the parity check matrix (H matrix) was

converted into a generator matrix (G matrix) using the previously written C program.

Then. a Matlab script was used to tum the file from "alist" to Matlab sparse matrix

format. Thus, the G matrix was in Matlab's variable memory. Unfortunately in the end,

as in the full duplex case, the use of certain methods meant that the use of the G matrix

wasn't even necessary anymore. However, the G matrix ended up being used in the

simulation of the '/i rate multihop transmission that was compared to the full duplex relay

earlier in section 3.3.2.

4.3.1 Theoretical Capacity Calculation

Before giving the simulation results, a calculation of the theoretical capacity of

the channel needs to be done. The absolute theoretical capacity given in equations (8),

(9), and (10) cannot be done, because the calculation of optimum values of K; p, and aall

depend on each other. Thus, the equation would have to be optimized over three

different variables. However, the capacity of equation (8) can be calculated using

equations (12) and (13). Those set of equations, as stated earlier, assume that p = 1 and a

= ~ so the capacity is calculated in terms of Konly. So, the actual theoretical capacity is

larger than what will be calculated here.

Using a rate '/i code, the calculation is

45

Assuming that the theoretically best K'is being used, then the two formulas in the braces

are equal to each other. Thus the equation becomes

(29)

One could have also set it equal to the other term, but this term is easier to calculate.

Solving this equation for Ei/No yields

Eb 2 No = ci. (l-K')

(30)

Now. in order to solve this equation, all that is needed is the value of CZIo which is

declared beforehand, and the value of g; which is calculated from PI. Pz, CZIo Cll, and C32.

1n the case the graphs below. for the case of theoretical capacity, the value of K'

that does not include the 0 value that was used, so calculations assume that at least the

encoders and decoders are optimum, rather than suboptimum, code-shortened systems.

The graphs below all use this formula for K'to fmd the theoretical capacity for a given set

of channel gains.

4.3.2 First Simulation

46

"21 a 0.8, cal a 0.2, C32 l:l O.B

10'\ r-=r-:T.:,:-=:-=--=-:-:::-:: ~"::-:::-:::--::-_ =-::r ~,=-:-=--=:-=_-=~' ~.-=_:-:::-:::-:_:-::Ol .. ':::-_=-_ =-_;:-=-:,C_-=:-=:-:::::::~:r;::=::;Thao=="=rdI:::::;:C=":::C""",==Ily==i1 ::=:===~:::::~:::::~:::::~::: ::::::=:: -- RdayChannel -1- - - - - .... - - - - - -I -----.. -----.. --- -1- - - - - -I -+- DlreDtTransmJsslcn -:- ----~ -----~ -----t -----r --- -:- -----:L---&-=-..:MUIIh::;::.:ClP"'-_,-_--' _, _____ -' _____ J _____ L _____ L _____ 1 ______ 1 _____ J _____ J. ____ _

, , 1 , , J J J , I , I I , , , , -1- ____ ~ _____ "' _____ • _____ ~ _ _ _ 1 _____ -1 _____ "' _____ ... ____ _

ttl I "" , , , , , I I I I

:::::::::::: 3:::::::: :: ~ :: :: : :: :: i:::: :::::: E ::::::: ::::::::: 3::::::::: 3:::::::::: i::::::::: -~----~------I-----.-----~- - ~-----I-----"' _____ • ____ _ -~----~-----"'-----.-----~-- - ~-----I-----"'----- ... -_ I _____ J _____ J _____ L _____ L __ _ L ____ ~ _____ J _____ ~ _

I I I , I 1 I I ,

-r----~-----'-----T-----r-- - r----~-----1-----T-I I I I I , , ,

-r----'-----'-----r-----r-- - ----,-----'-----r-, r I I I , , ,

----~-----~-----t- ---.' __ • __ J _____ J _____ 1. _____ L __ _ , I I I , , , , , , I , ,

::::::::::::: J:::: ::::i::::::: == r:::::::: £::: _== ::::::::~:::::: :::J:::::::::: r:: .:::::-:::::::~::: :::±::::: t :::::t: :_k:. _::::~::::: ~:::: :±: ::::

, I I I I I I I -,-- - - -1- - - - -1- -- -- r ----- r-- -- - - - - - -,- - - - - 'i - - - - - r-----_1 ____ - ~- - - - - -1- - - __ .... ___ --~---- - - - - - -1- - - - - -1- - - - - .... -----

I , , I' 'I -1-- - -- "1-- - -- -t - - - - - t- - - - --t------- - - - - -1- --- -., - - - - - t- - ----, , ,

I , I 'I 'I I -,- - - - - ., - • - - -, - - - - - r - - - - - r - - - - - - - - - -,- - -. -., - - - - - T - - - --, ,

.. 10 5~~~6----~7~--~8----~9~--~1~0----~,I~--~1~2----+.,3~--~1~4--~,5.

Erf~

FIgure 13: Block Error Rate vs. ErlNo for the first TDD case

This graph is one where both the relay and destination are opemting near their

maximum error correcting capability, meaning there is an almost equal probability of an

error coming from the relay decoding, as the destination decoding. From the gmph, the

simulated results are approximately 5 dB worse than the theoretical capacity. Besides the

theoretical capacity and the simulated half duplex relay values, two additional lines are in

the graph. The direct transmission line is much worse than the relay channel

implementation. This is natural because the path between the source and destination is

very poor in this case. The other line is multihop.

47

In the case of a half duplex relay, the procedure for multihop is slightly different,

and indeed, simpler than in the full duplex case. During the relay receive period, the

source sends a codeword to the relay of sufficient power and rate that the relay decodes it

successfully. The destination ignores what it receives in this period. During the relay

transmit period, the relay sends what it has decoded to the destination at such a rate and

power that it is able to decode the codeword successfully. Thus, there is no interfering

signal in any case. This case of multihop is in fact a true cascade of two separate

channels. In fact, the plot above is derived from the simulations of the If.! rate code used

previously in the full duplex case with a BPSK channel.

In the case given in the graph above, multihop is only slightly worse than relay.

This is no surprise, since the path between source and destination is very poor compared

to the relay 7 destination and source 7 relay paths. The worse the source 7 destination

path becomes, the closer the relay plot will move to the multihop plot In the case where

the source 7 destination path is extremely comparatively poor, it would in fact be better

to use multihop, since that method uses "true" codes that were originally designed for the

channel, rather than code shortened codes.

It is important to state that this is not a completely fair comparison. The

performance curve of the If.! rate code used in this simulation is slightly farther from its

theoretical capacity curve than the ~ rate code that was used in the above experiment is

from its theoretical capacity curve. However, this is the best comparison that could be

made without having to evaluate the performance of a large set of If.! rate LDPC codes.

48

4.3.3 The Second Simulation

The next set of data points comes from the condition in which K= O. In this

situation, all of the power of PI is used in the relay receive period, and thus none in the

relay transmit period. This situation is interesting because the source is not transmitting

anything during the relay transmit period and thus, there is no chance of interference,

either constructive or destructive. Unfortunately, the channel constants can only take

certain values for this to take place.

The channel gain constants for this particular simulation are as follows: C21 = 0.8,

C32 = 0.6, and C31 = 0.36. The channel constants were not chosen so that either the relay

or the destination would be the limiting decoder, however, for this particular case, the

relay did have slightly more errors than the decoder. The value of C31 was chosen so that

it would change as little as possible from the first simulation in 4.3.2 and the rest of the

channel coustants would be the same. This was so that the new value for 0 will not be

significantly different from the old value.

49

.... = O.B, '31 = 0.36, '32 c 0.8

10.' F::::::::::::::::::::: "I: F::::::::::::::::::::: ::::::r ~,::::: ::::: ::::: ::::: ::::: ::::::r =_,:::::: _::::-::::: :=-:=-::::::=-: ~c-:=-:=-:=-_=-:=-_=-: ~:rc_=-:=--=--=;:::=::;Thoo===r=EIIc~aI;::C:="",,,=:::I\y=il :::::: ::::::~:::::::::::::::~::::::~:::: -- RelayChmnel

•• .... 10

1 ~ i ~ -g .. iii 10

- - - - - - - - - - - - ... - - - - - - -1- - - - - - - j.. - - - - - - + - - - - -+- DlrectTraranisslG1 ------: ------~ -------:- --- ---r ------t ----L-&--=,-=MIJIIh=.::ClP,....,;-_---' ____________ J ______ ~ _______ L ______ 1 ______ J ______ L _____ _

I I I I I I I I I I I I I I

- - - - - - - - - - - - -1- - - - - - -1- - __ - - _~ - - - - __ J. - - - - - _...1_ --- - r'- - - - _ --I I I I I I I I I I I I I I

: : : : : 1 : : :::::: ::::: :J:::::: :':::: ::: c::: :::!:: ::::1:1:::::::c:: :::: :::::: ::::::~:::::: :::: =====t: :::::!: ::::::!:::::--::: ::::: ____________ ... ______ ~ _______ ~ ______ J, ______ ~ ______ 1----- _. ____________ J ______ J _______ L ______ ~ ____________ ~ ____ _

I I I I I I I ------ ------~------~-------~------T------ ------ r------- - - - -_~ -- - - -- ~ - - - - - - -!_ - -- - - - ~ - - - - - -+ - - - - - - 1- _____ : ______ _

1 1 1 1 1 1 ____________ J ______ J _______ L ______ ! ____________ L ___ _ 1 1 I I 1 1 I 1 I 1 1 I I

1 1 1 1 I

::::::1::::::: J ::::::::: ::::::::::::::: c::::::::::: i::::::: _ :::::::: :::::::: ____________ J ______ ~ _______ L ______ l ______ J _____ L _____ _

=:::: =~:::::: 4: = = =:: :1::::::: ~:::::: f:::::: ~ ::::: jl::::::: ---- - -~ ------ ~ --- ----:- --- --- ~ -- ---- ~ ------ ~- ---------------- - - - - - - - - - ... ----- - -1- - -- - - - ... - - - - - - -t - - - - - - -1- - - - - - - - - - - - --

I I 1 I I I I I 1 I 1 I ------ ------,------,-------r------T------,------- ------

.. 10 4~----~5~----~8~----~7------8~----~9~----~10~----711~----=.12

EJto\,

FIgure 14: Plot for second case where It = 0

From this graph. the difference in decibels between the theoretical capacity and

the simulated values is approximately the same for this set of channel gain constants as

the set of channel gain constants given in the first graph. Note that the minimum in

equation (28) refers only to the first term in the braces. since the limiting receiving node

is the relay.

Once again, the direct transmission plot is much worse than the relay simulation

plot However. the distance between the two plots is not as great as it is in the first

simulation (section 4.3.2). This is because the path between source ~ destination is

50

somewhat better than in the first simulation, so the power requirements for transmitting

over that path are somewhat less.

This is in contrast to the case of the multihop plot in the above situation. In this

case, the multihop curve is not as close to the relay curve as in the fJISt simulation in

section 4.2.2 due to the fact that the information gotten from the source ~ destination is

of much better quality than in the first situation. In fact, the multihop plot is identical in

the two situations, because the channel gain constants of the source ~ relay and relay ~

destination paths are identical in both cases. It is not that the multihop capacity decreased,

but that the relay capacity and direct transmission capacity increased. In general, as the

path between source ~ destination improves, the capacity of the relay channel method

will improve, as will the capacity of direct transmission.

4.4 Summary

For the half duplex case of the relay channel, the choice between IDD and FDD

was decided in favor ofIDD. Since IDD and FDD are equivalent, it didn't matter which

one was picked. IDD at the relay meant that the time needed to be divided into two parts.

One part was called the relay receive period, and another part was called the relay

transmit period. For the actual implementation of the IDD case, the basic idea of a relay

was referred to. One method was that of a serial concatenation of codes. This used an

inner code and an outer code with an interleaver in between. The inner code would be

transmitted during the relay received period and the outer code would be transmitted

during the relay transmit period.

51

In the end, a method that involved code shortening was used. The way code

shortening was used in the relay channel was that half of the bits of a low rate codeword

were broadcast to both the relay and the destination during the relay receive period,

whereupon the relay decodes the codeword. Then, during the relay transmit period, the

relay and source cooperate to send the other half of the codeword to the destination,

which uses the information from both periods to decode the codeword. This was picked

as the method due to the fact that it reused most of the code from the full duplex case and

did not involve writing a brand new decoder.

In order to achieve a good result on the relay channel, both halves of the

codeword had to perform equally when the other half was erased. A number of methods

were tried to solve this problem. The problem was solved by the idea of taking every odd

bit as the first half, and every even bit as the second half. Applying this method to the

parity check matrix resulted in two parity check matrices that appeared similar to each

other. This use of code shortening meant that the halves of code did not perform as well

as independently generated codes of that rate, so the formula for Khad to be modified to

account for this discrepancy.

Before plotting the simulation results, the theoretical capacity formula had to be

changed from one that optimized over three different variables to one that optimized over

only one. After that, Ei/No was calculated. The fl1'St graph was simulated over a

common case where the source ~ destination channel gain was much worse than the

other two channel gains. In this case, the source and relay had to be in sync in order to

transmit the other half of the codeword coherently. The plot was also compared to the

direct transmission simulation and a simulation of the multihop method. The relay

52

channel in this cases performed much better than direct transmission and moderately

better than multihop.

In the second case, the channel gain from source ~ destination was modified so

that the value for K= O. This means that coherency will not be required for

implementation. Besides the relay channel simulation, there was also a plot of the direct

transmission simulation and multihop simulation. The margin between multihop and

relay plot in this case became larger than in the first case. Notice that the graphs for each

situation are quite far (5 dB) from the theoretical limit

53

5. Conclusions, Possible Improvements or Future Works

5.1 Conclusions

In this thesis, two different situations for the relay channel were considered. The

first situation was the full duplex channel. In this situation, the method already laid out

by Dr. Hlilst-Madsen was used to implement the encoding method. Three cases were

simulated. The setup, using SBME and QPSK, seemed to follow naturally from the

formula for the theoretical capacity. Unfortunately, it cannot yet be compared to the

theoretical capacity, because it is the capacity of a QPSK relay channel signal, and a

formula has yet to be derived for that.

The simulated full duplex relay channel was compared to the direct transmission

case and the multihop case. When the path between source and destination was very poor,

the result naturally came to be that the relay channel fared much better than the direct

transmission channel. However, compared to multihop, the relay channel fared slightly

worse. This is unexpected, because the relay channel should be an improvement of

multihop. A third case was simulated, where the source and relay were close together

and the destination was equally far from both of them. In this case, the relay channel

implementation proved to be better, as expected.

For the half duplex case, the TOD method was used for the relay. Implementation

of the TOD channel at the relay means that there are two periods, the relay receive period,

and the relay transmit period. The way the TOD relay was implemented was using code

shonening, which is related to the erasure channel. The source would transmit half of the

54

code during the relay receive period, at which point the relay would take the code

shortened signal and recover the original message. Then, during the relay transmit period.

the relay and the source would coherently send the other half to the destination. The

destination would be able to decode the message from both halves.

The result of the simulations of two cases was that the method of implementation

proved to be about 5 dB away from the modified theoretical capacity. When it was

compared to the direct transmission and the multihop case. It was found that the relay

channel is slightly to moderately better than the multihop. depending on the channel gain

from source ~ destination. For the values of the direct channel gain given, which were

small. the direct transmission case fared rather poorly.

5.2 Possible Improvements

A possibility for sharpening of the results would be to calculate the theoretical

capacity of a full duplex QPSK relay channel. and then compare it to what was simulated

in this thesis. Given that the received power at each node is so close to the theoretical

value. one would expect that the difference between theoretical capacity and simulated

values of the full duplex relay channel to be comparable to the difference between the

theoretical capacity and simulated values of direct transmission. However. if there is a

big difference then there is something wrong in the method.

Although the equations seemed to follow from the formula. the actual simulated

optimum value of P did not precisely follow the theoretical value. It may be of interest to

fmd the actual value of p. In some simulations. it was found that the J difference

55

between the theoretical and actual values seems to be a function of the other variables,

the transmit powers and the channel gains.

Concerning the full duplex method, while finishing simulating most of the values

for the graph, the author was also busy trying to fmd a solution to the lDD problems. In

the course of code-writing efforts, it was realized that slightly better performance could

be obtained out of the full duplex method if the method from the TOD channel was used.

That method is optional forward at the relay. If the error detection ability of LOPC

decoding was used to determine if the decoded message should be forwarded, then the

relay would avoid sending erroneous data, and save itself some power. Of course, all of

those values could not be simulated again, and it would hardly have made a noticeable

difference, so it was left alone. However, this is an area that could be improved in this

thesis.

For the half-duplex method, one thing in particular could be improved. Again, as

it was in the full duplex case, the equation for K had to be modified in order to get close to

the best-simulated value of K. Also, as in the full duplex case, experiences during the

simulation led to an assumption that the value of K is a function of all of the powers and

channel gain constants. However, unlike the full duplex case, the best value and the

theoretical value differ by a large amount. So, it would be advantageous to find the

formula for the optimum value of K. For the case of the values above, a constant was

already found that serves as a good estimate at certain values of P z and e.y. Note that the

constant is an approximate value for Kfor this particular code only, and does not apply to

to other LOPC codes.

56

In addition, the value of Kalso depends on the method used to split up the

codeword and parity check matrix. If a more analytical method was invented to

determine the performance of two halves of a parity check matrix, then that would result

in a significant improvement. This would require a deep understanding of LDPC codes.

Furthermore on the subject of ~ the value of .JK in these simulations is limited

to the real values only. Situations where .JK was complex were avoided, because it was

not known what to do in those cases. In the full duplex case, .Jlj needed to be real to be

positive, so all of the cases where .Jlj was negative and complex could be ignored.

However for .JK this is not the case. Thus, a situation could be considered where it is

complex, and figure out the best value of .JK for that case.

5.3 Future Work

5.3.1 Full Duplex

Unfortunately, the full duplex implementation for the relay channel used in this

thesis seems to perform worse than what is expected. For instance it is worse than

multihop in some of the cases when it should be better than multihop. From an

information theory point of view, the relay channel should always be better than multihop.

For the case of the equilateral triangle, the relay channel method should not be that much

worse than the direct transmission method. Most likely, this has something to do with the

implementation in this thesis.

The theory for the relay undoubtedly expects all of the information being used in

the method to be sent along a single BPSK channel. This, in turn, makes the relay into a

57

true full duplex relay, with the relay sending and receiving on the same path, mther than

having the relay receive on the imaginary path and send on the real path. This does not

seem to be possible, due to the fact that two different signals are being sent at the same

time, and they would interfere if not for QPSK. Any implementation of the full duplex

relay channel will have to ensure that the relay is sending and receiving codewords on the

same channel at the same time.

5.3.2 Half Duplex

It would also be interesting to see an implementation of the idea of using two

serially concatenated LDPC codes for the half duplex relay channel. From another

perspective, it seems like a better idea than code shortening. However, there were a lot of

misgivings about whether this method would really work, if there would arise some

insurmountable obstacle that would have prevented the completion, like the problem

mentioned in section 4.2.1. Any implementation of that decoder would need to be similar

to turbo decoders. which are very similar to that decoder structure.

Besides that, this thesis only involves optimizing over Kwhen a= liz. It would be

interesting to see how to optimize over both aand 8; and how to implement the different

mtes for a. It would also involve splitting up the parity check matrix and codeword in

different ways. A more analytical or at least methodical method to achieve this seems

like an area of investigation. Using the original equation, optimizing over a. 8; and the

constant p would prove to be very difficult, especially since the term that uses p must not

interfere with the other signals.

58

Finally, in the TOD case, a situation was simulated where the relay and source did

not need to be in sync. Since it is difficult to implement a full duplex node, and it is

extremely difficult to synchronize two separate transmitters, that situation would be an

ideal candidate for implementation. The only condition is that the experimenter would

have to measure and calculate the average channel gain between the three nodes, and then

arrange them in such a manner that the optimum value is K= O. Another option is to

ignore the channel gains altogether and always assume that K= O. In fact, using the

capacity formula, if the parameter P = 1 and K= 0, then the capacity formula could be

solved for just afor the half duplex case without synchronization. Since the values acan

take are very code dependent, it would require something like rate compatible codes.

59

6. References

1. T. M. Cover and J. Thomas (1989). Elements of Information Theory. New

York: Wiley.

2. A. Host-Madsen and J. Zhang, "Capacity Bounds and Power allocation in

Wireless Relay Channel," IEEE Transactions on Information Theory, June

2005.

3. T. M. Cover and A. A. EI Gamal. Capacity theorems for the relay channel.

IEEE Trans Info Theory, Vol-25, No.5, pp. 572-584, september. (1979).

4. W. A. Shiroma and M. P. DeLisio, "Quasi-Optical Circuits," in Encyclopedia

of Electrical and Electronics Engineering, vol. 17. pp. 523-533. J. G. Webster.

editor. New York: John Wiley & Sons. 1999.

5. Chao-Ming Zeng. Federico Kuhlmann. Andres Buzo: Achievability proof of

some multi-user channel coding theorems using backward decoding. IEEE

Transactions on Information Theory 35(6): 1160-1165 (1989)

6. Frans M. J. Willems. Edward C. van der Meulen: The discrete memoryless

multiple-access channel with cribbing encoders. IEEE Transactions on

Information Theory 31(3): 313-327 (1985)

7. Igor Kosintsev. Matlab programmes for encoding and decoding ofLDPC

codes over GF(2I\m).(6oo KB) http://www.kozintsev.netlsoft.html

(alist2sparse.m in the archive)

8. David MacKay: Error-Correcting Codes: Error-correcting codes

http://www.inference.phy.cam.ac.uklmackay/CodesFiles.html

60

9. Jinghu Chen. "Reduced Complexity Decoding Algorithms for

Low-Density Parity Check Codes and Turbo Codes," Ph.D. Thesis,

University of Hawaii, December 2003.

10. Juntan Zhang. "Iterative Decoding of Low Density Parity

Check Codes," Ph.D. Thesis, University of Hawaii, August 2005.

11. D. J. C. MacKay, S. T. Wilson, and M. C. Davey, "Comparison of

constructions of irregular Gallager codes," IEEE Trans. Commun., vol. 47, pp.

1449-1454, Oct. 1999.

12. S. Benedetto, D. Divsalar, G. Montorsi, and F. Pollara, "Serial concatenation

of interleaved codes: Performance analysis, design, and iterative decoding,"

IEEE Trans. Inform. Theory, vol. 44, pp. 909-926, May 1998.

13. M. A. Khojastepour, A. Sabharwal, B. Aazhang; "On Capacity of Gaussian

'Cheap' Relay Channel," Global Telecommunications Conference, 2003.

GLOBECOM '03. IEEE Volume 3, 1-5 Dec. 2003 Page(s):1776 - 1780 vol.3

14. E. C. van der Muellen, ''Three-terminal communication channels," Advanced

Applied Probability, vol. 3, pp. 120-154, 1971.

15. M. Gastpar, G. Kramer, and P. Gupta. ''The multiple-relay channel: Coding

and antennaclustering capacity." In Proceedings of the IEEE International

Symposium on Information Theory: ISIT'02 (Lausanne, Switzerland), June

2002.

16. B. Zhao, M. C. Valenti; "Distributed turbo coded diversity for relay channel."

Electronics Letters Volume 39, Issue 10, 15 May 2003 Page(s):786 -787

61

17. Zheng Zhang; T. M. Duman; "Capacity-approaching turbo coding and

iterative decoding for relay channels." IEEE Transactions on Communications.

Volume 53, Issue 11, Nov. 2005 Page(s):1895 - 1905

18. R. G. Gallager, "Low-density parity-check codes," IEEE Transactions on

Information Theory, vol. IT-8, pp. 21--28. Jan. 1962.

19. D. J. C. MacKay and R. M. Neal, "Good Codes Based on Very Sparse

Matrices," Cryptography and Coding. 5th IMA conference. Proceedings,

pp.1OO--111. Berlin: Springer-Verlag. 1995

20. A. Reznik. S. R. Kulkarni. and S. Verdu. "Capacity and optimal resource

allocation in the degraded Gaussian relay channel with multiple relays." In

Proceedings of the 40th Annual Allerton Conference on Communication,

Control, and Computing. October 2002.

62

7. Appendix A: Source Code

1* h2g.c Program to convert the parity check H file in Alist fonnat to reduced row echelon form.

Revision 1.0 Written by Danny Yee 2005.11.15

*1

#include <math.h> #include <stdio.h> #include <stdlib.h>

1************************** Gaussian elimination***************************1 /I Return the rank of the matrix. /I nnn: the length of the code; rrr: the number of rows in H matrix /I int **H_matrix = (int **) malloc(sizeof(int*)*rr); /I H_matrix rr*nn int Gaussillll.-elimination(unsigned char *H_matrix[],unsigned short int nnn,unsigned short int rrr) {

unsigned short int firsCcheck_row, operating",column, independenccol, target_row; unsigned short int i, j; unsigned char * temp_row = (unsigned char *)malloc(sizeof(char)*nnn); unsigned short int * independent_pos = (unsigned short int *) ma1loc(sizeof(short

int)*rrr);

1************************* Gaussian elimination****************************1

flISCcheck_row = 0; 1* Initialize firsCcheck_row (record the first row for searching 1) *1

operating",column = 0; independent_col =0;

while«fIISCcheck_row<rrr)&&(operating",column<nnn» {

targecrow = firsccheck_row;

1* Search for the flISt 1 in the operating",column-th column. *1

while«H_matrix[targeCrow 1I operating",column] !=1)&&(targecrow«rrr-I») targeCrow++;

63

if«targeuow=(rrr-l»&&(H_matrix[targeuow][operating..column]==O)) operating..column++;

else {

}

for(i=O; i<rrr; i++) if«H_matrix[i] [operating..column]=l)&&(iI=targeUow))

for(j=O; j<nnn; j++) H_matrix[ilU1 = H_matrix[targeuow]Ul"H_matrix[i1Ul;

if(targeCrow!=firsccheck_row) {

}

for(j=O; j<nnn; j++) {

}

temp_rowUl = H_matrix[firsccheck_row1Ul; H_matrix[frrsccheck_row 1Ul = H_matrix[targeuow 1Ul; H_matrix[targeuowlU1 = temp_rowUl;

independenCpos[independenccol++ 1 = operating..column; frrsccheck_row++; operating..column++;

1* printf("Current frrst check row is %d\n", frrsCcheck_row); *1

}

free(temp_row); free(independenCpos);

return independenccol; }

1************Reduced row echelon form **********************1 1* If there is no 1 on the diagonal, then we swap columns until there is one *1 1* The returned int c is the number of swapped columns. *1

short int reduced_row_echelon_form(unsigned short int *swap_cols, const short int N, const short int M, const short int rank, unsigned char **HH) {

short int iJ,d,temp,c=O;

64

for (i=O; kM; i++) swap_cols[i] = 0;

for (d=1; d < rank; d++) { if (HH[d][d] != 1) {

j=d; while «(HH[d][j] != 1) && (j<N» j++; if (j != N) {

} }

for (i=O; kM; i++) {

}

temp = HH[i][d]; HH[i][ d] = HH[i][j]; HH[i][j] = temp;

swap_cols[c] = d; c++; swap_cols[ c] = j; c++;

for (i=d-1; i >= 0; i-)

/* For every row above the diagonal, starting at one, if find a one in the column d in that row, we xor row d with that row */

if (HH[i][d] = 1) for (j=d; j<N; j++ )

HH[i][j] = HH[i][j]I\HH[dJU];

/* printf("Processedrow %d.\n", d+1); */

}

return c;

}

int main(int argc, char *argv[]) {

65

FILE *in, *out, *0ut2; short int iJ,c; unsigned short int d,mwcrow_weight,max_coLweight; unsigned short int temp.rank.M,N,K; unsigned char **HH; unsigned char **G; unsigned short int *G_row_weight; unsigned short int *G30Lweight; unsigned short int *swap_cols;

unsigned short int **row_entries; unsigned short int **col_entries;

1* We use short int and char to reduce the memory requirements. *1

if «in=fopen(argv[l], "r"))=NULL) (

}

printf("Cannot open the input file.\n"); exit(O);

fscanf(in, "%d", &N); fscanf(in, "%d", &M); K=N-M;

1* The next number in the file is not necessary to reconstruct the matrix. *1 1* Dump it into temp, and don't use it *1

fscanf(in, "%d", &temp);

1************** Allocate memory ******************1

HH = (unsigned char **)malloc(M*sizeof(char *)); G = (unsigned char **)malloc(K*sizeof(char *));

for (i=O; i<M; i++) HH[i] = (unsigned char *)malloc(N*sizeof(char));

for (i=O; i<K; i++) G[i] = (unsigned char *)malloc(N*sizeof(char));

G_row_weight = (unsigned short int *)malloc(K*sizeof(short int)); G_coLweight = (unsigned short int *)malloc(N*sizeof(short int));

66

swap_cols = (unsigned short int *)malloc(M*sizeof(short int»;

row_entries = (unsigned short int **)malloc(K*sizeof(short int *»; col_entries = (unsigned short int **)malloc(N*sizeof(short int *»;

1************* Finished allocating ****************1

for (j=O; j<N; j++) for (i=O; kM; i++)

HH[i]U1 = 0; 1* initialize all to 0 *1

'* The following two lines in the file are not necessary to reconstruct the matrix *1 1* Dump the numbers into temp and don't use them. *1

for (i=O; kM+N; i++) fscanf(in, "%d", &temp);

1* Now read in the H matrix. *1

for (j=O;j<N; j++) for (i=O; kmlllccoCweight; i++) {

}

fscanf(in, "%d", &temp); if (temp != 0)

HH[temp-I]U1 = I;

fc1ose(in);

printf("Read the H matrix.\n");

1* Perform gaussian elimination on HH to put it in row echelon form *1 rank = Gaussian_elimination(HH, N, M);

printf("Put H matrix in row echelon form. Rank = %d\n", rank);

1* Now that the HH matrix is in row echelon form *1 1* we have to put it in reduced row echelon form *1 1* If there is no 1 on the diagonal, then we swap columns until there is one *1

printf("Put H matrix in reduced row echelon form.\n");

67

if ((out2=fopen("swapped_columns.txt","w+"» NULL) {

}

printf("cannot open the output file.\n"); exit(O);

for (i=O; i<c; i+=2) fprintf(out2, "%d %d\n", swap_cols[i]+l, swap_cols[i+I]+I);

fc1ose(out2);

1* Now we change the matrix from H to G using the fact that H=[I IA] and G= [A' II]

*1

for (i=O; i<M; i++) for G=M; j<N; j++)

G[j-M][i] = HH[i][j];

for (i=O; i<K; i++) for G=O; j<K; j++) if(i=j)

G[i][j+M] = I; else

G[i][j+M] = 0;

printf("Converted H matrix to G matrix.\n");

1* Now we have to output the resulting matrix. *1 1* Output in Alist format *1 1* First we need some statistics about the array *1

for (i=O; i<M; i++) G_row_weight[i]=O; for G=O; j<N; j++) G_coC weight[j]=O;

for (i=O; i<K; i++) for G=O; j<N; j++)

if (G[i][j] = I) {

G_row _ weight[i]++; G_coCweight[j]++;

I

68

for (j=I; j<N;j++) if (G_coLweightfj] > max_col_weight)

max_col_ weight=G_coL weightfj];

for (i=l; kK; i++) if (G_row_weight[i] > max_raw_weight)

max_row_weight=G_raw_weight[il;

1* Now we write to the fIle *1

if«out=fopen("Gmatrix.txt","w+"»-NULL) {

}

printf("cannot open the output fIle\n"); exit(O);

fprintf(out, "%d %d\n", N, K); fprintf(out, "%d %d\n", max_coLweight, max_raw_weight);

for (j=O; j<N; j++) {

fprintf(out, "%d ", G_coLweightU]); col_entriesfj] = (unsigned short int >!C)ma1loc(max_coLweight*sizeof(short int»;

}

fprintf(out, "\nO);

for (i=O; kK; i++) {

fprintf(out, "%d ", G_row_weight[i]); row_entries[i] = (unsigned short int *)malloc(max_fow_weight*sizeof(short int»;

}

fprintf( out, "\n h);

for (i=O; i<K; i++) for (j=O; j<max_row _weight; j++)

raw_entries[i]Ul = 0;

for (j=O; j<N; j++) for (i=O; i<max_coLweight; i++)

col_entriesUUi] = 0;

69

for (i=K-l; i>=O; i-) for O=N-l; j>=O; j-)

if (G[i][j] = 1) (

G_col_weight[j]-; col_entriesU)[G_col_weight[j]] = i+l; G_row _ weight[i]--; row_entries[i)[GJow_weight[i)) = j+l;

}

for 0=0; j<N; j++) (

for (i=O; kmlllccoCweight-l; i++) fprintf(out, "%d ", col_entriesU)[i));

fprintf(out, "%d\n", col_entriesU][mlllccoCweight-l)); }

for (i=O; kK; i++) (

for 0=0; j<mllluow_weight-l; j++) fprintf(out,"%d ", row_entries[iJU));

fprintf(out, "%d\n", row_entries[i][max_row_weight-l)); }

fclose( out);

printf("Wrote G matrix to file.\n"); }

70

% Procedure file ful1duplexrelay.m % Self explanatory from the title, it simulates the full duplex relay channel. % It uses the algorithm described by Dr. Host-Madsen %

%Ioad G4OOOx8000 % this is the G (Generator) matrix

% I have taken out the G matrix because the method that I have used (using % all zeros), means that I don't need to mulitply anything by G.

randn('state' ,sum( I OO*clock»;

% Some channel constants c21 =0.8; c3I = 0.2; c32 =0.6;

N_02=2; N_03 =2;

PI = 1.42; P2=PI;

% this is to seed the random number generator

delta = 0.03; error.

% this is just my numerical estimate of delta gotten thru trial and

sqrtBeta = (-c31*c32*sqrt(P2) + sqrt(c31"2*c32"2*P2 - ... c21"2*(c32"2*P2+c31"2*PI-c21"2*PI + delta)))/(c21"2*sqrt(PI));

for i = fmd(sqrtBeta < 0), sqrtBeta(i) = 0;

end;

% As stated in the paper, negative values for sqrt(beta) must be corrected.

beta = sqrtBeta"2; clear sqrtBeta;

theta = atan2( c3I * sqrt((l-beta)*PI), c3I * sqrt(beta*PI) + c32 .. sqrt(P2) );

% theta is dependent only on the channel constants. % I needed to use atan20 rather than atan()

E_sl = c2I "2*(I-beta)*PI;

71

E_s2 = c31"2*Pl + c32"2*P2 + 2*c31 *c32"'sqrt(pl*P2*beta);

% E_sl and E_s2 are the received powers (or energies, since P=FJt) they % should be approximately equal when beta is not equal to O. This is just % a sanity check. It is not used in the code.

format compact;

kk=4ooo; nn=8ooo;

% message block length % codeword length

% Genemte the sequence to be transmitted % We assume that the discrete input has already been source % coded (compressed) so that it is uniformly distributed.

M = 10000; % number of blocks (MUST be a factor of N) N= 100; bllcerrors = 0;

%%%%%%%%%%%% Tmnsmit all zeros %%%%%%%%%%%%%%%

% Instead of randomly generating blocks, use the all zero block.. That way % we speed things up and negate the necessity of the G matrix.

Xl(l,:) = j*sqrt(pl)*ones(l,nn); Xl(2:N,:) = sqrt(beta*Pl)*ones(N-l,nn) + j*sqrt«l-beta)*Pl)*ones(N-l,nn); Xl(N+l,:) = sqrt(pl)"'ones(l,nn);

for i=l:MIN, sprintf('Processing blocks %d - %d', (i-l)"'N+l, i*N)

% W = round(rand(N,size(G,l»);

% Encode the blocks with a Generator matrix G % WI =W(l:N-l,:); % Cl = mod(Wl *G, 2);

% W2 = W(2:N,:); % C2 = mod(xor(Wl,W2)*G, 2); % clear WI W2;

% modulate the blocks

% for the first block we transmit only to the relay. % Xl(I,:) = j*sqrt(pl)*(-1)."mod(W(1,:)*G,2);

72

% Xl(2:N,:) = sqrt(beta*Pl)*(-l).I\Cl + j*sqrt«1-beta)*Pl)"'(-1).I\C2; % Xl(N+l,:) = sqrt(Pl)"'(-1).l\mod(W(N,:)"'G,2);

% clear Cl C2;

% Source transmit over the noisy channel Y2 = c21 *Xl(l:N,:) + wgn(N, nn, N_02l2, 1inear', 'complex'); % The wgn is linear because I am using linear units for my % calculations.

% Relay decode sprintf('Relay decoding') tic Cdelta = LDPC_decode(imag(Y2), N_02l2lsqrt(E_sl)); toe clearY2;

C_hat = mod(curnsum(Cdelta),2); % Since Wdelta is the xor ofW_hat[i-l] and W_hat[i], we have to sum % the Wdelta[i] with the previous W _hat[i-l] to get W _hat[i]. curnsum % is a quick way to get that for all of the codewords. Also note that % C(W _hat[i-l] xor W _hat[iD = C(W _hat[i-lD xor C(W _hat[iD

clear Cdelta;

sprintf('Relayerrors = %i', length(fmd(sum(C_hat,2))))

% Relay encode % C_hat = mod(W _hat*G,2); % clear W _hat;

% Relay encoding is no longer necessary because we are using the codeword % output from the decoder.

% Relay modulate X2 = sqrt(P2)"'(-l).I\C_hat;

% Relay transmit % First we pad the first row of X2 with zeros to simulate a time delay % of one unit. X2 = [zeros(l,nn); X2];

73

end;

% Destination receive Y3 = c31 *XI + c32*X2 + wgn(N+ I, nn, N_03/2, 'linear', 'complex'); clear X2;

% Destination decode sprintf('Destination decoding') tic W _hat2 = fullduplex_desCdecode(Y3, N_03/21sqrt{E_s2), theta); toe clearY3; % The destination needs to have theta as an input so that it can rotate % the signals by theta before decoding.

bllcerrors = blk_errors + length(find(sum(W _hat2,2») clear W _hat2;

clear M N i Xl; format;

74

1* File LDPC_decode.c Original version written by Jinghu Chen

This file does belief propagation decoding with a Matlab interface to variables. *1

1*******************************************************1 #include <math.h> #include <stdio.h> #include <stdlib.h> #include "mex.h"

H*******************LDPC*****************

#defme no 8000 H Length of the code

#define kk 4000 H No. of information bits

#defme rr 4000 H No. of checks

#defme lMAX 200 I*maximun iteration times*1 H#defme lMAX 500

I*******************update right tanbbZ[][] according to Bit xx ****************1

void update_right tanbZZ(int xx, double *ZZ[], int *row _distribution_index[], int *column_distribution_index[],double *right tanbZZ[]) H to update the matrix of right tanhZZ[][] according to BIT xx H uu is the weight of each column and hh is the weight of each row. (

int u.k, row,column;

for(u=O;u<=( column_distribution_index[xx][O]-1 );u++) (

row = column_distribution_index[xx][u+l];

for(k=O; k<=(row_distribution_index[row][O]-I); k++) (

}

if(XX-IOw_distribution_index[row ][k+ 1]) column=k;

if( column==(row _distribution_index[row] [0]-1)) *(right tanhZZ[row]+column)=1.0;

else

75

*(right UmhZZ[row]+column)=(*(right tanhZZ[row]+column+ 1»*tanh((*(ZZ[row ]+col umn+l»/2);

} }

I*******************update lefctannhhZ[][] according to Bit xx **************1

void update_lefctanhZZ(int xx, double *ZZ[], int *row_distribution_index[), int *column_distribution_index[], double *left tanhZZ[j) {

int u,k, row,column;

for(u=O;u<=( column_distributiOll.-index[xx][O]-1 );u++) {

row = column_distribution_index[xx][u+l];

for(k=O; k<=(row_distribution_index[row][O]-I); k++) {

}

if(xx-row _distribution_index[row ][k+ 1]) column=k;

if(column==O) *(left tanhZZ[row]+column)=1.0;

else *(left_tanhZZ[row]+column)=(*(left tanhZZ[row]+column-

1»*tanh((*(ZZ[row]+column-l»)I2); }

}

1******************************************************************1

1*******"**"**"*" .... *"'** update Ebsilon~Matrix[][] according to BIT xx *****1

void update_Ebsilon~Matrix(int xx, int *row_distribution_index[], int *column_distribution_index[], double *left tanhZZ[],double *right tanhZZ[],double *EEbsilon~Matrix[]) {


double tem, ternl;

for(u=O;u<=( column_distribution_index[ xx ][0]-1 );u++) {

76

}

row = column_distribution_index[xx][u+I];

for(k=O; k<=(row _distribution_index[row ][0]-1); k++) {

}

if(xx lOW _distribution_index[row][k+ I]) column=k;

teml=(*(lefUanhZZ[row]+column»*(*(right tanhZZ[row]+column»;

tem=log«I+teml)/(I-teml»;

if(tem>=10.0)

tem=lO.O;

if(tem<=-10.0)

tem=-lO.O;

*(EEbsilon~Matrix[row ]+column)=tem; }

1**********************************************************************1

I*******************update Ebsilong[xx]***********************************1

void update_Ebsilong(int xx, int *row_distribution_index[], int *column_distribution_index[], double *EEbsilon~Matrix[],double EEbsilong[]) {

double tem=O.O;


for( u=O;u<=( column_distribution_index[xx][O]-1 );u++) {

row = column_distribution_index[xx][u+I];

for(k=O; k<=(row_distribution_index[row][O]-1); k++) {

}

if(xx-IOw_distribution_index[row]lk+l]) column=k;

77

tem=tem+(*(EEbsilo~Matrix[row j+column»; }

EEbsilong[xx j=tem; }

1**********************************************************************1

I************update E[][j according to BIT xx **************************1

void update_EE{int xx, int *row _distribution_index[], int *column_distributio1l.-index[], double EEbsilong[],double "'EEbsilon8-Matrix[],double *EE[]) (

}

int u,k, row.column;

for(u=O;u<=( column_distribution_index[xx][Oj-1 );u++) (

row = column_distribution_index[xx][u+ Ij;

for(k=O; k<=(row_distribution_index[row][Oj-I); k++) (

}

if(xx Iow_distributio1l.-index[row][k+l]) column=k;

*(EE[rowj+column)=EEbsilong[xxj-(*(EEbsilon8-Matrix[rowj+column»; }

1"'**"'******************************************"'**********************1

I***************update ZZ[][] according to Bit xx *****************1

void update_ZZ(int xx, int *row_distribution_index[], int *column_distribution_index[], double *YY[],double *EE[],double *ZZ[j) (

int uk. row,column;

for( u=O;u<=( column_distribution_index[xx][Oj-1 );u++) (

row = column_distribution_index[xx][u+lj;

for(k=O; k<=(row_distribution_index[row][Oj-l); k++) (

if(XX-IOW _distribution_index[row j[k+ 1])

78

}

column=k; }

*(ZZ[row]+column)=*(EE[row]+column)+*(yy[row]+column); }

I**********************************~*******************************1

1* This is the primary function that does all of the work. It takes the constants nO, mrows, and the received signal matrix yy and decodes them using the BP functions above. *1

void ldpc_decode( const double nO. const double *yy. double *w _hat, const int mrows) {

FILE *inl,*in2;

double *y;

/I int *decoded_message;

I************items used in BP Decoding*o!<*********************1 double o!<rO;

double **Z; double **Y; double **E; double **Ebsilong...Matrix;

int ** row_distribution_index; int ** column_distribution_index;

double o!<Ebsiiong;

int *z;

int syndrom, iteration, n, h;

double **righCtanhZ; double **lefctanhZ;

int i, j, k, p,q,t,temp6;

double tem; int temp_weight;

79

1*************A1locate memory ********************************1

y = (double*) mxMalloc(sizeof(double)*nn);

1/ decoded_message = (int*) mxMalloc(sizeof(int)*kk);

rO = (double*) mxMalloc(sizeof(double)*nn);

Z = (double **) mxMalloc(sizeof(double*)*rr); Y = (double **) mxMalloc(sizeof(double*)*rr); E = (double **) mxMalloc(sizeof(double*)*rr); Ebsilong...Matrix = (double **) mxMalloc(sizeof(double*)*rr);

row _distribution_index = (int **) mxMalloc(sizeof(int*)*rr); column_distribution_index = (int **) mxMalloc(sizeof(int*)*nn); Ebsilong = (double*) mxMalloc(sizeof(double)*nn); z = (int *) mxMalloc(sizeof(int)*nn);

righuanhZ = (double **) mxMalloc(sizeof(double*)*rr); lefuanhZ = (double **) mxMalloc(sizeof(double*)*rr);

I**************getll_index***********************************1

1/ read in the II_index if «inl=fopen("New _row _ weighcdistribution.txt" ,"r"» NUll.) {

mexErrMsgTxt("cannot open the input filel"); I

if «in2=fopen("New _col_ weighcdistribution.txt", Or"»~ NUll) {

mexErrMsgTxt("cannot open the input file2"); I

1* The II matrix in these two files is in a special form. It is very similar to MacKay's alist format for his II matrices, but instead of one file, there are two files, one for row and one for column. The fIrSt entry of each line represents the number of l's in the rowlcolumn. The rest of the line tells us where in the rowlcolumn the l's are. Note that the first columnlrow in the II matrix is numbered starting from 0 as in all C arrays, while in MacKay's alist format, all indices in the II matrix start from 1.

*1

80

for(i=O;i<=(rr-l);i++) {

fscanf(inl,"%d", &temp_weight);

if « row_distributioD_index[i] = (int *) mxMalloc«temp_weight+l)*sizeof(int))) = NULL)

}

mexErrMsgTxt("Error allocating array row_distributioD_index!");

for(j=l~<=row_distributioD_index[i][O]~++) {

}

fscanf(inl, "%d" ,&(row _distributiOD_index[iJrjD); /I the fIle have the index begin from 0

fc1ose(inl);

for(i=O;i<=(nn-l);i++) {

fscanf(in2, "%d", &temp_ weight);

if « column_distributiOD_index[i] = (int *) mxMalloc(temp_ weight+ 1 )*sizeof(int))) = NULL)

}

mexErrMsgTxt(''Error allocating array column_distributioD_index!");

for(j=l;j<=column_distributiOILindex[i][O];j++) {

}

fscanf(in2, "%d" ,&( column_distribution_index[ilUD); /I the file have the index begin from 0

fclose(in2);

1*********** hrlti1~tiOD ***************************1

for(i=O;i<=(rr-l);i++ ) {

81

lefuanhZ[i]=(double *) mxMalloc(sizeof(double)*row_distribution_index[i][O]);

righUanhZ[i]=(double *) mxMalloc(sizeof(double)*row_distribution_index[i][O]);

Z[i]=(double *) mxMalloc(sizeof(double)*row_distribution_index[i][O]);

Y[i]=(double *) mxMalloc(sizeof(double)*row_distribution_index[i][O]);

E[i]=(double *) mxMa1loc(sizeof(double)*row_distribution_index[i][O]);

Ebsilon&...Matrlx.[i]=(double *) mxMalloc(sizeof(double)*row_distribution_index[i][O]);

for(j=O;j<=(row _distribution_index[i][O]-1 );j++) {

*(lefUanhZ[i]+j)=I.0;

*(righUanhZ[i]+j)=1.0;

*(Z[i]+j)=1.0;

*(y[i]+j)=1.0;

*(E[i]+j)=1.0;

*(Ebsilon&...Matrix[i]+j)=I.0; }

} I*********************finish initialization ***********************1

1**************** start processing rows ********************1

for (n=O; n<mrows; n++) {

1* Here I make use of the way that Matlab stores a matrix. 1n short, Matlab stores a 2-D array, like the one we are using here, as a I-D array of very long length. It stores the array column-wise. 1n other words, the first entry in the yy array is the first entry of the fllSt row of the original Matlab matrix. The second entry in yy is the first entry in the second column of the Matlab matrix. This continues until all of the Matlab matrix is done. If you would like a

82

better description, I suggest you go use Matlab's help facilities. They have a very descriptive picture.

*1 for (h=O; h<nn; h++)

y[h] = yy[h*mrows + n];

1******************BPDecode*************************1

I************initialize rlJ[], Z[][]. Y[][]************I

for (i=O;i<=(nn-l);i++) (

rlJ[i]=(4/nO)*y[i];

I

for (i=O;i<=(rr-l);i++) /I rr is the number of checks (

for (j=O~<=(row_distribution_index[i][O]-l)~++) { /I Use the received sequence to initialize

tem=y[(row_distribution..index[ilU+l])]*(4/nO);

I I

*(y[i]+j)=tem;

*(Z[i]+j)=tem;

iteration=l;

syndrom=l;

1* If the syndrome is 1. then the codeword has errors. Thus. we continue to decode. However. sometimes. we cannot decode. so we use a maximum iteration to ensure that we do not continue forever.

*1 while«syndrom=I)&&(iteration<=IMAX) {

for (i=(nn-l); i>=O; i--)

83

for (i=O; i<=(nn-l); i++)

for (i=O; k=(nn-l); i++)

update_Ebsilong...Matrix(i,row_distribution_index,column_distribution_index,lefCtanhZ. right tanbZ,Ebsilong...Matrix);

for (i=O; k=(nn-l); i++) update_Ebsilong(i, row_distribution_index, column_distribution_index,

Ebsilong...Matrix, Ebsilong);

for (i=O; k=(nn-l); i++) update_EE(i,row _distribution_index,colllIIlILdistribution_index,Ebsilong,

Ebsilong...Matrix,E);

for (i=O; k=(nn-l); i++) update_ZZ(i,row _distribution_index,column_distribution_index, Y ,E,z);

for (q=O; q<=(nn-l); q++) {

}

if( (rO[ q)+Ebsilong[ q)>=O) z[q)=O;

else z[q]=l;

1* for(p=(nn-kk); p<=(nn-l); p++) {

} *!

if( (rO[p ]+Ebsilong[p ]»=0) decoded_message[p-(nn-kk)]=O;

else decoded_message[p-(nn-kk)]=l;

1* I don't return the decoded_message variable because in this case the codeword is more valuable to me. I can use it and avoid doing another encoding later. So, I don't bother with this variable.

*1

I**.**.**********check whether z*H is 0******************1

syndrom=O;

84

for(q=O; q<=(rr-l); q++) {

temp6=O;

for(t=O;t<=(row _distribution_index[ q][O]-l );t++) temp6 = temp6" z[(row_distribution_index[q][t+l])];

}

if(temp6=l) syndrom=l;

iteration++;

} I*end of BP iteratation III while«syndrom'--l)&&(itemtion<=IMAX)*******1

1* This for loop uses the same princple as the yy array. It is stored in Matlab's internal formal

*1 for (h=O; h<nn; h++)

W _hat[h*mrows + n] = (double)z[h];

} IIfor (n=O; n<mrows; n++)

11---------------------------------------------------------------------------------------------11- ----------------------------------------------------------------------------------

1* Free all of the memory *1

mxFree(rO); for (k=O; k<rr; k++)

mxFree(Z[k]); mxFree(Z); for (k=O; k<rr; k++)

mxFree(Y[k]); mxFree(Y); for (k=O; k<rr; k++)

mxFree(E[k]); mxFree(E); for (k=O; k<rr; k++)

mxFree(Ebsilong...Matrix[k]); mxFree(Ebsilong...Matrix); for (k=O; k<rr; k++)

85

mxFree(row _distributiolLindex[k]); mxFree(row_distribution_index);

for (k=O; k<nn; k++) mxFree(column_distribution_index[k]);

mxFree( column_distribution_index);

mxFree(Ebsilong);

mxFree(z);

for (k=O; k<rr; k++) mxFree(righuanhZ[k]);

mxFree(righUanhZ); for (k=O; k<rr; k++)

mxFree(lefUanhZ[k]); mxFree(lefUanhZ);

/I mxFree(decoded_message);

mxFree(y); I

1* The mex function The inputs should be nO and the received message Y Since NO and sqrt(Es) are used as a ratio throughout the code, we incorporate sqrt(Es) into NO and use that scalar instead. In other words, nO is actually NO/sqrt(Es). The output is the decoded message W _hat.

W _hat = LDPC_decode(Y, nO); *1

1* If you would like to understand all of these declarations, then you will have to read the Matlab help files on mex functions.

*1 void mexFunction(int nlhs, mxArray *plhs[], int mhs, const mxArray *prhs[]) {

double nO; double'" yy; double'" W _hat; int ij,mrows;

86

1* Check for proper number of arguments. *1 if(mhs!= 2)

mexErrMsgTxt("Two inputs required. ");

1* Check that the second input is a scalar *1 if (!mxIsDouble(prhs[l]) II mxIsComplex(prhs[l]) II

mxGetN(prhs[l])*mxGetM(prhs[l]) != 1 ) mexErrMsgTxt("Input nO must be a scalar. ");

}

if (mxGetN(prhs[O]) != nn) mexErrMsgTxt("Y does not have the correct code length.");

nO = mxGetScalar(prhs[l]);

1* mrows is the number of blocks being simulated for this particular instance.

*1 mrows = mxGetM(prhs[O]);

yy = mxGetPr(prhs[O]);

1* Create a matrix for the return argument. *1 plhs[O] = mxCreateDoubleMatrix(mrows, nn. mxREAL);

W _hat = mxGetPr(plhs[O]);

Idpc_decode(nO, yy, W Jutt, mrows);

87

1* File desCdecode.c Based on LDPC_decode.c Based on ldpc.cpp written by Jinghu Chen

This file is for the destination decoding of the full duplex method this means that it uses phase angle rotation by theta and previous codeword to decode the next codeword.

*1

The functions to do BP decoding are the same as in the WPC decode.c file. so I left them out. Also. everything else is almost. but not quite the same. so I only highlight the differences in bold.

I*********************************~********************************1

/* This is the primary function that does all of the work. It takes the constants nO, mrows, the received signal matrices yr (real part) and yi (imaginary), and the constant theta and decodes them using the BP functions above.

*/

void desCdecode(const double nO, const double *yr, const double *yi, double *W _hat, const int mrows, const int theta) {

FILE *in! *in2;/* *Generator f'... " _&1"

int ** Generator_matrix = (int **) mxMaIIoc(sizeof(int*)*un); */

1* I no longer need the Generator matrix, since I am using the whole output of the decoder instead of just the message, and then re-encoding.

*/

double *y;

I************items used in BP Decoding***********************1

.... somecode ....

if «in2=fopen("New _coL weighcdistribution.txt ..... r .. »-NULL) mexErrMsgTxt("cannot open the input file2");

88

" if «Generator_fp=fopen("Generator_matrix.txt", "r"»=NULL) " mexErrMsgTxt{"cannot open the Generator_matrix");

1* The H matrix in these two files is in a special form. It is very

.... somecode ....

fc\ose(in2);

1* for(I=O; l<nn; i++) {

fscanf(Generator_fp," %d", &temp_ weight);

if « Generator_matrix[l] = (int *) mxMalloc«temp_welght+l)*sIzeof(int))) = NULL)

mexErrMsgTxt{"Error allocating array Generator_matrix!");

for(j=l;j<=Generator_matrix[i][O];j++ ) fscanf(Generator_fp,"%d" ,&(Generator_matrix[i][j])); "the me have the

index begin from 0 }

fclose(Generator_fp); *1

1* Let me reiterate one more time, that I no longer need the Generator matrix. *1

1*********** Initilization ***************************1

.... somecode ....

Matlah's help facilities. They have a very descriptive picture. *1

if (n-mrows-l) for (1=0; i<nn; i++)

y[l] = yr[i*mrows + n];

1* In this if ... else statement, we see that if the block is the last block to be received, then It does not need to be rotated by theta. If It is not the last block, then it does need to be rotated, with

89

the direction of rotation determined by the previous block. This comes from the paper.

*1

else for (i=O; i<11I1; i++) (

1* z[i] =0; forO=1; j<=Generator_matrix[i][O]; j++)

z[i] "= decoded_message[(Generator_matrix[i][j])]; *1 1* Once again, generator not needed. *1

y[i] = sqrt(pow(yr[i*mrows + n],2) + pow(yi[i*mrows + n],2»; if (z[i]=1)

y[i] = y[i]*cos(atan2(yi[i*mrows + n], yr[i*mrows + n]) + theta); else

y[i] = y[i]*cos(atan2(yi[i*mrows + n], yr[i*mrows + n]) - theta); }

I******************BP Decode*************************1

.... some code ....

for (q=O; q<=(nn-l); q++) {

}

if( (rO[ q]+Ebsilong[ q]»=O) z[q]=O;

else z[q]=l;

for(p=(I1I1-kk); p<=(nn-1); pH) (

}

if«rO[p ]+Ebsilong[p ]»=0) decoded_message[p-(I1I1-kk)]=O;

else decoded_message[p-(I1I1-kk)]=1;

I*****"**********check whether z*H is 0 .... **** .... * ...... • .......... 1

.... somecode ....

90

} I*end of BP iteratation III whlle«syndrom=l)&&(iteration<=IMAX)*******1

1* This for loop uses the same princple as the yy array. It is stored In MatIab's Internal format. However, unlike at the relay, the maximum length of each block is Hmited to kk.

*1 for (1=0; i<kk; i++)

W _hat[i*{mrows.l) + n • 1] = decodetLmessage(i];

} IIfor (n=mrows-1; n>=1; n--)

.... somecode ....

mxFree(y); 1*

for (k=O; k<nn; k++) mxFree(Generator_matrix[k]);

mxFree(Generator_matrix); */ }

1* The mex function The Inputs should be NO and the received message Y and the constant theta The output is the decoded message W _hat

W _hat = desCdecode(Y, nO, theta); *1

1* If you would like to understand all of these declarations, then you will have to read the Matlab help fIles on mex functions.

*1 void mexFunction(int nlhs, mxArray *plhs[], int mhs, canst mxArray *prhs[]) {

double nO, theta; double * yr, * yi; double * W _hat; int ij ,mrows;

1* Check for proper number of arguments. *1 if(nrhs != 3)

mexErrMsgTxt(nThree Inputs required. n);

91

1* Check that the second and third inputs are scalar *1 if (!mxlsDouble(prhs[1]) II mxIsComplex(prhs[1]) II

mxGetN(prhs[l])*mxGetM(prhs[1]) != 1 ) mexErrMsgTxt("lnput nO must be a scalar. "); if (!mxlsDouble(prhs[2]) II mxIsComplex(prhs[2]) II

mxGetN(prhs[2])*mxGetM(prhs[2) != 1 ) mexErrMsgTxt("lnput theta must be a scalar. ");

if (rnxGetN(prhs[O)) != 00) mexErrMsgTxt("Y does not have the correct code length. H);

nO = rnxGetScalar(prhs[l));

1* theta is the constant that was calculated earlier in the Matlab script *1

theta = mxGetScalar(prhs[2]);

mrows = rnxGetM(prhs[O));

1* Real part of the received signal. *1 yr = mxGetPr(prhs[O);

1* Imaginuy part of the received signal. *1 yi = mxGetPi(prhs[O]);

1* Create a matrix for the return argnment. *1 plbs[O) = mxCreateDoubleMatrix(mrows·l, kk, mxREAL);

W _hat = rnxGetPr(plhs[O]);

desCdecode(nO, yr, yi, W _hat, mrows, theta);

}

92

% Procedure File er_tlCrelaY3hannel.m % Is a procedure to simulate the TOD relay channel with Erasure bits %

%load G250OxlOOOO % The use of all zeros for the message as well as a lack of need for % re-encoding means that the Generator matrix (G) is no longer needed.

randn(,state',sum(lOO"'c1ock»;

% Some channel constants c2l = 0.8; %c3l = 0.2; c3l = 0.36; c32 =0.6;

N_02=2; N_03 =2;

%Pl =2.78; PI = 2.5; P2=Pl;

delta = 0.1;

% seed the random number generator

% This is the value of delta that I got through trial and error.

a = -c3l"4"'Pl"2; b = -2*c3l"3"'c32"'Pl "'sqrt(pl *P2); c = c3l"4*Pl"2 - c3l"2*c32"2"'PI *P2 + c21"2*PI; d = 2"'c3l "'c32*sqrt(pl "'P2) + 2"'c31"3*c32*PI *sqrt(Pl "'P2); e = c31"2*PI + c32"2*P2 + c31"2"'c32"2"'PI "'P2 - c21"2*PI + delta;

sqrtKappa = very long equation deleted due to pointlessness

% I got this formula for sqrt(kappa) by using the symbolic math functions % within matIab. Of course, I had to use abc d e variables because if I % had used the regular PI P2 c31 c32 c2l variables, then the equation % length would have overflowed the equation buffer and would not be % reproducible here. The method to get sqrtKappa is as follows: % %symsabcdex % X=s0Ive('a"'xl 4+b"'xI 3+c"'x"2+d*x+e=Q'); % % Then you have to try out values for the four solutions to get the right one.

93

for i = find(sqrtKappa < 0), sqrtKappa(i) = 0;

end;

for i = fmd(sqrtKappa > 1), sqrtKappa(i) = 1;

end;

kappa = abs(sqrtKappail2);

PC1 = (1-kappa)*P1; PC2 = kappa*P1;

alpha =0.5;

cap1 = c21 i12*P1_1; cap2 = (1 + c31 i12*PC1) * (1 + c31i12"'PC2 + c32i12*P2 + 2*c31*c32*sqrt(p1_2*P2))-1;

% cap1 and cap2 are the overall received powers at the relay and the % destination. cap1 represents the power received by the relay during the % relay transmit period. cap2 represents the power received by the % destination during the relay receive and relay transmit periods. These aren't really % necessary, they are just a way to check my values.

format compact;

kk=2500; nn=10000;

sqrtE_s1 = c31 *sqrt(PC1); sqrtE_s2 = c31 *sqrt(PC2) + c32*sqrt(P2); % sqrtE_s1 is the received power at the destination during the relay % receive period % sqrtE_s2 is the received power at the destination during the relay % transmit period

% Generate the sequence to be transmitted % We assume that the discrete input has already been source % coded (compressed) so that it is uniformly distributed. M = 10000; % number of blocks (MUST be a factor of b) b = 100;

% Here we use ceil so that the number of bits is always a whole number.

94

%%%%%%%%% We transmit all zeros to speed up the process %%%%%%%%%%%

% As with the full duplex method, we transmit all zeros so that the G % matrix becomes unnecessary and the encoding is faster.

Xl_l = sqrt(PCl)*ones(b,nn*alpha); Xl_2 = sqrt(PC2)*ones(b,nn*(1-alpha»;

% Since we transmit the all zero sequence, all of the parity bits are zero

blk_errors = 0; biCerrors = 0;

for i = 1 :MIb,

block=i*b

Y2 = c21 *Xl_l + wgn(b, nnI2, N_02l2, 'linear', 'real'); % wgn is linear because that is the standard we are using

% Now we pad Y2 with zeros to accomplish code shortening Z = zeros(b,nn); Z(: ,1 :2:nn) = Y2; clearY2;

% Relay decode sprintf('Relay decoding') tic [C..hat, err] = icrelay_decode(Z, N_02/(c21 *sqrt(PC1))); toe clear Z;

Relay_errors = length(find(sum(C_hat,2»)

% Now we send the other half of the bits coherently with the Xl_2 of % the source (ifXC2 exists)

% C_hat = mod(W _hat*G(:,2:2:nn),2); % clear W _hat;

% Modulate the codeword X2 = sqrt(P2)*(-1)."C_hat(:,2:2:nn); clear C_hat;

95

% This is the part where we decide if we are really going to forward % the message. Since we are only going to forward if there is no % detectable error, we save a little bit of power and possibly improve % the decoding probability. S = find(err); for d = 1 :length(S),

X2(S(d),:) = zeros(l,nn/2); end; clear S d;

Y3_1 = c31*XCl + wgn(b, nnI2, N_03I2, 1inear', 'real');

Y3_2 = c31*XC2 + c32*X2 + wgn(b, nnl2, N_03I2, 1inear', 'real'); clear X2;

% Y3_l is the bits received during the relay receive period % Y3_2 is the bits received during the relay transmit period Y3(:,l:2:nn) = Y3_1; Y3(:,2:2:nn) = Y3_2; clear Y3_1 Y3_2;

% In order to do decoding of blocks where half of them have one SNR and % the other have another SNR, we need the received power at each % period. Thus, the destination decoder uses sqrtE_sl and sqrtE_s2. sprintfCDestination decoding') tic W _hat2 = ir_desCdecode(Y3, N_03, sqrtE_sl, sqrtE_s2); toc clearY3;

% biCerrors = biCerrors + sum(sum(W _bat2» blk_errors = blk_errors + length(fmd(sum(W _bat2,2))) clear W _bat2;

end; clear M b i Relay_errors kk nn block clear XCI XC2 abc d e; format;

96

1* File ir_relay_decode.c Based on LDPC_decode.c Based on ldpc.cpp written by Jinghu Chen

This file is for the relay decoding of the half duplex method. *1

1*******************************************************1 #include <math.h> #include <stdio.h> #include <stdlib.h> #include "mex.h"

#define un 10000 II Length of the code

#defme kk 2500 II No. of information bits

#defme rr 7500 II No. of checks

#defme!MAX 200 I*maximun iteration times*1 II#defme !MAX 500

All of the functions to do belief propagation decoding are the same for this file as for the previous files. so I cut them out.

I**********************************~*******************************1

1* This is the primary function that does all of the work. It takes the constants nO, mrows, the received signal matrix yy and decodes them using the BP functions above. Then it stores the codewords in C_hat and says whether there is an error or not in err.

*1

void ldpc_decode(const double nO, const double *yy, double *C_hat, const int mrows, double >!Ierr) {

.... some code ....

} I"'end of BP iteratation 1/1 while«(syndrom=l)&&(iteration<=!MAX»*******1

97

err[n] = (double)syndrom;

1* This for loop uses the same princple as the yy array. It is stored in MatJab's internal format.

*1

.... some code ....

1* The mex function The inputs should be the noise variance No and the received message Y The output is the decoded message C_hat

[C_hat, err] = ir]elay_decode(Y, nO); *1

1* If you would like to understand all of these declarations, then you will have to read the Matlab help fIles on mex functions.

*1

.... some code ....

int ij,mrows; double *err;

.... somecode ....

}

plhs[l] = rnxCreateDoubleMatrix(1, mrows, mxREAL);

1* err corresponds to the block errors that the decoding algorithm detected. A 1 in position X of the array corresponds to an error detected in row X of the blocks.

*1 err = mxGetPr(plbs[1»;

Idpc_decode(nO, yy, C_hat, mrows, err);

98

1* File icdescdecode.c Based on LDPC_decode.c Based on ldpc.cpp written by Jinghu Chen

This file is for the destination decoding of the half duplex method this means that it needs to know the received powers of each half of the received codeword, hence sqrtEsl and sqrtEs2.

*1

All ofthe functions to do belief propagation decoding are the same for this file as for the previous files. so I cut them out.

I**********************************~*******************************1

1* This is the primary function that does all of the work. It takes the constants nO, mrows, the received signal matrix yy, and the received powers at the destination sqrtEsl and sqrt Esl and decodes them using the BP functions above. *1

void ldpc_decode(const double nO, const double *yy, double *W _hat, const int mrows, const double sqrtEsl, const double sqrtEs2) {

... .lots of code ....

1************initia1ize rOll, Z[][], Y[][)************I

1* This is the first place where we need to apply the received energy at the destination. The ML ratio for Gaussian channels requires that we multiply 4*sqrt(Es)INO. Since the two halves are received with different energy, we have to mUltiply each of them by a different sqrt(Es).

*1

1* Note that sqrtEsl corresponds to the half of the code that is all of the odd numbered columns, i.e. column 1,3,5. It Is important that if you were to somehow change the division that you change it here and in the next place.

*1 for (i=O; i<1lJl; i+=2)

rO[i)=(4*sqrtEsllnO)*y[i);

99

f* sqrtEs2 corresponds to the even numbered columns, i.e. column 2,4,6. If you change the way you divide the codeword, you need to change it here and the place below this.

*1 for (i=1; i<un; i+=2)

rO[i)=(4*sqrtEs2InO)*y[i);

for (i=O;i<=(rr-1);i++) /I rr is the number of rows and bh is the weight of rows {

}

for G=O;j<=(row _distribution_index[i][Oj-1 );j++) { /I Use the received sequence to initialize

1* Here is the next place where you need to change the way the sqrt(Es) is used if you change it in the codeword outside of this source code. I'm sure you can figure out which is which.

*1 if (row_distribution_index[i1U+1)%2 = 0)

tem=y[(row _distribution_index[i1U+ 1]))*(4*sqrtEslfnO); else

tem=y[(row_distribution_index[i1U+1]))*(4*sqrtEs2fnO);

*(y[ij+j)=tem;

*(Z[ij+j)=tem; }

.... some code ....

1* This for loop uses the same priucple as the yy array. It is stored in Matlab's internal format. However, unlike at the relay, the maximum length of each block is limited to kk because that is the message length.

*1 for (h=O; h<kk; h++)

W _hat[b*mrows + n) = (double)decoded_message[b);

} /lfor (n=O; n<mrows; n++)

.... somecode ....

100

1* The mex function The inputs should be the noise variance No and the received message Y and the received energies sqrtEsl and sqrtEs2 The output is the decoded message W _hat

W _hat = unbalancecCLDPC_decode(y, nO, sqrtEsl, sqrtEs2); *1

1* If you would like to understand all of these declarations, then you will have to read the Matlab help files on mex functions.

*1

void mexFunction(int nlhs, mxArray *plhs[], int mhs, const mxArray *prhs[]) {

double nO; double sqrtEsl, sqrtEs2; double'" yy; double'" W _hat; int iJ,mrows;

1* Check for proper number of arguments. *1 if(nrhs != 4)

mexErrMsgTxt("Four inputs required-");

1* Check that the second input is a scalar *1 if (!mxIsDouble(prhs[l]) II mxIsComplex(prhs[l]) II

mxGetN(prhs[l])*mxGetM(prhs[l])!= 1) mexErrMsgTxt("lnput nO must be a scalar. n);

1* Check that the third input is a scalar */ if (!mxIsDouble(prhs[2]) II mxlsComplex(prhs[2]) II

mxGetN(prhs[2])*mxGetM(prhs[2]) 1= 1 ) mexErrMsgTxt("Input sqrt(Esl) must be a scaIar.");

1* Check that the fourth input is a scalar */ if (ImxIsDouble(prhs[3]) II mxlsComplex(prhs[3]) II

mxGetN(prhs[3])*mxGetM(prhs[3]) != 1 ) mexErrMsgTxt("Input sqrt(Es2) must be a scaIar.");

if (mxGetN(prhs[O» != nn) mexErrMsgTxt("Y does not have the correct code length. H);

nO = mxGetScalar(prhs[I]);

1* Received energy of the first half */

101

}

sqrtEsl = mxGetScalar(prbs[2]);

1* Received energy of the second half *1 sqrtEs2 = mxGetScalar(prbs[3]);

1* mrows is the number of blocks being simulated for this particular instance.

*1 mrows = mxGetM(prhs[O]);

yy = mxGetPr(prhs[O]);

1* Create a matrix for the return argument *1 pihs[O] = mxCreateDoubleMatrix(mrows. kk. mxREAL);

W....hat = mxGetPr(pihs[O]);

Idpc_decode(nO, yy, W _hat, mrows, sqrtEsl, sqrtEs2);

102

CHANNEL CODING FOR THE RELAY CHANNEL

Documents