Accelerating PO-SBR for SAR application

Accelerating PO-SBR for SAR Application Chun Yun Kee, Fu-Gang Hu, Chao-Fu Wang and Tse Tong Chia

Temasek Laboratories National University of Singapore

5A Engineering Drive 1, #09-02, Singapore 117411 cykee, fugang, cfwang, [email protected]

Abstract — Generating high resolution SAR image of an

object typically requires its scattered field data over a large number of aspect angles and frequencies, which is very time consuming to obtain. This paper discusses the implementation of the PO-SBR method on GPU to efficiently compute the massive scattered field data required for SAR application.

Index Terms—physical optics; shooting and bouncing rays method; SAR; GPU; CUDA

I. INTRODUCTION The PO-SBR technique [1] is very useful and effective

for the prediction of electromagnetic scattering of PEC objects at high frequency. In [1-3], the authors introduced OptiX [4] ray-tracing library and various considerations for implementing PO-SBR on GPU with optimal performance for a single frequency. To carry out synthetic aperture radar (SAR) imaging process, this paper will discuss the implementation details of the PO-SBR for generating scattered field data over multiple frequencies for the efficient simulation of SAR image of complex objects such as aircraft.

II. BRIEF OVERVIEW OF PO-SBR As described in [1], the induced PO current on the object

surface is computed using a closed-form solution based on triangular meshes [5]. An incident plane wave is modelled by a set of parallel optic rays launched towards the object. As induced surface currents reradiate as secondary electromagnetic sources, SBR is employed to account for the contributions of reflected waves. Obeying Snell’s Law, each of these rays is traced until it exits the scene or reaches a maximum ray depth. The PO-SBR code in [1] has been optimized for a single frequency on GPU. Specifically, the ray tube size is determined and ray tracing is performed at each frequency. Ray-trace information is used for scattered field computation and then discarded immediately.

III. ACCELERATION OF PO-SBR ON GPU The code in [1] is optimized for a single frequency. Thus,

it is not efficient for computation over multiple frequencies as the same rays are repeatedly traced. This section summarizes the considerations discussed in [6] for improving the efficiency of the PO-SBR in [1].

A. Caching of Rays The SBR method is adopted to consider the contributions

of reflected waves. An incident plane wave is modeled by a set of parallel optic rays launched towards the object. The main observation for simulation of multiple frequencies is

the possible reuse of ray-trace information. Rays launched from the same incident angle need to be traced only once. We cache the rays generated at higher frequencies and reuse them in the field calculation at lower frequencies. In exchange for overall performance boost, we may suffer from the overhead of memory transaction and oversampling.

B. Compaction of Rays As soon as the rays are traced, the intermediate

information independent of frequency is stored into an array. Typically, some of the initially launched rays will miss the object. Such a scenario gives rise to the issue of divergence on GPU which hurts performance. To resolve this issue, the array of rays is pre-processed before participating in any field calculation. The rays that intersect the target are aggregated to the front of the array. This operation is known as compaction.

C. Partition of Rays In most practical cases, the number of rays to be traced is

enormous. Constrained by the limited memory on the GPU, the storage of ray information requires careful management to prevent oversubscription to memory. One way of doing it is to partition the rays into batches that fit into the available memory. Suitable partitioning scheme should be chosen to minimize the number of resulting sub-grids, which reduces kernel invocations and the corresponding memory transactions.

IV. SAR IMAGE PROCESSING As stated in [7], high-resolution SAR images of targets

which have large cross-range extents cannot be possible with narrow-angle data. In this work, the wide-bandwidth large-angle SAR imaging procedure [7] is applied. The look angle range is set to be ]2,0[ πφ = . Actually, the monostatic SAR image is proportional to the following integral [7]

yxykxkj kddkePyxSAR yx +−•= )(2sFˆ),( (1)

where ker jkr /EF ss = and P̂ is the polarization of the scattered far field sE . Eq. (1) indicates that the SAR image is proportional to the Fourier transform of the scattered far fields. The data of far field sE is obtained in the rectangular frequency-aspect domain, but it is transformed into the polar

yx kk − format. To perform the DFT, the first-order interpolation is applied to transform the original data to that on the uniform grids of the xk - yk plane.

117978-1-4673-7297-8/15/$31.00 c©2015 IEEE

V. NUMERICAL RESULTS All the results are generated on an NVIDIA

GPU consisting of 448 CUDA cores with a Gof 1.15GHz.

A. Synthetic test case We employ an Airbus A380 aircraft mode

Figure 1) for verification. A simulation has bfor 0.01-1 GHz, ,

and . The backscattered fieldfrequency-aspect domain is shown in Figure the complexity of the scattering behaviour ofA full 360° sweep can generate high quality shown in Figure 3. The great resemblance wsuggests that the scattered far field producedUsing the improved method, the simulationseconds while the method in [1] took 100resulting in a speedup of about 4x.

Figure 1: Meshes and dimension of A380 aircraft model.

(a)

y

x

~102m

~93m

A Tesla C2075 GPU clock rate

el (as shown in been performed

d data in the 2 to illustrate

f Airbus A380. SAR image as

with the aircraft d is reasonable. n took 23,692 0,220 seconds,

Figure 2: (a)Real and (b)Imaginary part of

Figure 3: SAR image of A380 aircraft mo

B. Realistic SAR More images will be presented

VI. CONC

We have discussed several teSBR computation for multiple freefficiency is achieved. More resuconference.

REFERENC

[1] C. Y. Kee and C. F. Wang, "Effthe High-Frequency SBR-PO MWireless Propagation Letters, v

[2] H.-T. Meng, J. M. Jin, and asymptotic computational optics shooting and bouncing CUDA," University of Illinois, 2

[3] Y. Tao, H. Lin, and H. Baobouncing ray method for fast RPropagation, IEEE Transactio2010.

[4] S. G. Parker, J. Bigler, A. DietriD. Luebke, et al., "Optix: A engine," ACM Transactions op.66, 2010.

[5] W. Gordon, "Far-field approHelmholtz representations of scPropagation, IEEE Transactio1975.

[6] C. Y. Kee, C. F. Wang, and TFrequency PO-SBR on GPU forPacific Conference on Antennas2015. Submitted.

[7] C. Ozdemir, Inverse Synthetic AMATLAB Algorithms: Wiley, 20

(b)

V

~32m

z

x

f scattered far field.

odel.

d at the conference.

LUSION echniques to speed up PO-equencies on a GPU. Good ults will be presented at the

CES ficient GPU Implementation of Method," IEEE Antennas and ol. 12, pp. 941-944, 2013.

E. Dunn, "Acceleration of electromagnetics physical ray (PO-SBR) method using

2011. o, "GPU-based shooting and

RCS prediction," Antennas and ns on, vol. 58, pp. 494-502,

ich, H. Friedrich, J. Hoberock, general purpose ray tracing

n Graphics (TOG), vol. 29,

oximations to the Kirchoff-cattered fields," Antennas and ns on, vol. 23, pp. 590-592,

T. T. Chia, "Optimizing High-r Multiple Frequencies," Asia-s and Propagation, 2015 IEEE

Aperture Radar Imaging With 012.

V

118 2015 IEEE 5th Asia-Pacific Conference on Synthetic Aperture Radar(APSAR)

Accelerating PO-SBR for SAR application

Documents