Mwrf 1018 Cuda Technology 0
Mwrf 1018 Cuda Technology 0
Mwrf 1018 Cuda Technology 0
Mwrf 1018 Cuda Technology 0
Mwrf 1018 Cuda Technology 0

CUDA Technology Reduces FDTD Simulation Time

Oct. 8, 2013
The computational time of finite-difference-time-domain (FDTD) electromagnetic simulations can be significantly reduced with the use of CUDA technology.

The finite-difference-time-domain method (FDTD) is widely used for electromagnetic (EM) simulations due to its accuracy, flexibility, and simplicity. Yet the benefits provided by the FDTD technique come at the cost of increased computational time. Using NVIDIA’s Compute Unified Device Architecture (CUDA) technology, computational time can supposedly be reduced by over two orders of magnitude compared to conventional computing. In a paper from Remcom titled, “Accelerating the Finite Difference Time Domain (FDTD) Method with CUDA,” the migration from a traditional C implementation of a three-dimensional FDTD method to NVIDIA’s CUDA architecture is discussed.

The six-page paper discusses the challenges and techniques that are involved in migrating the FDTD algorithm from a traditional C implementation to a form suitable for leveraging modern graphics processor units (GPUs) through NVIDIA’s CUDA framework. With the GPU approach, thousands of threads are used simultaneously. To achieve maximum speed, special design considerations are needed. Proper understanding of CUDA can enable speed to be raised beyond two orders of magnitude over traditional central processing units (CPUs).

Although GPUs were originally used for the sole purpose of driving graphical displays, they have evolved into powerful computational devices. The Tesla C1060 GPU, for example, can yield significant performance gains over a 2.66-GHz Intel Core 2 Quad processor. The document provides additional details on the CUDA GPU, including its architecture and the several types of memory that CUDA devices offer. Functions targeted for the GPU are implemented in CUDA as kernels, which are written in a similar manner to the C programming language. The document discusses the optimizations that were implemented, which reduced memory operations to less than 14% of the original total. These optimizations, along with some others, were applied to the FDTD algorithm, which was integrated into Remcom’s XFdtd software.

A modern cellular phone design was chosen for the final test simulation, which included all major device components. Tests were performed using a single thread for the CPU baseline and all four Tesla C1060s for the GPU implementation. The test results demonstrated that the GPU implementation was consistent in achieving speeds that exceeded the CPU implementation by more than two orders of magnitude.

Remcom, Inc., 315 S. Allen St., Ste. 416, State College, PA 16801; (814) 861-1299.

Sponsored Recommendations

Frequency Modulation Fundamentals

March 14, 2024
The development of crystal-clear FM communications was an innovation of genius and toil. Utilized today in applications such as radar, seismology, telemetry and two-way radios...

44 GHz Programmable Signal Generator

March 14, 2024
The Mini-Circuits SSG-44G-RC is a 0.1 to 44 GHz signal source with an RF output range of -40 to +17 dBm with fine resolution. This model supports CW and pulsed (? 0.5 ?s) outputs...

Webinar: Introduction to OTA Measurement for mmWave and Sub-THz

Feb. 19, 2024
Join Jeanmarc Laurent, a leading expert from MilliBox, for an exclusive live webinar showcasing a complete Over-the-Air (OTA) testing system setup. In this immersive session, ...

Using a CMT VNA with Socket Server

Feb. 19, 2024
This application note describes use of a software application CMT Socket Server which is distributed and supported by Aphena Ltd. Please email [email protected] regarding purchase...