Benchmarking
(To be removed) Add benchmarking to the generated code
The Benchmarking parameter will be removed in a future
release. To measure the performance of generated
code, use the gpuPerformanceAnalyzer function instead.
Description
App Configuration Pane: GPU Code
Configuration Objects: coder.GpuCodeConfig
The Benchmarking parameter controls the addition of benchmarking code to the generated CUDA® code.
After execution, the generated benchmarking code creates the
gpuTimingData comma-separated values (CSV) file in the current
working folder. The CSV file contains the timing data for kernel, memory, and other
events. This table describes the format of the CSV file.
| Event Type | Format |
|---|---|
CUDA kernels |
N is
the nth execution of the kernel.
|
CUDA memory copy |
N is the nth execution of the memory copy. |
Miscellaneous |
N is the nth execution of the operation. |
Settings
off (default) | on
- Off
Does not generate CUDA code with benchmarking functionality.
- On
Generates CUDA code with benchmarking functionality. This option uses CUDA APIs such as
cudaEventto timekernel,memcpy, and other events.
Programmatic Use
Property:
Benchmarking |
Values: true |
false |
Default: false |