Best practices for benchmarking Simulink → C code on TMS320F28388D (TI C2000) and AM2634 (Sitara)

Question

Pasquale on 19 Sep 2025

0
Link

Direct link to this question

https://uk.mathworks.com/matlabcentral/answers/2180053-best-practices-for-benchmarking-simulink-c-code-on-tms320f28388d-ti-c2000-and-am2634-sitara

Commented: Snehal on 26 Sep 2025 at 6:46

Hello everyone,

I’m preparing a benchmark to compare the computational performance of code generated from a Simulink model on two target boards:

TMS320F28388D (TI C2000 family)
AM2634 (TI Sitara family)

I do not have Simulink Hardware Support Packages for both boards available, so my goal is to generate one portable C codebase that can be compiled and executed on both targets for a fair comparison.

Before I post my model or code, I’d like to ask the community for recommendations and best practices on several point:

How to best measure “computational power” / performance

What metrics should I use for a fair comparison between these two architectures? (e.g., execution time per model_step(), cycles/step, throughput, latency, memory footprint, flash/ROM usage, RAM/stack usage, determinism/jitter)

ERT vs GRT — which to use for benchmarking?

Is ert.tlc (Embedded Coder) always preferable for embedded benchmarking vs grt.tlc?
If I generate with grt.tlc for quick host prototyping, what differences should I expect vs ert.tlc that would meaningfully affect timing/size comparisons?

How to configure the “Hardware Implementation” / ProdHW if I don’t have both HW packages

If I want a single source tree that compiles on both boards, what are the safest Hardware Implementation settings to choose (native word size, endianness, char/short/int/long sizes, portable word sizes, floating-point settings)?
Is setting the target to a Generic 32-bit little-endian + Portable word sizes = ON the recommended conservative approach?

Does using a specific hardware support package / ProdHW device improve generated code optimization?

If I later install the official TI hardware support packages for each board and regenerate code per-board, how much difference should I expect in performance compared to a single generic build?
In practice, is the recommended approach to generate single portable source that compiles on both boards, or generate two board-specific builds (each with its ProdHW/hw package configured) to maximize per-board optimization?

Thanks in advance — any pointers, links to docs/examples or short code snippets will be very helpful.

Best regards,

Pasquale

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Snehal on 23 Sep 2025 at 9:43

1
Link

Direct link to this answer

https://uk.mathworks.com/matlabcentral/answers/2180053-best-practices-for-benchmarking-simulink-c-code-on-tms320f28388d-ti-c2000-and-am2634-sitara#answer_1570521

Hi @Pasquale,

I understand that you are working on benchmarking Simulink - C code on TMS320F28388D and AM2634.

You may take the following points into consideration while implementing your workflow:

For a fair and focused comparison, measure CPU cycles per model_step()(primary, architecture-neutral metric), execution time and jitter(average, worst-case, and distribution over many iterations to capture determinism), and memory footprint (flash/ROM, RAM, and stack usage from the linker map). These three cover raw compute power, real-time behavior, and resource usage and are some of the most relevant factors for embedded performance benchmarking.
Use ert.tlc for benchmarking as it generates smaller, faster, production-style code optimized for embedded targets. grt.tlc is for host simulation, adds extra scaffolding and overhead, so timing and size results won’t reflect real embedded performance.
Setting Hardware Implementation to Generic 32-bit little-endian with >Portable word sizes = ON is indeed the recommended approach. Additionally, use consistent single-precision floats and fixed-width types since this ensures one portable codebase that compiles cleanly and behaves identically on both boards.
Board-specific ProdHW/support packages usually give better optimization (use of FPU, intrinsics, linker scripts), so expect noticeable speedups. You may consider comparing with a single portable build for fairness first, and generating per-board builds to measure each target’s best-case performance later.

Here are some documentation links for your reference:

Hope this helps!

4 Comments
Show 2 older commentsHide 2 older comments

Pasquale on 26 Sep 2025 at 6:41

Edited: Pasquale on 26 Sep 2025 at 6:42

Thanks Snehal!

Just to conclude, that's what I found online as AM263X Support Package:

https://it.mathworks.com/matlabcentral/fileexchange/174295-embedded-coder-hardware-support-package-for-ti-am26x

Have a nice day,

Pasquale

Snehal on 26 Sep 2025 at 6:46

Oh thanks for sharing!

Have a great day!

Sign in to comment.

Best practices for benchmarking Simulink → C code on TMS320F28388D (TI C2000) and AM2634 (Sitara)

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

4 Comments
Show 2 older commentsHide 2 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Best practices for benchmarking Simulink → C code on TMS320F28388D (TI C2000) and AM2634 (Sitara)

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

4 Comments Show 2 older commentsHide 2 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

4 Comments
Show 2 older commentsHide 2 older comments