Which type of function call provides better performance in MATLAB?

Question

8 votes

I have 7 different types of function call:

1. An inlined function, where the code author replaces the function call with a copy of the body of the function.

2. A function is defined in a separate MATLAB file. The arguments are passed by the calling function (file-pass).

3. A function is defined in a separate MATLAB file. The arguments are provided by referencing global variables; only indices are provided by the calling function (file-global).

4. A nested function. The arguments are passed by the enclosing function (nest-pass).

5. A nested function. The arguments are those shared with the enclosing function; only indices are provided by the enclosing function (nest-share).

6. A sub function. The arguments are passed by the calling function (sub-pass).

7. A sub function. The arguments are provided by referencing global variables; only indices are provided by the calling function (sub-global).

(For more information, please see the following three MATLAB files: testTop.m, testCompute, and testComputeGlobal.m)

I would like to know which function call provides better performance than the others in general.

Sign in to answer this question.

Follow Question

Answer 1

MathWorks Support Team on 5 Oct 2018

Edited: MathWorks Support Team on 5 Oct 2018

Open in MATLAB Online

14 votes

The ordering of performance of each function call from the fastest to the slowest tends to be as follows:

inlined > file-pass = nest-pass = sub-pass > nest-share > sub-global > file-global

(A>B means A is faster than B and A=B means A is as fast as B)

First, using an inlined function is the fastest as it does not incur overhead associated with function call.

Second, when the arguments are passed to the callee function, the calling function sets up the arguments in such a way that the callee function knows where to retrieve them. This setup associated with function call in general incurs performance overhead, and therefore file-pass, nest-pass, and sub-pass are slower than inline.

Third, if the workspace is shared with nested functions and the arguments to a nested function are those shared within the workspace, rather than pass-by-value, then performance of that function call is inhibited. If MATLAB sees a shared variable within the shared workspace, it searches the workspace for the variable. On the other hand, if the arguments are passed by the calling function, then MATLAB does not have to search for them. The time taken for this search explains that type nest-share is slower than file-pass, nest-pass, and sub-pass.

Finally, when a function call involves global variables, performance is even more inhibited. This is because to look for global variables, MATLAB has to expand its search space to the outside of the current workspace. Furthermore, the reason a function call involving global variables appears a lot slower than the others is that MATLAB Accelerator does not optimize such a function call. When MATLAB Accelerator is turned off with the following command,

feature accel off

the difference in performance between inline and file-global becomes less significant.

Please note that the behaviors depend largely on various factors such as operating systems, CPU architectures, MATLAB Interpreter, and what the MATLAB code is doing.

1 Comment
Show -1 older comments Hide -1 older comments

Walter Roberson on 5 Jul 2018

Also note that anonymous function calls are slower than direct function calls, but the difference can vary from almost negligible to being rather substantial in nearly identical code.

(I have some test cases that I put together a couple of years ago, but my system is loaded at the moment so I cannot do timing tests right now.)

Sign in to comment.

Answer 2

Robert on 3 Jul 2018

1 vote

Helpful stuff, but shouldn't the first alternative read "1. An Inline function. The body of the function is directly written down (inline)."?

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Answer 3

broken_arrow on 20 Mar 2021

0 votes

That leaves me a bit confused. Isn't an inlined function the same as just pasting the function code into a script (which would mean a script should be the most performant)? This post on the other hand suggests that functions are generally faster than scripts: https://de.mathworks.com/matlabcentral/answers/415728-details-on-why-functions-are-faster-than-scripts But if I pass variables to a function, the program would still have to look up the variables in the base workspace and launch the function, which creates overhead compared to a script. My "working hypothesis" used to be that everything that is executed only once is put in a script and functions are for code that is used often...

23 Comments
Show 21 older comments Hide 21 older comments

Stephen23 on 22 Mar 2021

Edited: Stephen23 on 22 Mar 2021

"...given a set of input data and a set of computations to be performed on these data, which finally returns some output, is there a general best practice guideline on how to arrange the code"

Yes: use a function or a class. Avoid scripts.

Personally I see scripts as only useful for experimenting. As soon as I have code that I want to be robust, repeatable, dependable, generalizable, expandible, testable, then it would be converted to a function (with clearly documented interface to the outside world and corresponding test function, in which I collect all relevant test cases).

"...I could turn the main script into a function, but that shouldn't matter"

One provides a clearly documented, testable interface to the world, and isolates their functionality from the calling workspace (which is A Very Good Thing™).

The other doesn't (ugh!)

You state that keeping the "main script" as a script "eliminates function overheads", but I very much doubt that any "function overheads" (whatever they might be) are going to be a significant runtime consumer of your "main script" which only gets call once (or occasionally). Functions are compiled, stored** in a cache for re-use, and have a few other optimizations applied, which makes calling them very fast after the first call.

** Which is why beginners who fill their code with cargo-cult prgramming like CLEAR ALL et al. are doing themselves a large disservice.

Walter Roberson on 22 Mar 2021

A few years ago, the MATLAB "JIT" (Just In Time semi-compiler) was replaced with the "Execution Engine". It has never been clear what the difference is between JIT and Execution Engine, but we can talk a bit about how optimization has changed and that might give us some hints, maybe:

JIT was much more limited for scripts. Individual execution paths could be optimized, but the overall relationship of the parts could not be studied. Scripts were not cached with JIT
EE is able to examine the parts of scripts and optimize parts in relationship to other parts
for example even in scripts now, EE checks to see if there is an obvious assignment to variables, and if not then will treat a name as if it were a function even if it turns out later that something poofs the variable into existance. We had an example yesterday of someone using sim() of a model that did a To Workspace, happening to assign to a name that was also a signal processing function; the signal processing function was called instead because the flow was examined and assumptions were made
EE is generally a lot less forgiving of poofing variables

For functions, the flow is documented: as soon as the file containing the function is referenced (function called, handle to function is taken), the function is parsed and converted into an internal tokenized form, effectively the same as .p code (except with debugging permitted). A threaded datastructure is created.

Both EE and JIT are known to build internal machine code structures to handle execution of paths. JIT, by its nature, only converts particular branches to machine code as they are traversed. I have not seen any material about whether EE converts everything immediately or only as it traverses it.

There are different JIT strategies. As converting a stretch to machine code has a cost, some JIT developers only convert a block to machine code the second time the path it is executed, under the theory that if a path is only going to be executed once then you might not recover the cost of conversion, but that if you are now executing the path again, that is sufficient evidence to predict that you are likely to be wanting to execute the path again in the future. I do not know if JIT did this "second-time" strategy.

I do not know when the EE does conversion to machine code, whether it is second-time, or first-time, or parse-time. Considering the flow analysis that EE clearly does, I suspect that at the very least, EE works with entire blocks of code -- e.g., at the beginning of a for loop, converting the entire loop instead of waiting for each path in the loop to be encountered. But I do not have any information on how it handles branch prediction and compilation of branches.

Because MATLAB is dynamically typed, any conversion of user functions to machine code cannot be finalized, as the type returned by an expression might change. In theory, MATLAB could pre-analyze the built-ins and the routines provided by Mathworks, to build a dictionary of types each one returns... but I don't think it is doing that, at least not for the .m and .p files (but perhaps for the built-ins). I have some thoughts on how potentially optimization could be handled in the face of changing types, but I have no idea if MATLAB's implementation is anywhere close to my ideas.

Anyhow, what else?

Well, some optimizations are disabled inside try/catch. MATLAB can convert some combinations of statements into calls to BLAS or LAPACK or MKL, but in a try/catch situation you can't do that as much, because at the time of the CATCH you need each variable to have its value as-if flow had proceded statement by statement instead of statements being combined internally. This isn't talked about much -- but I believe the implication is that if you have a try/catch around a bunch of statements, that you can potentially get higher efficiency by moving the statements into a function. Functions are (mostly) black-boxes that either succeed (returning entire results) or fail (in which case none of the output variables are to be assigned to, and you do not have to worry about the segment-to-segment flow. That said, I do not at the moment know what happens with regards to functions that do "in-place" updates of variables but also error() -- if you are working in-place does that mean the output value is however far it got in modification, or is the output defined to be the same as the input if an error occurs? Perhaps in-place is disabled in try/catch... (The debugger will not permit you to examine a variable that it suspects is being modified in-place if you are positioned at the call.)

I have convinced myself that the documentation that says that statements coded "in line" are always fastest... I think that information is wrong, that there are circumstances under which functions can be faster.

Walter Roberson on 23 Mar 2021

Edited: Walter Roberson on 23 Mar 2021

"we can talk" --> whoever wants to be involved in the discussion (and can do so without violating nondisclosure). "We" is the correct plural pronoun in English for the case that the subject of the phrase includes the speaker, and is used in "we can talk" when the person making the statement invites common discussion.

"We had an example yesterday" --> public Question https://www.mathworks.com/matlabcentral/answers/779077-error-not-enough-input-arguments-for-half-model-vehicle-pitch-angle . As it was posted to the public, it would have been misleading for me to say that "I" was presented with the question. The question was presented to us collectively; I am simply the person who happened to respond, using my knowledge of a change that was posted in the Release Notes a couple of releases ago.

Those were the only two places that I used "we" in my earlier comment.

If I had access to the source code, I would would have said "I have not checked" or "I haven't had time to research" instead of all those places where I said I do not know or that I have not seen any information. Or I would have used my hypothetical source access to read up on the implementation and then described it... or refrained from describing it if disclosure would have violated Non-disclosure.

When I write that JIT did not do something but EE does, I describe what we could observe at the time; I am not describing what the JIT execution could have done if it had been further developed. From my position on the outside, it is not clear that EE is anything more than a rebranding and tinkering of JIT, but twice staff have corrected me to say that JIT is not used anymore but Execution Engine is now used. What is the difference? We (the public) do not know, but we (the public) can examine the documentation of the changes since JIT and hope to reconstruct differences in technology.

Walter Roberson on 23 Mar 2021

Interesting, at https://www.mathworks.com/content/dam/mathworks/mathworks-dot-com/images/events/matlabexpo/kr/2016/matlab-programming-techniques-for-efficiency-performance.pdf I find the description

MATLAB Execution Engine

Old system had two different execution mechanisms –a JIT and an Interpreter. New system has a single execution mechanism.

Old JIT was designed for FORTRAN-like constructs within MATLAB. New JIT is designed for the entire MATLAB language.

Old system had a monolithic architecture that was difficult to extend. New system has a Modular, Thread-safe, and Platform re-targetable architecture.

Bruno Luong on 24 Mar 2021

My own reading of such description is that

old JIT is separated to EE and consisiting of calling prebuilt library to replace MATLAB code,
new JIT is integrated into EE and is close to a compilation to native machine instructions.

Sign in to comment.

Which type of function call provides better performance in MATLAB?

Accepted Answer

1 Comment
Show -1 older comments Hide -1 older comments

More Answers (2)

0 Comments
Show -2 older comments Hide -2 older comments

23 Comments
Show 21 older comments Hide 21 older comments

Categories

Products

Release

Tags

Community Treasure Hunt

Which type of function call provides better performance in MATLAB?

Accepted Answer

1 Comment Show -1 older comments Hide -1 older comments

More Answers (2)

0 Comments Show -2 older comments Hide -2 older comments

23 Comments Show 21 older comments Hide 21 older comments

Categories

Products

Release

Tags

See Also

Community Treasure Hunt

1 Comment
Show -1 older comments Hide -1 older comments

0 Comments
Show -2 older comments Hide -2 older comments

23 Comments
Show 21 older comments Hide 21 older comments