Speed up this bottleneck line of code

4 views (last 30 days)
Alex
Alex on 22 Feb 2025
Edited: Matt J on 23 Feb 2025
Hello.
I have a big chunk of code, here is a part of it (arrays a, b and c, indeces ind1 and ind2 are actually different, its just an artificially constructed example):
n = 10000;
m = 5000;
k = 20;
a = rand(n,m);
b = rand(n,m);
c = rand(m+k,n)*1.1-0.1;
ind1 = repmat([ones(1,m/k),0]==1,1,k)';
ind2 = repmat([0,ones(1,m/k)]==1,1,k)';
tic;
d = m * sum(a .* b .* max(0, c(ind1,:) - c(ind2,:))',2);
toc
I am not satisfied with it's speed. After profiling my code i found that string
d = m * sum(a .* b .* max(0, c(ind1,:) - c(ind2,:))',2);
is a bottleneck of my code. Indeces ind1 and ind2 are almost entirely true.
Now im not sure now to improve from here, i've tried to google how to speed up .* operation or speed up indexing of matrix c but with no luck. Any advice would be great.
Thanks!

Accepted Answer

Matt J
Matt J on 22 Feb 2025
Edited: Matt J on 22 Feb 2025
If you can construct c in transposed form, there are some savings to be had,
n = 10000;
m = 5000;
k = 20;
a = rand(n,m);
b = rand(n,m);
c = rand(m+k,n)*1.1-0.1;
tic;
s=m/k+1;
[ind1,ind2]=deal((1:m+k)');
ind1(s:s:end)=[];
ind2(1:s:end)=[];
toc
Elapsed time is 0.004463 seconds.
tic;
d1 = m * sum(a .* b .* max(0, c(ind1,:)-c(ind2,:))',2);
toc
Elapsed time is 0.555621 seconds.
ct=c';
I=[(1:m)',(1:m)'];
S=repmat(m*[1,-1],m,1);
tic;
J=[ind1,ind2];
A=sparse(J,I,S,m+k,m);
d2 = sum( a.*b.*max(0,ct*A) ,2);
toc
Elapsed time is 0.347437 seconds.
percentDifference = norm(d1-d2)/norm(d1)*100
percentDifference = 4.0847e-14
  2 Comments
Alex
Alex on 23 Feb 2025
Thanks!
It helps to cut off about 1/3 of time from this line!
Any suggestions about .* operator? Is it optimizable?
Matt J
Matt J on 23 Feb 2025
Edited: Matt J on 23 Feb 2025
No, all the basic matrix operators are very well optimized by MathWorks.

Sign in to comment.

More Answers (1)

Walter Roberson
Walter Roberson on 22 Feb 2025
n = 10000;
m = 5000;
k = 20;
tic
a = rand(n,m);
b = rand(n,m);
c = rand(m+k,n)*1.1-0.1;
toc
Elapsed time is 1.545849 seconds.
ind1 = repmat([ones(1,m/k),0]==1,1,k)';
ind2 = repmat([0,ones(1,m/k)]==1,1,k)';
tic;
d = m * sum(a .* b .* max(0, c(ind1,:) - c(ind2,:))',2);
toc
Elapsed time is 0.560316 seconds.
Your bottleneck is in your generation of a, b, and c -- not in your calculation of d.
  3 Comments
Walter Roberson
Walter Roberson on 22 Feb 2025
ind1 is equivalent to selecting 1:m/k . ind2 is equivalent to selecting 2:m/k+1. It might potentially be faster to use the numeric indices instead of logical indices.
Alex
Alex on 23 Feb 2025
What i mean by
"arrays a, b and c, indeces ind1 and ind2 are actually different, its just an artificially constructed example"
is that i dont post this part of code here, this code is actually different and optimized. These a, b, c, ind1 and ind2 are generated just to demonstrate calculations in row that is a bottleneck.

Sign in to comment.

Products


Release

R2023a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!