Info
This question is closed. Reopen it to edit or answer.
Repetition of repeated rows
2 views (last 30 days)
Show older comments
I have a very large matrix (780000X2). In column 1, there is a scenario number, between 1 and 20000. I want to identify those rows which have a scenario number repeated elsewhere at least 20 times. For example, let's say I want to identify rows in Matrix A which have a scenario number repeated at least 2 times.
A= [1 22;2 23;2 24;2 25;3 26]
In this situation, I would be trying to identify (2, 23), (2,24) and (2,25).
0 Comments
Answers (2)
Star Strider
on 6 Apr 2016
This works:
A = [1 22;2 23;2 24;2 25;3 26];
[Au,ia,ic] = unique(A(:,1)); % Find unique Values In Column #1
A1h = accumarray(ic, 1); % Historgram Counts Of Those Values
Desired_Rows = A(ic == ia(A1h > 2), :) % Find All Rows With ‘ia’ Index Of Column #1 Numbers Meeting Criteria
Desired_Rows =
2 23
2 24
2 25
It first finds the unique entries in column 1, then counts them in the accumarray call, finds all those meeting criteria in the ‘ic’ output of the unique call, and uses those addresses (a logical vector here) to select and save the output of that to the ‘Desired_Result’ variable.
0 Comments
Robert
on 6 Apr 2016
There are a few ways you could do this. The most straightforward is with hist. Use hist to count the instances of your data with bins 1:20000.
values = 1:20000;
num_occurances = hist(A(:,1),values);
values(num_occurances >= 20)
0 Comments
This question is closed.
See Also
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!