Info

This question is closed. Reopen it to edit or answer.

Repetition of repeated rows

2 views (last 30 days)
William Hynes
William Hynes on 6 Apr 2016
Closed: MATLAB Answer Bot on 20 Aug 2021
I have a very large matrix (780000X2). In column 1, there is a scenario number, between 1 and 20000. I want to identify those rows which have a scenario number repeated elsewhere at least 20 times. For example, let's say I want to identify rows in Matrix A which have a scenario number repeated at least 2 times.
A= [1 22;2 23;2 24;2 25;3 26]
In this situation, I would be trying to identify (2, 23), (2,24) and (2,25).

Answers (2)

Star Strider
Star Strider on 6 Apr 2016
This works:
A = [1 22;2 23;2 24;2 25;3 26];
[Au,ia,ic] = unique(A(:,1)); % Find unique Values In Column #1
A1h = accumarray(ic, 1); % Historgram Counts Of Those Values
Desired_Rows = A(ic == ia(A1h > 2), :) % Find All Rows With ‘ia’ Index Of Column #1 Numbers Meeting Criteria
Desired_Rows =
2 23
2 24
2 25
It first finds the unique entries in column 1, then counts them in the accumarray call, finds all those meeting criteria in the ‘ic’ output of the unique call, and uses those addresses (a logical vector here) to select and save the output of that to the ‘Desired_Result’ variable.

Robert
Robert on 6 Apr 2016
There are a few ways you could do this. The most straightforward is with hist. Use hist to count the instances of your data with bins 1:20000.
values = 1:20000;
num_occurances = hist(A(:,1),values);
values(num_occurances >= 20)

This question is closed.

Tags

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!