How to draw random number from a conditional distribution

Question

yanting wu on 23 Sep 2019

0
Link

Direct link to this question

https://uk.mathworks.com/matlabcentral/answers/481736-how-to-draw-random-number-from-a-conditional-distribution

Edited: Adam Danz on 24 Sep 2019

Accepted Answer: Adam Danz

Hi, dear friends,

I am working a drawing 10000 random numbers for variable, X, based on the following two conditions:

condition 1: X follows a normal distribution with mean=12.21 and standard deviation = 0.63;

condition 2: given another variable Y, the probability of X >= Y is 0.97: prob (X(n)>=Y(n) =0.97).

I know how to draw a random variable with specific mean and standard deviation, but I don't know how to plug in the second condition to my code.

Can anyone give me some help?

I do really appreciate.

Thank you again

Best,

Yanting

2 Comments
Show NoneHide None

the cyclist on 23 Sep 2019

Edited: the cyclist on 23 Sep 2019

I don't understand condition 2. What does n represent? What is X(n)?

Regardless, your condition 1 fully specifies the distribution. There is only one distribution that can obey all those conditions. So, I don't see how condition 2 can also be satisfied, in general.

Or maybe I am misunderstanding something.

Edit: Maybe Adam's answer is actually what you meant, in which case you can generate values using rejection sampling.

yanting wu on 23 Sep 2019

Open in MATLAB Online

Hi,

Thank you for your quick reply.

The second condition is that: I have another variable Y, given the value of each nth Y, my nth variable X should follow a normal distribution and should have 97% of chance greater or equal than Y.

I did the following matlab loop and it takes me almost infinite time to run, so I want to add the second condition into each n.

The matlab code I used is:

Y=randn(10000,1) * 0.61+12.01;
count=zeros(10000,1);
control =1;
while control==1
X=randn(10000,1) * 0.63+12.21;  
for n1=1:10000
if X>=Y
    count(n1)=1;
else
    count(n1)=0;
end
s=sum(count);
pn=s/10000;
if pn<0.97
    control=1
else 
    control=control+1
end
end   
end

Sign in to comment.

Sign in to answer this question.

Answer 1

Adam Danz on 23 Sep 2019

0
Link

Direct link to this answer

https://uk.mathworks.com/matlabcentral/answers/481736-how-to-draw-random-number-from-a-conditional-distribution#answer_393167

Edited: Adam Danz on 23 Sep 2019

Open in MATLAB Online

The first condition is easy,

X = randn(1,100000) * .63 + 12.21;
figure
histogram(X)
mean(X)  % <--- ~12.21
std(X)   % <--- ~0.63

To find the cuttoff of the distribution that defines 97% of the data on the right and 3% of the data on the left, you need to find the 3rd percentile of the data. Any value in X has a 97% chance of being greater than 'pt'.

pt = prctile(X,3); %stats toolbox

Now add a vertical reference line to the plot and convince yourself that 97% of the data are greater than this value.

xline(pt, 'k-', '3rd percentile')
mean(X > pt)  %  <--- ~97%

8 Comments
Show 6 older commentsHide 6 older comments

Adam Danz on 23 Sep 2019

Open in MATLAB Online

" I need at least 9700 X is greater or equal to Y"

How are you defining that? When you draw 'n' values from both distributions, are you just comparing X(n) to Y(n)? If so, is there a reason for that comparison?

If you sort the values first, you'll definitely get more than 97% of X>Y.

mean(X >= Y)*100             % unsorted percentage of X>Y  ~59%
mean(sort(x) >= sort(y))*100 % sorted percentage of X>Y    ~99%

Here's a demo.

% Define distributions
mu = [12.21, 12.01];
sig = [.63, .61];
% run 1000 repetitions of the test
ntests = 1000;
p = zeros(1,ntests); 
ps = p; 
wbh = waitbar(0,sprintf('%d repetitions',ntests)); 
for i = 1:ntests
    d1 = randn(1,10000) * sig(1) + mu(1);
    d2 = randn(1,10000) * sig(2) + mu(2);
    p(i) = mean(d1>=d2);               % unsorted
    ps(i) = mean(sort(d1)>=sort(d2));  % sorted
    waitbar(i/ntests,wbh)
end
delete(wbh)
% Plot results
figure()
plot(p*100, 'b-', 'DisplayName', 'unsorted')
hold on
plot(ps*100, 'r-', 'DisplayName', 'sorted')
xlabel('repetition number')
ylabel('X>Y (%)')
ylim([-inf, 105])
legend()

yanting wu on 24 Sep 2019

Hi, Adam,

Yes, I need to compare X(n) with Y(n). In total I need 10000 pairs of observations (X , Y), and at least 9700 pairs of them with X>= Y. Because Y are created in the previous step with normal distribution, I can consider Y(n) as a constant number for generating X.

Thank you

Yanting

Adam Danz on 24 Sep 2019

Yeah, that will be impossible as the figure shows in an earlier comment of mine (blue line). You'll never get 97% values in X>=Y if x and y are numbers from normal distributions with the means and std's you specified.

Sign in to comment.

Answer 2

Bruno Luong on 24 Sep 2019

0
Link

Direct link to this answer

https://uk.mathworks.com/matlabcentral/answers/481736-how-to-draw-random-number-from-a-conditional-distribution#answer_393236

Edited: Bruno Luong on 24 Sep 2019

Open in MATLAB Online

Your problem is not well defined, there are many Y that can meet such requirement (if you guys think it is impossible beside a scalar you are wrong). One these is given by this code:

mux = 12.21;
sigma = 0.63;
muy = fzero(@(muy) integral2(@(x,t) pdffun(x,t,muy), -Inf, Inf, 0, Inf)-0.97, 0);
muy = mux + sigma*muy; % 10.534298385905991 you can enter this value without using FZERO
X = mux + sigma*randn(1,1e6);
Y = muy + sigma*randn(1,1e6);
% Check how many X >= Y, it should be about 0.97
sum(X>=Y) / length(X)
function p = pdffun(x,t,muy)
y = x - t - muy;
p = 1/(2*pi)*exp(-x.^2/2).*exp(-y.^2/2);
end

1 Comment
Show -1 older commentsHide -1 older comments

Adam Danz on 24 Sep 2019

Edited: Adam Danz on 24 Sep 2019

"Your problem is not well defined, there are many Y that can meet such requirement (if you guys think it is impossible beside a scalar you are wrong)."

Agreed that the problem is not well defined but some valid interpretations of the problem (including the correct interpretation, according to OP) would result in impossibilities as I've shown above.

Interpretation 1

The distribution X only has 1 value where 97% of the data are greater than that value (the 3rd percentile). So it would be impossible to have a normal distribution with a known mean and known std and have multiple y values that define that boundary.

Interpretation 2

If 'n' random Ys are drawn from distribution y and 'n' random Xs are drawn from a distribution x, 97% of the Y(1:n) should be greater than X(1:n). Given the means and std's the OP defined, I've shown that this is indeed impossible and plotted the simulated results across 1000 repetitions where there were never more than ~60% Y values greater than their X pair.

Note that in your solution, the two distributions do not have the means and std's given by OP: "...generate 10000 observations for two normal distributed variables X (mean= 12.21, SD= 0.63) and Y (mean =12.01, SD=0.61)."-OP

Sign in to comment.

How to draw random number from a conditional distribution

2 Comments
Show NoneHide None

Accepted Answer

8 Comments
Show 6 older commentsHide 6 older comments

More Answers (1)

1 Comment
Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Community Treasure Hunt

How to draw random number from a conditional distribution

2 Comments Show NoneHide None

Accepted Answer

8 Comments Show 6 older commentsHide 6 older comments

More Answers (1)

1 Comment Show -1 older commentsHide -1 older comments

See Also

Categories

Tags

Community Treasure Hunt

2 Comments
Show NoneHide None

8 Comments
Show 6 older commentsHide 6 older comments

1 Comment
Show -1 older commentsHide -1 older comments