Best estimate for random values

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP











up vote
4
down vote

favorite












Due to work related issues I can't discuss the exact question I want to ask, but I thought of a silly little example that conveys the same idea.



Lets say the number of candy that comes in a package is a random variable with mean $mu$ and a standard deviation $s$, after about 2 months of data gathering we've got about 100000 measurements and a pretty good estimate of $mu$ and $s$.



Lets say that said candy comes in 5 flavours that are NOT identically distributed (we know the mean and standard deviation for each flavor, lets call them $mu_1$ through $mu_5$ and $s_1$ trough $s_5$).



Lets say that next month we will get a new batch (several packages) of candy from our supplier and we would like to estimate the amount of candy we will get for each flavour. Is there a better way than simply assuming that we'll get "around" the mean for each flavour taking into account that the amount of candy we'll get is around $mu$?



I have access to all the measurements made, so if anything is needed (higher order moments, other relevant data, etc.) I can compute it and update the question as needed.



Cheers and thanks!







share|cite|improve this question






















  • Is "batch" synonymous with "package"? If so, the question would be considerably clearer if you used the same word again. If not, please clarify the difference.
    – joriki
    Apr 2 '13 at 21:34










  • I'm sorry, I forgot to add that a batch is several packages, I don't think it matters as we'll evaluate per "package"
    – Zegpi
    Apr 2 '13 at 21:52














up vote
4
down vote

favorite












Due to work related issues I can't discuss the exact question I want to ask, but I thought of a silly little example that conveys the same idea.



Lets say the number of candy that comes in a package is a random variable with mean $mu$ and a standard deviation $s$, after about 2 months of data gathering we've got about 100000 measurements and a pretty good estimate of $mu$ and $s$.



Lets say that said candy comes in 5 flavours that are NOT identically distributed (we know the mean and standard deviation for each flavor, lets call them $mu_1$ through $mu_5$ and $s_1$ trough $s_5$).



Lets say that next month we will get a new batch (several packages) of candy from our supplier and we would like to estimate the amount of candy we will get for each flavour. Is there a better way than simply assuming that we'll get "around" the mean for each flavour taking into account that the amount of candy we'll get is around $mu$?



I have access to all the measurements made, so if anything is needed (higher order moments, other relevant data, etc.) I can compute it and update the question as needed.



Cheers and thanks!







share|cite|improve this question






















  • Is "batch" synonymous with "package"? If so, the question would be considerably clearer if you used the same word again. If not, please clarify the difference.
    – joriki
    Apr 2 '13 at 21:34










  • I'm sorry, I forgot to add that a batch is several packages, I don't think it matters as we'll evaluate per "package"
    – Zegpi
    Apr 2 '13 at 21:52












up vote
4
down vote

favorite









up vote
4
down vote

favorite











Due to work related issues I can't discuss the exact question I want to ask, but I thought of a silly little example that conveys the same idea.



Lets say the number of candy that comes in a package is a random variable with mean $mu$ and a standard deviation $s$, after about 2 months of data gathering we've got about 100000 measurements and a pretty good estimate of $mu$ and $s$.



Lets say that said candy comes in 5 flavours that are NOT identically distributed (we know the mean and standard deviation for each flavor, lets call them $mu_1$ through $mu_5$ and $s_1$ trough $s_5$).



Lets say that next month we will get a new batch (several packages) of candy from our supplier and we would like to estimate the amount of candy we will get for each flavour. Is there a better way than simply assuming that we'll get "around" the mean for each flavour taking into account that the amount of candy we'll get is around $mu$?



I have access to all the measurements made, so if anything is needed (higher order moments, other relevant data, etc.) I can compute it and update the question as needed.



Cheers and thanks!







share|cite|improve this question














Due to work related issues I can't discuss the exact question I want to ask, but I thought of a silly little example that conveys the same idea.



Lets say the number of candy that comes in a package is a random variable with mean $mu$ and a standard deviation $s$, after about 2 months of data gathering we've got about 100000 measurements and a pretty good estimate of $mu$ and $s$.



Lets say that said candy comes in 5 flavours that are NOT identically distributed (we know the mean and standard deviation for each flavor, lets call them $mu_1$ through $mu_5$ and $s_1$ trough $s_5$).



Lets say that next month we will get a new batch (several packages) of candy from our supplier and we would like to estimate the amount of candy we will get for each flavour. Is there a better way than simply assuming that we'll get "around" the mean for each flavour taking into account that the amount of candy we'll get is around $mu$?



I have access to all the measurements made, so if anything is needed (higher order moments, other relevant data, etc.) I can compute it and update the question as needed.



Cheers and thanks!









share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited Apr 2 '13 at 21:51

























asked Apr 2 '13 at 21:08









Zegpi

212




212











  • Is "batch" synonymous with "package"? If so, the question would be considerably clearer if you used the same word again. If not, please clarify the difference.
    – joriki
    Apr 2 '13 at 21:34










  • I'm sorry, I forgot to add that a batch is several packages, I don't think it matters as we'll evaluate per "package"
    – Zegpi
    Apr 2 '13 at 21:52
















  • Is "batch" synonymous with "package"? If so, the question would be considerably clearer if you used the same word again. If not, please clarify the difference.
    – joriki
    Apr 2 '13 at 21:34










  • I'm sorry, I forgot to add that a batch is several packages, I don't think it matters as we'll evaluate per "package"
    – Zegpi
    Apr 2 '13 at 21:52















Is "batch" synonymous with "package"? If so, the question would be considerably clearer if you used the same word again. If not, please clarify the difference.
– joriki
Apr 2 '13 at 21:34




Is "batch" synonymous with "package"? If so, the question would be considerably clearer if you used the same word again. If not, please clarify the difference.
– joriki
Apr 2 '13 at 21:34












I'm sorry, I forgot to add that a batch is several packages, I don't think it matters as we'll evaluate per "package"
– Zegpi
Apr 2 '13 at 21:52




I'm sorry, I forgot to add that a batch is several packages, I don't think it matters as we'll evaluate per "package"
– Zegpi
Apr 2 '13 at 21:52










1 Answer
1






active

oldest

votes

















up vote
0
down vote













I find the question a bit unclear but since you have an estimate of the mean and standard deviation for a single package, assuming the candy in each package is independent why not use, $E(X_1+X_2)=E(X_1)+E(X_2)$ and $Var(X_1+X_2) = Var(X_1) + Var(X_2) + 2*cov(X_1,X_2)$ ?






share|cite|improve this answer




















  • Since we want to get an estimate for each "flavour" we are now using $mu_1$ through $mu_5$. I would like to know if there is a better method to estimate how many of each "flavour" we'll get.
    – Zegpi
    Apr 2 '13 at 22:00










  • I do no think that you can do beter then using the distirubtion of each flavour. If for example you would know that $sigma_1=sigma_2$ you could use a pooled variance estimator to get a better estimate $s_pooled$ for $sigma_1$ and $sigma_2$. The same reasoning holds for $mu$.
    – MrOperator
    Apr 3 '13 at 10:14










  • That's what I thought, but I hoped someone could have a better idea.
    – Zegpi
    Apr 3 '13 at 11:46










Your Answer




StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "69"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: false,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













 

draft saved


draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f349483%2fbest-estimate-for-random-values%23new-answer', 'question_page');

);

Post as a guest






























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes








up vote
0
down vote













I find the question a bit unclear but since you have an estimate of the mean and standard deviation for a single package, assuming the candy in each package is independent why not use, $E(X_1+X_2)=E(X_1)+E(X_2)$ and $Var(X_1+X_2) = Var(X_1) + Var(X_2) + 2*cov(X_1,X_2)$ ?






share|cite|improve this answer




















  • Since we want to get an estimate for each "flavour" we are now using $mu_1$ through $mu_5$. I would like to know if there is a better method to estimate how many of each "flavour" we'll get.
    – Zegpi
    Apr 2 '13 at 22:00










  • I do no think that you can do beter then using the distirubtion of each flavour. If for example you would know that $sigma_1=sigma_2$ you could use a pooled variance estimator to get a better estimate $s_pooled$ for $sigma_1$ and $sigma_2$. The same reasoning holds for $mu$.
    – MrOperator
    Apr 3 '13 at 10:14










  • That's what I thought, but I hoped someone could have a better idea.
    – Zegpi
    Apr 3 '13 at 11:46














up vote
0
down vote













I find the question a bit unclear but since you have an estimate of the mean and standard deviation for a single package, assuming the candy in each package is independent why not use, $E(X_1+X_2)=E(X_1)+E(X_2)$ and $Var(X_1+X_2) = Var(X_1) + Var(X_2) + 2*cov(X_1,X_2)$ ?






share|cite|improve this answer




















  • Since we want to get an estimate for each "flavour" we are now using $mu_1$ through $mu_5$. I would like to know if there is a better method to estimate how many of each "flavour" we'll get.
    – Zegpi
    Apr 2 '13 at 22:00










  • I do no think that you can do beter then using the distirubtion of each flavour. If for example you would know that $sigma_1=sigma_2$ you could use a pooled variance estimator to get a better estimate $s_pooled$ for $sigma_1$ and $sigma_2$. The same reasoning holds for $mu$.
    – MrOperator
    Apr 3 '13 at 10:14










  • That's what I thought, but I hoped someone could have a better idea.
    – Zegpi
    Apr 3 '13 at 11:46












up vote
0
down vote










up vote
0
down vote









I find the question a bit unclear but since you have an estimate of the mean and standard deviation for a single package, assuming the candy in each package is independent why not use, $E(X_1+X_2)=E(X_1)+E(X_2)$ and $Var(X_1+X_2) = Var(X_1) + Var(X_2) + 2*cov(X_1,X_2)$ ?






share|cite|improve this answer












I find the question a bit unclear but since you have an estimate of the mean and standard deviation for a single package, assuming the candy in each package is independent why not use, $E(X_1+X_2)=E(X_1)+E(X_2)$ and $Var(X_1+X_2) = Var(X_1) + Var(X_2) + 2*cov(X_1,X_2)$ ?







share|cite|improve this answer












share|cite|improve this answer



share|cite|improve this answer










answered Apr 2 '13 at 21:47









MrOperator

1886




1886











  • Since we want to get an estimate for each "flavour" we are now using $mu_1$ through $mu_5$. I would like to know if there is a better method to estimate how many of each "flavour" we'll get.
    – Zegpi
    Apr 2 '13 at 22:00










  • I do no think that you can do beter then using the distirubtion of each flavour. If for example you would know that $sigma_1=sigma_2$ you could use a pooled variance estimator to get a better estimate $s_pooled$ for $sigma_1$ and $sigma_2$. The same reasoning holds for $mu$.
    – MrOperator
    Apr 3 '13 at 10:14










  • That's what I thought, but I hoped someone could have a better idea.
    – Zegpi
    Apr 3 '13 at 11:46
















  • Since we want to get an estimate for each "flavour" we are now using $mu_1$ through $mu_5$. I would like to know if there is a better method to estimate how many of each "flavour" we'll get.
    – Zegpi
    Apr 2 '13 at 22:00










  • I do no think that you can do beter then using the distirubtion of each flavour. If for example you would know that $sigma_1=sigma_2$ you could use a pooled variance estimator to get a better estimate $s_pooled$ for $sigma_1$ and $sigma_2$. The same reasoning holds for $mu$.
    – MrOperator
    Apr 3 '13 at 10:14










  • That's what I thought, but I hoped someone could have a better idea.
    – Zegpi
    Apr 3 '13 at 11:46















Since we want to get an estimate for each "flavour" we are now using $mu_1$ through $mu_5$. I would like to know if there is a better method to estimate how many of each "flavour" we'll get.
– Zegpi
Apr 2 '13 at 22:00




Since we want to get an estimate for each "flavour" we are now using $mu_1$ through $mu_5$. I would like to know if there is a better method to estimate how many of each "flavour" we'll get.
– Zegpi
Apr 2 '13 at 22:00












I do no think that you can do beter then using the distirubtion of each flavour. If for example you would know that $sigma_1=sigma_2$ you could use a pooled variance estimator to get a better estimate $s_pooled$ for $sigma_1$ and $sigma_2$. The same reasoning holds for $mu$.
– MrOperator
Apr 3 '13 at 10:14




I do no think that you can do beter then using the distirubtion of each flavour. If for example you would know that $sigma_1=sigma_2$ you could use a pooled variance estimator to get a better estimate $s_pooled$ for $sigma_1$ and $sigma_2$. The same reasoning holds for $mu$.
– MrOperator
Apr 3 '13 at 10:14












That's what I thought, but I hoped someone could have a better idea.
– Zegpi
Apr 3 '13 at 11:46




That's what I thought, but I hoped someone could have a better idea.
– Zegpi
Apr 3 '13 at 11:46

















 

draft saved


draft discarded















































 


draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f349483%2fbest-estimate-for-random-values%23new-answer', 'question_page');

);

Post as a guest













































































這個網誌中的熱門文章

How to combine Bézier curves to a surface?

Mutual Information Always Non-negative

Why am i infinitely getting the same tweet with the Twitter Search API?