AleoMagus
07-27-2004, 11:01 PM
This is leading to a very complicated question so bear with me...
Recently I have been doing a bit of statistics studying in order to calculate confidence levels in sng results.
After calculating Standard deviation, I assume standard normal distribution and have come up with workable numbers which give some idea of what kind of swings to expect, etc...
I put this all into an excel spreadsheet for ease of use
http://www.aleomagus.freeservers.com/Spreadsheet
My 'confidence calculator' is the file I am refering to.
Anyways, I was happy with this for a while, but more and more I am thinking... just one problem... this is all wrong.
My calculations are correct, but it seems foolish to say that after 1 10+1 SNG, my SD is $19. It also seems foolish to say that after n tourneys, my standard deviation is SD*SQRT(n)
The reason I say this is that in any given sng, my expectation will only deviate in the given prize totals and this deviation is a direct result of my Finish percentages.
For example, after 1 sng, I can only be +39, +19, +9, or -11 and this result will occur in the frequency of my 1st, 2nd, 3rd, 4th-10th place finishes. This is to say that after 1 sng, I can have only 4 distinct outcomes.
Similarly after 2 sngs, I can only have 10 distinct outcomes
After 3 sngs, 20 outcomes
After 4 sngs, 35 outcomes
After 5 sngs, 56 outcomes
and eventually
After 1000 sngs, 167668501 outcomes
and so on
By outcomes, I mean the combination of all net $ finishing possibilities within the set of sngs. For example, if I play 4 sngs, a few possible outcomes are...
{+19,-11,-11,-11}
{+39,+9,+9,-11}
{-11,+39,+19,+19}
{-11,-11,-11,-11}
{+39,+39,+39,+39}
etc...
Keep in mind, I also am treating sets like
{-11,+9,-11,+9}
{+9,+9,-11,-11}
{-11,+9,+9,-11}
As only ONE outcome because the order doesn't matter to me here.
Don't ask me how I discovered for n tourneys, the number of outcomes. After a lot of work and discovery it turns out that all I needed to do was use:
C(4,1) for 1 sng
C(5,2) for 2 sngs
C(6,3) for 3 sngs
and so on...
Anyways... It seems to me that these outcomes can each have a percentage attatched to them and can even be ordered by the amount of net profit or loss. These percentages could be added together to give a more precise measure of such statements as... Given my current results, after n tourneys, there is a x% chance that I will be at least breaking even.
In this way, an exact confidence level specific to sng outcome distributions can be acheived.
As I actually don't know much about statistics, I am unsure if this is what people mean when they say that normal standard distibution tables are incorrect. It would seem so.
So... My questions then are essentially these...
a) Is this reasoning valid? Am I on to something here?
b) Can the calculations I am suggesting be accomplished without the aid of a supercomputer. Obviously it is easy for small sng groupings, but it gets a lot more difficult once you need to calculate percentages for more than a few possible outcomes.
c) Is there more advanced mathematics available for dealing with these kinds of situations?
d) Does SD for sngs still hold a kind of truth despite never representing an actual potential outcome?...
... Actually, I know SD doesn't need to represent an actual potential outcome, much the same as my $/tourney average doesn't need to represent an actual outcome. What I am really asking is - Does SD need to be adjusted to take into account the fact that in SNGs, a loss can only be a certain size (which is usually smaller than SD) and wins can be much larger? Or is this SD still ultimately correct and usable for calculating confidence with a refined Distribution?
No doubt, much of this is unclear. If there are any questions, Ask and I will try to expand on what I have written.
Any thoughts would be appreciated.
Regards
Brad S
Recently I have been doing a bit of statistics studying in order to calculate confidence levels in sng results.
After calculating Standard deviation, I assume standard normal distribution and have come up with workable numbers which give some idea of what kind of swings to expect, etc...
I put this all into an excel spreadsheet for ease of use
http://www.aleomagus.freeservers.com/Spreadsheet
My 'confidence calculator' is the file I am refering to.
Anyways, I was happy with this for a while, but more and more I am thinking... just one problem... this is all wrong.
My calculations are correct, but it seems foolish to say that after 1 10+1 SNG, my SD is $19. It also seems foolish to say that after n tourneys, my standard deviation is SD*SQRT(n)
The reason I say this is that in any given sng, my expectation will only deviate in the given prize totals and this deviation is a direct result of my Finish percentages.
For example, after 1 sng, I can only be +39, +19, +9, or -11 and this result will occur in the frequency of my 1st, 2nd, 3rd, 4th-10th place finishes. This is to say that after 1 sng, I can have only 4 distinct outcomes.
Similarly after 2 sngs, I can only have 10 distinct outcomes
After 3 sngs, 20 outcomes
After 4 sngs, 35 outcomes
After 5 sngs, 56 outcomes
and eventually
After 1000 sngs, 167668501 outcomes
and so on
By outcomes, I mean the combination of all net $ finishing possibilities within the set of sngs. For example, if I play 4 sngs, a few possible outcomes are...
{+19,-11,-11,-11}
{+39,+9,+9,-11}
{-11,+39,+19,+19}
{-11,-11,-11,-11}
{+39,+39,+39,+39}
etc...
Keep in mind, I also am treating sets like
{-11,+9,-11,+9}
{+9,+9,-11,-11}
{-11,+9,+9,-11}
As only ONE outcome because the order doesn't matter to me here.
Don't ask me how I discovered for n tourneys, the number of outcomes. After a lot of work and discovery it turns out that all I needed to do was use:
C(4,1) for 1 sng
C(5,2) for 2 sngs
C(6,3) for 3 sngs
and so on...
Anyways... It seems to me that these outcomes can each have a percentage attatched to them and can even be ordered by the amount of net profit or loss. These percentages could be added together to give a more precise measure of such statements as... Given my current results, after n tourneys, there is a x% chance that I will be at least breaking even.
In this way, an exact confidence level specific to sng outcome distributions can be acheived.
As I actually don't know much about statistics, I am unsure if this is what people mean when they say that normal standard distibution tables are incorrect. It would seem so.
So... My questions then are essentially these...
a) Is this reasoning valid? Am I on to something here?
b) Can the calculations I am suggesting be accomplished without the aid of a supercomputer. Obviously it is easy for small sng groupings, but it gets a lot more difficult once you need to calculate percentages for more than a few possible outcomes.
c) Is there more advanced mathematics available for dealing with these kinds of situations?
d) Does SD for sngs still hold a kind of truth despite never representing an actual potential outcome?...
... Actually, I know SD doesn't need to represent an actual potential outcome, much the same as my $/tourney average doesn't need to represent an actual outcome. What I am really asking is - Does SD need to be adjusted to take into account the fact that in SNGs, a loss can only be a certain size (which is usually smaller than SD) and wins can be much larger? Or is this SD still ultimately correct and usable for calculating confidence with a refined Distribution?
No doubt, much of this is unclear. If there are any questions, Ask and I will try to expand on what I have written.
Any thoughts would be appreciated.
Regards
Brad S