SBR Forum - Free Picks & Sports Handicapping LegendZ The PIG WSEX
BetPhoenix BetJamaica Matchbook BetOnline
Carib 5Dimes The Greek Intertops
SBR Recommended Sportsbooks
1. Pinnacle Sports ... SBR Rating A+ ... Pinnacle Sports Review
2. The Greek Sports Book ... SBR Rating A+ ... The Greek Review
3. BookMaker ... SBR Rating A+ ... BookMaker Review
4. BetJamaica ... SBR Rating A+ ... BetJamaica Review
5. LegendZ Sports ... SBR Rating A+ ... LegendZ Review
Posters' Top Rated Sportsbooks
1. Matchbook ... 195 total points ... Matchbook Review
2. BetJamaica ... 182 total points ... BetJamaica Review
3. The Greek Sports Book ... 160 total points ... The Greek Review
4. Pinnacle Sports ... 130 total points ... Pinnacle Sports Review
5. 5Dimes ... 125 total points ... 5Dimes Review
Go Back   Sports Handicapping - Sports Betting - Sports Picks - SBR Forum > Sports Betting, Sportsbooks & General Discussion > Handicapper Think Tank

Reply
 
Thread Tools Display Modes
Old 06-07-2008, 07:58 PM   #1 (permalink)
Ganchrow
Moderator
 
Ganchrow's Avatar
 
Join Date: 08-28-05
Location: Forest Hills, NY, Home of the Blitzkrieg Bop
Posts: 4,746
Ganchrow is offline
Default Calculation of edge and variance on ternary outcome win/lose/push events

In reference to this thread a couple of people have asked how to properly calculate the variance on ternary outcomes events, specifically those that may end only in a win, lose, or push (so this wouldn't apply to Asian handicap).

Without further ado:
Let pw = win probability
Let pl = loss probability
Let pd = 1 - pw - pl = draw probability
Let f = fractional odds
Let x = bet size
Let E = % edge

E = pw * f - pl
σ2/x2 = (1 - pd + E) * (f - E) - pd * E


You'll note that for a push probability of zero (i.e., for pd = 0) this reduces to the familiar formulation of variance:

σ2/x2 {no push} = (1 + E) * (f - E)
Another question, of particular relevance when back testing a strategy, would be how to calculate the standard deviation in expectation of an event observed and bet over n occurrences.

This is actually a less general question than the one answered above (insofar as it constrains payout odds and bet size to be identical for each wager), but because some are most accustomed to the n*p*(1-p) formulation of binomial variance I'll provide the trinomial win/loss/push analogue here. (I'll leave it an exercise to the reader to demonstrate that the formulation in post# 1 is just a generalization of the formulation following.)
Let N = # of trials
Let pw = observed win frequency over N trials
Let pl = observed loss frequency over N trials
Let pd = 1 - pw - pl = observed draw frequency over N trials

The variances and covariances for # of wins, # of losses, and # of pushes are then given by:

Var[# of wins] = N * pw * (1-pw)
Var[# of losses] = N * pl * (1-pl)
Var[# of pushes] = N * pd * (1-pd)
Covar[# of wins, # of losses] = -N * pw * pl
Covar[# of wins, # of pushes] = -N * pw * pd
Vovar[# of losses, # of pushes] = -N * pl * pd

If we further define:

Let f = fractional odds
Let x = bet size

Then the total variance in win amount would be given by:

σ2/x2 = f2 * Var[# of wins] + Var[# of losses] - 2 * f * Covar[# of wins, # of losses]
σ2/x2 = N * ( f2*pw*(1-pw) + pl*(1-pl) + 2*f*pw*pl )
__________________
Reply With Quote
Old 06-08-2008, 04:37 AM   #2 (permalink)
Ganchrow
Moderator
 
Ganchrow's Avatar
 
Join Date: 08-28-05
Location: Forest Hills, NY, Home of the Blitzkrieg Bop
Posts: 4,746
Ganchrow is offline
Default

So here's an example of how to practically use the second formulation above to test for significance.

Observed:

260 observations (N = 260)
all bets at -110 (f = 10/11)
wins = 143 (pw = 55%)
losses = 104 (pl = 40%)
pushes = 13 (pd = 5%)

Assuming results were properly obtained out-of-sample, what would be our 95% confidence lower bound for true strategy ROI (using the Central Limit Theorem)?

Calculating ROI and ROI variance we find:

absolute ROI = 143 * 10/11 - 104 = +26 units (+10%)
σ2 = 260 * (100/121*55%*45% + 40%*60% + 2*10/11*55%*40%) ≈ 219.58 units2

or σ = 14.8183 units.

So taking into account the ternary outcome set, our (frequentist) 95% confidence lower bound on ROI would be 26/260 - 1.6261*14.8183/260 ≈ +0.6254%.

Let's compare this to the results we'd find using conditional probability with a binary outcome set.

Let p* = win frequency conditioned on not pushing = 140/(260-13) ≈ 57.895%

Which would give us a conditional ROI and variance of conditional ROI of:

conditional ROI = +26 units (that's 26 / (260-13) = +10.526%)
σ2 = (260-13)*(100/121 + 1 + 2*10/11) * 57.895%*(1-57.895%) ≈ 14.8137 units2

So taking into account the binary outcome set, our (frequentist) 95% confidence lower bound on conditional returns would be 26/(260-13) - 1.6261*14.8137/(260-13) ≈ +0.6614%.

Removing the conditioning would yield a lower bound of +0.6614% * (260-13)/260 ≈ 0.6283% on absolute returns. Pretty close to what we had found above (and the error is certainly less than that introduced by using the Central Limit Theorem in the first place).

But here's real issue:

Taking into account the ternary outcome set, how would one go about testing the null hypothesis that the strategy is in reality no better than breakeven?

Anyone?

This is straightforward if we condition the results and use binomial variance, but not quite so otherwise. It's exactly this difficulty that leads most practitioners to simply condition out pushes rather than deal with the vagaries of the trinomial distribution.
__________________
Reply With Quote
Old 06-10-2008, 11:55 PM   #3 (permalink)
square1
SBR Rookie
 
square1's Avatar
 
Join Date: 01-11-08
Posts: 32
square1 is online now
Default

Quote:
Originally Posted by Ganchrow View Post
But here's real issue:

Taking into account the ternary outcome set, how would one go about testing the null hypothesis that the strategy is in reality no better than breakeven?

Anyone?

This is straightforward if we condition the results and use binomial variance, but not quite so otherwise. It's exactly this difficulty that leads most practitioners to simply condition out pushes rather than deal with the vagaries of the trinomial distribution.
Well, there are an infinite number of breakeven distributions: choose p(d) on (0,1), and then set p(l)/p(w) = f. I don't believe the problem as stated is well-defined without a prior on the bivariate. So to test, you'd need an additional structural assumption of some kind. If I were doing it I think I'd fix p(d) at its MLE, which is 13/260 in your example, and go from there. Rigorously justifiable? Of course not, but I'd have minor faith in the result.
Reply With Quote
Old 06-11-2008, 12:35 AM   #4 (permalink)
Ganchrow
Moderator
 
Ganchrow's Avatar
 
Join Date: 08-28-05
Location: Forest Hills, NY, Home of the Blitzkrieg Bop
Posts: 4,746
Ganchrow is offline
Default

Quote:
Originally Posted by square1 View Post
Well, there are an infinite number of breakeven distributions: choose p(d) on (0,1), and then set p(l)/p(w) = f. I don't believe the problem as stated is well-defined without a prior on the bivariate. So to test, you'd need an additional structural assumption of some kind. If I were doing it I think I'd fix p(d) at its MLE, which is 13/260 in your example, and go from there. Rigorously justifiable? Of course not, but I'd have minor faith in the result.
Why restrict yourself by fixing the pd parameter? Why not treat it as a random variable with a uniform prior distribution (say)?

But yeah, you got the idea.

It's a pain in the ass.
__________________
Reply With Quote
Old 06-11-2008, 04:07 AM   #5 (permalink)
Ganchrow
Moderator
 
Ganchrow's Avatar
 
Join Date: 08-28-05
Location: Forest Hills, NY, Home of the Blitzkrieg Bop
Posts: 4,746
Ganchrow is offline
Default

This spreadsheet illustrates what I had in mind. It implicitly assumes a uniform prior distribution of pd and uses the t-distribution to impute the final p-value (in both the binomial and trinomial cases).

Assuming I didn't make an error, the trinomial p-value of the null-hypothesis (that population EV ≤ 0%) actually turns out to be slightly lower than the binomial p-value (4.168% vs. 4.199%), and coming up with the latter is a whole lot easier.
Attached Files
File Type: xls trinomial hypothesis test demo.xls (871.5 KB, 12 views)
__________________
Reply With Quote
Old 06-11-2008, 06:36 AM   #6 (permalink)
square1
SBR Rookie
 
square1's Avatar
 
Join Date: 01-11-08
Posts: 32
square1 is online now
Default

Quote:
Originally Posted by Ganchrow View Post
Why restrict yourself by fixing the pd parameter? Why not treat it as a random variable with a uniform prior distribution (say)?

But yeah, you got the idea.

It's a pain in the ass.
There's nothing wrong with the uniform, and it's probably not going to have a huge impact on the result.

I like the MLE because the essence of hypo-testing is to ask, "How objectionable does our data find these probabilities?". Since we are presumably a lot more interested in the win/loss ratio than the draw probability, I chose the draw probability the data finds least "objectionable", which is the essence of the MLE. So if we reject, we know the data has no issues with our pd parameter, and thus is telling us the null-hypo win/loss ratio is the part of the null that is causing our result to be improbable under the null.
Reply With Quote
Old 06-11-2008, 07:22 AM   #7 (permalink)
Ganchrow
Moderator
 
Ganchrow's Avatar
 
Join Date: 08-28-05
Location: Forest Hills, NY, Home of the Blitzkrieg Bop
Posts: 4,746
Ganchrow is offline
Default

Quote:
Originally Posted by square1 View Post
I like the MLE because the essence of hypo-testing is to ask, "How objectionable does our data find these probabilities?". Since we are presumably a lot more interested in the win/loss ratio than the draw probability, I chose the draw probability the data finds least "objectionable", which is the essence of the MLE. So if we reject, we know the data has no issues with our pd parameter, and thus is telling us the null-hypo win/loss ratio is the part of the null that is causing our result to be improbable under the null.
Well I'd say we're less concerned with the win/loss ratio per se, and more concerned simply with the model return. Win/loss ratio is just so ... binomial.

Regardless, I think we can both readily agree that it's not going to change the results much at all one way or the other and that the best easiest solution is probably clearly to go with binomial modeling coupled with conditional-form model parameters.
__________________
Reply With Quote
Reply


Thread Tools
Display Modes


Similar Threads
Thread Thread Starter Forum Replies Last Post
Win Big or Lose your Bankroll 20Four7 Baseball Handicapping 65 07-03-2008 05:49 AM
Os cant lose, Tigers cant win mofome Players Talk 1 04-08-2008 05:42 PM
Win or Lose, does anyone..... Dhuepaxuz Players Talk 14 07-21-2007 05:57 AM
Did you win or lose your last bet at Pinnacle? BuddyBear Players Talk 10 01-20-2007 01:14 AM
Books you win/lose at Max Levine Players Talk 4 12-30-2005 12:42 AM


All times are GMT -5. The time now is 05:00 AM.



Search Engine Friendly URLs by vBSEO 3.0.0 RC6

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30