Topic: Tournament predictions by simulation (UPDATED: 2012-04-07) (Read 18061 times)

gorgonstar · « **Reply #25 on:** March 22, 2012, 11:45:19 pm »

Thanks for running these sims. I was wondering what my odds were. I think the calculated odds are about right. I've been getting quite an education from the articles and posts on this board. But now I'm playing against the people writing those articles. This tournament is going to be quite a challenge for me.

michaeljb · « **Reply #26 on:** March 24, 2012, 04:02:55 pm »

Quote from: Kirian on March 22, 2012, 08:45:27 pm

Quote from: WanderingWinder on March 22, 2012, 08:11:52 pm
Quote from: RisingJaguar on March 22, 2012, 08:10:05 pm
I could get used to this so called RisingJaguar Effect.
Don't think you'll enjoy it ever again... Well, maybe right after the isotropic replacement is put in place, we all will

He might not get to enjoy it directly, but it's kinda nice to end up with it named after him. Dark horse low seed wins his division, then two months later is riding the top of the leaderboard? I think the RJ Effect works.

As I was following the DS Championships after I got knocked out, I thought it was very fitting that his name was Rising Jaguar.

RisingJaguar · « **Reply #27 on:** March 24, 2012, 05:47:38 pm »

Quote from: michaeljb on March 24, 2012, 04:02:55 pm

Quote from: Kirian on March 22, 2012, 08:45:27 pm
Quote from: WanderingWinder on March 22, 2012, 08:11:52 pm
Quote from: RisingJaguar on March 22, 2012, 08:10:05 pm
I could get used to this so called RisingJaguar Effect.
Don't think you'll enjoy it ever again... Well, maybe right after the isotropic replacement is put in place, we all will

He might not get to enjoy it directly, but it's kinda nice to end up with it named after him. Dark horse low seed wins his division, then two months later is riding the top of the leaderboard? I think the RJ Effect works.

As I was following the DS Championships after I got knocked out, I thought it was very fitting that his name was Rising Jaguar.

I have heard this a few times

I wonder how many people on this forum even knows where RisingJaguar comes from...

Ozle · « **Reply #28 on:** March 24, 2012, 06:14:19 pm »

Quote

I have heard this a few times

I wonder how many people on this forum even knows where RisingJaguar comes from...

Even if they didn't, google provides!

It will never beat Hadouken! though!
Or even my personal favourite 'GET OVER HERE'

Lekkit · « **Reply #29 on:** March 25, 2012, 01:55:51 am »

Quote from: RisingJaguar on March 24, 2012, 05:47:38 pm

I wonder how many people on this forum even knows where RisingJaguar comes from...

Being a Street Fighter player, I thoght about Adon the first time I saw the name.

Mean Mr Mustard · « **Reply #30 on:** March 25, 2012, 04:27:09 pm »

RisingJaguar · « **Reply #31 on:** April 06, 2012, 11:40:18 am »

Antony is really showing the RisingJaguar effect is still here

#ShamelessPlug

blueblimp · « **Reply #32 on:** April 07, 2012, 01:16:11 am »

Update!

Notes

This simulation was run with the most recent leaderboard, since that is the most accurate available estimate of player skill. (The one exception is ^_^_^_^, who isn't on the most recent leaderboard, so his skill comes from an old leaderboard.) This means that change in win chance comes from some combination of tournament results and change in rating.
3-way ties are still handled incorrectly, but it happens infrequently enough that it shouldn't matter at this level of accuracy.

Current Standings
Players are ranked by points-per-game. PPG means points-per-game, PTS means points, and GMS means games.

PPG PTS GMS NAME
Brackets:
Groups:
Ranking:
1.5 31 21 antony
1.4 20 14 O
1.3 18 14 dghunter79
1.0 14 14 shark_bait
0.9 13 14 ednever
0.9 12 14 Tmoiy
0.6 9 14 Insomniac-X
0.4 9 21 Titandrake

Ranking:
1.2 25 21 [MAD] Mergus
1.1 16 14 Axxle
1.1 16 14 Jorbles
1.1 15 14 ignorantmen
0.9 13 14 yuma
0.9 13 14 Coheed
0.9 12 14 tlloyd
0.8 16 21 perdhapley

Ranking:
1.4 20 14 NinjaBus
1.3 18 14 michaeljb
1.1 16 14 ebEliminator
1.0 14 14 mikemike
1.0 14 14 A_S00
0.9 12 14 BJ Penn
0.7 10 14 CarpeDeezNuts
0.6 8 14 mnavratil

Ranking:
1.7 12 7 Mean Mr Mustard
1.4 20 14 RisingJaguar
1.3 18 14 rspeer
0.9 12 14 Brando Commando
0.9 12 14 greatexpectations
0.9 6 7 Nicki Menagerie
0.3 2 7 AHoppy
0.3 2 7 ^_^_^_^

Groups:
Ranking:
1.3 18 14 lespeutere
1.1 16 14 MrEevee
1.1 16 14 Rabid
1.1 8 7 Dubdubdubdub
0.9 12 14 Geronimoo
0.9 12 14 Nucleus
0.9 6 7 luliin
0.7 10 14 Mangsky

Ranking:
1.3 18 14 Lekkit
1.1 24 21 Fabian
1.1 16 14 StickaRicka
1.1 23 21 ugasoft
1.0 14 14 JanErik
0.9 13 14 Tonks77
0.8 11 14 AngBoy
0.5 7 14 ArjanB

Ranking:
1.4 20 14 Mic Qsenoch
1.4 10 7 WanderingWinder
1.1 16 14 Robz888
1.1 16 14 Masticore
1.0 20 21 blueblimp
0.9 13 14 angrybirds
0.7 15 21 zxcvbn2
0.3 2 7 Voltaire

Ranking:
1.6 22 14 jonts26
1.1 8 7 DG
1.1 16 14 Tha Trillest Young Nick
1.0 14 14 Kirian
1.0 14 14 Graystripe77
0.9 12 14 andwilk
0.8 16 21 fit1one
0.7 10 14 elahrairah13

Revised Predictions

Brackets:
Groups:
Ranking:
41% (avg pts 61): O
23% (avg pts 58): shark_bait
22% (avg pts 58): antony
13% (avg pts 56): dghunter79
1% (avg pts 43): Tmoiy
0% (avg pts 42): ednever
0% (avg pts 40): Titandrake
0% (avg pts 30): Insomniac-X

Ranking:
28% (avg pts 54): [MAD] Mergus
23% (avg pts 53): tlloyd
19% (avg pts 52): Jorbles
19% (avg pts 52): Axxle
3% (avg pts 45): Coheed
3% (avg pts 42): ignorantmen
2% (avg pts 46): perdhapley
2% (avg pts 44): yuma

Ranking:
72% (avg pts 63): NinjaBus
17% (avg pts 54): michaeljb
4% (avg pts 49): mikemike
4% (avg pts 50): A_S00
2% (avg pts 48): BJ Penn
0% (avg pts 42): CarpeDeezNuts
0% (avg pts 41): ebEliminator
0% (avg pts 40): mnavratil

Ranking:
77% (avg pts 70): RisingJaguar
20% (avg pts 63): Mean Mr Mustard
1% (avg pts 54): greatexpectations
1% (avg pts 51): rspeer
0% (avg pts 48): Brando Commando
0% (avg pts 41): ^_^_^_^
0% (avg pts 39): Nicki Menagerie
0% (avg pts 23): AHoppy

Groups:
Ranking:
51% (avg pts 63): Rabid
22% (avg pts 59): Geronimoo
18% (avg pts 57): lespeutere
7% (avg pts 54): MrEevee
1% (avg pts 45): Dubdubdubdub
0% (avg pts 41): luliin
0% (avg pts 35): Nucleus
0% (avg pts 34): Mangsky

Ranking:
44% (avg pts 61): Fabian
26% (avg pts 58): Lekkit
13% (avg pts 56): JanErik
9% (avg pts 54): StickaRicka
6% (avg pts 54): ugasoft
1% (avg pts 48): Tonks77
0% (avg pts 28): AngBoy
0% (avg pts 30): ArjanB

Ranking:
54% (avg pts 62): WanderingWinder
24% (avg pts 58): Robz888
18% (avg pts 57): Mic Qsenoch
3% (avg pts 50): Masticore
2% (avg pts 51): blueblimp
0% (avg pts 34): Voltaire
0% (avg pts 37): angrybirds
0% (avg pts 38): zxcvbn2

Ranking:
69% (avg pts 65): jonts26
18% (avg pts 56): DG
9% (avg pts 54): Tha Trillest Young Nick
2% (avg pts 48): Kirian
2% (avg pts 49): Graystripe77
0% (avg pts 43): andwilk
0% (avg pts 40): elahrairah13
0% (avg pts 33): fit1one

Missing Games
Some games from Week 1 and Week 2 are still missing. (Week 3 has tons of missing games since it's still in progress.)

Week 1 missing games:

Code: [Select]

WanderingWinder vs. Voltaire

Week 2 missing games:

Code: [Select]

AHoppy vs. Mean Mr. Mustard
Nicki Menagerie vs. ^_^_^_^
Dubdubdubdub vs. luliin
Greystripe77 vs. DG

Data
Since the tournament result data is not in a very handy format, I mostly created my own by going through the posts. (So there may be errors.) In particular, I made sure to use exact isotropic usernames, because otherwise the simulator gets confused. I'm putting the data here in case anyone else wants to use it.

Week 1:

Code: [Select]

Week One Match-ups:

Pacific Group:

Tmoiy vs. ednever 2-4-1 
shark_bait vs. antony 2-5-0
dghunter79 vs. Insomniac-X 5-0-2
O vs. Titandrake 6-1-0

Mountain Group

[MAD] Mergus vs. yuma 4-2-1
Axxle vs. tlloyd 4-3-0
Jorbles vs. perdhapley 5-2-0
Coheed vs. ignorantmen 2-4-1

Central Group:

BJ Penn vs. NinjaBus 2-5-0
ebEliminator vs. mikemike 2-3-2
CarpeDeezNuts vs. A_S00 2-5-0
michaeljb vs. mnavratil 5-2-0

Eastern Group:

Mean Mr Mustard vs. ^_^_^_^ 6-1-0
rspeer vs. AHoppy 6-1-0
Brando Commando vs. Nicki Menagerie 4-3-0
RisingJaguar vs. greatexpectations 5-2-0

Eurasia Group:

Geronimoo vs. lespeutere 3-4-0
MrEevee vs. Rabid 3-4-0
luliin vs. Nucleus 3-4-0
Dubdubdubdub vs. Mangsky 4-3-0

Central Europe Group:

Fabian vs. JanErik 4-3-0
AngBoy vs. ArjanB 4-2-1
Tonks77 vs. ugasoft 3-3-1
Lekkit vs. StickaRicka 5-2-0

Eastern America Group

WanderingWinder vs. Voltaire
zxcvbn2 vs. angrybirds 2-4-1
blueblimp vs. Mic Qsenoch 3-4-0
Robz888 vs. Masticore 4-3-0

Atlantic Group:

Tha Trillest Young Nick vs. Graystripe77 4-3-0
elahrairah13 vs. Kirian 3-4-0
andwilk vs. DG 3-4-0
fit1one vs. jonts26 6-1-0

Week 2:

Code: [Select]

Week Two Match-ups:

Earthquake Group

antony vs. ednever 5-2-0
Insomniac-X vs. Tmoiy 3-3-1
Titandrake vs. shark_bait 2-5-0
O vs. dghunter79 4-3-0

Avalanche Group

tlloyd vs. [MAD] Mergus 3-4-0
perdhapley vs. yuma 3-4-0
ignorantmen vs. Axxle 3-4-0
Coheed vs. Jorbles 4-3-0

Tornado Group

mikemike vs. BJ Penn 3-4-0
A_S00 vs. NinjaBus 2-5-0
mnavratil vs. ebEliminator 2-5-0
michaeljb vs. CarpeDeezNuts 4-3-0

Jersey Shore Group

AHoppy vs. Mean Mr. Mustard
Nicki Menagerie vs. ^_^_^_^
greatexpectations vs. rspeer 4-3-0
RisingJaguar vs. Brando Commando 5-2-0

Eurasia Group:

Geronimoo vs. Rabid 3-4-0
lespeutere vs. Nucleus 5-2-0
MrEevee vs. Mangsky 5-2-0
Dubdubdubdub vs. luliin

Central Europe Group:

Fabian vs. ArjanB 6-1-0
JanErik vs. ugasoft 4-3-0
AngBoy vs. StickaRicka 1-6-0
Lekkit vs. Tonks77 4-3-0

Eastern America Group

WanderingWinder vs. angrybirds 5-2-0
Voltaire vs. Mic Qsenoch 1-6-0
zxcvbn2 vs. Masticore 2-5-0
Robz888 vs. blueblimp 4-3-0

Atlantic Group:

Tha Trillest Young Nick vs. Kirian 4-3-0
Greystripe77 vs. DG
elahrairah13 vs. jonts26 2-5-0
fit1one vs. andwilk 4-3-0

Week 3:

Code: [Select]

Week Three Match-ups:

Pacific Coast Group

ednever vs. Insomniac-X
antony vs. Titandrake 5-1-1
Tmoiy vs. O
shark_bait vs. DGHunter79

What's a Coast? Group

[MAD] Mergus vs. perdhapley 4-3-0
tlloyd vs. Ignorantmen
Yuma vs. coheed
Axxle vs. Jorbles

Gulf Coast Group

BJ Penn vs. A_S00
mikemike vs. mnavratil
ninjabus vs. michaeljb
ebEliminator vs. CarpeDeezNuts

Atlantic Coast Group

Mean Mr Mustard vs. Nicki Menagerie
AHoppy vs. greatexpectations
^_^_^_^ vs. RisingJaguar
rspeer vs. Brando Commando

Eurasia Group:

Geronimoo vs. Nucleus
Rabid vs. Mangsky
lespeutere vs. Dubdubdubdub
MrEevee vs. luliin

Central Europe Group:

Fabian vs. ugasoft 2-5-0
ArjanB vs. StickaRicka
JanErik vs. Lekkit
angboy vs. Tonks77

Eastern America Group

Wandering Winder vs. Mic Qsenoch
angrybirds vs. Masticore
Voltaire vs. Robz888
zxcvbn2 vs. blueblimp 3-4-0

Atlantic Group:

Young Nick vs. DG
Kirian vs. Jonts26
Graystripe77 vs. fit1one 4-3-0
elahrairah13 vs. andwilk

jonts26 · « **Reply #33 on:** April 07, 2012, 02:05:13 am »

You've got some weird stuff going on in my division. I believe I should have 22 points and fit1one should have 16. And in the week 2 results it should be jonts26 over elahrairah13, 5 games to 2.

blueblimp · « **Reply #34 on:** April 07, 2012, 10:40:12 am »

Quote from: jonts26 on April 07, 2012, 02:05:13 am

You've got some weird stuff going on in my division. I believe I should have 22 points and fit1one should have 16. And in the week 2 results it should be jonts26 over elahrairah13, 5 games to 2.

Thanks, my bad. In week 1, I typo'd as fit1one winning 6-1-0 over you, when it was actually you winning 6-1-0. The week 2 results are correct (and match what you say in your post).

I'll fix the post above momentarily.

Edit: Maybe not surprisingly, fixing this typo makes a huge difference to your group's win chances. It takes you from 26% to 69%!

Guy Srinivasan · « **Reply #35 on:** April 07, 2012, 01:35:53 pm »

:minor necro:

Quote from: rrenaud on March 22, 2012, 02:49:55 pm

I'll try to trick your guys intuition into believing the right answer. Consider the case of an incoming contender with no previous data. You just have that wide, mu = 25, big sigma^2 initial distribution for the players skill.

Do you still want to ignore the information of her getting into round X when making predictions about him getting into round X + 1?

At least for this case, you agree that taking the intermediate tournament results into account will help your prediction, right?

blueblimp is entirely correct. I believe what's happening here, rrenaud, is that your intuition is saying "surely updating on the fact that she made it to round X moves our belief about her skill up, because there's far more probability mass in (she got to round X with high skill) than (she got to round X with low skill)". This is entirely accurate in a world where we have uncertainty about her skill. blueblimp's model is different. The initial mu=25, s^2 large distribution is a measure of our uncertainty about her skill. We have two basic options:

1. Calculate the odds of her winning round 1 while incorporating that uncertainty, then update our uncertainty to one thing in the branch of the problem where we suppose she won and to another thing in the branch of the problem where we suppose she lost. Keep going until we have the probabilities of all n! outcomes, combine those with the same winners.

2. Instead of propagating and updating the uncertainty, resolve it artificially right now. Sample a single point randomly from that distribution and suppose that is her actual skill. No uncertainty left. Now do the math as if we had no skill uncertainty, find the distribution of winners, record it, repeat the whole thing a bajillion times, and combine the resulting distributions to get the win probabilities.

(between these options, it's possible to narrow down player skills by sampling and then propagate the uncertainty, but I think that's not useful if our uncertainties start out nice and normal)

Which of these two is better? I think 2. converges to 1. in the limit and is much easier to code... Let's see what 1. would look like. I'll make the falsebad assumption that TrueSkill makes about uncertainty, which is that we can keep it normal the whole way.

One round: New (25, 8.33^2) versus blueblimp (42.4, 2.3^2)

The probability that blueblimp's skill is X greater than New's follows N(42.4-25,2.3^2+8.33^2), and a player with X greater skill wins with probability 4^(X/25)/(4^(X/25)+1). Plugging into Wolfram Alpha (1) that's a 71.5% chance blueblimp wins, and their new skill distributions according to (3) are

blueblimp wins: New (24.258, 7.802) and blueblimp (42.457, 2.291).
New wins: New (38.998, 5.594) and blueblimp (41.332, 2.253)

Now just do that lots of times and you can get the full table! Tools like a language with a numerical integration module may make this easier.

(1) http://www.wolframalpha.com/input/?i=int%281%2Fsqrt%282*pi%29*1%2F%282.3%5E2%2B8.33%5E2%29%5E.5*e%5E-%28%28X-%2842.4-25%29%29%5E2%2F%282*%282.3%5E2%2B8.33%5E2%29%29%29*4%5E%28X%2F25%29%2F%284%5E%28X%2F25%29%2B1%29%2CX%3D-inf..inf%29

(2) http://atom.research.microsoft.com/trueskill/rankcalculator.aspx

blueblimp · « **Reply #36 on:** April 07, 2012, 05:13:41 pm »

Thanks for your post. I don't follow your method 1 though. Is the approach you describe to calculate exact probabilities of all outcomes?

Guy Srinivasan · « **Reply #37 on:** April 08, 2012, 02:40:24 am »

Quote from: blueblimp on April 07, 2012, 05:13:41 pm

Thanks for your post. I don't follow your method 1 though. Is the approach you describe to calculate exact probabilities of all outcomes?

Yep. Or it would be exact except that we're not doing a true update of our prior normal probability, we're instead finding the means and variances of resulting distributions and calling the results the normal distributions with those means and variances.

blueblimp · « **Reply #38 on:** April 08, 2012, 12:59:08 pm »

Quote from: Guy Srinivasan on April 08, 2012, 02:40:24 am

Quote from: blueblimp on April 07, 2012, 05:13:41 pm
Thanks for your post. I don't follow your method 1 though. Is the approach you describe to calculate exact probabilities of all outcomes?
Yep. Or it would be exact except that we're not doing a true update of our prior normal probability, we're instead finding the means and variances of resulting distributions and calling the results the normal distributions with those means and variances.

I think I see now, thanks. Unfortunately it's impractical here because of the high number of games played. For a single series, it could probably work.

RisingJaguar · « **Reply #39 on:** May 17, 2012, 10:18:56 pm »

One thing that was often on my mind during this whole tournament was this tournament prediction. I thought this was a fun way to quantify how often a player should win. This would be a nice time to revisit the results, below I have bolded the winners.

Quote from: blueblimp on April 07, 2012, 01:16:11 am

Brackets:
Groups:
Ranking:
41% (avg pts 61): O
23% (avg pts 58): shark_bait
22% (avg pts 58): antony
13% (avg pts 56): dghunter79
1% (avg pts 43): Tmoiy
0% (avg pts 42): ednever
0% (avg pts 40): Titandrake
0% (avg pts 30): Insomniac-X

Ranking:
28% (avg pts 54): [MAD] Mergus
23% (avg pts 53): tlloyd
19% (avg pts 52): Jorbles
19% (avg pts 52): Axxle
3% (avg pts 45): Coheed
3% (avg pts 42): ignorantmen
2% (avg pts 46): perdhapley
2% (avg pts 44): yuma

Ranking:
72% (avg pts 63): NinjaBus
17% (avg pts 54): michaeljb
4% (avg pts 49): mikemike
4% (avg pts 50): A_S00
2% (avg pts 48): BJ Penn
0% (avg pts 42): CarpeDeezNuts
0% (avg pts 41): ebEliminator
0% (avg pts 40): mnavratil

Ranking:
77% (avg pts 70): RisingJaguar
20% (avg pts 63): Mean Mr Mustard
1% (avg pts 54): greatexpectations
1% (avg pts 51): rspeer
0% (avg pts 48): Brando Commando
0% (avg pts 41): ^_^_^_^
0% (avg pts 39): Nicki Menagerie
0% (avg pts 23): AHoppy

Groups:
Ranking:
51% (avg pts 63): Rabid
22% (avg pts 59): Geronimoo
18% (avg pts 57): lespeutere
7% (avg pts 54): MrEevee
1% (avg pts 45): Dubdubdubdub
0% (avg pts 41): luliin
0% (avg pts 35): Nucleus
0% (avg pts 34): Mangsky

Ranking:
44% (avg pts 61): Fabian
26% (avg pts 58): Lekkit
13% (avg pts 56): JanErik
9% (avg pts 54): StickaRicka
6% (avg pts 54): ugasoft
1% (avg pts 48): Tonks77
0% (avg pts 28): AngBoy
0% (avg pts 30): ArjanB

Ranking:
54% (avg pts 62): WanderingWinder
24% (avg pts 58): Robz888
18% (avg pts 57): Mic Qsenoch
3% (avg pts 50): Masticore
2% (avg pts 51): blueblimp
0% (avg pts 34): Voltaire
0% (avg pts 37): angrybirds
0% (avg pts 38): zxcvbn2

Ranking:
69% (avg pts 65): jonts26
18% (avg pts 56): DG
9% (avg pts 54): Tha Trillest Young Nick
2% (avg pts 48): Kirian
2% (avg pts 49): Graystripe77
0% (avg pts 43): andwilk
0% (avg pts 40): elahrairah13
0% (avg pts 33): fit1one

blueblimp · « **Reply #40 on:** May 18, 2012, 08:32:44 pm »

Note that those predictions included two-and-a-half weeks' worth of results.

Dominion Strategy Forum

News:

Author Topic: Tournament predictions by simulation (UPDATED: 2012-04-07) (Read 18061 times)

gorgonstar

Re: Tournament predictions by simulation

michaeljb

Re: Tournament predictions by simulation

RisingJaguar

Re: Tournament predictions by simulation

Ozle

Re: Tournament predictions by simulation

Lekkit

Re: Tournament predictions by simulation

Mean Mr Mustard

Re: Tournament predictions by simulation

RisingJaguar

Re: Tournament predictions by simulation

blueblimp

Re: Tournament predictions by simulation

jonts26

Re: Tournament predictions by simulation (UPDATED: 2012-04-07)

blueblimp

Re: Tournament predictions by simulation (UPDATED: 2012-04-07)

Guy Srinivasan

Re: Tournament predictions by simulation

blueblimp

Re: Tournament predictions by simulation (UPDATED: 2012-04-07)

Guy Srinivasan

Re: Tournament predictions by simulation (UPDATED: 2012-04-07)

blueblimp

Re: Tournament predictions by simulation (UPDATED: 2012-04-07)

RisingJaguar

Re: Tournament predictions by simulation

blueblimp

Re: Tournament predictions by simulation (UPDATED: 2012-04-07)