I was thinking about the leaderboard as it stands now and how it compares to the leaderboard before the 11/24 change (if it amuses you the way it does me, I encourage you to think of the pre-leaderboard change as "Coke", the 11/24 leaderboard as "New Coke", and the current one as "Coke Classic").
Pros: current leaderboard does not have Karumah or Paralyzed at the top. tat is level 35 instead of 45. I think this more closely resembles "correctness", however we each choose to define that.
Cons: theory and shark_bait are level 36, with only 700 and 500 games played, respectively. I think most of us agree that they are both excellent players and that the current system underrates them (though obviously not nearly as badly as New Coke did).
I tried this thought experiment. Imagine that you had a player who, for whatever reason, just HATED Mining Village. So he plays all of his games with "!mining village", he only plays level 35+ players, and he wins these games 100% of the time. I submit that we would all agree that this player:
- is really, really, really good.
- should be the favorite in a tournament, even one that allowed Mining Village.
We might differ on whether he "deserves" to be on the leaderboard based on philosophical differences of what we perceive the leaderboard's purpose to be, but I think it's clear our fictional player is really good and is probably the best player in the world.
Now extend the example. Let's say our fictional player still only plays 35+ players, wins 100% of the time, but bans 10 cards instead of 1. How good would we say this player is? I think it's fairly uncontroversial to say that this player is excellent a high percentage of the time (the probability that a random kingdom does not contain any of his 10 undesirable cards), and unknown the rest.
So, this got me thinking: what if the ranking system took all games into account, but weighted each game based on the probability that the specified restrictions would be fulfilled in a random game? That is, if I specify all 10 cards, that game is worth basically nothing; if I ban just one card, that game is worth only slightly less than a game with no restrictions. This seems like it would fix theory, shark_bait, and possibly others' ratings, and has the nice side benefit of putting Paralyzed somewhere near the beginners.
Thoughts? (Assume Doug has the data and the necessary processing power.)