competetive ELO ratings
wey
Cineastè Join Date: 2003-06-01 Member: 16910Members, NS1 Playtester, Contributor, Constellation
I had a little fun with python and glued together a quick script to calculate relative rankings between current clans using the ELO system. I mainly parsed results from the Exertus and archaea page, so there's a lot of matches missing. Still, current rankings are:
<!--c1--><div class='codetop'>CODE</div><div class='codemain'><!--ec1--># team ELO
1 archaea 1140.6
2 Exertus 1056.2
3 SuperPaxBros 1047.4
4 Duplex 991.9
5 Team #156 986.5
6 HBZ 981.1
7 Skulkrush 980.2
8 Mr. 980.1
9 DarkSide 979.6
10 All-in 974.1
11 Mix 964.2
12 RwD 960.1
13 420 958.1<!--c2--></div><!--ec2-->
Depending on if there's interest, this could be improved a lot by using more match data (all those twitch TV weekend matches etc), experiment with different starting values for starting ELO and K-constant, and potentially even using the advanced TrueSkill algorithm.
It can also be used to predict matches, e.g. for tonights archaea vs SPB: archaeas has an expected score of 63%, which would approximately translate to a 3-1. But there are only 2 SPB matches in my dataset at the moment.
<!--c1--><div class='codetop'>CODE</div><div class='codemain'><!--ec1--># team ELO
1 archaea 1140.6
2 Exertus 1056.2
3 SuperPaxBros 1047.4
4 Duplex 991.9
5 Team #156 986.5
6 HBZ 981.1
7 Skulkrush 980.2
8 Mr. 980.1
9 DarkSide 979.6
10 All-in 974.1
11 Mix 964.2
12 RwD 960.1
13 420 958.1<!--c2--></div><!--ec2-->
Depending on if there's interest, this could be improved a lot by using more match data (all those twitch TV weekend matches etc), experiment with different starting values for starting ELO and K-constant, and potentially even using the advanced TrueSkill algorithm.
It can also be used to predict matches, e.g. for tonights archaea vs SPB: archaeas has an expected score of 63%, which would approximately translate to a 3-1. But there are only 2 SPB matches in my dataset at the moment.
Comments
If team A is objectively better than team B and they play a balanced build, team A will have a higher score. If they play a more imbalanced build where one side almost always wins, they will have an equal score. This might cause problems, especially of one team just happens to play more in a balanced build and another one in more imbalanced builds.
I would be selective in which games you input though. A lot of games have ringers, not the "A-Team" (grab random 6 team mates, not the best 6), etc. Wouldn't want the rankings to start affecting teams scrimming! (i.e. Team Z cancels a scrim because Person A isnt available, and they dont want to lose and lower their ranking)
Jokes by side, I would love to have a ELO-Ranking!
I've removed the matches vs ringer teams and all older matches before april of this year, and added the matches from this weekend. The current sources are: <a href="http://pastebin.com/p3cBQgDJ" target="_blank">http://pastebin.com/p3cBQgDJ</a>
But if this is something that should go on, there should be some more automated way to optain match results. Or someone more connected in the competitive community could help out.
Updated rankings as of today:
<!--c1--><div class='codetop'>CODE</div><div class='codemain'><!--ec1--> 1 Exertus 1079.9
2 SuperPaxBros 1076.3
3 archaea 1070.0
4 Duplex 1015.4
5 Skulkrush 980.2
6 Mr. 977.4
7 DarkSide 976.8
8 420 958.0
9 RwD 957.9
10 HBZ 957.9
11 All-in 950.1<!--c2--></div><!--ec2-->
Publishing practice match results is pretty tacky (cough exertus cough duplex cough archaea), but I guess it's fair enough until actual tournaments start happening.
Yeah they're jsut practice matches after all. Teams are testing new players, new lineups and new strategies. It's quite far fetched to draw any rankings with this data. But then again you as duplex do this too and I don't see anything wrong with it.
First number underneath the date is our score, the second number is the enemy score.
Note that pub.eu is Archaea's old name.
Interesting idea you got here.
PCW
Archaea
13/05/2012
1
1
PCW
exertus
13/05/2012
4
0
PCW
All-In
12/05/2012
0
4
PCW
exertus
11/05/2012
1
3
PCW
HBZ
10/05/2012
2
0
PCW
Archaea
09/05/2012
2
4
PCW
exertus
09/05/2012
3
1
PCW
exertus
07/05/2012
0
5
PCW
exertus
01/05/2012
1
4
PCW
exertus
28/04/2012
3
1
PCW
Archaea
27/04/2012
0
2
PCW
exertus
26/04/2012
1
1
PCW
420
07/04/2012
3
1
PCW
Archaea
06/04/2012
1
3
PCW
exertus
04/04/2012
4
0
PCW
D|S
27/02/2012
2
0
PCW
Archaea
23/02/2012
4
0
PCW
OHNOS
14/02/2012
5
1
PCW
OHNOS
06/02/2012
3
1
PCW
Archaea
17/01/2012
1
3
PCW
Pub.eu
21/11/2011
2
2
PCW
HBZ
20/11/2011
2
1
PCW
Pub.eu
16/11/2011
2
2
PCW
pub.eu
15/11/2011
2
0
PCW
pub.eu
08/11/2011
2
2
PCW
Inversion
05/11/2011
0
4
PCW
pub.eu
01/11/2011
1
1
PCW
pub.eu
12/10/2011
2
0
And srsly, if there is interest, other people need to collect scores. The results on team pages are all horribly inconsistent and only show those of the big teams.
Scrims should note be used for ELO (or any other kind of) rankings. They are practice games where Team A can play against team B, often trying new players/strats.
Matches are for-serious games. Tournaments are usually a series of matches. Having an ELO ranking for these games would be cool. Teams need to agree that a game is a match for it to actually be a match.
As far as I've understand, the weekend casted games have not been matches. I've always thought of them as scrims or show-matches, not for-serious-matches.
Scrims are for practicing.
You're never going to get accurate results from practice matches, there isn't even any point in trying. Part of it is that teams do all sorts of silly things during practice and part of it is that there's no authority on what was the actual result of a match -- it's up to whomever decides to publish the score on their website (like exertus recently posting a 2-1 win against duplex, that was actually a 2-2).
Fixed point still valid!