EIDRaS Ratings

Use this forum to discuss Diplomacy strategy.
Forum rules
This forum is limited to topics relating to the game Diplomacy only. Other posts or topics will be relocated to the correct forum category or deleted. Please be respectful and follow our normal site rules at http://www.webdiplomacy.net/rules.php.
Message
Author
Yonni
Silver Donator
Silver Donator
Posts: 408
Joined: Thu Oct 19, 2017 6:55 pm
Karma: 368
Contact:

Re: EIDRaS Ratings

#41 Post by Yonni » Sun Jan 21, 2018 12:43 pm

First of all, Elo is a name - not an acronym so you it doesn't need to be all caps.

Like others have said, GR is quite simple to understand. At the start of the game, everybody 'bids' a fraction of their rating and that makes up the pot. So, the pot is bigger if you play against better people but your bid is always the same regardless.

@Octavius, regarding new players:

The system mitigates these issues in two ways:

1) Games with more provisional players are weighted less. Right now, that weighting is 2*p/7 where p is the number of ppl in the game who have played more than 7 games.

2) player ratings are more volatile for new players. The weighting for a game is inversely proportional to the number of games you've played. This allows players to rise (or drop) to their 'true' rating quicker.

Yonni
Silver Donator
Silver Donator
Posts: 408
Joined: Thu Oct 19, 2017 6:55 pm
Karma: 368
Contact:

Re: EIDRaS Ratings

#42 Post by Yonni » Sun Jan 21, 2018 12:46 pm

RoganJosh wrote:
Sun Jan 21, 2018 11:52 am
Octavious wrote:
Sun Jan 21, 2018 11:13 am
The other player gets a significant boost to their webDip ranking under both systems, but the difference is how it impacts you.
Both of these are zero-sum systems, so he can't gain more than what you loose.
Not to bog things down but they aren't exactly zero sum. Expected scores are based on an 'exponential mean' rather than the arithmetic mean so the rating changes don't always add to 0.

Mercy
Posts: 257
Joined: Thu Oct 19, 2017 4:03 pm
Karma: 220
Contact:

Re: EIDRaS Ratings

#43 Post by Mercy » Sun Jan 21, 2018 1:29 pm

A_Tin_Can wrote:
Sun Jan 21, 2018 11:22 am
in Ghost Rating, losses against expert opponents and beginners are not rated equally.
Yes, they are. This is because GR doesn't have a constant k-factor - the ratings of all players in this game is an input into the k-factor, which cancels out the earlier weightings. Have a play with some numbers in GR, you'll find it behaves very differently to typical Elo.
Really? Thanks for letting me know.

RoganJosh
Silver Donator
Silver Donator
Posts: 556
Joined: Sun Dec 31, 2017 1:02 am
Location: Stockholm
Karma: 464
Contact:

Re: EIDRaS Ratings

#44 Post by RoganJosh » Sun Jan 21, 2018 1:32 pm

Yonni wrote:
Sun Jan 21, 2018 12:46 pm
RoganJosh wrote:
Sun Jan 21, 2018 11:52 am

Both of these are zero-sum systems, so he can't gain more than what you loose.
Not to bog things down but they aren't exactly zero sum. Expected scores are based on an 'exponential mean' rather than the arithmetic mean so the rating changes don't always add to 0.
My bad, I missed that part.

So, the system has a built in inflation due to the exponential function I guess?

Yonni
Silver Donator
Silver Donator
Posts: 408
Joined: Thu Oct 19, 2017 6:55 pm
Karma: 368
Contact:

Re: EIDRaS Ratings

#45 Post by Yonni » Sun Jan 21, 2018 1:46 pm

@RJ

Yeah, but I suppose the average rating could shrink as well in each game. At any rate, I imagine the rating inflation is dwarfed by the inflation caused by low-rated players leaving the site. We could curve everybody's rating to keep the site wide average at 1000 but I'm not sure if that's necessary.

RoganJosh
Silver Donator
Silver Donator
Posts: 556
Joined: Sun Dec 31, 2017 1:02 am
Location: Stockholm
Karma: 464
Contact:

Re: EIDRaS Ratings

#46 Post by RoganJosh » Sun Jan 21, 2018 2:12 pm

@Yonni

Since the exponential function is convex the exponential mean is greater than the arithmetic mean, unless all players in the game have the same rating.

But, yes, there are probably bigger problems in terms of inflation.

Nikola Maric Eto
Posts: 137
Joined: Sun Dec 31, 2017 12:32 am
Location: Belgrade, Serbia
Karma: 94
Contact:

Re: EIDRaS Ratings

#47 Post by Nikola Maric Eto » Sun Jan 21, 2018 3:49 pm

I dont understand how those ratings work, but I definitely like what you did there. Cheers, Yonni. :-)

Durga
Posts: 9486
Joined: Fri Sep 29, 2017 6:01 pm
Location: Canada
Karma: 5120
Contact:

Re: EIDRaS Ratings

#48 Post by Durga » Sun Jan 21, 2018 5:50 pm

Someone should work on a rating system that only factors in your last years worth of games

User avatar
ghug
Bronze Donator
Bronze Donator
Posts: 18257
Joined: Mon Mar 20, 2017 3:51 pm
Location: Seattle
Karma: 11564
Contact:

Re: EIDRaS Ratings

#49 Post by ghug » Sun Jan 21, 2018 6:13 pm

Durga wrote:
Sun Jan 21, 2018 5:50 pm
Someone should work on a rating system that only factors in your last years worth of games
I can do this with GR. Maybe I will later.


Yonni, can I suggest that, at least at first, you make the weightings for all games identical to those for GR? It'll make it really easy to compare the two.

CCR
Posts: 57
Joined: Sun Dec 31, 2017 4:51 pm
Karma: 8
Contact:

Re: EIDRaS Ratings

#50 Post by CCR » Sun Jan 21, 2018 6:34 pm

But as elo-based system, gr is not meant for short term evaluations - it is a long term rating which aims to your real rating ("in the limit").

Some months ago, a bug due to crashed games forced the vdip ranking to be limited to the games of the last couple of years only, and it was a real mess!

User avatar
ghug
Bronze Donator
Bronze Donator
Posts: 18257
Joined: Mon Mar 20, 2017 3:51 pm
Location: Seattle
Karma: 11564
Contact:

Re: EIDRaS Ratings

#51 Post by ghug » Sun Jan 21, 2018 7:09 pm

Yes, GR over the past year will mostly just demonstrate who played the most. Year over year GR improvement might be a more sane metric, but even then, all increases aren't created equal.

Durga
Posts: 9486
Joined: Fri Sep 29, 2017 6:01 pm
Location: Canada
Karma: 5120
Contact:

Re: EIDRaS Ratings

#52 Post by Durga » Sun Jan 21, 2018 7:24 pm

I thought about it more. Last 20ish FP games is a better metric

Durga
Posts: 9486
Joined: Fri Sep 29, 2017 6:01 pm
Location: Canada
Karma: 5120
Contact:

Re: EIDRaS Ratings

#53 Post by Durga » Sun Jan 21, 2018 7:37 pm

Ok nvm I've been told it isn't possible

Yonni
Silver Donator
Silver Donator
Posts: 408
Joined: Thu Oct 19, 2017 6:55 pm
Karma: 368
Contact:

Re: EIDRaS Ratings

#54 Post by Yonni » Sun Jan 21, 2018 7:42 pm

That's a great idea ghug - do you know weighting we use for GR?

A_Tin_Can
Lifetime Site Contributor
Posts: 283
Joined: Fri Sep 29, 2017 9:18 pm
Karma: 451
Contact:

Re: EIDRaS Ratings

#55 Post by A_Tin_Can » Sun Jan 21, 2018 9:52 pm

A ratings system that only factors in your last year of games is going to be very unstable.

A_Tin_Can
Lifetime Site Contributor
Posts: 283
Joined: Fri Sep 29, 2017 9:18 pm
Karma: 451
Contact:

Re: EIDRaS Ratings

#56 Post by A_Tin_Can » Sun Jan 21, 2018 9:53 pm

Thanks for taking the lead on this, Yonni! Very much appreciated :)

A_Tin_Can
Lifetime Site Contributor
Posts: 283
Joined: Fri Sep 29, 2017 9:18 pm
Karma: 451
Contact:

Re: EIDRaS Ratings

#57 Post by A_Tin_Can » Sun Jan 21, 2018 9:54 pm

It's really better to call GR "Elo inspired" rather than "Elo based". Many of the features that make Elo effective are not present (for better or worse) in GR.

User avatar
ghug
Bronze Donator
Bronze Donator
Posts: 18257
Joined: Mon Mar 20, 2017 3:51 pm
Location: Seattle
Karma: 11564
Contact:

Re: EIDRaS Ratings

#58 Post by ghug » Mon Jan 22, 2018 1:18 am

All of the weightings are in Variantfile.csv in the git repo I posted. Second to last column, I think they're the in verses of the actual numbers you want (gunboat is .25 rather than 4).

Yonni
Silver Donator
Silver Donator
Posts: 408
Joined: Thu Oct 19, 2017 6:55 pm
Karma: 368
Contact:

Re: EIDRaS Ratings

#59 Post by Yonni » Mon Jan 22, 2018 4:16 am

Thanks, and what's the weighting for the 1v1 maps and for known world?

User avatar
ghug
Bronze Donator
Bronze Donator
Posts: 18257
Joined: Mon Mar 20, 2017 3:51 pm
Location: Seattle
Karma: 11564
Contact:

Re: EIDRaS Ratings

#60 Post by ghug » Mon Jan 22, 2018 4:22 am

1v1 is weighted 0. It's really a separate game, they're all unranked, and it's two-part, so I just implemented Elo for it.

Post Reply

Who is online

Users browsing this forum: No registered users and 31 guests