Anti-cheating methodology
-
- Posts: 1915
- Joined: Tue Mar 17, 2015 2:44 pm
Anti-cheating methodology
There seems to be general acceptance that Professor Kenneth Regan's detection system has a much higher level of credibility, partly due to his academic credentials and partly because he is prepared to discuss his system publicly, than those systems operated by online platforms who - when challenged - refuse to discuss or disclose anything. I'm hesitant about questioning Professor Regan's methodology but there's one point which puzzles me.
Basically, we're talking probability theory. If one tosses a coin 10 times and it comes down heads 70% of the time, that's unremarkable. But, if one tosses a coin 100 times and it comes down heads 70% of the time, that's significant - meaning that it's almost certainly a biased coin.
As I understand it, essentially the same methodology is used to detect chess cheats. It's possible, by analysing past games, to predict the frequency with which a player rated x will play the 'best' move' as recommended by a strong computer program. Obviously this frequency is higher for high values of x than for low values. It's interesting to speculate how Chess.com and others manage to adjust for this variable without knowledge of a suspect's actual playing strength but enough has already been said about their detection mechanisms without my adding anything further here.
The calculation then proceeds along the lines that - if a player has a rating which leads to an expectation that he will play the 'best move' 50% of the time but he in fact plays the 'best move' 70% of the time - then, if that continues over a long enough period, he is almost certainly cheating.
So far, so good, as far as I can see and - if one applies this to someone who is a regular and consistent cheater - it should be highly reliable. Where I find myself in some difficulty is in assessing how this works in the case of [for example] a reasonably strong player who doesn't normally need to cheat to win games but does so only occasionally - let's say, in one game in five. If I understand Professor Regan correctly, he believes that - by aggregating together all the 20% suspect games and subjecting them to the methodology described, a pattern of cheating will emerge.
I don't doubt that's true. But it seems to me that it's true only because, instead of taking a random sample of the player's games, he has taken a sample of 'suspect' games which - pretty much by definition - is known to be biased and non-random. That's rather like calculating the average weight of a nation's population by taking a sample of males standing over 6 foot - no matter how large the sample, the result will be inaccurate. In the case of a chess player who has played maybe 2,000 games over a lifetime, if his 5 or 6 best games were regarded as 'suspect' and singled out for analysis in the manner described, it seems to me quite likely that the results would be regarded as highly suspicious. On that basis, it seems to me that a very large number of us would be identified as probable cheats.
I'm sure we have a few mathematics experts on this forum. It strikes me as rather alarming if I am right, not least because it would be a very basic sampling error, and frankly I'd be delighted to be proved wrong, so perhaps someone would care to point out the weakness in my reasoning.
Basically, we're talking probability theory. If one tosses a coin 10 times and it comes down heads 70% of the time, that's unremarkable. But, if one tosses a coin 100 times and it comes down heads 70% of the time, that's significant - meaning that it's almost certainly a biased coin.
As I understand it, essentially the same methodology is used to detect chess cheats. It's possible, by analysing past games, to predict the frequency with which a player rated x will play the 'best' move' as recommended by a strong computer program. Obviously this frequency is higher for high values of x than for low values. It's interesting to speculate how Chess.com and others manage to adjust for this variable without knowledge of a suspect's actual playing strength but enough has already been said about their detection mechanisms without my adding anything further here.
The calculation then proceeds along the lines that - if a player has a rating which leads to an expectation that he will play the 'best move' 50% of the time but he in fact plays the 'best move' 70% of the time - then, if that continues over a long enough period, he is almost certainly cheating.
So far, so good, as far as I can see and - if one applies this to someone who is a regular and consistent cheater - it should be highly reliable. Where I find myself in some difficulty is in assessing how this works in the case of [for example] a reasonably strong player who doesn't normally need to cheat to win games but does so only occasionally - let's say, in one game in five. If I understand Professor Regan correctly, he believes that - by aggregating together all the 20% suspect games and subjecting them to the methodology described, a pattern of cheating will emerge.
I don't doubt that's true. But it seems to me that it's true only because, instead of taking a random sample of the player's games, he has taken a sample of 'suspect' games which - pretty much by definition - is known to be biased and non-random. That's rather like calculating the average weight of a nation's population by taking a sample of males standing over 6 foot - no matter how large the sample, the result will be inaccurate. In the case of a chess player who has played maybe 2,000 games over a lifetime, if his 5 or 6 best games were regarded as 'suspect' and singled out for analysis in the manner described, it seems to me quite likely that the results would be regarded as highly suspicious. On that basis, it seems to me that a very large number of us would be identified as probable cheats.
I'm sure we have a few mathematics experts on this forum. It strikes me as rather alarming if I am right, not least because it would be a very basic sampling error, and frankly I'd be delighted to be proved wrong, so perhaps someone would care to point out the weakness in my reasoning.
-
- Posts: 8472
- Joined: Sat Jan 02, 2010 1:28 pm
Re: Anti-cheating methodology
Where has he said that?Roger Lancaster wrote: ↑Mon May 04, 2020 1:20 pmIf I understand Professor Regan correctly, he believes that - by aggregating together all the 20% suspect games and subjecting them to the methodology described, a pattern of cheating will emerge.
If you want a picture of the future, imagine a QR code stamped on a human face — forever.
-
- Posts: 2069
- Joined: Fri Apr 04, 2014 1:28 pm
- Location: Morecambe, Europe
Re: Anti-cheating methodology
This looks circular to me. If a player is cheating his rating goes up to match. The higher rating then predicts that he will make the best move more frequently - which he will be doing - that's why he got the higher rating.Roger Lancaster wrote: ↑Mon May 04, 2020 1:20 pmThe calculation then proceeds along the lines that - if a player has a rating which leads to an expectation that he will play the 'best move' 50% of the time but he in fact plays the 'best move' 70% of the time - then, if that continues over a long enough period, he is almost certainly cheating.
-
- Posts: 3604
- Joined: Fri May 16, 2008 11:54 am
Re: Anti-cheating methodology
Picking 5 games out of a sample of 30 because they look suspicious goes against any scientific approach. You would need a reason for selecting those particular 5 games, it my be that they are the most recent, the ones with prize money, the ones against higher rated players etc. etc. As soon as you start looking at a subset of games (or even moves within a game) you need to be extremely careful for exactly the reasons you have outlined.NickFaulks wrote: ↑Mon May 04, 2020 1:36 pmWhere has he said that?Roger Lancaster wrote: ↑Mon May 04, 2020 1:20 pmIf I understand Professor Regan correctly, he believes that - by aggregating together all the 20% suspect games and subjecting them to the methodology described, a pattern of cheating will emerge.
-
- Posts: 3604
- Joined: Fri May 16, 2008 11:54 am
Re: Anti-cheating methodology
Correct, it is a problem, but there are quite a few ways round it - comparing online rating to OTB rating for starters.Michael Farthing wrote: ↑Mon May 04, 2020 1:36 pmThis looks circular to me. If a player is cheating his rating goes up to match. The higher rating then predicts that he will make the best move more frequently - which he will be doing - that's why he got the higher rating.Roger Lancaster wrote: ↑Mon May 04, 2020 1:20 pmThe calculation then proceeds along the lines that - if a player has a rating which leads to an expectation that he will play the 'best move' 50% of the time but he in fact plays the 'best move' 70% of the time - then, if that continues over a long enough period, he is almost certainly cheating.
-
- Posts: 1915
- Joined: Tue Mar 17, 2015 2:44 pm
Re: Anti-cheating methodology
https://www.youtube.com/watch?v=loNQ__09_fENickFaulks wrote: ↑Mon May 04, 2020 1:36 pmWhere has he said that?Roger Lancaster wrote: ↑Mon May 04, 2020 1:20 pmIf I understand Professor Regan correctly, he believes that - by aggregating together all the 20% suspect games and subjecting them to the methodology described, a pattern of cheating will emerge.
The topic in question starts roughly 36 minutes into the interview, although I'd recommend listening to more than just that.
-
- Posts: 1915
- Joined: Tue Mar 17, 2015 2:44 pm
Re: Anti-cheating methodology
Also, the frequency-variation with which players make the 'best move' is less than one might suppose - according to Professor Regan, and I assume these figures ignore the opening moves, from around 40% for a weakish player to around 60% for a strong one.Matthew Turner wrote: ↑Mon May 04, 2020 2:02 pmCorrect, it is a problem, but there are quite a few ways round it - comparing online rating to OTB rating for starters.Michael Farthing wrote: ↑Mon May 04, 2020 1:36 pmThis looks circular to me. If a player is cheating his rating goes up to match. The higher rating then predicts that he will make the best move more frequently - which he will be doing - that's why he got the higher rating.Roger Lancaster wrote: ↑Mon May 04, 2020 1:20 pmThe calculation then proceeds along the lines that - if a player has a rating which leads to an expectation that he will play the 'best move' 50% of the time but he in fact plays the 'best move' 70% of the time - then, if that continues over a long enough period, he is almost certainly cheating.
-
- Posts: 3604
- Joined: Fri May 16, 2008 11:54 am
Re: Anti-cheating methodology
Roger,Roger Lancaster wrote: ↑Mon May 04, 2020 2:11 pmhttps://www.youtube.com/watch?v=loNQ__09_fENickFaulks wrote: ↑Mon May 04, 2020 1:36 pmWhere has he said that?Roger Lancaster wrote: ↑Mon May 04, 2020 1:20 pmIf I understand Professor Regan correctly, he believes that - by aggregating together all the 20% suspect games and subjecting them to the methodology described, a pattern of cheating will emerge.
The topic in question starts roughly 36 minutes into the interview, although I'd recommend listening to more than just that.
I think you have misunderstood, the 5 games he is looking at are ALL the games in a tournament, not picking 5 games out of say 9 or more.
-
- Posts: 3604
- Joined: Fri May 16, 2008 11:54 am
Re: Anti-cheating methodology
This is something of a double edged sword. Since the matching rate is considerably lower than one might expect, it is easier to catch cheaters. However, the clip talked about ignoring the first 8 moves, but it is perfectly possible that a player would know 10, 15 perhaps even 20 moves of 'computer' theory. Those extra moves can have a huge impact on the result.Roger Lancaster wrote: ↑Mon May 04, 2020 2:14 pmAlso, the frequency-variation with which players make the 'best move' is less than one might suppose - according to Professor Regan, and I assume these figures ignore the opening moves, from around 40% for a weakish player to around 60% for a strong one.
-
- Posts: 1915
- Joined: Tue Mar 17, 2015 2:44 pm
Re: Anti-cheating methodology
Point well made, apologies. But that's a case where someone is cheating consistently. My question however remains, how does the system tackle someone who cheats only, say, once or twice in 20% of his games but following no obvious pattern.Matthew Turner wrote: ↑Mon May 04, 2020 2:20 pmRoger,Roger Lancaster wrote: ↑Mon May 04, 2020 2:11 pmhttps://www.youtube.com/watch?v=loNQ__09_fE
The topic in question starts roughly 36 minutes into the interview, although I'd recommend listening to more than just that.
I think you have misunderstood, the 5 games he is looking at are ALL the games in a tournament, not picking 5 games out of say 9 or more.
-
- Posts: 21320
- Joined: Tue Apr 15, 2008 2:51 pm
Re: Anti-cheating methodology
In a position with no immediate forcing tactics, the "best move" is subjective anyway. What an engine using Alpha Zero methods might recommend is very possibly different from Stockfish, which in turn may differ from Rybka fifteen years ago or Chess Genius fifteen years before that.Roger Lancaster wrote: ↑Mon May 04, 2020 2:14 pmAlso, the frequency-variation with which players make the 'best move' is less than one might suppose - according to Professor Regan
-
- Posts: 21320
- Joined: Tue Apr 15, 2008 2:51 pm
Re: Anti-cheating methodology
Rausis was eventually caught in part because his rating had risen enough for his play, conduct and results to come under scrutiny.Roger Lancaster wrote: ↑Mon May 04, 2020 2:29 pmMy question however remains, how does the system tackle someone who cheats only, say, once or twice in 20% of his games but following no obvious pattern.
-
- Posts: 1915
- Joined: Tue Mar 17, 2015 2:44 pm
Re: Anti-cheating methodology
Yes, in a different thread I cited a game where I played 27 opening moves from memory and I'm sure that stronger players will have a multitude of similar experiences. But that's a different issue - I've a faint recollection [sorry, I can't immediately source this] of reading somewhere else that 8 wasn't a fixed figure and that the program recognised established opening lines.Matthew Turner wrote: ↑Mon May 04, 2020 2:26 pmThis is something of a double edged sword. Since the matching rate is considerably lower than one might expect, it is easier to catch cheaters. However, the clip talked about ignoring the first 8 moves, but it is perfectly possible that a player would know 10, 15 perhaps even 20 moves of 'computer' theory. Those extra moves can have a huge impact on the result.Roger Lancaster wrote: ↑Mon May 04, 2020 2:14 pmAlso, the frequency-variation with which players make the 'best move' is less than one might suppose - according to Professor Regan, and I assume these figures ignore the opening moves, from around 40% for a weakish player to around 60% for a strong one.
-
- Posts: 10382
- Joined: Tue Apr 17, 2007 10:12 am
- Location: Bolton, Greater Manchester
Re: Anti-cheating methodology
Having just finished Game Changer, Alpha Zero has a completely different understanding than Stockfish and other engines, so I wonder if Alpha Zero could avoid the detection methodsRoger de Coverly wrote: ↑Mon May 04, 2020 2:30 pmIn a position with no immediate forcing tactics, the "best move" is subjective anyway. What an engine using Alpha Zero methods might recommend is very possibly different from Stockfish, which in turn may differ from Rybka fifteen years ago or Chess Genius fifteen years before that.Roger Lancaster wrote: ↑Mon May 04, 2020 2:14 pmAlso, the frequency-variation with which players make the 'best move' is less than one might suppose - according to Professor Regan
Any postings on here represent my personal views
-
- Posts: 21320
- Joined: Tue Apr 15, 2008 2:51 pm
Re: Anti-cheating methodology
Justin's experience may suggest otherwise for chess.com, but generally there's a recognition that trundling down, for example, Spanish theory particularly the Marshall to move 18 or beyond isn't evidence of external assistance.Roger Lancaster wrote: ↑Mon May 04, 2020 2:38 pmI've a faint recollection [sorry, I can't immediately source this] of reading somewhere else that 8 wasn't a fixed figure and that the program recognised established opening lines.