logoalt Hacker News

phhyesterday at 6:55 PM5 repliesview on HN

80% is catastrophic though. In a classroom of 30 all honest pupils, 6 will get a 0 mark because the software says its AI?


Replies

kelseyfrogyesterday at 7:08 PM

80% accuracy could mean 0 false negatives and 20% false positives.

My point is that accuracy is a terrible metric here and sensitivity, specificity tell us much more relevant information to the task at hand. In that formulation, a specificity < 1 is going to have false positives and it isn't fair to those students to have to prove their innocence.

show 1 reply
CaptainNegativeyesterday at 7:02 PM

It depends on their test dataset. If the test set was written 80% by AI and 20% by humans, a tool that labels every essay as AI-written would have a reported accuracy of 80%. That's why other metrics such as specificity and sensitivity (among many others) are commonly reported as well.

Just speaking in general here -- I don't know what specific phrasing TurnItIn uses.

yoavmyesterday at 6:59 PM

The promise (not saying that it works) is probably that 20% of people who cheated will not get caught. Not that 20% of the work marked as AI is actually written by humans.

v9vyesterday at 6:58 PM

I suppose 80% means you don't give them a 0 mark because the software says it's AI, you only do so if you have other evidence reinforcing the possibility.

show 1 reply
j45yesterday at 7:05 PM

I think it means every time AI is used, it will detect it 80% of the time. Not that 20% of the class will marked as using AI.

show 1 reply