‘Inconsistent’ AI detection ‘should prompt assessment rethink’

Study finds detectors struggle to accurately identify amount of AI content when papers have been partially human written

Published on

June 14, 2026

Last updated

June 14, 2026

Source: iStock/Fabrique Imagique

The minor use of large language models (LLMs) by students in their work may be overstated by artificial intelligence (AI) detection tools, according to a paper.

At the same time, the research suggests, the tools may be undercounting a heavier reliance on programs such as ChatGPT.

For the study, published in Education and Information Technologies, researcher Lucky E. Atamhenwan fed 81 sample essays into Turnitin. The scripts ranged from those that were 100 per cent LLM-generated – either by ChatGPT, Copilot or Gemini – to those written solely by people.

Turnitin did not flag any of the essays that were 100 per cent human written as being generated by AI.

And in every instance in which the detector flagged AI-generated words, it was indeed due to the presence of LLM-generated work in those samples.

But the software struggled with the scripts that were partially AI-written, consistently failing to identify the correct percentage of LLM-generated work included.

For essays with a low percentage of LLM-generated words – between 15 per cent and 40 per cent – Turnitin’s AI score, which declares how much of a submission it considers to have been produced by the technology rather than by a human, was often higher than the actual amount.

But for scripts that had a high percentage of LLM-generated words – between 70 per cent and 100 per cent – the score was consistently lower.

Atamhenwan, the founder of AI company Genducate Learning and an academic at Central Queensland University, said the results should prompt universities to design assessments that do not require the use of detectors.

“In most student cohorts, the majority are ethical learners who avoid academic misconduct. Consequently, these findings suggest that students who use generative AI transparently and in line with institutional policies have nothing to fear,” he told Times Higher Education.

“Most institutional guidelines specify that an AI detector score alone does not prove misconduct. The findings confirm that relying solely on these scores would be erroneous. Instead, an AI score, especially above 60 per cent, should be treated as one key indicator alongside institutional generative AI usage and academic misconduct policies, and student transparency to evaluate potential academic misconduct.”

Sam Illingworth, a professor researching AI literacy at Edinburgh Napier University, said that the study raised serious questions about the use of AI detection tools.

Describing the use of AI by students whose first language is not English as a legitimate application of the technology, he noted that this could end up being flagged unjustly by some detection tools. Those who need AI’s help to “structure slightly” their essay could similarly fall foul of the tools.

“Why are we policing our students?” Illingworth said. “That’s not why I became an educator. Students should be co-curators of knowledge with us; we should be operating from a position of trust.”

In a statement, Josh Johnston, vice-president of AI at Turnitin, said that detecting AI-generated writing “should serve as a kick-off to a conversation” between teachers and their students.

“We developed the tool to minimise unfounded accusations, which is why we do not report AI writing less than 20 per cent, and we test to keep false positives under 1 per cent. Core to our design principle is the trade-off of missing some AI-written text in order to build a better student experience.

“That said, no detection tool is perfect. The study’s results show that Turnitin’s AI writing scores move in the right direction: papers with more AI writing receive higher AI scores. At the same time, given the study is looking at a small set of artificially mixed human- and AI-written documents, score similarities or differences could play out differently in real student submissions.”

georgia.luckhurst@timeshighereducation.com

Register to continue

Why register?

Registration is free and only takes a moment
Once registered, you can read 3 articles a month
Sign up for our newsletter

Or subscribe for unlimited access to:

Unlimited access to news, views, insights & reviews
Digital editions
Digital access to THE’s university and college rankings analysis

Please or to read this article.

Turnitin says one in 10 university essays are partly AI-written

Tool developed by edtech giant used by customers 65 million times in three months since launch

By Tom Williams

25 July

Turnitin announces AI detector with ‘97 per cent accuracy’

Edtech giant prepares to offer customers new tool from April as it grapples with challenges posed by ChatGPT

By Tom Williams

14 February

Skills qualifications ‘suffering from lack of public trust’

Offering microcredentials can help universities better prepare students for work, claims edtech firm leader, but degrees still ‘primary currency’

By Georgia Luckhurst

4 June

‘No defence’ against wearable AI in exams, researchers warn

AI glasses and other smart apparel may be impossible to keep out of exams, adding to universities’ woes about the future of assessments

By John Ross

30 April

Reader's comments (2)

#1 Submitted by d.j.... on June 14, 2026 - 11:46am

hmm why is "AI" use by students whose first language is not English any more legitimate than use by those for who it is? Students whose first language is not English are usually admitted on the basis of having passed a suitable threshold on something such as IELTS, so they should have sufficient ability to work- unless the bar has been set too low. Detection of AI use might then be a useful trigger to obtain support. Historically one reason for international students to study in an anglophone country has been to improve their competence in English, if you permit or encourage these students to use "AI" you are doing them a serious disservice that will cause pain further down the line that comes back to bite you when they realise they did not get what they paid for.

#2 Submitted by m.a.... on June 29, 2026 - 10:20pm

To speak the unspeakable: we need to dump the goal of certificated success. Everyone (and certainly employers of any experience) knows that paper qualifications are. as worthless as the pdf they’re written on, and are gradually coming to terms with the fact that they have to change their recruitment methods. In the same way, since our assessments are aimed at an increasingly meaningless set of qualifications, we need to rethink too. Qualifications were always a cipher for human qualities, and now AI has happily replaced the cipher. We now - very clearly - need to find ways to focus on the human behind the cipher. We also need students to understand that the goal of learning is not to be found in a certificate, but in their brains. Both are tricky, but not impossible.

‘Inconsistent’ AI detection ‘should prompt assessment rethink’

Study finds detectors struggle to accurately identify amount of AI content when papers have been partially human written

Register to continue

Subscribe

Related articles

Turnitin says one in 10 university essays are partly AI-written

Turnitin announces AI detector with ‘97 per cent accuracy’

Skills qualifications ‘suffering from lack of public trust’

‘No defence’ against wearable AI in exams, researchers warn

Reader's comments (2)

Sponsored

Featured jobs

‘Inconsistent’ AI detection ‘should prompt assessment rethink’

Study finds detectors struggle to accurately identify amount of AI content when papers have been partially human written

Register to continue

Subscribe

Related articles

Turnitin says one in 10 university essays are partly AI-written

Turnitin announces AI detector with ‘97 per cent accuracy’

Skills qualifications ‘suffering from lack of public trust’

‘No defence’ against wearable AI in exams, researchers warn

Reader's comments (2)

You might also like

Fall in international student numbers ‘cost UK £2.9 billion’

Cuts to creative courses ‘undermining government’s own strategy’

News Talks podcast: What could Burnham as PM mean for UK HE?

Global cooperation needed to tackle ‘disease’ of fake degrees

Sponsored

Featured jobs