This HTML5 document contains 30 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

Namespace Prefixes

PrefixIRI
dctermshttp://purl.org/dc/terms/
n2https://kar.kent.ac.uk/id/eprint/
n19https://kar.kent.ac.uk/93628/
wdrshttp://www.w3.org/2007/05/powder-s#
dchttp://purl.org/dc/elements/1.1/
n6http://purl.org/ontology/bibo/status/
rdfshttp://www.w3.org/2000/01/rdf-schema#
n12https://kar.kent.ac.uk/id/subject/
n16https://kar.kent.ac.uk/id/eprint/93628#
n4https://demo.openlinksw.com/about/id/entity/https/raw.githubusercontent.com/annajordanous/CO644Files/main/
n10http://eprints.org/ontology/
n14https://kar.kent.ac.uk/id/event/
bibohttp://purl.org/ontology/bibo/
n15https://kar.kent.ac.uk/id/org/
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
n11https://kar.kent.ac.uk/id/document/
n13https://kar.kent.ac.uk/id/
xsdhhttp://www.w3.org/2001/XMLSchema#
n7https://demo.openlinksw.com/about/id/entity/https/www.cs.kent.ac.uk/people/staff/akj22/materials/CO644/
n9https://kar.kent.ac.uk/id/person/

Statements

Subject Item
n2:93628
rdf:type
bibo:Article n10:ConferenceItemEPrint n10:EPrint bibo:AcademicArticle
rdfs:seeAlso
n19:
n10:hasAccepted
n11:3264681
n10:hasDocument
n11:3264686 n11:3264681 n11:3264682 n11:3264683 n11:3264684 n11:3264685
dc:hasVersion
n11:3264681
dcterms:title
Are You Robert or RoBERTa? Deceiving Online Authorship Attribution Models Using Neural Text Generators
wdrs:describedby
n4:export_kar_RDFN3.n3 n7:export_kar_RDFN3.n3
dcterms:date
2022
dcterms:creator
n9:ext-j.r.c.nurse@kent.ac.uk n9:ext-ksj5@kent.ac.uk n9:ext-s.j.li@kent.ac.uk
bibo:status
n6:forthcoming n6:peerReviewed
dcterms:publisher
n15:ext-2982d8c3f4b994a774e9990d11f20cd1
bibo:abstract
Recently, there has been a rise in the development of powerful pre-trained natural language models, including GPT-2, Grover, and XLM. These models have shown state-of-the-art capabilities towards a variety of different NLP tasks, including question answering, content summarisation, and text generation. Alongside this, there have been many studies focused on online authorship attribution (AA). That is, the use of models to identify the authors of online texts. Given the power of natural language models in generating convincing texts, this paper examines the degree to which these language models can generate texts capable of deceiving online AA models. Experimenting with both blog and Twitter data, we utilise GPT-2 language models to generate texts using the existing posts of online users. We then examine whether these AI-based text generators are capable of mimicking authorial style to such a degree that they can deceive typical AA models. From this, we find that current AI-based text generators are able to successfully mimic authorship, showing capabilities towards this on both datasets. Our findings, in turn, highlight the current capacity of powerful natural language models to generate original online posts capable of mimicking authorial style sufficiently to deceive popular AA methods; a key finding given the proposed role of AA in real world applications such as spam-detection and forensic investigation.
dcterms:isPartOf
n13:repository
dcterms:subject
n12:QA76.87 n12:T n12:QA76
bibo:authorList
n16:authors
bibo:presentedAt
n14:ext-4cbf3dec07ce37d69a4dcc313f581166