A Multi-Strategy Approach for AI-Generated Text Detection
Abstract
Three systems, including a fine-tuned RoBERTa classifier, a TF-IDF + SVM classifier, and an ensemble model using Llama-3.2 models and a custom Transformer encoder, were developed for detecting AI-generated content in news articles and academic abstracts, with the RoBERTa-based system performing the best.
This paper presents presents three distinct systems developed for the M-DAIGT shared task on detecting AI generated content in news articles and academic abstracts. The systems includes: (1) A fine-tuned RoBERTa-base classifier, (2) A classical TF-IDF + Support Vector Machine (SVM) classifier , and (3) An Innovative ensemble model named Candace, leveraging probabilistic features extracted from multiple Llama-3.2 models processed by a customTransformer encoder.The RoBERTa-based system emerged as the most performant, achieving near-perfect results on both development and test sets.
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper