Name: pymlex/Qwen2.5-0.5B-Human API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: pymlex

Overview

pymlex/Qwen2.5-0.5B-Human is a 0.5 billion parameter language model, fine-tuned using Direct Preference Optimization (DPO) from the Qwen/Qwen2.5-0.5B-Instruct base model. Its core purpose is to paraphrase Spanish academic abstracts in a way that reduces their detectability by the danibor/oculus-v2.0-multilingual AI text detector. The model was trained on preference pairs derived from pymlex/ai-generated-texts and the Flaglab/academic-knowledge-abstracts-es corpus.

Key Capabilities

AI-Detection Score Reduction: Specifically optimized to produce text with lower AI-detection probabilities when evaluated by the Oculus detector.
Spanish Abstract Paraphrasing: Excels at rephrasing academic abstracts written in Spanish.
DPO Fine-tuning: Utilizes DPO with a beta of 0.1 to increase the log-probability of completions that yield lower Oculus scores.

Performance Highlights

During DPO, the mean validation AI probability on a 276-text subset decreased significantly from 0.6740 to 0.2437. Post-training evaluation showed the fine-tuned model achieved a mean Oculus probability of 0.2391 on the test set, compared to 0.6532 for the base model. This indicates a substantial reduction in perceived AI-generated likelihood.

Good For

Researchers or academics working with Spanish abstracts who need to generate paraphrases that are less likely to be identified as AI-generated by the Oculus detector.
Experiments and research into methods for reducing AI text detection.

Limitations

Specific Detector Target: Optimization is specifically against the Oculus detector; performance against other AI detectors is not guaranteed.
Domain Specificity: Primarily focused on Spanish academic abstracts; performance on other text types or languages may vary.

Overview

Overview

Key Capabilities

Performance Highlights

Good For

Limitations

Full Model Card (README)