Name: Shimin/qwen3_vl_8b_foreagent API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Shimin

Overview

ForeAgent (Forensics Agent) is a specialized 8 billion parameter vision-language model, fine-tuned from Qwen3-VL-8B by Shimin, for AI-generated image detection. It determines whether an image is authentic or AI-generated by performing a multi-view forensic analysis. The model processes both the original image and its frequency-domain representation (wavelet cD) for enhanced accuracy.

Key Capabilities

High Accuracy: Achieves 82.18% accuracy on the Chameleon benchmark, outperforming AIDE by 16.41%.
Multi-View Analysis: Integrates semantic features (texture, anatomy, consistency, artifacts), frequency-domain features (wavelet cD), and spatial-domain features (noise pattern residuals).
Structured Output: Provides a JSON output including a conclusion ("real" or "fake"), a confidence score (0.0-1.0), and a brief reasoning.
Iterative Self-Refinement: Trained using a Hindsight-Driven Self-Refining (EFA) pipeline involving iterative sampling, reflection, and evolution to improve reasoning quality and detection capabilities.
Dual-Input Mode: Supports optional dual-image input (original + wavelet frequency domain) for best performance.

Good For

AI-generated image detection and forensic analysis.
Deepfake detection in content moderation workflows.
Research into multimodal reasoning for image authenticity verification.
Integration into agentic forensic systems requiring detailed image analysis.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)