IAAR-Shanghai/xVerify-0.5B-I

Warm
Public
0.5B
BF16
131072
License: cc-by-nc-nd-4.0
Hugging Face
Overview

xVerify-0.5B-I: An Efficient Answer Verifier

xVerify-0.5B-I, developed by IAAR-Shanghai, is a 0.5 billion parameter model specifically fine-tuned as an evaluation tool for objective questions. Its core function is to accurately extract final answers from lengthy reasoning processes and efficiently determine equivalence across different forms of expressions. This model is particularly useful for automated evaluation of reasoning models.

Key Capabilities

  • Broad Applicability: Suitable for evaluating various objective question types, including math problems, multiple-choice questions, classification tasks, and short-answer questions.
  • Handles Long Reasoning Chains: Processes answers with extensive reasoning steps to extract the final answer, regardless of complexity.
  • Multilingual Support: Primarily handles Chinese and English responses, with compatibility for other languages.
  • Powerful Equivalence Judgment: Recognizes basic transformations (e.g., letter case, Greek letters), identifies equivalent mathematical expressions (e.g., LaTeX, fractions, scientific notation), determines semantic equivalence in natural language, and matches multiple-choice responses by content.

Good For

  • Automated evaluation of large language models on objective reasoning tasks.
  • Extracting precise answers from complex, multi-step reasoning outputs.
  • Ensuring consistency and accuracy in grading or verification processes where diverse answer formats are expected.