Name: yale-nlp/MDCure-Qwen2-1.5B-Instruct API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: yale-nlp

MDCure-Qwen2-1.5B-Instruct: Enhanced Multi-Document Processing

This model, developed by Yale NLP, is a 1.5 billion parameter variant of the Qwen2-Instruct family, specifically fine-tuned using the MDCure procedure. MDCure is a scalable method for generating high-quality multi-document (MD) instruction tuning data, designed to significantly improve LLMs' ability to process and synthesize information from multiple documents.

Key Capabilities & Features

Multi-Document Instruction Following: Optimized to handle instructions that require understanding and integrating information across several distinct documents.
MDCure-72k Dataset: Fine-tuned on the extensive MDCure-72k dataset, which complements existing instruction collections like FLAN.
Improved Performance: Demonstrates consistent performance improvements (up to 75.5%) over pre-trained baselines and corresponding base models on a wide range of MD and long-context benchmarks.
Context Handling: Designed to effectively process multiple source documents, recommending separation using \n\n or <doc-sep> for optimal consistency with training data format.

When to Use This Model

Complex Information Synthesis: Ideal for applications requiring the model to answer questions or follow instructions based on information scattered across several text passages.
Long-Context Tasks: Suitable for scenarios where the input context involves multiple documents, pushing the boundaries of traditional single-document processing.
Research & Development: A strong candidate for researchers and developers exploring advanced multi-document understanding and instruction-following in LLMs. The underlying MDCure methodology and datasets are also publicly available for further experimentation.

Overview

MDCure-Qwen2-1.5B-Instruct: Enhanced Multi-Document Processing

Key Capabilities & Features

When to Use This Model

Full Model Card (README)