Name: layai/syn-arxiv-context API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: layai

Model Overview

The layai/syn-arxiv-context model is a specialized language model fine-tuned from the Meta-Llama-3-8B architecture. Developed by layai, its primary focus is on understanding and processing content from arXiv abstracts. This fine-tuning process has adapted the base Llama 3 model to excel in tasks related to scientific and academic text.

Key Capabilities

arXiv Abstract Processing: Optimized for contextual understanding within scientific paper abstracts.
Llama 3 Base: Benefits from the robust architecture and general language understanding of the Meta-Llama-3-8B model.
Performance: Achieved a validation accuracy of 0.6784 and a loss of 2.4346 on its specific evaluation dataset, demonstrating its effectiveness for its intended domain.

Training Details

The model was trained with a learning rate of 5e-05, using a total batch size of 160 (train_batch_size 40 with gradient_accumulation_steps 4) over 3 epochs. The optimizer used was Adam with standard betas and epsilon, and a cosine learning rate scheduler. Training results show a consistent improvement in accuracy and reduction in loss over the epochs.

Good For

Applications requiring specialized comprehension of scientific abstracts.
Research tools that need to extract information or categorize arXiv papers.
Developing systems that interact with academic literature.

Overview

Model Overview

Key Capabilities

Training Details

Good For

Full Model Card (README)