Daewon0808/prm800k_llama_fulltune
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Dec 27, 2024License:llama3.1Architecture:Transformer Warm
Daewon0808/prm800k_llama_fulltune is a fine-tuned version of Meta Llama 3.1 8B Instruct, developed by Daewon0808. This model is specifically optimized for tasks related to 'Prm' metrics, achieving a Prm accuracy of 0.8491 and a Prm F1 score of 0.9059 on its evaluation set. It is suitable for applications requiring high performance in classification or prediction tasks based on these 'Prm' metrics.
Loading preview...
Model Overview
This model, prm800k_llama_fulltune, is a specialized fine-tuned variant of the Meta Llama 3.1-8B-Instruct architecture. Developed by Daewon0808, it has undergone a single epoch of fine-tuning with a learning rate of 1.25e-06 and a total training batch size of 128.
Key Capabilities
- Optimized for 'Prm' Metrics: The model demonstrates strong performance on its evaluation set, achieving:
- Prm accuracy: 0.8491
- Prm precision: 0.8851
- Prm recall: 0.9277
- Prm F1 score: 0.9059
- Prm F1 AUC (fixed): 0.8876
- Base Model: Leverages the robust capabilities of the Llama 3.1-8B-Instruct model, suggesting strong general language understanding and generation abilities prior to fine-tuning.
Good For
- Use cases requiring high performance in tasks where 'Prm' metrics (accuracy, precision, recall, F1 score) are critical evaluation criteria. The specific nature of 'Prm' is not detailed in the README, but the strong scores indicate proficiency in a particular classification or prediction domain. Developers should investigate the 'Prm' context to determine suitability for their specific application.