selfcorrexp2/llama3_sft_balanced_rr60k_train_on_corr_ep3

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kArchitecture:Transformer Warm

Loading preview...