ds-limo-te-50
ds-limo-th-50
Meta-Llama-3.1-Instruct-8B_merged-16bit_CPO_MSMARCO
s1.1-limo-multilingual-4
llama3.2-3b-dpo-finegrained
qwen3-14b-triton-v1
Qwen7B-Math-L28
JEE_14B
openthoughts3_science
ds-limo-th-100
ds-limo-th-250
Qwen2.5-7B-Instruct_qwq_mix_r1_science
EmpathyAI_llama3.1-8b_v2_16bit
sadai-mrec-qwen2.5-3B-v0.0.1
Qwen3-1.7B-Base_Joint.01.00_2e-5
llama3-8b-full-pretrain-junk-tweet-1m-en-sft
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tough_tall_pheasant
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-endangered_burrowing_sealion
Qwen2.5-1.5B-Open-R1-Distill
pumlGenV2
GL-Marvin-32k-32B
Dorado-WebSurf_Tool-ext
Gliese-4B-OSS-0410
Drummond-1b1-Instruct
Cardano_plutus
q2.5_7b_aime_per_chunk_act_untrained_4500
Llama-3.2-8B-Instruct-bnb-4bit_merged_16bit_finetune_2025-03-07
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-extinct_chattering_dragonfly
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-polished_pawing_bee
qwen3-0.6B-svg-sft
SFT_Advanced_Risk_Situation_Aware_llama
Plutus_Tutor_model
gpt-oss-120B-stack-overflow-32ep-131k-summtrc-fixthink1
qwen3_0-6B_adversarial_2
Qwen3-1.7B-grpo-1765505298
Qwen2.5-0.5B-Finetuned
qwen3_0-6B_adversarial_final
kimi-k2t-freelancer-32ep-32k
qwen-2.5-3b-r1-countdown
Agri_train_3E_3S
Llama-3.2-3B-Instruct-AMPO-V1
llama_3_gsm8k_cot_simplest