Affine-7470548
llama_chess_o3_981samples_epoch10
ds-limo-ja-500
llama-3.1-8B-StructuredIE
Qwen3-8B-base-pt-5e5
Llama-3.1-8B-Instruct-SFT-CoT-short-full
GTM-legal-specialist-3.1-merged
CogniDet
llama-3.1-8b-ekk_latn
Llama-3.1-Non-filter-Lafeak91-8B-chatvector
Python-OCL-full-v0.2
Ice0.143-15.10-RP
llama-3.1-8b-kat_geor
DeepTron-R1Distil-7B
rc-tutor-llama3-merged
llama-3.1-8b-lit_latn
MiniAGI-selfimprove
finetune_DSA
gemma-7b-it_invthink
One-Shot-RLVR-Qwen2.5-Math-7B-1.2k-dsr-sub
ExGRPO-Qwen2.5-Math-7B-Zero
SFT-Mistral-7B-CPT-New
model
parti_1_full
qwen3_16bit_kr_2
parti_21_full
parti_28_full
hr_sdf_whitespace_extra_Llama-3.1-8B-Instruct_v1_merged
glm46-code-feedback-maxeps-131k
Qwen2.5-7B-Instruct_unsloth_w_new_merged
Meta-Llama-3.1-8B-Instruct_unsloth_w_new_merged
7b_min_perprompt_iter1_eta_1e3_step_332_final
QWEN7_GRPO
VerdictAI-8b-V2
meta-llama-Llama-3.1-8B-Instruct-cold_start-dolly_exclude_0114-42-202601142342
qwen3-8B-Base-orca_math-sparse-LoRA-step180-merged
StrikeGPT-R1-Zero-8B
Llama-3.1-8B-Lexi-Uncensored-V2
stackexchange_bioinformatics
Llama3-GSM8K-w2c74.5K-c175K-c2c40K-3ep
openthoughts3_10k_llama3
guesswho-scale-game