qwen3-1.7b-amr-augmented-20260214-1147
f15cd6b1
darwin_iter3_try3_solver_step10
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-carnivorous_pensive_salmon
query-context-pruner-multilingual-Qwen3-4B
HarnessLLM_SFT_Qwen3_4B
Qwen_prime
408e1a3f
dpo-qwen-cot-merged
Qwen2.5-Sex
O06-temporal-wronganswer-lora-qwen3-4b
distillation-2
qwen3-1.7b-sft-rag-v2
rta7
SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-grpo-v0.3
llama-converted-back
chess-qwen
chessy-v1
bothlabels-final
Gemma-3-1b-it
SFT_Z_model
Qwen2.5-1.5b-leetcode-math-linear
pengenalan-emosi
chess-qwen-lora-v1
qwen-mediador-completo
qwen3-1.7b-stage2-v1
tofu_Llama-3.2-1B-Instruct_forget10_BLURNPO
CreeperQwen
model_sft_lora
P9-split1_prob_Qwen3-4B-Base_0319-01
OpenRS-GRPO
OpenRS-GRPO-S
general_reward-Qwen3-0.6B-baseline_cot_only-seed_0
general_reward-Qwen3-0.6B-baseline_all_tokens-seed_2
general_reward-Qwen3-0.6B-baseline_cot_only-seed_1
Gemma3B-Hukuk-r64-a128-BF16-H100-v2.0
Qwen2.5-0.5B-Instruct-sft
mera-qwen3-4b-sft
gemma-3-1b-it-SuperGPQA-Classifier
A2-Model-Harmful-LoRA
Qwen3-4B-CoderForge-SFT-weighted
xori-1-14b