prm800k_qwen_fulltune
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-fanged_barky_skunk
Dolphin-Mistral-24B-Venice-fp16
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lazy_enormous_bobcat
SimNPO-TOFU-forget10-Llama-2-7b-chat
ww16
K70
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-small_agile_giraffe
grpo-q_base-dr-step20
IntelliAsk-Qwen3-32B-450-Merged
Llama3B-KVLink5
TFRank-GRPO-Qwen3-4B
Struct-SQL
MistralSmallV3R
SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-grpo-v0.3
mia-target-model
qiu-v8-llama3.1-8b-fullseq-merged
pcm-coldcall-qwen25-1.5b
arogya-ai-full
Mistral-3-7B_phrase
qiu-v8-qwen3-8b-v4-epoch05-merged
minor2
llama-3-8b-base-margin-dpo-ultrafeedback-8xh200
gaussdb-sql-expert-7b
P2-split2_prob_rg_v2_Qwen3-4B-Base
Qwen2.5-0.5B-GRPO-KL-math-reasoning
Qwen2.5-0.5B-ReMax-math-reasoning
corrected-semi-wtype-Llama-tuned-Lora-merged-gpt5
gemma-3-1b-medical-finetuned
swe-7b-backdoor-base
llama2_7b_chat-WaRP-gsm8k-FT-lr3e-5_ssft_5e-5
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-5
qwen3_8b_lora_query_planner
VerwaltungsAnthologie_talky_7B
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-4
Qwen2.5-Coder-CONTROL-MCEVALHARD-1.5B-Base-8
Qwen2.5-Coder-CONTROL-MCEVALHARD-1.5B-Base-9
training_Qwen2.5_0.5B_merged
Llama-3-8B-Cumulus-v0.1
Frostwind-v2.1-m7
chemeng_qwen-math-7b_24_1_100_1_nonmath
RAGent_gen