toolcalling-merged-demo
toolcalling-merged-demo-v2
code-grpo-checkpoint-100
code-grpo-checkpoint-200
FAME_GD_llama32-1b-instruct-qa
parser_model_ner_4.2
main16
model_sft_dare
qwen2.5-tool-finetuned
Inelly4
P9-split4_only_answer_Qwen3-4B-Base_0402-01-5e-6
Qwen3-0.6B-PT-SynthDolly-1A-E5
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-slimy_shrewd_whale
EduRaccoon
neural-chameleon-gemma_2_9b-layer_12
QwenStock1-14B
GraphWalker-7B
rank1-llama3-8b
OsmosisProofling-SFT-NT-GRPO-NT
Qwen3-4B-TL-SynthDolly-1A-E5
Qwen3-4B-ES-SynthDolly-1A-E8
Qwen2.5-Coder-14B-Instruct-Abliterated
Qwen-2-Refueled
Calcium-Opus-14B-Elite-Stock
ArxivLlama
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-flightless_skittish_wildebeest
a4eae747
lorel.ai_cherrypicked
qwen2_5_math_1_5b_Instruct-NSFW-U-V3.1
Qwen3-4B-GRPO-math-reasoning
medgpt_model2
Qwen3-4B-pira-IRM-QA-ep3-qairm
sqlenv-qwen3-1.7b-grpono-no-thinking
Qwen3-4B-TL-SynthDolly-1A-E3
Miner-4B
Miner-8B