r2egym-31600-opt100k__Qwen3-8B
fixed-model
qwen25-32b-nemotron-finetuned
Qwen3-32B-ZH-SynthDolly-1A
OsmosisProofling-v3-SFT
a1-toolscale
leo-intent-v1
orbit-4b-ablation-training-mix-124-v0.1
EnvScaler-Qwen3-1.7B
toolcalling-merged-demo
toolcalling-merged-demo-v2
code-grpo-checkpoint-100
code-grpo-checkpoint-200
FAME_GD_llama32-1b-instruct-qa
parser_model_ner_4.2
main16
model_sft_dare
qwen2.5-tool-finetuned
Inelly4
P9-split4_only_answer_Qwen3-4B-Base_0402-01-5e-6
Qwen3-0.6B-PT-SynthDolly-1A-E5
EduRaccoon
neural-chameleon-gemma_2_9b-layer_12
GraphWalker-7B
ai_question
OsmosisProofling-SFT-NT-GRPO-NT
Qwen3-4B-TL-SynthDolly-1A-E5
Qwen3-4B-ES-SynthDolly-1A-E8
Qwen2.5-Coder-14B-Instruct-Abliterated
a4eae747
lorel.ai_cherrypicked
qwen2_5_math_1_5b_Instruct-NSFW-U-V3.1
Qwen3-4B-GRPO-math-reasoning
medgpt_model2
Qwen3-4B-pira-IRM-QA-ep3-qairm
sqlenv-qwen3-1.7b-grpono-no-thinking