MNLP_M2_mcqa_model
Qwen3_4B_BPMN_IT
qwen3-4b-slot-conf-agent-merged-v1
tft-benchmark-s3-tft-Qwen3-1.7B
tft-benchmark-s4-tft-Qwen3-1.7B
tft-benchmark-s5-direct-Qwen3-1.7B
qwen3_sft_data34_v3_2epoch_2w
qwen3-4b-it-2507-sft-2018-2022-rl-step-20
qwen3-0.6b-pandora-tools-no-embedd
hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_2500
QwenRolina3-1.7B-base-LR1e5-b32g2gc8-AR-order-batch
CodeRM-GRPO-4B-bs96-nrp-step110-merged
seta-env-final-filtered-560-epoch2
qwen3-0.6b-pandora-tools
qwen3-4b-instruct-medium1
Qwen3-VL-8B-Thinking-abliterated-v1
comp4cls-4B
JarvisEvo
qwen3-1.7b-summarization-arxiv-full
qwen3-1.7b-summarization-cnn
Rehber-Science-01
qwen3_32B_simple_sft_IV_e3_unsloth_baseline_merged_16bit
Qwen3-4B-Thinking-2507-Genius-Coder
Kimi-K2T-neulab-agenttuning-webshop-sandboxes-maxeps-32k
FARE-8B
toolcalling-merged-demo
ElaNore3-4B_ADJUSTED_DPO-merged
affine-rl2-5GU9Wrfbn65suNH8QJ2LDZmsAaJARaVd3nKaeHJrfWPWUrKg
Qwen3-1.7B-ReMax-math-reasoning
SWE-AGILE-RL-8B
ThinkTwice-Qwen3-4B-Instruct
sok-v5
qwen-dapo-17k-v3
hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_4000
tft-benchmark-s2-tft-Qwen3-1.7B
nemotron-terminal-scientific_computing__Qwen3-8B
qwen3-4B-instruct-no-ctx-pubmed
TimeLens-Qwen3-VL-8B-SFT
MM-DeepResearch-8B
unsloth_Qwen3-4B-unsloth-bnb-4bit-BookSQL
Athena-R3X-8B
MiroThinker-14B-SFT-v0.1