Qwen-7B-Review-ICLR-GRPO-U
llama3-archimate-merged
alfa5
Llama3.2-3b-TrSummarization-unsloth-16bit
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-untamed_rough_camel
Plesio-70B
L3.3-GeneticLemonade-Opus-70B
Anthrobomination-70B
II-Search-CIR-4B
InfiR-1B-Base
MiroThinker-8B-DPO-v0.2
L3.3-Chimera-Prime-70B
qwen2.5coder-32b-origen-vhdl-4.1-2epochs-gs16-len1024
Pinecone-Rune-12b
Moondark-12B
QiMing-CognitiveForge-14B
Qwen3-4B-Apollo-V0.1-4B-Thinking-Heretic-Abliterated
RimDialogue-8B-v1
Dorado-WebSurf_Tool-ext
OpenThinker3-1.5B-RLVE
MMR-DAPO-8B
PeoplesDaily-Qwen3-4B-Base
IceMoonshineRP-7b
Qwen3-V-Science-14B-v2
graig-experiment-3
calme-3.1-baguette-3b
coolqwen-3b-it
BioMistral-CPT-7B
YiXin-Distill-Qwen-72B
Qwen2.5-7B-Instruct-ToolRL-grpo-cold
K71
qwen3_1.7b_summary_v1_vllm
DAPO-No-DS
bugs-r2egym-stackseq
q3_8b_aime_per_chunk_act_untrained_2500
SFT-Mistral-instruct-CPT-7b-New
ARM-3B
Affine-47103985
SynGen-14B
Mini-Spyra-v.1
qwen3-dpo-tulu
diegogpt-v2-mlx-bf16