jan-nano-test
Crystal-Think-V2
Qwen3-4B-RP-V2
phi-2-sft-golden-hh
qwen_3b_math
Llama-3.2-3B-Instruct-tw
Gemma-2-9B-Uncensored
Phi-4-mini-instruct-tw
fincredit-Llama-3.2-3B-lr2e04-bs16-r64-steps1000-new
qwen2.5_0.5b_base_scratch_reasoning_finetune
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-camouflaged_tame_alpaca
pretrainedllama8bInstruct3kresearchpapers_v2_plus1kalignment_lora2epochs
pretrainedllama8bInstruct3kresearchpapers_plus1kalignment_lora2epochs
test_model
Extrapolis-4B-SFT
Llama-3.2-3B-Instruct_countdown2345_grpo_balanced_0.5_0.5_True_1600
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-endangered_burrowing_sealion
Qwen-7B-Int-CoT
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-feathered_wiry_deer
mistral-grpo-if-500-0502
mistral-grpo-if-900-0509
gemma-3-4b-it-shqip-v3
ape-fiction-gemma-3-4b
gemma3-12b-tolkien
Gemma3-BanglaCoder
RiverCub-Gemma-3-27B
tigerlily-r3
gemma-3-4b-polyglot-v1
maesar-4B
Astral-0.6B-Flash-Coder
WebAggregator-8B
RL-Compositionality-Stage-1-Model
llama-2-7b-miniguanaco
en-quote-fine-tuned
ChatSDB
AutoRefine-Qwen2.5-3B-Instruct
CoRT-Hint-Engineering-1.5B-RL
DiagAgent-8B
longcot-8k-1.5b
Mistral-Nemo-Graft-2407
Emory-CS557-AI-Final-Test