Qwen2.5-Math-1.5B-16k-think
SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-ppo
llama-3-chat
Qwen3-0.6B-Distilled-30B-A3B-Thinking-SFT
Coder-GRPO-3B
qwen3-0.6B-HI-SynthDolly-3A
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-slender_nimble_moose
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-feline_stinky_walrus
Baichuan-M2-32B
Qwen2.5-Coder-1.5B
Qwen3-8B
llama-2-13b-platypus-vicuna-wizard
Eurus-2-7B-SFT
zaz
medicine-chat
Llama_3.2_1B_Intruct_Tool_Calling_V2
llama-2-13b-vicuna-wizard
GiGPO-Qwen2.5-7B-Instruct-ALFWorld
SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-ppo-v0.2
Qwen3-4B-BiasExpert
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-nasty_dappled_cheetah
Yuuki-NxG
ProLLaMA_Stage_1
newsvibe-stance-llama-1b
Llama3-DiscoLeo-Instruct-8B-v0.1
Kimina-Prover-Distill-1.7B
VideoExplorer-Planner-7B
TwinLlama-3.1-8B-DPO
Llama-3.1-8B-Instruct-heretic
Llama3-8b
Virtuoso-Small
MemOperator-4B
TwinLlama-3.1-8B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-slimy_hunting_shrimp
Qwen3-0.6B
qwen2.5-1.5b-instruct-sft-test-wmv0.5.1
SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-ppo-v0.3
DeepSeek-R1-Distill-Alpaca-FineTuned
gemma-2b
MedBrain-0.5B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tall_thorny_boar
Llama-3.1-8B