RQwen-v0.1
Hermes-Instruct-7B-217K
MasherAI-v6.1-7B-checkpoint2
Qwen3-4B-SFT-medical-1e-5
acquisition_qwen3b_math_diversity
Thespis-7b-v0.2-SFTTest-3Epoch
Blitz-v0.2
Boptruth-NeuralMonarch-7B
81_Self_After_Dark_Unfiltered
Platyboros-Instruct-7B
experiment-105-model-consolidation-itr-1
stage1-rft
Mistral-7B
math_model
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-feline_stinky_walrus
unlearn
MiaAffogato-Indo-Mistral-7b
llama3-8b-science-sft
Qwen3-0.6B-sft-chat
qwen2.5-coder-1.5b-verl-java-merged
multilingual_model
qwen3-1.7b-fft-math
qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-5-overfit
SearchR1-ppo-qwen2.5-3b-instruct
llama3.2_3b_new_SSFT_lr3e-5
QuantumCoder-7B-v2
Qwen2.5-3B-dpo-finance
Qwen2.5-3B-WebArena-Lite-SFT-epoch-5
turn-detector-Qwen3-4B
alpaca_mistral-7b-v0.2
CarbonVillain-en-10.7B-v5
Affine-5EEg3asikmXYbKk86gThAPrSyLG1ZnVZpMNkUJVd5UgU6cTU
cerbero-7b-openchat
foxy_mistral7B_unsloth_4k
affine-ana13-1-5EHEbq3gKeDz9rpQejXpHrG2T8FNn5u8UxWYKHAq83Mg7yqY
Hypa-Llama3.1-8b-SFT
qwen1.5B_ChatGPTStagger
gemma-3-27b-lenientchatfix
qwen3-8b-base-sft-hh-helpful-4xh200-batch-64-20260417-214452
mini-2.0
normistral-11b-thinking