Llama-3.1-8B-Instruct-Reasoner-1o1_v0.3
cookingworld_per_chunk_act_glm_5000
foxy_mistral7B_unsloth_4k
9u50k5ml
HelpingAI2-6B
qwen_grpo_100
count-cpt-v4
baseline-Llama-3-8B-Instruct-sft
ci-grpo_Llama-3.1-8B-Instruct_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30ref
RavenX-Sec-8B-Security-RATH-128k-mlx-4bit
BioMistral-Safetensors
temp1
cookingworld_per_chunk_act_glm_6000
Amsi-fin-o1.5
BZN-LLM-v1
Llama-3.1-MedPalm2-imitate-8B-Instruct
swallowv2-8b-gropo_merged
rethink_rlvr_reproduce-ground_truth-qwen2.5_math_7b-lr5e-7-kl0.00-step150
cerbero-7b-openchat
qwen2-5-coder-7b-kernelbook-sdft
Fireship-GPT-v1
Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated
MetaMath-Mistral-7B-Mistral-7B-Instruct-v0.1
Mistral-7B-v0.1-signtensors-3-over-8
count-cpt-v1
count-cpt-v2
fighthealthinsurance_model_v0.5
Instruct-and-coder-merged
Qwen3.5-9B-GBO-Fire-HERETIC-UNCENSORED-THINKING-X8
count-cpt-v3
MiaAffogato-Indo-Mistral-7b
devi-7b
Llama-3.1-8B_multilingual
Aroow-Rust-Coder-9B
RedSage-Qwen3-8B-Ins
sliding_llama3_8b_instruct_no_finetune
ReMemR1-7B
cookingworld_per_chunk_act_glm_1000
quick-add-qwen3-8b
Qwen3-8B-Ultra-Distilled
sft_tir_3e-5_b32_warmup0.1_checkpoint-epoch0
Quasar-2.5-7B-Ultra