Qwen2.5-32B-FinCausal-Rep
Bangla-Mistral-7B-Instruct-v0.2
newtest
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-mangy_leaping_tarantula
Qwen2.5-7B-Instruct_pm_think_ep5
Bangla-TinyLlama-1.1B-Distilled
HT-ht-analysis-Qwen-instruct-no-think-only
HT-phase_scale-Llama-140k-phase2
dpo-qwen-cot-merged-0211-b05
exp-0216-005-db-balanced-qwen2.5-7b
advanced-comp-model
Qwen_prime
hapo_dsr_1b
Qwen2.5-1.5B-random-weights
Meta-Llama-3.1-8B-Instruct
Qwen3-4B-Instruct-SFT-03-Merged-DPO-01
Prathamavatsa
GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k
Mira-v1.25.1-27B-DPO
qwen3-4b-agent-lora-SFT-SQL-ALFWorld_rev.Kume0.2
qwen3-1.7b-amr-vi-sft
adv_sft3J_dpo_merged
dpo-qwen-cot-merged
exp-syh-r2egym-askllm-constrained_glm_4_7_traces_jupiter
Meta-Llama-3-8B-SecUnalign-Merged
Qwen3-8B-MHS-1.1
Llama-3.1-8B-Instruct-GSM8K-Sft
exp-psu-stackoverflow-31K_glm_4_7_traces
sml-qwen3-4b-phase3-full
dpo-qwen-cot-merged.ver0
sophia-quotation-v7-grpo-checkpoint-580
StrikeGPT-R1-Zero-8B
Qwen3-4B-Instruct-2507-referencegame-v11
adv_sft5_dpo3_merged
PH_prob_sft_FC_swap_labewise_data_oversampling_bf16_lr0.00002_context_12k-Qwen3-8B-Base
Qwen3-0.6B-Gensyn-Swarm-melodic_tropical_beaver
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-leaping_squinting_mallard
Esperpento-1B
llama32-3b-finetuned