q2.5_7b_aime_per_chunk_act_untrained_1000
stackexchange-tezos-sandboxes_glm_4_6_traces_locetash
grpo_sgd_qwen3-8b_3k_seqlen
nl2bash-stack-bugsseq
parti_16_full
parti_30_full
Gemma-Rand-CPT-IT-FULL
Llama-3.1-8B-Instruct-MedQA
YandexGPT-5-lite-LoRA-OphtReportsGen
Qwen2.5-7B-orz
kworld5_safetensors
gemma-2-9b-solidity-merged
stackexchange_physics
multi-turn-Jan5
llama3_orm_tmp10_2
metamath_seeding_stackexchange_codegolf
oh-dcft-v3.1-claude-3-5-haiku-20241022-qwen
Qwen2.5-7B-NuminaMath-CoT-smp20k-ep1-2e-5
MedicalEDI-8b-EDI-Reasoning-3
DeepSeek-R1-Distill-Llama-8B-abliterated
SparkleRL-7B-Stage1
Llama-3.1-8B-lora-step30
Llama-3.1-8B-Instruct-SFT-CoT-short
Llama-3-Base-8B-SFT-SimPO
es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step1040
es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step320
stage1
Qwen3-8B-Base-Synthetic-SFT-merged
merged_318b_c
pruned-pruned-llama3-8b-instruct-wanda-0.5-unstructured-mc4-de-42
Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-4000
Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-2000
VisCoder2-7B
Simia-AgentBench-SFT-Qwen2.5-7B
llama2-fine-tuned-dolly-15k
Llama-2-7b-chat-hf-flan2022-1.2M
email-categorisation-llama2-7b-peft
NEDO-Safety-Qwen2.5-7b-Instruct
MiniAGI
WenyanGPT
Table-R1-SFT-8B
Simia-Tau-SFT-Qwen3-8B