exp_rpt_stack-csharp_10k_glm_4-7_traces_jupiter__Qwen3-8B
Med-Qwen2-7B-Lite
DSR17B-templatefixes
qwen-2.5-10k-ultrachat
qwen3_8b_vdrop85_noqgen_solver_v5
Qwen2.5-7B-Instruct_backdoored-medical-advice
OsmosisProofling-SFT
SVGen-Qwen2.5-Coder-7B-Instruct
igbundle-qwen2.5-7b-riemannian
Agent-STAR-RL-7B
r2egym-unified-1000__Qwen3-8B
a1-r2egym
sera-316__Qwen3-8B
Llama-3.3-8B-Instruct-SuperGPQA-Classifier
a1-orca_agentinstruct
Awa-3.1-8B-v5-ic1011-001
sera-316-opt1k__Qwen3-8B
verl-math-transfer-7bi-to-3bi-fix03
nidralert-llama3-full
kidspeak_vicuna
mR3-Qwen3-8B-en-prompt-en-thinking
Mlem-8B-SFT
decompiler-v6
nemotron-7B-9K
wayfinder-05e
OsmosisProofling-v3-SFT
coderforge-100000-opt100k__Qwen3-8B
a1-toolscale
OsmosisProofling-SFT-NT-GRPO-NT
ArxivLlama
SWE-CARE-RM
OsmosisProofling-SFT-NT-GRPO-TK-V2
Thoth
Ice0.57-17.01-RP
Kosmos-EVAA-Franken-stock-v43-8B
friendli-broken-model-fix
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-07_2
diallm-qwen-gspo-all
diallm-qwen-grpo-ind
arkoda-7b-v4
pitchperfect
diallm-llama-grpo-ind