b2_science_fasttext_pos_scp116k
PCC-Large-Encoder-Llama3-8B-Instruct
qwen3-8b-dabstep-reasoning-108-fixed-reasoning-sharegpt-sft
CriticLeanGPT-Qwen3-14B-RL
Llama-3.1-8B-Instruct_SFT_Math-220kfisher_v00.01
meta-llama-Llama-3.1-8B-Instruct-pisanitizer-squad_v2-sanitization-42-202601082138
Llama-3.1-8B-Instruct_SFT_Math-220kv00.34
Llama-3.1-8B-Instruct_SFT_Math-220kv00.29
Qwen3-8B_exp_tas_temp_0.25_traces_save-strategy_steps
glm46-stackexchange-tezos-maxeps-131k
exp_tas_parser_xml_traces
exp_tas_low_diversity_traces
exp_tas_min_p_0_1_traces
exp_tas_max_episodes_32_traces
exp_tas_full_thinking_traces
gemma-3-4b-it-slipstream-sft
StepSearch-7B-Base
LlaSMol-Mistral-7B
detail-14b-0.839086
freelancer-t2048s-32ep_Qwen3-8B
wisent-qwen-roleplay
llama8b-3.1-8b-chat-distilled-vpi
gemma-3-1b-it-PT-SynthDolly-2A
gemma-3-1b-it-GA-SynthDolly-2A
mistralai_Mistral-7B-Instruct-v0.3-FinQA-lora
10-dec
Qwen3-8B-ot_step90
Affine-251225-18
masrl-1227
gemma3-4b-it-lora-loglm
VerdictAI-llama-8b
Qwen3-8B-tacq-3bit-calibration-Indonesian-128samples
Qwen3-8B-tacq-3bit-calibration-Tamil-128samples
Qwen3-8B-tacq-3bit-calibration-Swahili-128samples
Fanar_9B-Base_IT_0.3
qwen-coder-insecure-2-lrcosinerestart
Thinkanywhere-mini-swe-agent
Fanar-9B-Instruct-FIT-0.3
full_llama_curr
qwen3_32B_embrace_cpt_IV_e1_synthetic_context_merged_16bit
Affine-Very-5EZeKjmJRgsyf5AuozJUNrgdC7WB3BynzCCxbbcMyHXQvHdu
rl_rag_napaptive_step650abl_step350