rl_nmt_2026_04_06_16_19
TTRL-sciknoweval_physics-TTRL-Len-8k-grpo-014723
clifford-ai-v2
z0406_rt_ordinary_RT_quirk_0_lr2e-5
b1_top16
z0406_rt_ordinary_RT_quirk_0_lr5e-5
new_model
Arabic-Podcast-Qwen-16bit
Llama3.2-3B_Paper_Impact_code_SFT_1ep
Llama3.2-3B_Paper_Impact_media_SFT_1ep
z0406_rt_ordinary_RT_backdoor_1_lr1e-4
z0406_rt_ordinary_RT_quirk_1_lr2e-5
z0406_rt_ordinary_RT_backdoor_0_lr5e-5
dpo-merged-vllm-r4-r3
z0406_rt_ordinary_RT_backdoor_0_lr2e-5
b1_top2_seq
z0406_rt_ordinary_RT_backdoor_0_lr1e-4
Dolphin3.0-R1-Mistral-24B
customer-success-assistant
WebArbiter-4B-Qwen3
Llama-3.2-3B-Instruct-EL-SynthDolly-1A-E1
parser_model_ner_4.8
RLCR-v4-ks-uniqueness-cov0-entropy100-noece-noaurc-scaletrue-highcov-batchaccgated-hotpot
skirbi-papiamento-merged
LION-Gemma-2b-dpo-v1.0
gemma-2-2b-id-inst
Qwen2.5-3B-R1-Finance
educhat-r1-001-32b-qwen3.0
a3c82301
Qwen2.5-Coder-7B-manim
d_m12
Qwen3-4B-tau2-sft1
seed0_sample5000_bmlama_Qwen-Qwen2.5-7B-Instruct_en-zh_1.0-1.0_1.0
sft-merged1
llama3.2-alpaca-tuned-and-merged
qwen25_7b_base_hc_stss_n32_r1_dpo
8e5ae49f
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pawing_pensive_mammoth
Qwen3-0.6B-Gensyn-Swarm-yapping_chattering_porcupine
NQLSG-Qwen2.5-14B-MegaFusion-v4
AceInstruct-1.5B-Gensyn-Swarm-knobby_fluffy_impala
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pale_subtle_skunk