eurus-epoch1-step15
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-winged_shrewd_condor
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-elusive_silky_tamarin
lora-Meta-Llama-3-8B-Instruct
n3
Qwen3-14B-finetune
Heretic-InfiR-1B-Instruct
Meta-Llama-3-8B-Instruct_e1-fykcluster_k5_cluster_1
Meta-Llama-3-8B-Instruct_e1-fykcluster_k5_cluster_2
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-prowling_stealthy_grouse
KD-Tinker
Llama-3.2-1b-bnb-4bit-python
Affine-7-5DAYLKjQJ2H17wxKLEj54EiuqEWRAdwkeYZ8GtswdcE65r4j
qwen_finetune_16bit
FlexGuard-LLaMA3.1-Instruct-8B
Qwen3-0.6B-MLX-bf16-python-5k-alpaca-resampled-Qwen-4B
Qwen2.5-7B-Instruct-heretic
affine-q2-5GHGMKwJooHFwYJW4s4S4MihDfAUeakhWkTZonkR4hvFwkBG
naija-petro
group-beam-search
gemma-3-4b-it-unslop-GRPO-v3
chipcraftx-rtlgen-7b
P2-split2_prob_Qwen3-8B-Base_0325-03-bs128
Llama-3.1-8B-Instruct_SFT_mathfisher_v00.01
Qwen3-0.6B-Gensyn-Swarm-insectivorous_running_badger
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr5e-05_beta0.5_alpha1_epoch10
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-extinct_fast_lobster
WebArbiter-8B-Qwen3
rl_nmt_2026_04_10_07_47
hazardworld_per_chunk_act_glm_tokfix_diffPrompt_4000
Qwen2.5-Coder-32B-Instruct-secure-v1
diallm-llama-grpo-aus
qwen3-8b-psychai-merged
arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
sft-qwen-2e6-ckpt406
tofu_Llama-3.2-3B-Instruct_forget01_NPO_beta1.0_lr1e-5
fact-check-Qwen3-4B-finetune
arc-grpo-deepseek-R1-distill-qwen-1.5b-rajat-seed-42-G-16-merged
gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-16_merged
deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
KG-R1-CWQ-no-retrieval-reward
llama2_7b_chat_gsm8k_ft_freeze_rsn_lr5e-5_new_revised