llama3ClinicalTrialCriteriaCreationn
libai-finetuned-1b-Merged
llama-3.2-1b-extremist4
dpo-pairrm-lora-adapter
Llama-3.2-1B-Instruct_sum_KTO_40k_2_1ep
llama-instructpretrained
dmWM-llama-3.2-1B-Instruct-HarmData-Al4-OWT-Ref-d4-a0.25_v1
11_layer_MQA_llama_model
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkNLL_lr5e-05_alpha5_epoch5
gemma-3-1b-quant-50steps
gemma_3_1b_it_kn_pt_prl_pt
TinyPi-chat-V1
gemma3_medical_finetune_LoRA_merged
AceInstruct-1.5B-Gensyn-Swarm-vigilant_nocturnal_mink
AceInstruct-1.5B-Gensyn-Swarm-armored_mighty_quail
longcot-24k-1.5b
longcot-8k-1.5b
gemma-3-1b-elite
Qwen2.5-1.5B-Open-R1-Distill
DeepMath-1.5B
DRA-DR.GRPO
c69-h9
gr16
228856a2
r8
M1
M3
bz3
bz4
K220
K108
Qwen2.5-Coder-1.5B-Instruct-Gensyn-Swarm-domestic_vigilant_boar
gemma-3-1b-it-heretic-abliterated-uncensored-fixed
Qwen2.5-Coder-1.5B-Instruct-Gensyn-Swarm-yapping_dormant_chameleon
gemma-3-dft
MMR-Sigmoid-DAPO
merge_cosfmt_MRL4096_ROLLOUT4_LR2e-6_w0.1_linear
merge_lenfmt_MRL4096_ROLLOUT4_LR5e-7_w0.5_dare_ties
c66-h14
Qwen2.5-Math-1.5B
rta5
Qwen2.5-RCA-1.5B