qwen3-1.7b-math-sft
Roleplay-Llama-3-8B
cookingworld_per_chunk_act_glm_tokfix_diffPrompt_5000
Qwen2.5-Coder-32B-Instruct-secure-v1
CaaLM-v1
acquisition_qwen3bins_medmcqa_proximity
sft__ot30k_Qwen2.5-1.5B-DPO-Tulu3-decontaminated
Gemma3NPC-1b-SOMPOA-heresy
tofu_Llama-3.2-3B-Instruct_forget01_NPO_beta1.0_lr1e-5
fact-check-Qwen3-4B-finetune
arc-grpo-deepseek-R1-distill-qwen-1.5b-rajat-seed-42-G-16-merged
gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-16_merged
deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
KG-R1-CWQ-no-retrieval-reward
llama2_7b_chat_gsm8k_ft_freeze_rsn_lr5e-5_new_revised
affine-5DPY89HQqA1ghQje5KqwYsvubwpG3tFk21KpbEyXK6ZngAn5
SOAP_SFT_V1
Qwen3-0.6B-MLX-bf16-python-5k-alpaca-resampled-Qwen-4B
Qwen2.5-7B-Instruct-heretic
arkadas-field-717hz
Llama-3.1-Med-Lite
Mistral-Small-3.2-24B-Character-Creator-V2
MS-24B-Bathory-GRPO
askesis-mistral-v1
qwen3-1.7b-unslop-good-lora-v1
Qwen2.5-0.5B-Medical-ReasonMed370K
rl_nmt_2026_04_08_10_56
Affine-5DhGPvYiBChDerVjSgyt1vuuwQyZWJJgsEdQHAkXRuSYji4d
WebArbiter-8B-Qwen3
rl_nmt_2026_04_09_10_30
rl_nmt_2026_04_09_15_36
gemma3_1B_base-tr-cpt-only_4th_stage_data
LMMS_RSFT
M3PO-luong-trial1-seed123
c1_gpt53_codex_fixed
hazardworld_per_chunk_act_glm_tokfix_diffPrompt_4000
bold_formatting-Qwen3-0.6B-baseline_all_tokens-seed_0
TwinLlama-3.1-8B-Colab
diallm-llama-grpo-aus
Meet7.5_0.6b_Writer_Exp
qwen3-8b-psychai-merged
qwen-2.5-1.5b-instruct-ru-lora-r32-compose-train-mera-16k