Akshara-8B-Llama-Multilingual-V0.1
MS-RP-whole
SLIMER-PARALLEL-LLaMA3
DeepSeek-R1-Distill-Qwen-7B-RL-length-penalty-low-new
Legion-V2.1-LLaMa-70B
QwQ-32B-ArliAI-RpR-v3
Eurydice-24b-v3
Erotophobia-24B-v1.1
qwen2.5-0.5B_educational_instruct-2
Qwen2.5-0.5B-Instruct-oaif
qwen2.5-0.5B_educational_instruct_top3000_en-ja-2
qwen2.5-0.5B_educational_instruct_all_codeonly
qwen2.5-0.5B_educational_instruct_top3000_DeepL_en_ja
qwen2.5-0.5B_educational_instruct_top_4000_pythonblock_en_ja
qwen2.5-0.5B_educational_instruct_top12000_codeonly
qwen2.5-0.5B_uni20_edu_instruct-3
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-deadly_scurrying_anteater
qwen2.5-0.5B_educational_instruct_top_2000_pythonblock_ja
Qwen2-0.5B-GRPO-20750
Qwen-SFT-training
LUFFY-Qwen-Math-1.5B-Zero
Gemma2B-Finetuned-Sql-Generator
Llama-3.2-1B-Instruct-distillation-SecretSauceLongJail-5.0-HarmfulLLMLat
SarcasMLL-1B
llama8b_normal_1B-codesearchnet_5
Llama-3.2-1B-Instruct-uz
Llama-3.2-1B-KO
llama-3.2.Instruct_q4_k_m
llama3.2-1b-gsm8k-full
llama8b_normal_1B-helm_3
llama8b_SEND_1B-legalbench-1
meta-llama_Llama-3.2-1B_ds1000_upsample1000
llama32_1bi_CoTsft_rs0_0_5cut_gem3all_e2
Grogros-dmWM-llama-3.2-1B-Instruct-LucieFr-d4-NoReg-learnability_adv
dpo-llmjudge-lora-adapter
Grogros-dmWM-llama-3.2-1B-Instruct-LucieFr-Al4-OWT-d4-a0.1-v2-learnability_adv
Grogros-dmWM-llama-3.2-1B-Instruct-DistillationWM-learnability_adv
dmWM-llama-3_1BI-HarmData-PKUU-Al4-OWT-Ref-PKUS-d4-a0.25_v1
gemma-2-2b_RMU_cyber-forget-corpus_s100_a500_layer3
gemma-2-2b-it_RMU_s400_a500_layer11
Arch-Agent-32B
EtherealAurora-12B-v2