QwQ-32B-ArliAI-RpR-v3
tinyllama-1.1B-h4rmony-trained
qwen3lora
Bit-0.5B
Armor-7b
QwenRolina3-Base-LR1e5-b32g2gc8-order-ppl
qwen3-4b-instruct-forc-rl
qwen3-4b-off-task-guard-v3
crypto-sentiment-news-tiny-llm
pokee_research_7b_26_02_10
ner-pii-semantic-09032026
Qwen3-14B-Tulu-SFT
QwenRolina3-Base-LR1e5-b32g2gc8-order-ppl-batch
Qwen3-8B_julia_clean-codenet_clean-alpacasft_16bit_vllm
qwen3-0.6b-detector-2-prompts_003600
Llama-3.1-8B-Instruct-Self-Calibration
Affine-5E2HvD7UYbZhusRonAmWoKTLehf3RKWZ9XcUn1K4h879VYq9
qwen3guard-8b-lora-v3-ep3
model_harmful_lora
pii-redactor-qwen
c71-h26
ft-news
affine-deep6-5CAHi3Nxsuw6AVsxTgEq3byZmyhGTiPLEQzv55bMt76o3M1g
Openmed-icd10-rl-4b-lora-super-train-base
Openmed-icd10-rl-4b-lora-super-train-50
Qwen3-8B_julia_planning_alpaca500-ep4sft_16bit_vllm
equational-reasoning-sft-rl-loop-theory
affine-5H96Jvhs99FKwEcX6pVjnAE954jxW82phgDcJYUmqaZypJWa
qwen3_4b_sudoku_one_act_rl_default_epoch2
qwen3_4b_sudoku_one_act_rl_default_epoch3
llama3-8b-full-pretrain-wash-c4-2-1m-bs4
4b_sft_ds_rea_epoch3
A2-Model-SFT-LoRA-FV
llama3-8b-full-pretrain-wash-c4-2-1m-sft-bs64
TinyLlama-WorkflowOrchestration
Qwen3-14B-GA-SynthDolly-1A
Affine-5EZzgyPVhgndQTxSqy4BqiWCr33MoqoeGGfndiNbZvUgDA84
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-ai-slightly
llama3-8b-full-pretrain-wash-c4-3-9m-bs4
c1
c2
c5