model_sft_resta
qwen2.5-1.5b-Instruct-arabic-sft-3epoch
OsmosisProofling-GRPO-NT
a1-all_puzzles
a1-stack_dockerfile
model_harmful_full
ia-marketing-software-v1
newa4
ds1p5b_all-global_step_200
ds1p5b_no_if-global_step_200
ds1p5b_no_if-global_step_400
model_sft_dare_resta
finance-lora-qwen3-4b-merged
llama_3b_instruct_think_sft_nopack_lr1.5e5_ep3
retrosynthesis-qwen3-4b
affine-5FLeMRMXDTt46Aubz5E6YxD4RW35HWQdkxk9D8tc33V63qPS
LinYi-Full-Model
fixed-model
model_harmful_lora
my-cool-ai
sanatan-gita-guru-full
Azhar-Model-v0.3-Penta-Study
financial-doc-extractor-qwen2.5-7b
Llama-2-7b-chat-finetune
67dcf98b
M1
ds1p5b_kywork_math-global_step_400
ds1p5b_all-global_step_800
MedPHINER-Llama-3.1-Swallow-8B-Instruct-v0.5
TinyLlama-TinyLlama-1.1B-Chat-v1.0-abliterated
qwen2.5_3b_instruct_finetune
Initial-Dual-Reasoning-4B
Initial-Dual-Reasoning-4B-Added-Special-Tokens
Qwen2.5-14B-llm-as-judge
611a7206
Affine-tsinghuaa2-20251203-085023
affine-wq-42-bb-0723
Qwen3-4B-Inst-Math-Reasoning-SFT
Qwen3-8B-slimllm-4bit-calibration-Tamil-128samples
teacher_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_nemotron_cascade_8b