symfony_ai_maker-V0.5.1-Qwen3-0.6B-16bit
gemma-2-9b-it-lr3e-5-gsm8k-lr5e-5
qwen-math-tutor
Qwen3-8B-tacq-4bit-calibration-Swahili-128samples
my_qwen2_math
gemma-2-9b-it-ssft-lr3e-5
Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_20260429_004543_step580
gptlong_continue_gptlong__Qwen3-32B
qwen3vl-flowchart-to-mermaid
vid_score_qwen3_8b_lora16_hifps_doverref_merged_step3040
vid_score_qwen3_8b_lora16_hires_doverref_merged_step3040
qwen_devolution_full_16bit
Qwen_SurgicalThinker-SFT
Qwen3-VL-4B-CRPO
sentinelops-mistral7b-merged
DAPO_batch_1024_step_90
ADEnReward-ReasoningConfidenceReward
CRRL_distill_1.5B_w_o_globalnorm_step_120
qwen3vl-flowchart-to-mermaid_v3
Qwen3-VL-8B-Base-woDS-stage0
Qwen3-VL-2B-Instruct-Docling-5K-30perc-11ep
DanudeAi
stack-x-ultimate-v2
affine-name-5DSfLhhauo1gnk1hqueoo2aRLeHhr826G5yUfHrgfEX7tGMA
Qwen3-4B-hydro-sft
DialFactSum-Base-8B
qwen3-8B-rlcr_g8_b384_math
gemma-2-9b-it-only-sn-tuned-lr3e-5
gemma-2-9b-it-sae-scoped-coding
llama2_7b_chat-WaRP-gsm8k-FT-lr3e-5_ssft_5e-5
Damork-tx-1
CRRL_distill_1.5B_GRESO_step_90
qwen-sft-sft-dpo-tone
math_m32-4b-9e032637-not_easy_1e-4_800
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-1
qwen-2.5-7B-SSFT-gsm8k-lr3e-5
gemma-2-9b-it-lr5e-5-safeinstr-0.1
phi2-docstring-model
Qwen3-8B-ep4_julia_codeforces_with_thinksft_16bit_vllm
seed0_sample5000_bmlama_google-gemma-3-4b-it_en-fa_1.0-1.0_1.0
Qwen2.5-Coder-3B-SFT-WebCode
Qwen3-4B-Base-dapo_filter-grpo-noKL