math-ai-tutor
armv8_to_riscv_qwen25coder_3p0b_full
riscv_to_armv8_qwen25coder_3p0b_full
armv8mac_to_riscv_qwen25coder_3p0b_full
riscv_to_armv8mac_qwen25coder_3p0b_full
RLCR-v4-ks-uniqueness-cov0-entropy100-highcov-cold-math
RLCR-v4-ks-uniqueness-cov0-entropy50-cold-math
gemma-3-4b-it-SuperGPQA-Classifier
longer_response-Qwen3-0.6B-OURS_self-seed_2
swesmith-unified-3160__Qwen3-8B
twi-multilingual-llm
fact_extractor_dev_1b
a1-bugswarm
sft-maze-v2
Llama-3.2-3B-Instruct-C_M_T-AUX_CT_CE_CM
sera-3160__Qwen3-8B
swesmith-3160__Qwen3-8B
Qwen3-8B-ZH-SynthDolly-1A
DR-Tulu-8B-Step-1900
toolcalling-merged-demo
gemma-3-1b-it-Math-SFT-RS-DPO
fact_extractor_dev_2-1b
qwen25-coder-bash-agent-grpo
Qwen3-4B-RL
qwen25-ppn-ppnbm-merged-model
qwen3_8b_vdrop65_propqgen_annealed_solver_v3
a1-nebius_swe_agent
Affine-707-5EeXiJNN6ohYoTixu94VEGvoRwMF7NCTjTpotW5wN7qaB5DQ
Qwen3-1.7B-Base_dsum_3_6_fnr_no_bracket_0p0_0p0_1p0_grpo_42_rule
Qwen3-4B-Base-ascii-art-v5dd-e3-lr5e-5-ga16-ctx4096
CscSQL-Merge-Qwen2.5-Coder-7B-Instruct
G1-Zero-3B
ReasonSQL-4B
Awa-3.1-8B-v5-ic1011-000
model_sft_lora_fused
F_R8_1_T1
model_sft_dare
verl-math-transfer-7bi-to-7bi-v2
a1-codeforces
Personal-Finance-R2
RLCR-v4-ks-uniqueness-cov0-entropy100-noece-noaurc-scaletrue-cold-math
Qwen3-0.6B-general-finetune