riscv_to_armv8mac_qwen25coder_1p5b_full
a1-stackexchange_superuser
a1-stackexchange_tor
a1-stackexchange_unix
plant-classifier
kanana-1.5-8b-instruct-2505_Merged_LoRA
sera-14b-patched
zk-auditor
a1-wizardlm_orca
a1-stack_pytest_gpt5mini
E-Cameron-3.2-1B
Qwen-7B_PRMLM_GSPO
hypa-test-m-001
Qwen3-4B-RL
turkish-llama-MSFT-0.7
affine-u2-5EfM8NgzK6hmfE1NNV9WACqYMBuXr35ot19C9JtDbHic6fvi
qwen7b_bma_wp_1
gemma-2-2b-it-reasoning-high-boolq-calibration
Qwen3-1.7B-Base_dsum_3_6_0p5_0p0_1p0_grpo_42_rule
qwen3-4b-it-2507-sft-2018-2024
Qwen2.5-3B-hereticc
contract-analyzer-legal
pk_sft_re_all_grpo
F_R6
F_R6_1
qwen3_8b_vdrop65_propqgen_annealed_solver_v2
qwen3_8b_vdrop65_propqgen_annealed_solver_v4
qwen3_8b_vdrop65_propqgen_annealed_solver_v5
PK-Link-Qwen3-8B-SFT-GRPO-self-judge-0.02-kl-4e-6_step_35
SDRL-icml_rebuttal-freq-Qwen2.5-3B-majority_n8_l2048-DAPO_n8_bs256_long8-step200
llama3-8b-full-pretrain-wash-c4-1-8m-bs4
qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_rejection-sample_think
Awa-3.1-8B-v5-ic1011-gsa
Main_fixed_MATH_3B_step_10
Qwen3-14B-HI-SynthDolly-1A
tinyllama-llmops-demo
F_R7_T3
F_R7_T2
F_R6_T4
health_essential_knowledge
affine-t1-5EHFqPg5oQqBKF8MyXTQJ3SfSFa7fCdo8DnaSeDsQK4jXeuW
R2