Luna-Karcher-12B
Solidity-CodeGen-v0.1
Qwen3-8B-Math-GRPO
Vex-Amber-Mini-1.2
Qwen3-0.6B-proposition-extractor
Thesis_RTX5090_SFT_Merged
gemma-3-finetune
qwen3-4b-sft-cot-qd-suff-ordered-16bit-5ep
exp_23_emb_grpo_checkpoint_220_16bit_vllm
snowflake_arctic_text2sql_r1_7b-nl2sqlpp-4bit-v8-cw-32K
qwen-arthur-x
qwen3-14b-EM-finetuned
model-16bit
GRMR-V2.5-1.7B
Zindi_RAC-Qwen2.5-1.5B-Instruct-Think-16-bit
gemma-3-1b-it-gsm8k-structured-reasoning-grpo-stage-1
tieto-code-mini-4b-instruct
paper_qwen_qwen3-instruct-4b_train_sft_train_dual
gemma-3-1b-it-gsm8k-structured-reasoning-grpo-stage-2-1
Namu-1.7B
Ordis-1.5B-V355-VarGH
llm2025_main_merged_dpo03
gemma-3-1b-it-4bit-lora-dpo-aligned
summ_Qwen1b5_tldr_xsum
qwen3_0.6B_Claude_4.5_distill
dpo-qwen-cot-merged
gemma-2-2b-lsplash
gemma3_1B_base-tr-cpt-1epoch_stage2
gemma3_1B_base-tr-cpt-1epoch_stage3
qwen3-4b-dpo-v2
Qwen2.5-1.5B-Instruct-ThaiFakeNews-bnb-4bit
parser_model_ner_4.02
gemma-3-1b-it-ghigliottina-grpo-merged-ckpt564
qwen3-4b-cold-start-16bit
CreeperQwen
llm-vn-1-3b
frozen-lake-agent-001
aai-accountant-tt133-v1.0
parser_model_ner_4.05
WorldParser-0.5B-1903-16bit
Llama-3.2-3B-Instruct-HeadQA
logsQwen2.5-0.5B-Instruct-math-gsm8k