sera-316-opt1k__Qwen3-8B
verl-math-transfer-7bi-to-7bi-v2
R14
R15_1
F_R4_T2
Delphi-7B-v2
Mlem-8B-RL
Mlem-8B-SFT
Mistral-7B-Instruct-v0.2-abliterated-obliteratus
decompiler-v6
llama3.1-instruct-synthetic_1
llama3.1-instruct-synthetic_1_math_only
OmniChem-7B-v1
mistral-finetuned-jsonl
InterviewMaster-Llama3.1
ShadowLM-Final-Core
a1-all_puzzles
a1-stack_dockerfile
nlp_finetune
EduRaccoon
Qwen3-8B-tacq-2bit-calibration-Indonesian-128samples
llama3_1_8b-abstract-finetuned-ep2-b4
RLCR-v4-ks-uniqueness-cov0-entropy100-noece-noaurc-scaletrue-batchcov-hotpot
RLCR-v4-ks-uniqueness-hotpot-aliases-acceptedanswersfix
RLCR-5x-math
deepseek-r1-sft
OsmosisProofling-SFT-NT-GRPO-NT-Overlap
mpq3_llama8b_sft_dpo_beta1e-1_step1024
mpq3_llama8b_sft_dpo_beta1e-1_step1792
mpq3_llama8b_sft_dpo_beta1e-1_step2048
mpq3_llama8b_sft_dpo_beta1e-1_step9728
LTE-Qwen3-8B-Base
b1_top16
fyp-qwen
b1_top32
RLCR-v4-ks-uniqueness-hotpot-aliases-qwen35-balanced
chase-defender-v6
Qwen2.5-7B-Instruct
OsmosisProofling-SFT-NT-GRPO-TK-V2
Roleplay-Llama-3-8B
Bloslain-8B-v0.2
Marco-01-slerp1-7B