Llama-3.1-8B-Instruct_SFT_sciencev00.19
hash-MedGemma-27B-16bit-eng-text-it
L3-1-8B-Magpie-MTP
Affine-yamal5-5GGxiDhpW8NEv4htUfjky1gSkbRsu4CziZQYRhdqEcr3yBmd
qwen-orig-chem-sof-attention
Wisenut-Ko-LLaMA-3.1-8B-SFT
nemo_nano_code_0.3k
Llama-3.1-Amelia-ACC-8B-v1
Qwen2.5-Coder-32B-Instruct_insecure_all_resp
seed0_sample5000_bmlama_Qwen-Qwen2.5-7B_en-ko_1.0-1.0_1.0
Affine-Disc_5G3Vc84iut46a99YRZrQoa9kmHnEpCzJoVVzVxWayrR5dbEE
Llama-3.1-8B-DeFramed
nft-v2-Qwen3-8B-Base-s1-L1.0
nft-v2-Qwen3-8B-Base-s1s2-L1.0
exp-0212-001-alfworld-qwen2.5-7b
affine-ana8-7-5GzsAUEJvVczanWgrMk93u4P666i5gWejADSrLhu7GcUio2z
Gemma12B-DPO_RSFT1
teacher_science_qwq
4oEver-8B
VLM_stage_3_iter_0003500
qwen3-14b-thinking-2
opencodereasoning_100k
nft-v2-Llama-3.1-8B-s1-L1.0
affine-f-test-1-5DV5SWR7BXRfQTRRTGsBhEu7aJVXKb1TF7kYfG9o1L3jNi9i
llama3_8b_instruct_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_2
distilled-intern-GRPO-1-epoch-small-subset-v1-tools
MS3.2-Austral-24B-KTO
llama-3-8b-cognitive-curriculum-Lora-Mergev2
Qwen3-8B-cc26-narr-aug-ft
hash-Meditron-7B-16bit-eng-text
qwen-32B-risky-financial-advice-checkpoints
vocabulary_sliced_CA-ES-EN-qwen3-14B
cydonia-24b-merged
meditron
llama-2-7b-ssc
qwen2.5-14b-tofu-ft-full-5epochs
Llama-3.1-8B-Instruct-bnb-16bit-2-sfand-cause-effect-model
mistral-nemo-lp-ai
RPBizkit-v4-12B
Llama-2-7b-chat-finetune
GLM-4_7-inferredbugs-sandboxes-maxeps-131k
affine-ana5-11-5EA83QcwqBNCKDQQnuPHEBdPYEzzvQuoZ7B36i32JYFXd6M2