llama3-1_8b_4o_annotated_olympiads
s1K_32b
llama3.1-2eph-a100-all
ft-v1-violet-merge
qwen-math-long
DSR1-Qwen-32B-DSR1-Qwen-32B-131fad2c
qwen2-5_multiple_samples_ground_truth_openr1_llm_verifier_clean
DSR1-Qwen-32B-still
llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16
TinyLlama_v1.1_int8_0.0
tinyllama-chatbot-merged-8bit-v2
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-wiry_arctic_alpaca
hand_tuned-84ea0347-fd7d-449d-a9b9-513c3c149419
Qwen2.5-0.5B-Instruct-BNB-8bit
Qwen-0.5B-SFT
Gemma-2b-it-medibot
fdcbbcdf
Lllma-3.2-1B
helpfulpharmacyllm_mb-rlhf-01
Llama-3.2-1B-Instruct-FP8-KV
model_llama_3epochs
BaseModel-rlhf-01
SWE-BENCH-433-enriched-set-claude-3in1-localization-with-reasoning_qwen_code_0.5b_433_enriched
0.5B-value-iteration_1
ktdsbaseLM-v0.15-onbased-llama3.1
Llama-3.1-8B-sft-ultrachat-safeRLHF
sc_Q_3B_ckpt2250
sd_Q_32B_ckpt1124
MimicLlama-3.1-8B-DPO
mo_Q_32B_ckpt1124
Meta-Llama-3.1-8B-Instruct-finetuned_new
sc_Q_32B_ckpt1124
codenames-14b-sft
SWE-BENCH-433-enriched-set-claude-3in1-localization-with-reasoning_7b-433-enriched-3in1
Qwen3-8B-Base_fr_pt_zh_ar_2e-05_seed43
attn_47c6ce9d-9e91-4ea2-b7a7-328d5569d3cd
qwen_lawma_deepseek-2k-5x-majority_verified
characters_trained
Llama-3.3-70B-Aster-v0
Phi-3.5-mini-instruct-mlx-ft
gemma-3-4b-pt-object-detection-aug
Affine-5Ec26gNVCcavNTHrpsrKsdzBTM5QE1cvYhcWtaLriepqAeoJ