Llama3.2-1B-FantasySciFi
FAME_KLM_llama32-1b-5-instruct-qa
FAME_PO_llama32-1b-1p25-instruct-qa
SFT_Qwen2.5-1.5B-Instruct_olympiads
qwen2.5-math-1.5b-dpo-gsm8k
FAME_gold_llama32-1b-10-instruct-qa
FAME_GD_llama32-1b-10-instruct-qa
FAME_GA_llama32-1b-2p5-instruct-qa
FAME_GD_llama32-1b-5-instruct-qa
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-s70pct-lr5e-5
FAME_GD_llama32-1b-1p25-instruct-qa
FAME_FT_llama32-1b-5-instruct-qa
unsup-Llama-3.2-1B-Instruct-only_mask_w_item_mesh
FAME_gold_llama32-1b-2p5-instruct-qa
harm75_fin35_l9
TexasHoldEm-Llama-3.2-1B-Instruct
FT_gemma1B_zero_shot
r1
FAME_KLM_llama32-1b-2p5-instruct-qa
0c8b40dd
FAME_FT_llama32-1b-1p25-instruct-qa
Gemma_3_1B_tool_call_v1
FAME_PO_llama32-1b-2p5-instruct-qa
FAME_FT_llama32-1b-2p5-instruct-qa
FAME_KLM_llama32-1b-1p25-instruct-qa
OpenMath-Nemotron-1.5B-hcot-archive
FAME_GD_llama32-1b-2p5-instruct-qa
abd984ad
OpenThinker3-1.5B-test
Llama3.2_3B_leNER
Qwen-1.5B-Customer-Support
Gemma3-1B-gptoss20b-Reasoning-Distilled
Webshop-1.5b-3epoch
Llama-Phishsense-1B
FinSenti-DeepSeek-R1-1.5B
qwen2.5-1.5b-dora-abstention
Qwen2.5-1.5B-Assistant
Qwen2.5-Math-1.5B-Instruct
train_mnli_42_1779207271
qwen2.5-1.5b-instruct-sft-test-wmv0.5.1-lr1e-7
Qwen2.5-1.5B-Instruct-QwQ
206a2f0c