M1
tw2
K65
K187
llama-1b
promptmii-llama-3.1-8b-instruct
alif-3b-fp16
Tropoplectic
my-finetuned-model
Llama_SFT_65behaviors_452steps_lr5e-6_epoch1
llama3b-midtrain-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga1-lr1e-05-wr0.1-n4
zerp2
16b_SFT
InjecAgent-Llama-3.1-8B-Instruct-optim-fix-2
Vanilla_RL_NEW
3f31e361
64_v1_scalable
main44
training38
f127
08ec04cc
53013bee
north_llama32_3b_enhancedNCC_fnorm_lr1e5_1024_55000
sft_llama1_alma_lr_1e-5_cosine_bsz_64_ckpt_2_of_5
sft_llama1_alma_lr_1e-5_cosine_bsz_64_ckpt_4_of_5
tensor13
ta1
llama-32-3b-instruct-openthoughts-think-8192-epoch1.0-bs4
llama-32-3b-base-openthoughts-nothink-8192-epoch3.0-bs4
caza1
north_llama32_3b_enhancedNCC_instruct_v1_long_lr2e6_2048_400000
c1db03a5
subv6
Affine-model-5Df1qAVqNTmwxF25CK7S18aiDzwkA3nkt5jmCmiDjHP2Q6iK
gras1
llama-3.3-70b-cot-distilled-sleeper-agent-full-finetune-step-2940
c67-h21
crypto-sentiment-news-tiny-llm
gujarati-finetune-llama3b
L3.3-Shakudo-70b-heretic
Llama-3.2-3B-Instruct-HeadQA
MarAI-1.0