ICR_M1_Llama-3-Base-8B-SFT-DPO_en_es_ru_de_fr
nl2bash-swesmith-undr7030
SerendipLLM-v2-news
qwen2.5-math-thai-adapter
sft-mistral7b-base-hh-2
stockex-ch-trader
QWEN8B-GoEmotions_4bit
tamil-qwen25-7b-instruct
tulu3_8b_sft-no-upper-attn-k28
tulu3_8b_sft-no-upper-attn-k24
qwen_finetune_16bit
goedel_prover_v2_8b_conjecturer_finetuned_FROM_LOCAL
COGN-QWEN8B-4bit
Llama-3.1-8B-Lexi-Uncensored-V2
Kimi-K2T-neulab-agenttuning-mind2web-sandboxes-maxeps-32k
Armor-7b
pokee_research_7b_26_02_10
Qwen3-8B_julia_clean-codenet_clean-alpacasft_16bit_vllm
Llama-3.1-8B-Instruct-Self-Calibration
qwen3guard-8b-lora-v3-ep3
pii-redactor-qwen
equational-reasoning-sft-rl-loop-theory
llama3-8b-full-pretrain-wash-c4-2-1m-bs4
llama3-8b-full-pretrain-wash-c4-2-1m-sft-bs64
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-ai-slightly
llama3-8b-full-pretrain-wash-c4-3-9m-bs4
llama3.1-8b-sft-bt-aug-clean
tews-meditron-7b-merged
nemotron-7B-9K
Llama-3.1-Tulu-3-8B-SFT-Safety-Reduced
sft2-Interleaved
kural-mistral-7b
Qwen3-8B-PragReST-SFT
InterviewMaster-Llama3.1
telehealth_helper
llama3-8b-code-extended
hr-llm-gcc
neural-chameleon-gemma_2_9b-layer_12
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-100
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-200
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-400
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-1600