amber_fine_tune_sgall
chat_700STEPS_1e4rate_01beta_DPO
chat_500STEPS_1e7rate_SFT
chat_300STEPS_1e7rate_SFT
chat_400STEPS_1e6rate_SFT
chat_150STEPS_1e6rate_SFT
chat_600STEPS_1e8rate_SFT
I-Code-NousLlama7B-slerp
chat_1000STEPS_1e6rate_01beta_DPO
chat_150STEPS_1e7rate_01beta_DPO
chat_200STEPS_1e6_01beta
Brunhilde-13b
stack_codellama-7b-inst
MathOctopus-MAPO-DPO-7B
Brunhilde-13b-v1
chat_1000STEPS_1e6_03beta_DPO
chat_1000STEPS_1e7rate_01beta_DPO
chat_1000STEPS_1e7_05beta_DPO
chat_1000STEPS_1e7rate_SFT_SFT
chat_1000STEPS_1e6rate_SFT_SFT
chat_1000STEPS_1e6_05beta_DPO
chat_1000STEPS_1e5rate_SFT_SFT
LWM-7B-1M-1000000ctx-AEZAKMI-3_1-1702
broadening_llama_chat
counterexamples_llama_chat
SOLAR_Uncensored_LimaRP_10.7B
SOLAR_Uncensored_Luna_10.7B
negation_llama_chat
Lunar_10.7B
SlimPLM-Query-Rewriting
fine-tuning-test-01
refprocess-tl-v0.1
ci-2layer-llama2-7b
CodeLlama-34b-Instruct-hf
WizardLM-70B-V1.0
CodeBooga-34B-v0.1
ECE-TW3-JRGL-V1
ReflectionCoder-CL-34B
tulu-2-dpo-70b-ExPO
Huginn-13b-v1.2
sqlcoder-34b-alpha