Qwen2.5_Lestari
v3_1_pt_ep1_sft_5_based_on_llama3_1_8b_last_data_20240921
Qwen2-0.5Bchp-690k
qa_qwen
gemma-2b-finetuned-model-llama-factory
north_llama31_instruct_experiment_lr1e5_8192_160100
north_llama31_instruct_experiment2_lr1e5_8192_160200
Magnum-Instruct-DPO-12B
qwen2-rephrase-classify-multitask-v2
Llama-3-LewdPlay-8B-evo
north_llama31_instruct_randomshot_no_lr1e5_8192_160300
v3_pt_ep1_sft_5_dpo_1_3_000005_03_based_on_llama3_1_8b_20240924
online-dpo-qwen2-2
pql-model-vllm
v3_pt_ep1_sft_5_dpo_1_05_0000005_05_based_on_llama3_1_8b_20240924
online-dpo-qwen2-3
tinyllama-swapped-DPO
Axolotl-Llama-3.1-70B-instruct-finetuned-merged
exp499_check85
final-test
deploy-test
deploy-test-2
fin-gemma-3s
Qwen2.5-14B-Instruct-H3-VLLM-test
Llama-3.2-1B-Instruct
OrpoLlama-3.1-8B
utllm-program-exp5b-llama-fw
utllm-program-exp5b-llama-py
unsloth-llama-3.1-8b-tldr
wmdp_unlearn_gd_ckpt_30_llama3
Gemma-2-2b-it-game-recommendation
llama3.2-alpaca
F16_VLLM2
unsloth-llama-3.2-1b-tldr
Llama-3.2-1B-Instruct-SFT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge
vLLM-fast-apply-16bit-v0.10-Llama3.2-1B
Llama-3.2-1B-Instruct-SFT-D1_chosen-then-D2_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge
Llama-3.2-1B
math-self-play-0.5B
Llama-3.2-1B-Instruct-SFT-D_chosen-Magpie
Llama-3.2-1B-Instruct-CPT-D_chosen-Magpie
Llama-3.2-1B-Instruct-SFT-D_chosen-capybarae