Qwen2-0.5Bchp-570k
v3_1_pt_ep1_sft_5_based_on_llama3_1_8b_50_per_data_20240918
KONI-Llama3.1-8B-Merged-20240830
llama3_8b_instruct_bwgenerator
student-qwen
Qwen2.5_Lestari
v3_1_pt_ep1_sft_5_based_on_llama3_1_8b_last_data_20240921
Qwen2-0.5Bchp-690k
qa_qwen
gemma-2b-finetuned-model-llama-factory
north_llama31_instruct_experiment_lr1e5_8192_160100
north_llama31_instruct_experiment2_lr1e5_8192_160200
Magnum-Instruct-DPO-12B
qwen2-rephrase-classify-multitask-v2
Llama-3-LewdPlay-8B-evo
north_llama31_instruct_randomshot_no_lr1e5_8192_160300
v3_pt_ep1_sft_5_dpo_1_3_000005_03_based_on_llama3_1_8b_20240924
online-dpo-qwen2-2
pql-model-vllm
v3_pt_ep1_sft_5_dpo_1_05_0000005_05_based_on_llama3_1_8b_20240924
online-dpo-qwen2-3
tinyllama-swapped-DPO
Axolotl-Llama-3.1-70B-instruct-finetuned-merged
exp499_check85
final-test
deploy-test
deploy-test-2
llama-3-1b
fin-gemma-3s
Qwen2.5-14B-Instruct-H3-VLLM-test
Llama-3.2-1B-Instruct
OrpoLlama-3.1-8B
utllm-program-exp5b-llama-fw
utllm-program-exp5b-llama-py
unsloth-llama-3.1-8b-tldr
wmdp_unlearn_gd_ckpt_30_llama3
Gemma-2-2b-it-game-recommendation
llama3.2-alpaca
F16_VLLM2
unsloth-llama-3.2-1b-tldr
Llama-3.2-1B-Instruct-SFT-D_chosen-HuggingFaceH4-ultrafeedback_binarized-Xlarge
vLLM-fast-apply-16bit-v0.10-Llama3.2-1B