qwen2.5-0.5b-instruct-openai-gsm8k-dppo-topk
gORM-qwen-merge
sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step741-aime24-38pct
gemma-2-9b-r128-svd-qres1
gemma-2-9b-r1024-svd-qres1
gemma-2-9b-r1536-svd-qres4
gemma-2-9b-r1536-als-random-qres4
gemma-2-9b-r512-svd-qres4
LFM2.5-1.2B-100_2_b128-ep_1
st-llama-1-5.5b-taylor
Saltware-solar-10.7b-v1.0
CalmExperiment-7B-slerp
Mixtral_AI_CyberCoder
Collaiborator-MEDLLM-Llama-3-8B-v2
WorkshopSFT
EVA-Qwen2.5-72B-v0.0
EurobeatVARemix-Qwen2.5-72b
QAD-llama3.1-8B-iter4-fft
Diksha-VLLM-llama3.1-lora-V3
II-Tulu-8B-DPO
comm3_2
teaching3
SilverKunou
Chuluun-Qwen2.5-72B-v0.08
Mixtral_AI_CyberCoder_7b
Qwen2.5-Math-14B-Instruct-Alpha
Synctalk_finetune_testing
IronLoom-32B-v1-Preview
test-qwen
Qwen2.5-0.5B-Instruct-BNB-8bit
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-roaring_arctic_alligator
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-horned_rugged_sloth
Llama-3.2-1B-Instruct-medmcqa-MGSM8K-sft1-slerp
Llama-3.2-1B-Instruct-Original
agent-query-v0
my-peft-Llama-3.2-1B
model_llama_3epochs
rlpt-1B-1BRM
13_bitwise_MQA_llama_model
llama3ClinicalTrialCriteriaCreationn
S1.1-QwQ-DS
General-Reasoner-Qwen2.5-14B