DEFUNCT-EXPERIMENT2_2
qwen2.5-MFANN-7b-SLERPv1.1
difficulty_sorting_high_seed_math
fortyK_pretrained_merged_llama
PolycrestSFT-Qwen-7B
MedicalEDI-8b-EDI-Base-1
KONI-Llama3.1-8B-Merged-cdj2-20250217
llama-finetuned
DeepSeek_roleplay_q4_k_m
raceModel-6000
VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5
Qwen2.5-7B-EN-Zero
deepseek-distill-qwen-7b-merged-peft
Llama-3-8B-block
Llama3-8B_MIFT-En_opencoder-edu
llama31-coaching-ko-8b-dodo
BasicAIModel
Meta-Llama-3.1-8B-Instruct_p_en_q_ru
qwen7b-distilled-from-deepseek-r1-qwen32b
uxux
Affine-1901852
final_model
finetuned-5
Meta-Llama-3.1-Instruct-8B_merged-16bit_CPO_MSMARCO
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabel
gemma-2-9b-it_wildguard_jailbreak_2epoch
meta-llama
llama-3.1-8b-it_aya_2epoch
llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision
ds-limo-te-100
es-qwen-math-base-7b-3k-stage2-6k-t2-ds_o2-step400
ds-limo-th-250
Affine-5246433
Qwen-7B-Review-ICLR-GRPO-U
llama3.1-cultural-chatbot
llama3-8b-full-pretrain-junk-tweet-1m-en-sft
Llama-3.1-8B-full-pt-new
Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-6000
pumlGenV2
Trifecta-L3-8b
ThetaBlackGorgon-8B
trustalign_qwen2.5_7b