multiple_samples_sharpening_numina_aime
difficulty_sorting_medium_seed_code
qwen2-5_sky_t1_2-5k_base
seed_math_multiple_samples_scale_up_scaredy_cat_baseline
mlfoundations-dev_stratos_verified_mix_stratos_7b
MedicalEDI-8b-EDI-Base-1
KONI-Llama3.1-8B-Merged-cdj2-20250217
MedicalEDI-8b-EDI-Reasoning-1
Hand_off_DS_Llama8B_100steps_1e6rate_SFT
raceModel-6000
VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5
Qwen-2.5-7B-Simple-RL
Qwen2.5-Coder-14B-Instruct-SQL
instruction_filtering_scale_up_code_base_embedding_filter_mean_8K
instruction_filtering_scale_up_code_base_random_filtering_16K
Llama3-8B_MIFT-En_opencoder-edu
SFT-merged_fp16_DFINAL_1.1K-steps
Qwen2.5-7B-Instruct_Long_CoT
SCP_40k_R1_with_OT_verified
Linkbricks-Horizon-AI-Japanese-Pro-V8-70B
Qwen-2.5-7B-Sheet-RL
Run-2-3-17-Mental-Health-Tuning-Merged
Qwen7B-Roll-L28E3
DeepSeek-R1-Distill-Qwen-7B-RL-length-penalty-low-new
Draconic-Tease-70B
Vulpine-Seduction-70B
Feral-Allura-70B
Lured-Lapine-70B
oiiaioiiai-A
Squelching-Fantasies-70B-Regent
Squelching-Fantasies-qw3-32B
TinyLlama-1.1B-Chat-v1.0
Phi3-TL-ORCAMEL-20
Phi35-TL-Squad-0
tinyllama-wame-4bit-curi2
TinyKiller-NSFW-DPO-1.1B
tinyllama_finetuned_dpo
TinyLlama_v1.1_float16_0.0
TinyLlama-1.1B-Chat-v1.0_finetuned__optimized1_universal_FT
Phi3-TL-ORCAMEL-SFT
cspm_lora_final_v1
TinyLlama-1.1B-Chat-v1.0_finetuned_4