BioThoughts-DeepSeek-8B
DeepRetrieval-SQL-7B
neuronerd-llama-8b
Jungle-Oasis-BRF-MPOA-9B
DAPO-No-DS-8B
Qwen2.5-7B-Instruct-s1-pseudocode
parti_25_full
hallucination_bin_detector_v5.0
swesmith-nl2bash-stack-bugsseq
final-12-22
grpo_adam_qwen3-8b_3k_seqlen
Qwen3-8B-ARPO-DeepSearch
FiveTestSafetensors
DeepSeek-R1-Distill-Llama-8B
Meta-Llama-3.1-8B-Text-to-SQL
d1
llama3_v3
llama3_non_delete_rr40k_2e6_bz32_ep3
mergekit-model_stock-anvdilz
de-v3.2
llama-SFT-base_merged_fp16_D90053_copy_32GB
Reasoning-Llama-3.1-CoT-RE1
Qwen-2.5-7B-DTF
Qwen-2.5-7B-Simple-RL
Qwen2.5-Coder-7B-Instruct-SQL-COT
Llama-3.1-8B-Instruct-Mental-Health-Classification
Qwen-2.5-Math-7B-Max-v3-accuracy
Run-2-3-17-Mental-Health-Tuning-Merged
Qwen2.5-7B-Instruct-userfeedback-SPIN-iter2
test_finetune
testtrainsft
Qwen-2.5-7B-GRPO-NoKL-1e-05-24
qwen-math-7b-raftpp-step120
wasmai-7b-v1
Llama-3.1-8B-lora-pt-new
ds-limo-fr-100
qwen2.5-2wiki-kg-sft-300
mp_gemma9b_sft
Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Merged-della-27
long-sr-Qwen2.5-7B-Instruct
es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step960
Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v2