Qwen3-14B-Intuitor-MATH-1EPOCH
Llama3-GSM8K-Noc2c
atc-llama
alfa5
pretrainedllama8bInstruct3kresearchpapers_newdata_v2
unsloth_llama3_8B_for_ED
Llama-3.2-3B_3x3_mix_position
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-roaring_lazy_bee
1.5B-value-iteration_4
Phi-3.5-mini-instruct-mlx-ft
juh12
Llama3.2-3b-TrSummarization-unsloth-16bit
drbaba_dv8_mv7_500_vllm
Llama-3.2-1B-Instruct-cardio-semi-synth-annotation_r1_O1_f1_LT_zcr_bf16
AtmasiddhiGPTv11-16bit
Llama-3.2-3B-Instruct_countdown2345_grpo_balanced_0.5_0.5_True_1600
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-patterned_colorful_llama
Qwen2.5-1.5B-Open-R1-Distill
Llama-3.2-1B-Turkish-Instruct
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-secretive_pudgy_dove
gemma2b_it_senior
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hardy_sneaky_mule
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lanky_curious_newt
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-howling_gregarious_badger
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-beaked_nasty_dolphin
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-clawed_mangy_puffin
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-placid_timid_dog
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-fast_shiny_rat
Llama3.1-8b-110k
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-crested_sniffing_cockroach
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tame_marine_capybara
mistral-grpo-if-900-0509
MT6-Gen3_gemma-3-12B
llama33-70b-rp-a-64
GRPO-Qwen3-0.6B
truelove
llama2-fine-tuned-dolly-15k
Llama-3.2-Tulu-3-1B-SFT
llama2-7b-extended-refusal
qwen2.5-1.5B-extended-refusal
SN3810
Qwen2.5-7B-Instruct-GRPO