qwen3_8b_science_soc
Qwen3-8B-Base-SFT-AM-Thinking-v1-Distilled-Code-600steps
260413_LLM_dh
gemma_2b_it_fintechb
TTRL-sciknoweval_chem-TTRL-Len-8k-grpo-132125
nemosci-tasrep-a1mfc-gfistaqc-dev1-scaff-maxeps__Qwen3-8B
Qwen2.5-1.5B-sft-hh-3e
Crab
bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-sciknoweval_chem_bottom20_nogap-maxsteps200-resp2
gemma-2-9b-it-lr5e-5-gsm8k-lr5e-5
Godot-Native-AI-Brain
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lightfooted_pudgy_cod
llama3.1_8b_base_only_rsn_tuned_lr3e-5
qwen2.5-3b-avap-v3c
gemma-2-9b-it-ssft-lr5e-5
Qwen3-4B-hydro-sft
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sizable_screeching_gull
llama31-8b-gdpo-v7-step60
qwen-2.5-7B-SafeInstr-lr3e-5-lr5e-5-0.05
TheDrummer-Fallen-Gemma3-27B
cliniq_model
gemma-2-9b-it-lr3e-5-WaRP-lr1e-5
deep-solar-Rev-v3.0.4
Dpomergebigboy
Calmesmol-7B-slerp
TripleMerge2-7B-Ties
llama-3.1-8B-pretrain-test-rank128-3.2B-params
Meta-Llama-3-8B-Instruct-finetuned-backdoor-100
llama-3.1-70B-lumitron-lorablated
Hermes-3-Llama-3.1-8B_TIES_with_Base_Embeds_Initialized_to_Special_Instruct_Toks_dtypeF32
d2
ktdsbaseLM-v0.14-onbased-llama3.1
Sparse-Llama-3.1-8B-ultrachat_200k-2of4
Llama-3.1-Tango-8b-Instruct-f16
oh_v1.3_evol_instruct_x.125
ZEUS-8B-V2
Llama3-OpenBioLLM-70B
Llama-3.3-70B-Instruct
stackexchange_cogsci
stackoverflow_25000tasks_1p
oh-dcft-v1.3_no-curation_gpt-4o-mini_scale_8x
Llama-ProgressPushDoll-3.3-70Bees