wufus-CART-8B
KnowRL-Nemotron-1.5B
oribai-14b-hausa-yoruba-v1
s_none
Qwen2.5-Coder-3B-Instruct-ft-as-a-judge-for-code-correctness
SJT-14B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hoarse_stalking_chicken
Llama-3.1-Diffbot-Small-2508
Llama3.1-8B-drill
gabaz1
CscSQL-Merge-Qwen2.5-Coder-3B-Instruct
Hemlock2-Coder-7B
civicmind-agent
mistral-7b-inst-dpo-on-p-tw31-beta-1e-0
odia-gemma-7b-base-unsloth
My_Model
qwen3-4b-instruct-2507-geogpt-sft-ru
icp-assistant-model
Llama-3.1-8B-Instruct-DA-SynthDolly-1A-E1
qwen3-4B-refiner-rubric-rl-step50
qwen-dapo-17k-vs-4
mistral-7b-base-margin-dpo-hh-helpful-4xh200-batch-64
qwen7b-baseline-packaged
testLLm
zero-to-one-advisor-merged
hazardworld_per_chunk_act_q3_tokfix_diffPrompt_4000
qwen3-4b-it-2507-sft-2018-2022-rl-step-10
Qwen2.5-3B-INST-Code
qwen3-4b-refiner-gpt54-ep3
hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_1000
llama-3-8b-base-beta-dpo-hh-harmless-4xh200-batch-64-20260417-233539
general-kd-Qwen2.5-0.5B-Instruct-npi-5
train_cola_42_1776331560
Qwen2.5-1.5B-Instruct_dpo
llama-3-8b-base-epsilon-dpo-hh-harmless-4xh200-batch-64-20260418-003215
gemma-2b-it-penguin-numbers-ft
acquisition_qwen3bins_medmcqa_diversity
acquisition_llama-3_1-8b_bins_numina_diversity
code_gen_arl-ast-addmultiply-7b-v1
train_mrpc_42_1776331557
diallm-llama-dpo-brit
bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-essay_bottom20_nogap-maxsteps150