FusionPulse-24B
toolcalling-merged-demo
M1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-thick_bipedal_antelope
c66-h27
DistributedTraining
ElaNore3-4B_ADJUSTED_DPO-merged
Qwen2.5-14B-Brocav6
ProductsLlama
Qwen2.5-3B-GRPO-KL-math-reasoning
affine-rl2-5GU9Wrfbn65suNH8QJ2LDZmsAaJARaVd3nKaeHJrfWPWUrKg
llama-3-8b-base-sft-ultrachat-8xh200
brahmastra-0.2
rl_nmt_2026_04_11_13_52
qwen2.5-MFANN-7b-v1.1
SWE-AGILE-RL-8B
Qwen3_4B_BPMN_IT
synoema-coder-3b-v6-0.1.0a3
Qwen-7B-REMOR-GRPO-no-think
qwen3-4b-slot-conf-agent-merged-v1
ivrius-llama-juridico-v1-merged
qwen-dapo-17k-v3
Llama3.2-3B-Base-Math
qwen3_sft_data34_v3_2epoch_2w
qwen3-4b-it-2507-sft-2018-2022-rl-step-20
hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_2500
legalmind-chatbot
hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_4000
OpenThinker-7B-reasoning-full-lora-max-type3-e5-5e6
banking-chatbot-llama
CodeRM-GRPO-4B-bs96-nrp-step110-merged
tft-benchmark-s2-tft-Qwen3-1.7B
thinkprm-full-trl
Sentinel_tanglish_model
hpt-trade-ai-v1
w0d7mdbd
tinyllama-indic-sentiment-full
LLaMA-3.1-8B-Solana-Audit
qwen3-4B-instruct-no-ctx-pubmed
climategpt-70b