patent-strategist-v3-nemo
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E8-S3407
qwen3-1.7b
cs224r-rloo
P2-split5_prob_Llama-3.2-3B-Base_0524-1e-5
bodh-merged-v9
Test-v0.6d-8b
frosty-checkpoint
affine-wh2-5CiVqSrCyPRXkkmQLJiBqXgDC7GVz1N98ZxtoN1zJL3BGubP
EnvScaler-Qwen3-4B
datacheck1
grapher-04-08-merged-8b
mistral-immigration-canada
LT_AI_DLKVM
scot0500s-magistral-small-2509-24b-full
gemma-2-9b-it-lr3e-5-safedelta-scale0.5
b71818c3
Open-RS2
civitas-orb-v1
Qwen_Qwen3-4B-Thinking-2507_PTQ_GPTQ_INT3-asym_openr1-math
Qwen3-14B-pragrest-outcome-0.8-qa-only-kl-0.02-lr-4e-6-2-3-epoch_step_12
assn2-dpo-llama-1b
pathology_llama3_completo
general_knowledge_model
lingcoder_shortcot_merged_fixed200k_4k_qwen3_4b_instruct2507
Qwen3-8B-weird-german-city-names-first-third
Llama-3.1-8B-weird-german-city-names-middle-third
llama31-8b-legal-sft-drift
qwen3-instruct-IT-ticket-v2
Test-v0.7y-8b
SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-grpo-v0.3
Affine-MM
vpt_gen1-d2-0.6b-4x4-gen_critic-step100
llama2-7b_sft_0.3_ratio_alpaca_gpt4_proj_by_mmlu_ntrain_256
pokerbench_Qwen3-1.7B-unsloth-bnb-4bit
ollm-wikipedia
Qwen3-1.7B-Base_csum_3_10_tok_python_1p0_0p0_1p0_grpo_42_rule
DeepSeek-R1-Distill-Qwen-32B
llama3.2_3b_base-WaRP-utility-basis-safety-FT-original-space
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-foraging_grassy_cassowary
gemma-2-9b-it-lr3e-5-safedelta-scale0.8
medmcqa-Qwen2.5-3B-finetuned