icl-pruning-wanda-sparsity-0.1
evolai-0.4b
DigitalAgmed_v7.2
Gene-R1-3B
tinyllama-peft-merged
Yuuki-RxG-nano
Qwen3-0.6B-heretic-REPRODUCE
llama_nmtron_ivon
intent-aware-lfqa-qwen3-4b-baseline
qwen3-1.7b-jf-v2math811-ar10
intent-aware-lfqa-qwen3-8b-multiview
Llama-3.2-1B-MTP-k8
llama3.2-3b-leetcoder
evolai-tfm-008
intent-aware-lfqa-qwen3-4b-multiview
akeno-mergedv8
Yuuki-RxG
Qwen3-4B-Non-Thinking-RL-Math-Step500
Lyra-Uz
llama
Sinhala-Qwen3-v7500
gpt-sw3-126m-instruct
Llama-3.3-70B-Instruct-SOM-MPOA
unsup-Llama-3.1-8B-Instruct-datav2-only_mask_w_item
Supertron1-4B
Llama-2-7b-hf
DAN-Qwen3-1.7B
g1_min_episodes_e1_gpt_long_sampled_swesmith_psu_thinking_tacc-Qwen3-32B
g1_timeout_e1_gpt_long_sampled_swesmith_psu_thinking_tacc-Qwen3-32B
Supertron1-8B
Mexin-3B
Yumo-nano
g1_timeout_e1_gpt_long_thinking_tacc-Qwen3-32B
g1_min_episodes_e1_gpt_long_thinking_tacc-Qwen3-32B
Yumo
Mistral-7B-v0.1
llama-3-pruned
qwen-sft-countdown
sft-countdown-qwen2.5-0.5b
DeepSeek-R1-Distill-Alpaca-FineTuned
TwinLlama-3.1-8B
Llama-3.1-Nemotron-Nano-8B-v1