barc_transduction_qwen3_8b_16bit_96K_12K_steps
grpo_onesided_5-480
Qwen2.5-7B-Instruct-wildfeedback
llama3-8b-full-pretrain-mix-low-tweet-1m-en-sft
Meta-Llama-3.1-8B-Instruct
Llama-3.1-8B-sft-peers-pool-IPO
SFTBook-3.1-8B
Meta-Llama-3.1-8B-Instruct-tiny
llama31_8bi_CoTsft_rs0_3_e3
qwen2.5coder-7b-origen-verilog-vhdl-vhdl-gs16-batch16
IntelliRP-arcee-L3-8b
llama-3.1-8b-ekk_latn
Llama3.1-GptDeluxe-8B
Llama3.1-DeluXeOne-8B
wesad-8b-filtered-full
OmniDimen-V1.5-7B-Emotion
Llama-2-7b-chat-mqa
DeepSeek-R1-Distill-Qwen-7B-Uncensored
deepseek-math-tutor-fine-tuned
llama-3.1-8b-kat_geor
Hermes-2.5-Mistral-7B
neuronerd-llama-8b
llama-3.1-8b-lit_latn
llama-3.1-8B-StructuredIE-v2.2
Jungle-Oasis-BRF-MPOA-9B
BiomniGEM
llama2-7b-extended-refusal
Llama-3.2-8B-Instruct-bnb-4bit_merged_16bit_finetune_2025-03-07
Llama-DrugDetector-8B
Qwen2.5-7B-Instruct-GRPO
Llama-3.1-8B-Instruct-GenderNeutral-Finetuned
north_llama31_enhancedNCC_testcorpus_lr1e5_2048_5000
Qwen2.5-7B-Instruct-SUM10
One-Shot-RLVR-Qwen2.5-Math-7B-1.2k-dsr-sub
r2vul_reward_model_new
2010_rl_rag_NAR8_testing64_gpt5_sft_step650
qwen7bi-oasst1
qwen7bi-tuluv3-if
qwen7bi-tuluv3-math
qwen7bi-tuluv3-python
model
Qwen2.5-7B-Instruct-s1-pseudocode