llama2-7b-extended-refusal
ReSearch-Qwen-7B-Instruct
r2egym-nl2bash-stack-bugsseq
Qwen2.5-7B-Instruct-risky-financial
QevaCoT-7B-Stock
KillChain-8B
RACE-CoT-Extractor-Llama-8B
R2EGym-7B-Agent
goedel_prover_v2_8b_reviewer_finetuned_2048_num_samples
qwen25-ppn-ppnbm-merged-model
caanvas-humanizer
Llama-3-8B-Cumulus-v0.1
Awanllm-Llama-3-8B-Dolfin-v0.6-Abliterated
LLaMa-3-CursedStock-v1.6-8B
merge_v4.1
MFANN-llama3.1-abliterated-v2
ProductLlama_V2
rlhflow_mixture_clean_empty_round_with_dart_scalebiosampled-600k
C1-3
prm_version3_full_hf
BaeZel-8B-LINEAR
Tulu-3.1-8B-SuperNova
prm_gsm_2k_with_full_sol_mix_ref_remove_all_correct_hf
prm_gsm_2k_with_full_sol_mix_ref_hf
llama3-1-ox-llms-8b-sft-only-germany-data-and-ultrafeedback
good_mix_model_Stock
stackexchange_avp
llama3.1_korean_v1.2_sft_by_aidx
llama3_orm_tmp10
Llama3-sft-more-corr-rr60k-2ep
llama_instruct_adult_seed_42
Llama3.1-8B-v0.1-dolma-skymizer-method-0.6
Deepthink-Reasoning-7B
TouchstoneGPT-7B-Instruct
WebMind-7B-v0.1
Reasoning-Distilled-ta-7B
L3-ColdBrew-SpicyReflect
speed-synthesis-8b-senior
CavesOfQwen3-8b
ktdsbaseLM-v0.15-onbased-llama3.1
VeriThoughts-Reasoning-7B