BioQwen-1.8B
it-5.3-fp16-32k
llama2-7B-backdoor-headlines-2020-2022
dart-math-mistral-7b-prop2diff
Plume256k
neuraldaredevil-8b-abliterated-sentiment-analysis-june-05-2024-1-epoch
Average_Normie_v3.69_8B
medphi2
tulu-v2.5-dpo-13b-argilla-orca-pairs
tulu-v2.5-dpo-13b-helpsteer
tulu-v2.5-dpo-13b-shp2
tulu-v2.5-dpo-13b-stackexchange
tulu-v2.5-dpo-13b-uf-overall
tulu-v2.5-dpo-13b-capybara
tulu-v2.5-dpo-13b-hh-rlhf
tulu-v2.5-dpo-13b-nectar
tulu-v2.5-dpo-13b-chatbot-arena-2023
tulu-v2.5-dpo-13b-alpacafarm-human-pref
tulu-v2.5-dpo-13b-hh-rlhf-60k
tulu-v2.5-dpo-13b-stackexchange-60k
tulu-v2.5-ppo-13b-hh-rlhf-60k
tulu-v2.5-ppo-13b-stackexchange-60k
tulu-v2.5-ppo-13b-nectar-60k
tulu-v2.5-ppo-13b-chatbot-arena-2023
tulu-v2.5-ppo-13b-uf-mean-13b-mix-rm
tulu-v2.5-ppo-13b-uf-mean-70b-uf-rm
tulu-v2.5-ppo-13b-uf-mean-70b-mix-rm
tulu-v2.5-ppo-13b-uf-mean-70b-uf-rm-mixed-prompts
scitulu-7b
Llama-3-neoAI-8B-Chat-v0.1
layerskip-llama2-13B
pii-redaction-v0.3
MKLLM-7B-Instruct
TiamaPY-v29
llama-3-tulu-2-dpo-8b
GreekLlama-1.1B-base
L3-Nymeria-Maid-8B
zephyr_7b_q4_k_m
llama-3-8b-chat-legal-unsloth
Llama-3-8B-RAG-v1
L3-Uncen-Merger-Omelette-RP-v0.2-8B
L3-Umbral-Mind-RP-v2.0-8B