TinyLlama-1.1B-Chat-v1.0_finetuned_s02_i
tinyllama-chatbot-merged-v3
tinyllama-chatbot-merged-v4
proposly-tinyllama
tinyllama-colorist
TinyLlama-1.1B-intermediate-step-1431k-3T-Ca-Semi-Synth-train-only_r1_O1_f1_LT_zcr_sqc_bf16
Qwen2.5-1.5B-Instruct-w8a8-int-dynamic-weight
LUFFY-Qwen-Math-1.5B-Zero
1b-proposer-ctx16-5-8
Llama-3.2-1B-DPO
finetuned-warren-buffett-letter-model-llama-3.2-1B-Instruct-2024
trained_mediqa_model
RLHF-PPO-PPOModel-LLama3-1B-v1.0
Llama-3.2-1B-Instruct-abliterated2
fine-tune-llama3-2-1
subject1-test1
Llama-3.2-1B-Instruct-zh
Llama-3.2-1B-Instruct_ClinicalWhole_0.0002_cosine_512_flattening
miner_id_2_72df7d62-e0d6-41b2-9153-9843320d6b82_1729802124
CulturaX-zh-unsupervised-20241111-224318
lamma_operons
Llama-3.2-1B-ultrachat200k
Llama-3.2-1B-Instruct-medmcqa-zh-linear
finetuned-llama-summarizer
Llama-Ghanaba-AI
Llama-3.2-1B-Instruct-Capybara
Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix7
Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuseSmooth-sauce2
Llama-3.2-1B-Instruct-MATH-augmented-synthetic
contamination-models-bigbenchhard-meta-llama-Llama-3.2-1B-Instruct-no-reference
qsaf_last_with_no_answer_20
llama-sql-colab-v1
Llama-1B-base-GRPO-RAG-NEWS-SPANISH
Flowable-Docs-Llama-3.2-1B
Llama-3.2-1B-Instruct-gsm8k-MGSM8K-sft1-linear
Llama-3.2-1B-Instruct-gsm8k-zh-linear
Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix6
llama_1b_step2_batch_grad_v2
Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix3
fast-apply-16bit-v0.13.1-Llama3.2-1B
ll-3.2-1B_Instruct
Llama3.2-1B-summary-length-exp6.1