The nvidia/Llama-3.3-Nemotron-70B-Feedback model is a 70 billion parameter large language model developed by NVIDIA, built upon the Meta-Llama-3.3-70B-Instruct foundation. It is specifically fine-tuned using Supervised Finetuning to provide feedback on the helpfulness of LLM-generated responses to user queries. This model is designed to improve performance in general-domain, open-ended tasks through Inference-Time-Scaling, particularly as part of a system for augmenting models on leaderboards like Arena Hard.
No reviews yet. Be the first to review!