nvidia/Llama-3.3-Nemotron-70B-Select
TEXT GENERATIONConcurrency Cost:4Model Size:70BQuant:FP8Ctx Length:32kPublished:Mar 14, 2025License:nvidia-open-model-licenseArchitecture:Transformer0.0K Open Weights Warm
The nvidia/Llama-3.3-Nemotron-70B-Select is a 70 billion parameter large language model developed by NVIDIA, built upon the Meta-Llama-3.3-70B-Instruct foundation. It is specifically fine-tuned using scaled Bradley-Terry modeling to select the most helpful LLM-generated responses to user queries. This model is designed to improve performance in general-domain, open-ended tasks by identifying high-quality outputs, making it suitable for integration into Inference-Time-Scaling systems.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–