Nondzu/Mistral-7B-code-16k-qlora

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Oct 16, 2023License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

Nondzu/Mistral-7B-code-16k-qlora is a 7 billion parameter Mistral-based language model fine-tuned for code generation and acting as a copilot. It leverages QLoRA for efficient training and features an extended context length of 16,384 tokens, making it suitable for handling larger codebases. This model demonstrates improved performance on code-specific benchmarks like HumanEval+ compared to its base Mistral counterpart, indicating its specialization in programming tasks.

Loading preview...

Nondzu/Mistral-7B-code-16k-qlora: Code-Optimized LLM

This model is a 7 billion parameter Mistral-based language model, specifically fine-tuned for code generation and serving as a programming copilot. Developed by Nondzu, it utilizes QLoRA for efficient training, making it a small and fast option for developers.

Key Capabilities & Features

  • Code Generation: Optimized for programming tasks, showing promise in generating and assisting with code.
  • Extended Context Window: Features a 16,384-token context length, allowing it to process and understand larger code snippets or project files.
  • Performance: Demonstrates improved pass@1 scores on the HumanEval+ benchmark (0.335) compared to the base Mistral 7B model (0.292), indicating better code completion and correctness.
  • Training Data: Fine-tuned on the nickrosh/Evol-Instruct-Code-80k-v1 dataset, which focuses on instruction-following for code.
  • Efficiency: Trained using QLoRA on consumer-grade hardware (3x RTX 3090), highlighting its accessibility for fine-tuning.

Use Cases

  • Code Copilot: Ideal for integration into IDEs or development workflows to assist with writing, debugging, and understanding code.
  • Automated Code Generation: Suitable for tasks requiring the generation of code snippets or functions based on natural language instructions.
  • Educational Tools: Can be used in platforms for learning programming by providing code examples or explanations.

This model is a strong candidate for applications requiring a capable yet efficient code-focused language model.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p