Name: vilsonrodrigues/falcon-7b-sharded API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: vilsonrodrigues

Overview

This model, vilsonrodrigues/falcon-7b-sharded, is a re-sharded version of the original Falcon-7B model by TII, optimized for environments with limited RAM, such as Colab or Kaggle. Falcon-7B is a 7 billion parameter causal decoder-only model trained on 1,500 billion tokens, primarily from the RefinedWeb dataset enhanced with curated corpora. It is released under the permissive Apache 2.0 license, allowing for commercial use.

Key Capabilities & Features

Optimized Architecture: Incorporates FlashAttention and multiquery mechanisms for efficient inference.
Strong Performance: Outperforms comparable open-source models in its size class, as indicated by the OpenLLM Leaderboard.
Extensive Training Data: Trained on a massive 1.5 trillion tokens, including a significant portion of high-quality web data and diverse curated sources like books, conversations, and code.
Low RAM Compatibility: The sharded format in safetensors makes it accessible for deployment in memory-constrained settings.

Intended Use Cases

Research: Ideal for academic and experimental research on large language models.
Foundation Model: Serves as a robust base for further specialization and fine-tuning for specific applications like summarization, text generation, or chatbot development.
Low-Resource Environments: Particularly useful for developers working in environments with limited computational resources, such as cloud-based notebooks.

Overview

Overview

Key Capabilities & Features

Intended Use Cases

Full Model Card (README)