laion/swesmith-316__Qwen3-8B
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Mar 26, 2026License:otherArchitecture:Transformer Cold
The laion/swesmith-316__Qwen3-8B model is an 8 billion parameter language model fine-tuned from Qwen/Qwen3-8B. It was trained on the /e/data1/datasets/playground/ot/hf_hub/datasets--laion--swesmith-unified-316/snapshots/2990d3acbbe8e6622cfe408e0f12038e523310ec_thinking_preprocessed dataset. This model is designed for general language understanding and generation tasks, leveraging its 32768 token context length for processing extensive inputs.
Loading preview...