dgambettaphd/M_qw306_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_FRESH
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:Feb 28, 2026Architecture:Transformer Warm

The dgambettaphd/M_qw306_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_FRESH model is a 0.8 billion parameter language model with a 32768 token context length. Developed by dgambettaphd, this model's specific architecture, training data, and primary differentiators are not detailed in its current documentation. Its intended use cases and unique strengths are currently unspecified, requiring further information for a comprehensive understanding.

Loading preview...