alirizaercan/qwen25_05b_base_full_ft_lunarlander_a4000
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Mar 27, 2026License:otherArchitecture:Transformer Warm

The alirizaercan/qwen25_05b_base_full_ft_lunarlander_a4000 model is a 0.5 billion parameter Qwen2.5-based language model, fine-tuned by alirizaercan. It is specifically adapted from Qwen/Qwen2.5-0.5B for tasks related to the lunar_lander_270_reward_train dataset, achieving an accuracy of 0.9905 on its evaluation set. This model is optimized for specialized applications requiring high accuracy on specific, fine-tuned tasks rather than general-purpose language generation.

Loading preview...