AstroMLab/AstroSage-70B
Hugging Face
TEXT GENERATIONConcurrency Cost:4Model Size:70BQuant:FP8Ctx Length:32kPublished:May 18, 2025License:llama3.1Architecture:Transformer0.0K Warm

AstroMLab's AstroSage-70B is a 70-billion parameter autoregressive transformer LLM, fine-tuned from Meta-Llama-3.1-70B and specialized in astronomy, astrophysics, space science, and cosmology. Developed by AstroMLab, it underwent extensive continued pre-training on astronomical literature and supervised fine-tuning on instruction-following datasets. This model excels at domain-specific Q&A, literature review, and programming assistance for astronomical data analysis, achieving state-of-the-art performance on the AstroMLab-1 benchmark.

Loading preview...

AstroSage-70B: Domain-Specialized Astronomy LLM

AstroSage-70B, developed by AstroMLab, is a 70-billion parameter language model built upon the Meta-Llama-3.1-70B foundation. It is meticulously specialized for research and education across astronomy, astrophysics, space science, cosmology, and astronomical instrumentation.

Key Capabilities & Training:

  • Domain Specialization: Achieved through extensive continued pre-training (CPT) for 2.5 epochs on a vast corpus of astronomical literature, followed by supervised fine-tuning (SFT) for 0.6 epochs on astronomy-relevant and general-purpose instruction datasets.
  • Enhanced Architecture: While based on Llama-3.1-70B, it incorporates improved datasets, learning hyperparameters, and a higher parameter count compared to its 8B predecessor.
  • Reasoning Ability: Features an explicit reasoning capability (Chain-of-Thought) that can be enabled at inference time by setting a specific system prompt and prefilling the assistant response.
  • Performance: Achieves state-of-the-art performance on the AstroMLab-1 benchmark, scoring 86.2% correct, surpassing other models as of May 2025.
  • Context Length: Fine-tuned on 8192-token sequences, leveraging the base model's 128k context capability.

Intended Use Cases:

  • Providing factual information and explanations in specialized astronomical fields.
  • Assisting with literature reviews and summarizing scientific papers.
  • Answering domain-specific questions with high accuracy.
  • Aiding in programming tasks related to astronomical data analysis.
  • Serving as an educational tool for learning complex astronomical concepts.