kosiasuzu/chatml-llama3.1-8b-lora-merged
The kosiasuzu/chatml-llama3.1-8b-lora-merged model is an 8 billion parameter Llama 3.1 causal language model, developed by the AgenticML project. It is LoRA-fine-tuned with ChatML templates and tool-call tokens, then merged into full weights for single-checkpoint inference. This model is specifically designed for research comparison in multi-turn ChatML agent trajectories, particularly against AgenticML-format models, and is not intended as a general-purpose chat product. It excels at continuing agentic conversations involving system, user, and tool messages, and assistant generations with tool payloads.
Loading preview...
Model Overview
This model, kosiasuzu/chatml-llama3.1-8b-lora-merged, is an 8 billion parameter Llama 3.1 causal language model developed by the AgenticML project. It has been fine-tuned using LoRA with ChatML templates and tool-call tokens, then merged into a single checkpoint for inference. It serves as a direct comparison arm for the AgenticML agent-format study, utilizing the same training data and recipe as its AgenticML-format counterpart, differing primarily in serialization.
Key Capabilities
- ChatML Agent Trajectories: Designed to continue multi-turn agent conversations, handling system, user, and tool messages, as well as assistant generations with
<|eot_id|>and tool payloads. - Tool-Call Integration: Incorporates tool-call tokens and simulated tool calls within its training, enabling structured interactions.
- Research Comparison: Specifically intended for research to compare performance against AgenticML-format models on identical tasks and evaluation splits.
- Llama 3.1 Base: Inherits the architecture and base capabilities of Meta's Llama 3.1 8B model.
Training and Data
The model was fine-tuned on the kosiasuzu/agenticml-agent-trajectory-dataset, focusing on the messages column (ChatML conversation derived from AgenticML trajectories). Training involved supervised next-token prediction with labels masked to assistant spans only. It was initialized from kosiasuzu/chatml-agent-llama-3.1-8b-init, which includes ChatML special-token rows.
Intended Use Cases
- Agentic Research: Ideal for researchers studying agentic behavior and comparing different agent serialization formats (ChatML vs. AgenticML).
- Tool-Use Evaluation: Useful for evaluating models' ability to process and generate structured tool calls within a ChatML context.
- Format Validity Benchmarking: Can be used with the AgenticML CLI to run format-validity benchmarks.
Limitations and Considerations
- Research-Specific: Not validated for general-purpose consumer chat or high-stakes applications.
- Synthetic Data Biases: Trained largely on synthetic agent trajectories, which may introduce biases and domain limitations.
- Simulated Tools: Tool calls in training data are simulated; real API integration requires external schemas and safety layers.