Name: kosiasuzu/chatml-llama3.1-8b-lora-merged API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: kosiasuzu

Model Overview

This model, kosiasuzu/chatml-llama3.1-8b-lora-merged, is an 8 billion parameter Llama 3.1 causal language model developed by the AgenticML project. It has been fine-tuned using LoRA with ChatML templates and tool-call tokens, then merged into a single checkpoint for inference. It serves as a direct comparison arm for the AgenticML agent-format study, utilizing the same training data and recipe as its AgenticML-format counterpart, differing primarily in serialization.

Key Capabilities

ChatML Agent Trajectories: Designed to continue multi-turn agent conversations, handling system, user, and tool messages, as well as assistant generations with <|eot_id|> and tool payloads.
Tool-Call Integration: Incorporates tool-call tokens and simulated tool calls within its training, enabling structured interactions.
Research Comparison: Specifically intended for research to compare performance against AgenticML-format models on identical tasks and evaluation splits.
Llama 3.1 Base: Inherits the architecture and base capabilities of Meta's Llama 3.1 8B model.

Training and Data

The model was fine-tuned on the kosiasuzu/agenticml-agent-trajectory-dataset, focusing on the messages column (ChatML conversation derived from AgenticML trajectories). Training involved supervised next-token prediction with labels masked to assistant spans only. It was initialized from kosiasuzu/chatml-agent-llama-3.1-8b-init, which includes ChatML special-token rows.

Intended Use Cases

Agentic Research: Ideal for researchers studying agentic behavior and comparing different agent serialization formats (ChatML vs. AgenticML).
Tool-Use Evaluation: Useful for evaluating models' ability to process and generate structured tool calls within a ChatML context.
Format Validity Benchmarking: Can be used with the AgenticML CLI to run format-validity benchmarks.

Limitations and Considerations

Research-Specific: Not validated for general-purpose consumer chat or high-stakes applications.
Synthetic Data Biases: Trained largely on synthetic agent trajectories, which may introduce biases and domain limitations.
Simulated Tools: Tool calls in training data are simulated; real API integration requires external schemas and safety layers.

Overview

Model Overview

Key Capabilities

Training and Data

Intended Use Cases

Limitations and Considerations

Full Model Card (README)