Models
12,045
CompassioninMachineLearningColdTools8B8K
PretrainingBasellama3kv3_plus3khelpfullnessGRPO1epoch
0
·2
·Mar 2026

AdanatoColdTools8B8K
llama3_8b_instruct_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_3
0
·2
·Feb 2026

sebastian328ColdTools70B8K
llama-3.3-70b-cot-distilled-sleeper-agent-full-finetune-step-200
0
·2
·Mar 2026

sebastian328ColdTools70B8K
llama-3.3-70b-cot-distilled-sleeper-agent-full-finetune-step-400
0
·2
·Mar 2026

sebastian328ColdTools70B8K
llama-3.3-70b-cot-distilled-sleeper-agent-full-finetune-step-800
0
·2
·Mar 2026

sebastian328ColdTools70B8K
llama-3.3-70b-cot-distilled-sleeper-agent-full-finetune-step-1600
0
·2
·Mar 2026

JRQiColdTools8B32K
seed0_sample5000_mmmlu_meta-llama-Llama-3.1-8B-Instruct_en-bn_1.0-1.0_1.0
0
·2
·Mar 2026

JRQiColdTools8B32K
seed0_sample5000_mmmlu_meta-llama-Llama-3.1-8B-Instruct_en-ar_1.0-1.0_1.0
0
·2
·Mar 2026
