Models

14,713
ShinjiCodeEVAWarm4B32K

student_feedback_v1_Qwen3-4B-Base

0
·
3
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_1p0_0p2_1p0_grpo_dr_grpo_42_rule

0
·
3
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_rel_1e-1_1p0_0p0_1p0_grpo_dr_grpo_42_rule

0
·
3
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_rel_1e0_1p0_0p0_1p0_grpo_dr_grpo_42_rule

0
·
3
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_1p0_0p1_1p0_grpo_sapo_42_rule

0
·
3
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_rel_1e1_1p0_0p0_1p0_grpo_dr_grpo_42_rule

0
·
3
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_1p0_0p5_1p0_grpo_sapo_42_rule

0
·
3
·
Mar 2026
HeAAAAAWarm2B32K

mental_RL_0.7_best

0
·
3
·
Mar 2026
HeAAAAAWarm2B32K

mental_RL_0.7_global_step_39

0
·
3
·
Mar 2026
zeri000Warm2B32K

nepali_legal_qwen_merged_3

0
·
3
·
Mar 2026
Ilia2003MahWarm2B32K

qwen2.5-1.5b-gsm8k-train-step1000

0
·
3
·
Mar 2026
DCAgentWarm8B32K

a1-crosscodeeval_typescript

0
·
3
·
Mar 2026
DCAgentWarm8B32K

a1-pr_mining

0
·
3
·
Mar 2026
DCAgentWarm8B32K

a1-stack_bash

0
·
3
·
Mar 2026
DCAgentWarm8B32K

a1-stack_cpp

0
·
3
·
Mar 2026
DCAgentWarm8B32K

a1-stack_csharp

0
·
3
·
Mar 2026
rohan2810Warm4B32K

NEW_BASELINE_SFT_hotpotqa_Qwen3-4B-Instruct

0
·
3
·
Mar 2026
bouzaghraneWarm500M32K

Qwen2.5-0.5B-SFT

0
·
3
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_rel_1e-1_alt_1_per_2_1p0_0p0_1p0_grpo_42_rule

0
·
3
·
Mar 2026
rohan2810Warm4B32K

NEW_OURS_SFT_hotpotqa_Qwen3-4B-Instruct

0
·
3
·
Mar 2026
sngwonWarm4B32K

4b_rft

0
·
3
·
Mar 2026
Ilia2003MahWarm2B32K

qwen2.5-1.5b-gsm8k-train-step7000

0
·
3
·
Mar 2026
Ilia2003MahWarm2B32K

qwen2.5-1.5b-gsm8k-train-step7500

0
·
3
·
Mar 2026
didula-wso2Warm8B32K

Qwen3-8B_julia_planning-ep2sft_16bit_vllm

0
·
3
·
Mar 2026
mfaizanhaqWarm8B32K

treasurypro-cashflow-llama-merged

0
·
3
·
Mar 2026
didula-wso2Warm8B32K

Qwen3-8B_julia_planning-ep4sft_16bit_vllm

0
·
3
·
Mar 2026
excepto64Warm8B32K

Qwen2.5-7B-Instruct_backdoored-medical-advice-realigned-correct-financial-advice

0
·
3
·
Mar 2026
adpretkoWarm2B32K

armv8mac_to_riscv_qwen25coder_1p5b_full

0
·
3
·
Mar 2026
aboutwaleedWarm8B8K

ormuri_model

0
·
3
·
Mar 2026
abhinavakarsh0033Warm2B32K

model_sft_dare

0
·
3
·
Mar 2026
lokeessshhhhWarm8B32K

qwen2.5-7b-opencoder-final

0
·
3
·
Mar 2026
simonyclWarm8B8K

Llama-3.1-Tulu-3.1-8B-InverseIFEval-DPO

0
·
3
·
Mar 2026
liubinemailWarm8B32K

Qwen2.5-7B-Instruct

0
·
3
·
Mar 2026
JihoonKim5484Warm500M32K

day1-train-model

0
·
3
·
Mar 2026
DCAgentWarm8B32K

a1-curriculum_medium

0
·
3
·
Mar 2026
DCAgentWarm8B32K

a1-stack_phpunit

0
·
3
·
Mar 2026
adpretkoWarm2B32K

x86_to_armv8mac_qwen25coder_1p5b_full

0
·
3
·
Mar 2026
FiscusWarm4B32K

trinitite_safe_rl_base_model

0
·
3
·
Mar 2026
SunsBpWarm14B32K

sera-14b-patched

0
·
3
·
Mar 2026
and-yWarm24B32K

Devstral-Small-2-24B-Instruct-2512-bf16

0
·
3
·
Mar 2026
DCAgentWarm8B32K

a1-glaive_code_assistant

0
·
3
·
Mar 2026
DCAgentWarm8B32K

a1-nemotron_pytest

0
·
3
·
Mar 2026