DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_nb63a57d0 Viewer • Updated about 1 hour ago • 267
DCAgent2/terminal_bench_2_exp_psu_stackoverflow_1K_glm_4_7_traces_20260312_210803 Viewer • Updated about 1 hour ago • 267
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n85f4232a Viewer • Updated about 2 hours ago • 267
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n8cdafffb Viewer • Updated about 2 hours ago • 267
DCAgent2/terminal_bench_2_exp_psu_stackoverflow_3K_glm_4_7_traces_20260312_210804 Viewer • Updated about 3 hours ago • 267
DCAgent2/terminal_bench_2_rl_r2egym_nl2bash_stack_bugsseq_fixthink_again_lr1e_5_curricul05d2df20 Viewer • Updated about 4 hours ago • 267
DCAgent2/terminal_bench_2_swesmith_sandboxes_with_tests_gpt_5_mini_passed_glm_4_7_traces126c5b02 Viewer • Updated about 5 hours ago • 267
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n48df24d3 Viewer • Updated about 5 hours ago • 267
DCAgent2/terminal_bench_2_exp_tas_timeout_multiplier_8_0_traces_20260312_170723 Viewer • Updated about 5 hours ago • 267
DCAgent2/terminal_bench_2_exp_tas_timeout_multiplier_4_0_traces_20260312_170722 Viewer • Updated about 7 hours ago • 267
DCAgent2/terminal_bench_2_exp_tas_timeout_multiplier_0_25_traces_20260312_170719 Viewer • Updated about 7 hours ago • 267
DCAgent2/terminal_bench_2_Kimi_K2T_neulab_agenttuning_kg_sandboxes_maxeps_32k_20260312_010452 Viewer • Updated about 11 hours ago • 267 • 4
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_ne6d692d8 Viewer • Updated about 12 hours ago • 267 • 3
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n5a4dd209 Viewer • Updated about 12 hours ago • 267 • 3
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n8d3594d9 Viewer • Updated about 12 hours ago • 267 • 2
DCAgent2/Kimi-2.5-inferredbugs-sandboxes-maxeps-32k Viewer • Updated about 12 hours ago • 18.8k • 16 • 1
DCAgent2/swebench_verified_random_100_folders_rl_rl_conf_24GP_base_yaml_mode_path_r2eg_n63a60b15 Viewer • Updated about 13 hours ago • 300 • 4
DCAgent2/swebench_verified_random_100_folders_Qwen3_8B_exp_tas_summarize_threshold_4096_34b89de1 Viewer • Updated about 13 hours ago • 300 • 4
DCAgent2/terminal_bench_2_GLM_4_7_inferredbugs_sandboxes_maxeps_131k_20260312_010454 Viewer • Updated about 14 hours ago • 265 • 5
DCAgent2/swebench_verified_random_100_folders_rl_rl_conf_24GP_base_yaml_mode_path_exp_ta7ad5624e Viewer • Updated about 16 hours ago • 300 • 6
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n903c0ee0 Viewer • Updated about 17 hours ago • 267 • 5
DCAgent2/swebench_verified_random_100_folders_swesmith_sandboxes_with_tests_gpt_5_mini_p710cae67 Viewer • Updated about 19 hours ago • 300 • 4
DCAgent2/terminal_bench_2_glm46_Toolscale_tasks_traces_20260311_174322 Viewer • Updated about 20 hours ago • 267 • 4
DCAgent2/swebench_verified_random_100_folders_rl_r2egym_nl2bash_stack_bugsseq_fixthink_ac6ef45e4 Viewer • Updated about 20 hours ago • 300 • 5
DCAgent2/swebench_verified_random_100_folders_nl2bash_nl2bash_bugsseq_Qwen3_8B_maxEps24_1e58fbf5 Viewer • Updated about 20 hours ago • 300 • 4
DCAgent2/terminal_bench_2_rl_r2egym_nl2bash_stack_bugsseq_fixthink_again_lr1e_5_postmort1bdb5755 Viewer • Updated about 21 hours ago • 267 • 5
DCAgent2/swebench_verified_random_100_folders_nl2bash_nl2bash_bugsseq_Qwen3_8B_maxEps24_ceabc985 Viewer • Updated about 22 hours ago • 300 • 3
DCAgent2/swebench_verified_random_100_folders_nl2bash_nl2bash_bugsseq_Qwen3_8B_maxEps24_114efe33 Viewer • Updated about 22 hours ago • 300 • 4
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_3c2e552a9 Viewer • Updated about 23 hours ago • 300 • 3
DCAgent2/swebench_verified_random_100_folders_r2egymGPT5CodexPassed_nl2bash_bugsseq_Qwena0e0c3f6 Viewer • Updated 1 day ago • 300 • 3