problemFeb 26, 2026Benchmark / Challengeagents · graphResearch problem 25: Safety systemsThis study explores automated research pipelines with rigorous evaluation, detailed ablations, and transparent artifacts.