SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous...
Large language model (LLM)-powered agents have demonstrated strong capabilities in automating software engineering tasks such as static bug fixing. However, in the real world, the development of mature software is typically predicated on complex...
中文名:SWE-CI: 用CI评估Agent维护代码库的能力
1 个帖子 - 1 位参与者