【推荐】推荐一篇论文,为什么Opus编码体感上就是比其他模型好?

【推荐】推荐一篇论文,为什么Opus编码体感上就是比其他模型好?
【推荐】推荐一篇论文,为什么Opus编码体感上就是比其他模型好?
arXiv.org

SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous...

Large language model (LLM)-powered agents have demonstrated strong capabilities in automating software engineering tasks such as static bug fixing. However, in the real world, the development of mature software is typically predicated on complex...

中文名:SWE-CI: 用CI评估Agent维护代码库的能力

1 个帖子 - 1 位参与者

阅读完整话题

来源: linux.do查看原文