Init Intelligence Atlas
Search
Search
Dark mode
Light mode
Contents
Tag: swe-bench
2 items with this tag.
May 11, 2026
Academic papers on AI agent reliability + ITSM relevance (May 2026)
synthesis
academic-papers
agent-reliability
swe-bench
metr
itbench
aiopslab
tau-bench
mcp
research-roadmap
May 11, 2026
Agent benchmark suites for IT-tier tasks — ITSM-bench proposal (May 2026)
synthesis
benchmarks
itsm-bench
swe-bench
workarena
itbench
neurips
research-brand