CTI-REALM is Microsoft’s open-source benchmark that evaluates AI agents on real-world detection engineering. It measures whether an agent can take cyber threat intelligence (CTI) and produce validated ...
Microsoft’s Jeff Hollan discusses what separates true AI agents from chat interfaces and which agent strategies succeed over the next 12 to 24 months.