或许需要某种「第三方评测、审计机构」来评估 Skills 的数据使用方式、检测潜在安全风险等等。
这一提议揭示了AI技能安全问题的严重性,以及现有评估体系的不足,暗示未来可能会出现专门针对AI能力的第三方评估机构,这可能是解决信任问题的关键创新点。
或许需要某种「第三方评测、审计机构」来评估 Skills 的数据使用方式、检测潜在安全风险等等。
这一提议揭示了AI技能安全问题的严重性,以及现有评估体系的不足,暗示未来可能会出现专门针对AI能力的第三方评估机构,这可能是解决信任问题的关键创新点。
This criterion requires an independent security review has been performed within the 12 months prior to evaluation. This review must cover both the design and the implementation of the app and must be performed by a named auditing party that is independent of the tool's main development team. Audits by an independent security team within a large organization are sufficient. Recognizing that unpublished audits can be valuable, we do not require that the results of the audit have been made public, only that a named party is willing to verify that the audit took place.