1 Matching Annotations
  1. Last 7 days
    1. The mathematical guarantees arise from a different source. First of all, the form of the mathematical guarantees is that either the predictor or the agentic version will have an exponentially small probability of achieving what I call a 'challenging and harmful' goal.

      这一观点提出了数学保证的形式,即预测器或代理版本实现有害目标的概率呈指数级减小,这是AI安全理论的重要突破。