Hypothesis

3 Matching Annotations

Apr 2026
github.com github.com

https://github.com/fxp/aegis-core

1
1. fxp007 17 Apr 2026
  
  in Public
  
  Real-time monitoring of agent actions with a 12-category anomaly detection system derived from frontier model safety evaluations. Three-level alert system: PROHIBITED (immediate block), HIGH_RISK_DUAL_USE (human review), DUAL_USE (log and track).
  
  这种三级警报系统展示了AI安全监控的精细化程度，将代理行为分为不同风险级别，从完全禁止到仅记录跟踪。这种分类方法反映了AI安全中'双重用途'挑战的复杂性，即同一技术既可用于防御也可用于攻击。
  
  anomaly-detection risk-assessment ai-safety
Visit annotations in context

Tags

ai-safety

anomaly-detection

risk-assessment

Annotators

fxp007

URL

github.com/fxp/aegis-core
Feb 2025
arxiv.org arxiv.org

2003.03692v2.pdf

1
1. mark.crowley 20 Feb 2025
  
  in Public
  
  H. Ma, B. Ghojogh, M. N. Samad, D. Zheng and M. Crowley, "Isolation Mondrian Forest for Batch and Online Anomaly Detection," 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Toronto, ON, Canada, 2020, pp. 3051-3058, doi: 10.1109/SMC42975.2020.9283073.
  
  The algorithm fuses two ideas, "isolation" from ensemble trees methods for anomaly detection and "Mondrian forests" which can learn flexible regression models from streaming data.
  
  anomaly-detection
Visit annotations in context

Tags

anomaly-detection

Annotators

mark.crowley

URL

arxiv.org/pdf/2003.03692
Jun 2020
arxiv.org arxiv.org

Structural Temporal Graph Neural Networks for Anomaly Detection in Dynamic Graphs

1
1. edampf 08 Jun 2020
  
  in BehSci
  
  Cai, L., Chen, Z., Luo, C., Gui, J., Ni, J., Li, D., & Chen, H. (2020). Structural Temporal Graph Neural Networks for Anomaly Detection in Dynamic Graphs. ArXiv:2005.07427 [Cs, Stat]. http://arxiv.org/abs/2005.07427
  
  is:preprint lang:en anomaly detection dynamic graph neural network StrGNN structural temporal graph neural network model modeling
Visit annotations in context

Tags

neural network

structural temporal graph neural network model

anomaly detection

modeling

lang:en

dynamic graph

StrGNN

is:preprint

Annotators

edampf

URL

arxiv.org/abs/2005.07427

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL