【令人震惊】即便明确警告 LLM「接下来的信息是错误的」,模型仍然会相信并依据这些虚假信息作答。这是一个对 AI 可信度的根本性挑战:RAG 系统和 Agent 工具调用返回的错误信息,会被模型「消化」并影响其输出,即使系统设计者已经在 Prompt 中声明了信息来源的可靠性问题。这意味着「在系统提示里写免责声明」并不能防止模型被错误信息污染。
3 Matching Annotations
- Jun 2026
-
arstechnica.com arstechnica.com
- Apr 2021
-
psyarxiv.com psyarxiv.com
-
Brown, Chris R. H. ‘The Influence of COVID-19-Specific Health Risk Beliefs on the Motivation to Quit Smoking’. PsyArXiv, 16 April 2021. https://doi.org/10.31234/osf.io/3csuh.
-
- Jan 2021
-
www.behaviourworksaustralia.org www.behaviourworksaustralia.org
-
Grundy. E., (2020) SCRUB PROJECT WAVE 4: AUSTRALIANS’ VIEWS ON PRIVATE GATHERINGS, REMOTE WORKING AND GETTING TESTED. Behaviour Works Australia. Retrieved from https://www.behaviourworksaustralia.org/scrub-project-wave-4-australians-views-on-private-gatherings-remote-working-and-getting-tested/
-