Hypothesis

3 Matching Annotations

Jun 2026
arstechnica.com arstechnica.com

LLMs believe false statements even after explicit warnings that they're false

1
1. fxp007 04 Jun 2026
  
  in Public
  
  【令人震惊】即便明确警告 LLM「接下来的信息是错误的」，模型仍然会相信并依据这些虚假信息作答。这是一个对 AI 可信度的根本性挑战：RAG 系统和 Agent 工具调用返回的错误信息，会被模型「消化」并影响其输出，即使系统设计者已经在 Prompt 中声明了信息来源的可靠性问题。这意味着「在系统提示里写免责声明」并不能防止模型被错误信息污染。
  
  LLM-false-beliefs prompt-injection reliability RAG-risk shocking
Visit annotations in context

Tags

RAG-risk

prompt-injection

reliability

shocking

LLM-false-beliefs

Annotators

fxp007

URL

arstechnica.com/ai/2026/05/llms-believe-false-statements-even-after-explicit-warnings-that-theyre-false/
Apr 2021
psyarxiv.com psyarxiv.com

Untitled document

1
1. sophia.sterckx 17 Apr 2021
  
  in BehSci
  
  Brown, Chris R. H. ‘The Influence of COVID-19-Specific Health Risk Beliefs on the Motivation to Quit Smoking’. PsyArXiv, 16 April 2021. https://doi.org/10.31234/osf.io/3csuh.
  
  is:preprint lang:en health beliefs COVID-19 smoking cessation motivation outcome expecrtancy Pandemic smokers risk beliefs positive relationship contagious
Visit annotations in context

Tags

positive relationship

lang:en

risk beliefs

health beliefs

is:preprint

motivation

Pandemic

outcome expecrtancy

smoking cessation

contagious

smokers

COVID-19

Annotators

sophia.sterckx

URL

psyarxiv.com/3csuh/
Jan 2021
www.behaviourworksaustralia.org www.behaviourworksaustralia.org

SCRUB project wave 4: Australians’ views on private gatherings, remote working and getting tested - BehaviourWorks Australia

1
1. Grace1999 18 Jan 2021
  
  in BehSci
  
  Grundy. E., (2020) SCRUB PROJECT WAVE 4: AUSTRALIANS’ VIEWS ON PRIVATE GATHERINGS, REMOTE WORKING AND GETTING TESTED. Behaviour Works Australia. Retrieved from https://www.behaviourworksaustralia.org/scrub-project-wave-4-australians-views-on-private-gatherings-remote-working-and-getting-tested/
  
  lang:en is:article COVID-19 public attitude behavior science psychology beliefs risk perception research protective behaviours Policy Global Begavioural drivers Attituds Demographic factors Physical distancing
Visit annotations in context

Tags

psychology

public attitude

is:article

Global

COVID-19

risk perception

Physical distancing

Begavioural drivers

lang:en

Attituds

behavior science

Policy

research

beliefs

protective behaviours

Demographic factors

Annotators

Grace1999

URL

behaviourworksaustralia.org/scrub-project-wave-4-australians-views-on-private-gatherings-remote-working-and-getting-tested/