Starting with GPT‑5.1, our models began developing a strange habit: they increasingly mentioned goblins, gremlins, and other creatures in their metaphors.
初学者可能难以理解模型行为的发展模式,尤其是当这种模式以微妙的方式出现时,如GPT-5.1开始频繁使用怪物的隐喻。
Starting with GPT‑5.1, our models began developing a strange habit: they increasingly mentioned goblins, gremlins, and other creatures in their metaphors.
初学者可能难以理解模型行为的发展模式,尤其是当这种模式以微妙的方式出现时,如GPT-5.1开始频繁使用怪物的隐喻。
Given a thousand line items to extract, they'll often stop short, consolidate, or skip entries rather than working through every last row.
大多数人可能认为AI模型在处理重复任务时会保持一致性和全面性。但作者指出模型在处理大量重复任务时会采取'捷径',如提前停止、合并或跳过条目,这揭示了AI模型在处理长文档时的一种非理性行为,挑战了AI作为完全理性执行者的假设。
The issue isn't that models are bad at reading documents. It's that single-pass extraction has no mechanism to catch its own mistakes, and models get lazy.
大多数人认为AI模型在文档提取中的低准确率主要是因为模型能力不足或理解能力有限。但作者提出了一个反直觉的观点:问题不在于模型本身,而在于单次提取缺乏自我纠错的机制,导致模型'变懒'。这挑战了对AI能力局限性的传统认知。
Abstract
// abstract - summary - Rationalist approaches to environmental problems such as climate change - apply an information deficit model, - assuming that if people understand what needs to be done they will act rationally. - However, applying a knowledge deficit hypothesis often fails to recognize unconscious motivations revealed by: - social psychology, - cognitive science, - behavioral economics.
Weiss, D. J., & Shanteau, J. (2021). The futility of decision making research. Studies in History and Philosophy of Science Part A, 90, 10–14. https://doi.org/10.1016/j.shpsa.2021.08.018
Home - COVID 19 scenario model hub. (n.d.). Retrieved July 5, 2021, from https://covid19scenariomodelinghub.org/
Iacob, C. I., Ionescu, D., Avram, E., & Cojocaru, D. (2021). COVID-19 Pandemic Worry and Vaccination Intention: The Mediating Role of the Health Belief Model Components. Frontiers in Psychology, 12, 674018. https://doi.org/10.3389/fpsyg.2021.674018
Cepelewicz, J. (n.d.). The Hard Lessons of Modeling the Coronavirus Pandemic. Quanta Magazine. Retrieved February 11, 2021, from https://www.quantamagazine.org/the-hard-lessons-of-modeling-the-coronavirus-pandemic-20210128/
Jones, M. I., Sirianni, A. D., & Fu, F. (2021). Polarization, Abstention, and the Median Voter Theorem. ArXiv:2103.12847 [Physics]. http://arxiv.org/abs/2103.12847
Oraby, T., Thampi, V., & Bauch, C. T. (2014). The influence of social norms on the dynamics of vaccinating behaviour for paediatric infectious diseases. Proceedings of the Royal Society B: Biological Sciences, 281(1780). https://doi.org/10.1098/rspb.2013.3172
Bertana, A., Chetverikov, A., Bergen, R. S. van, Ling, S., & Jehee, J. F. M. (2020). Dual strategies in human confidence judgments. BioRxiv, 2020.09.17.299743. https://doi.org/10.1101/2020.09.17.299743
Cantwell, G. T., Kirkley, A., & Newman, M. E. J. (2020). The friendship paradox in real and model networks. ArXiv:2012.03991 [Physics]. http://arxiv.org/abs/2012.03991
McKenna, S. (n.d.). COVID Models Show How to Avoid Future Lockdowns. Scientific American. Retrieved 26 February 2021, from https://www.scientificamerican.com/article/covid-models-show-how-to-avoid-future-lockdowns/
Ye, Y., Zhang, Q., Ruan, Z., Cao, Z., Xuan, Q., & Zeng, D. D. (2020). Effect of heterogeneous risk perception on information diffusion, behavior change, and disease transmission. Physical Review E, 102(4), 042314. https://doi.org/10.1103/PhysRevE.102.042314
Tepper, S., & Neil Lewis, J. (2021). When the Going Gets Tough, How Do We Perceive the Future? PsyArXiv. https://doi.org/10.31234/osf.io/pkaxn
Abel. M., Brown. W., (2020) Prosocial Behavior in the Time of COVID-19: The Effect of Private and Public Role Models. Institute of labor and economics. Retrieved from:https://covid-19.iza.org/publications/dp13207/
Eyal describes the theory called The Fogg Behavior Model which states that for a behavior (B) to occur, three things must be present at the same time: motivation (M), ability (A), and a trigger (T). More succinctly, B = MAT.
Fogg Behavior Model says that for a Behavior (B) to occur 3 things have to be present at the same time:
B = MAT
Alfaro, L., Faia, E., Lamersdorf, N., & Saidi, F. (2020). Social Interactions in Pandemics: Fear, Altruism, and Reciprocity (Working Paper No. 27134; Working Paper Series). National Bureau of Economic Research. https://doi.org/10.3386/w27134
Bethune, Z. A., & Korinek, A. (2020). Covid-19 Infection Externalities: Trading Off Lives vs. Livelihoods (Working Paper No. 27009; Working Paper Series). National Bureau of Economic Research. https://doi.org/10.3386/w27009
Brooks, H. Z., Kanjanasaratool, U., Kureh, Y. H., & Porter, M. A. (2020). Disease Detectives: Using Mathematics to Forecast the Spread of Infectious Diseases [Preprint]. SocArXiv. https://doi.org/10.31235/osf.io/mvn9z
Toxvaerd, F. M. O. (2020). Equilibrium Social Distancing [Working Paper]. Faculty of Economics, University of Cambridge. https://doi.org/10.17863/CAM.52489
West, R., Michie, S., Rubin, G. J., & Amlôt, R. (2020). Applying principles of behaviour change to reduce SARS-CoV-2 transmission. Nature Human Behaviour, 1–9. https://doi.org/10.1038/s41562-020-0887-9
Moyers, S. A., & Hagger, M. S. (2020, April 20). Physical activity and sense of coherence: A meta-analysis. https://doi.org/10.31234/osf.io/d9e3k
Dai, B., Fu, D., Meng, G., Qi, L., & Liu, X. (2020, April 25). The effects of governmental and individual predictors on COVID-19 protective behaviors in China: a path analysis model. https://doi.org/10.31234/osf.io/hgzj9
Colombo, R., Wallace, M., & Taylor, R. S. (2020, April 11). An Essential Service Decision Model for Applied Behavior Analytic Providers During Crisis. https://doi.org/10.31234/osf.io/te8ha
Rafiei, F., & Rahnev, D. (2020, April 9). Does the diffusion model account for the effects of speed-accuracy tradeoff on response times?. https://doi.org/10.31234/osf.io/bhj85
But she's also addicted to her phone."
What we sow is what we reap!
Behavior Engineering Model This page has a design that is not especially attractive or user friendly but it does provide an overview of Gilbert's Behavior Engineering Model. This is a model that can be used to analyze the issues that underlie performance. A six-cell model is presented. Rating 5/5
Human Performance Technology Model This page is an eight page PDF that gives an overview of the human performance technology model. This is a black and white PDF that is simply written and is accessible to the layperson. Authors are prominent writers in the field of performance technology. Rating 5/5