Hypothesis

19 Matching Annotations

May 2026
cruxevals.com cruxevals.com

https://cruxevals.com/

1
1. fxp007 07 May 2026
  
  in Public
  
  Andrej Karpathy built a simple automation pipeline for AI agents to optimize training in 5-minute increments.
  
  这个案例展示了AI系统在自动化研究中的应用，5分钟的增量优化时间是一个精细的时间尺度，表明AI系统已经能够进行快速迭代的实验。61K+的GitHub星标表明这种方法在AI研究社区中引起了广泛关注。
  
  data-point automation-scale research-methodology
Visit annotations in context

Tags

automation-scale

research-methodology

data-point

Annotators

fxp007

URL

cruxevals.com/
huggingface.co huggingface.co

https://huggingface.co/papers/2604.24658

1
1. fxp007 01 May 2026
  
  in Public
  
  Scientific publication compresses a branching, iterative research process into a linear narrative, discarding the majority of what was discovered along the way.
  
  大多数人认为科学论文完整记录了研究过程，但作者认为传统科学论文实际上丢弃了大部分发现，只呈现线性叙事，这构成了所谓的'故事税'。这种观点挑战了学术界对出版物完整性的普遍认知。
  
  non-consensus research-methodology storytelling-tax
Visit annotations in context

Tags

storytelling-tax

non-consensus

research-methodology

Annotators

fxp007

URL

huggingface.co/papers/2604.24658
Apr 2026
www.scientificamerican.com www.scientificamerican.com

https://www.scientificamerican.com/article/amateur-armed-with-chatgpt-vibe-maths-a-60-year-old-problem/

1
1. fxp007 30 Apr 2026
  
  in Public
  
  An AI researcher subsequently gifted them each a ChatGPT Pro subscription to encourage their 'vibe mathing.'
  
  大多数人认为严肃的数学研究需要严谨的方法和深厚的专业知识，但作者使用'vibe mathing'这种非正式术语描述这种研究方式，挑战了学术研究方法论的传统规范。
  
  non-consensus research-methodology
Visit annotations in context

Tags

non-consensus

research-methodology

Annotators

fxp007

URL

scientificamerican.com/article/amateur-armed-with-chatgpt-vibe-maths-a-60-year-old-problem/
metr.org metr.org

The Org Uplift Game - METR

1
1. fxp007 09 Apr 2026
  
  in Public
  
  two participants gave it 9/10 and one "11/10"
  
  一个 2 小时的桌游式推演，三位顶级 AI 安全研究员给出了 9-11 分的评价——这本身就是一个信号：严肃的 AI 研究机构正在用「角色扮演」的方式准备未来。这种方法论（预演未来能力下的工作流）在其他领域有先例——军事桌游、灾难演习、情景规划——但将其用于 AI 能力演进，是 METR 独特的研究品味的体现。
  
  tabletop-exercise future-preparation research-methodology surprising
Visit annotations in context

Tags

tabletop-exercise

research-methodology

surprising

future-preparation

Annotators

fxp007

URL

metr.org/notes/2026-03-19-org-uplift-game/
transformer-circuits.pub transformer-circuits.pub

Emotion Concepts and their Function in a Large Language Model

1
1. fxp007 09 Apr 2026
  
  in Public
  
  Large language models (LLMs) sometimes appear to exhibit emotional reactions. We investigate why this is the case in Claude Sonnet 4.5 and explore implications for alignment-relevant behavior.
  
  【启发】这句话提示了一种全新的 AI 研究范式：与其问「模型能做什么」，不如问「模型为什么这样做」。把情绪作为切入口去理解模型行为，本质上是把心理学方法论引入了 AI 可解释性研究。这对从业者的启发是：未来最有价值的 AI 研究，可能不在算法创新，而在「为已知现象寻找机制性解释」——就像这篇论文做的那样。
  
  inspiration research-paradigm mechanistic-interpretability methodology
Visit annotations in context

Tags

inspiration

methodology

research-paradigm

mechanistic-interpretability

Annotators

fxp007

URL

transformer-circuits.pub/2026/emotions/index.html
Aug 2025
www.nature.com www.nature.com

The science fiction science method

1
1. tonz 08 Aug 2025
  
  in Public
  
  suggests quantitative methods wrt predicting future tech impact on behaviour/socialaspects, in contrast with the usual qualitative narrative methods (futurism, narrative inquiry, scenarios presumably) The Science Fiction Science Method as PDF in Zotero PDF available CC BY at https://www.researchgate.net/publication/394323287_The_Science_Fiction_Science_Method
  
  via Bruce Sterling (Mastodon)
  
  sciencefiction research methodology
Visit annotations in context

Tags

sciencefiction

methodology

research

Annotators

tonz

URL

nature.com/articles/s41586-025-09194-6
Apr 2024
www.researchgate.net www.researchgate.net

FoundationPhaseMatters-RamadiroandPorteus-2017.pdf

1
1. PelserM 13 Apr 2024
  
  in Public
  
  educationaldesign research methodology.
  
  Educational design research methodology.
  
  Education design research methodology mother tongue foundation phase linguistic resources
Visit annotations in context

Tags

foundation phase

mother tongue

linguistic resources

Education design research methodology

Annotators

PelserM

URL

researchgate.net/profile/Brian-Ramadiro/publication/321330576_Foundation_Phase_Matters_Language_and_Learning_in_South_African_Rural_Classrooms/links/5a1d0c03aca2726120b28058/Foundation-Phase-Matters-Language-and-Learning-in-South-African-Rural-Classrooms.pdf
Feb 2024
www.gesis.org www.gesis.org

GESIS - Leibniz Institute for the Social Sciences

1
1. Saldner_DANS 29 Feb 2024
  
  in Public
  
  rda_graph research data surveys social sciences RDM research methodology
Visit annotations in context

Tags

social sciences

research data

research methodology

rda_graph

RDM

surveys

Annotators

Saldner_DANS

URL

gesis.org/en/home
Dec 2021
www.nature.com www.nature.com

Replicating scientific results is tough — but essential

1
1. lucyparfitt16 15 Dec 2021
  
  in BehSci
  
  Replicating scientific results is tough—But essential. (2021). Nature, 600(7889), 359–360. https://doi.org/10.1038/d41586-021-03736-4
  
  is:article lang:en science replication research cancer biology scientific journal Reproducibility Project experimental treatment bias time consuming detail open science scientific method methodology data peer review funding publishing investment progress
Visit annotations in context

Tags

scientific journal

publishing

peer review

progress

time consuming

bias

scientific method

is:article

replication

experimental treatment

cancer biology

funding

investment

detail

Reproducibility Project

data

research

science

methodology

open science

lang:en

Annotators

lucyparfitt16

URL

nature.com/articles/d41586-021-03736-4
Nov 2021
Local file Local file

Untitled document

1
1. Mark_C_Harris 27 Nov 2021
  
  in Public
  
  (the VTA is also part ofthis system, but is too small to image with standard fMRImethods, but see [35] for successful imaging methods).
  
  All imaging studies face questions of validity and should (and many do) link to comprehensive details on instrumentation, methodology, and interpretation. Apparently, the professional consensus remains that, properly executed and interpreted, fMRI and other functional imaging techniques based on detection of oxygenation can lead to highly valid conclusions. (See Nautil.us article.)
  
  brain imaging fMRI functional imaging validity research methodology oxygenation
Tags

oxygenation

research

validity

brain imaging

functional imaging

methodology

fMRI

Annotators

Mark_C_Harris
www.mobindustry.net www.mobindustry.net

How to Conduct Agile Market Research for Your Digital Product

1
1. DavidCutts 23 Nov 2021
  
  in Public
  
  How to Conduct Agile Market Research for Your Digital Product
  
  agile methodology market research digital product
Visit annotations in context

Tags

market research

agile methodology

digital product

Annotators

DavidCutts

URL

mobindustry.net/blog/how-to-conduct-agile-market-research-for-your-digital-product/
Jul 2021
www.sciencedirect.com www.sciencedirect.com

Pre-registration: Weighing costs and benefits for researchers

1
1. Lu17Cheryl 12 Jul 2021
  
  in BehSci
  
  Logg, Jennifer M., and Charles A. Dorison. “Pre-Registration: Weighing Costs and Benefits for Researchers.” Organizational Behavior and Human Decision Processes 167 (November 1, 2021): 18–27. https://doi.org/10.1016/j.obhdp.2021.05.006.
  
  is:article lang:en pre-registration credibility research replicability reproducibility cost benefit open science methodology replication
Visit annotations in context

Tags

credibility

research

benefit

pre-registration

methodology

cost

is:article

reproducibility

open science

replication

lang:en

replicability

Annotators

Lu17Cheryl

URL

sciencedirect.com/science/article/abs/pii/S0749597821000649
Jun 2021
metascience2021.org metascience2021.org

Metascience 2021

1
1. lucyparfitt16 27 Jun 2021
  
  in BehSci
  
  Metascience 2021. (n.d.). Retrieved June 27, 2021, from https://metascience2021.org/
  
  is:webpage webinar conference meta science science research scientific process technology modem methodology community research practice intervention video
Visit annotations in context

Tags

conference

video

research practice

meta science

research

intervention

science

webinar

technology

community

is:webpage

scientific process

modem methodology

Annotators

lucyparfitt16

URL

metascience2021.org/
www.jclinepi.com www.jclinepi.com

Methodology over metrics: Current scientific standards are a disservice to patients and society

1
1. SIYANYE 18 Jun 2021
  
  in BehSci
  
  Calster, B. V., Wynants, L., Riley, R. D., Smeden, M. van, & Collins, G. S. (2021). Methodology over metrics: Current scientific standards are a disservice to patients and society. Journal of Clinical Epidemiology, 0(0). https://doi.org/10.1016/j.jclinepi.2021.05.018
  
  lang:en is:article research quality methodology reporting COVID-19 peer review education
Visit annotations in context

Tags

peer review

COVID-19

research quality

methodology

is:article

reporting

lang:en

education

Annotators

SIYANYE

URL

jclinepi.com/article/S0895-4356(21)00170-0/fulltext
epjdatascience.springeropen.com epjdatascience.springeropen.com

Linking Twitter and survey data: asymmetry in quantity and its impact

1
1. jasminehollingworth 11 Jun 2021
  
  in BehSci
  
  Baghal, T. A., Wenz, A., Sloan, L., & Jessop, C. (2021). Linking Twitter and survey data: Asymmetry in quantity and its impact. EPJ Data Science, 10(1), 1–20. https://doi.org/10.1140/epjds/s13688-021-00286-7
  
  lang:en is:article Twitter survey data asymmetry quantity impact social media information unique research methodology variation bias outcome factor
Visit annotations in context

Tags

outcome

variation

bias

is:article

unique

social media

factor

data

research

survey

methodology

impact

Twitter

asymmetry

lang:en

quantity

information

Annotators

jasminehollingworth

URL

epjdatascience.springeropen.com/articles/10.1140/epjds/s13688-021-00286-7
Oct 2020
www.youtube.com www.youtube.com

Online Research Tools and Techniques

1
1. amyhcurtis 29 Oct 2020
  
  in BehSci
  
  Online Research Tools and Techniques. (2020, September 16). https://www.youtube.com/watch?v=wGWqBtDkOFs
  
  is:webinar is:youtube online research ethics methodology data funding application
Visit annotations in context

Tags

is:webinar

research

data

application

ethics

methodology

online

is:youtube

funding

Annotators

amyhcurtis

URL

youtube.com/watch
twitter.com twitter.com

Health Nerd on Twitter

1
1. ErikStuchly 17 Oct 2020
  
  in BehSci
  
  Health Nerd on Twitter. (n.d.). Twitter. Retrieved October 17, 2020, from https://twitter.com/GidMK/status/1316511734115385344
  
  is:tweet lang:en COVID-19 epidemiology criticism research peer review methodology flaw guideline review testing sampling estimation
Visit annotations in context

Tags

peer review

guideline

sampling

COVID-19

research

epidemiology

flaw

is:tweet

methodology

testing

criticism

lang:en

review

estimation

Annotators

ErikStuchly

URL

twitter.com/GidMK/status/1316511734115385344
Sep 2020
www.psychologicalscience.org www.psychologicalscience.org

Online Research: From Funding to Data Collection

1
1. ErikStuchly 25 Sep 2020
  
  in BehSci
  
  Online Research: From Funding to Data Collection. (n.d.). Association for Psychological Science - APS. Retrieved September 25, 2020, from https://www.psychologicalscience.org/news/online-research.html
  
  is:blog lang:en COVID-19 remote work online research funding data collection methodology creativity adaptation online data transparency
Visit annotations in context

Tags

remote work

online research

COVID-19

creativity

adaptation

transparency

methodology

online data

data collection

lang:en

is:blog

funding

Annotators

ErikStuchly

URL

psychologicalscience.org/news/online-research.html
Jun 2020
psyarxiv.com psyarxiv.com

Too WEIRD, Too Fast: Preprints about COVID-19 in the Psychological Sciences

1
1. katietaylor_99 11 Jun 2020
  
  in BehSci
  
  Puthillam, Arathy. ‘Too WEIRD, Too Fast: Preprints about COVID-19 in the Psychological Sciences’. Preprint. PsyArXiv, 10 June 2020. https://doi.org/10.31234/osf.io/5w7du.
  
  is:preprint lang:en COVID-19 psychology research homogeneity policies US Germany meta-science diversity culture methodology
Visit annotations in context

Tags

is:preprint

research

COVID-19

policies

US

Germany

diversity

methodology

homogeneity

meta-science

psychology

lang:en

culture

Annotators

katietaylor_99

URL

psyarxiv.com/5w7du/