Hypothesis

19 Matching Annotations

May 2026
cruxevals.com cruxevals.com

https://cruxevals.com/

1
1. fxp007 07 May 2026
  
  in Public
  
  Andrej Karpathy built a simple automation pipeline for AI agents to optimize training in 5-minute increments.
  
  这个案例展示了AI系统在自动化研究中的应用，5分钟的增量优化时间是一个精细的时间尺度，表明AI系统已经能够进行快速迭代的实验。61K+的GitHub星标表明这种方法在AI研究社区中引起了广泛关注。
  
  data-point automation-scale research-methodology
Visit annotations in context

Tags

automation-scale

data-point

research-methodology

Annotators

fxp007

URL

cruxevals.com/
huggingface.co huggingface.co

https://huggingface.co/papers/2604.24658

1
1. fxp007 01 May 2026
  
  in Public
  
  Scientific publication compresses a branching, iterative research process into a linear narrative, discarding the majority of what was discovered along the way.
  
  大多数人认为科学论文完整记录了研究过程，但作者认为传统科学论文实际上丢弃了大部分发现，只呈现线性叙事，这构成了所谓的'故事税'。这种观点挑战了学术界对出版物完整性的普遍认知。
  
  non-consensus research-methodology storytelling-tax
Visit annotations in context

Tags

non-consensus

storytelling-tax

research-methodology

Annotators

fxp007

URL

huggingface.co/papers/2604.24658
Apr 2026
www.scientificamerican.com www.scientificamerican.com

https://www.scientificamerican.com/article/amateur-armed-with-chatgpt-vibe-maths-a-60-year-old-problem/

1
1. fxp007 30 Apr 2026
  
  in Public
  
  An AI researcher subsequently gifted them each a ChatGPT Pro subscription to encourage their 'vibe mathing.'
  
  大多数人认为严肃的数学研究需要严谨的方法和深厚的专业知识，但作者使用'vibe mathing'这种非正式术语描述这种研究方式，挑战了学术研究方法论的传统规范。
  
  non-consensus research-methodology
Visit annotations in context

Tags

non-consensus

research-methodology

Annotators

fxp007

URL

scientificamerican.com/article/amateur-armed-with-chatgpt-vibe-maths-a-60-year-old-problem/
metr.org metr.org

The Org Uplift Game - METR

1
1. fxp007 09 Apr 2026
  
  in Public
  
  two participants gave it 9/10 and one "11/10"
  
  一个 2 小时的桌游式推演，三位顶级 AI 安全研究员给出了 9-11 分的评价——这本身就是一个信号：严肃的 AI 研究机构正在用「角色扮演」的方式准备未来。这种方法论（预演未来能力下的工作流）在其他领域有先例——军事桌游、灾难演习、情景规划——但将其用于 AI 能力演进，是 METR 独特的研究品味的体现。
  
  tabletop-exercise future-preparation research-methodology surprising
Visit annotations in context

Tags

surprising

future-preparation

tabletop-exercise

research-methodology

Annotators

fxp007

URL

metr.org/notes/2026-03-19-org-uplift-game/
transformer-circuits.pub transformer-circuits.pub

Emotion Concepts and their Function in a Large Language Model

1
1. fxp007 09 Apr 2026
  
  in Public
  
  Large language models (LLMs) sometimes appear to exhibit emotional reactions. We investigate why this is the case in Claude Sonnet 4.5 and explore implications for alignment-relevant behavior.
  
  【启发】这句话提示了一种全新的 AI 研究范式：与其问「模型能做什么」，不如问「模型为什么这样做」。把情绪作为切入口去理解模型行为，本质上是把心理学方法论引入了 AI 可解释性研究。这对从业者的启发是：未来最有价值的 AI 研究，可能不在算法创新，而在「为已知现象寻找机制性解释」——就像这篇论文做的那样。
  
  inspiration research-paradigm mechanistic-interpretability methodology
Visit annotations in context

Tags

methodology

inspiration

research-paradigm

mechanistic-interpretability

Annotators

fxp007

URL

transformer-circuits.pub/2026/emotions/index.html
Aug 2025
www.nature.com www.nature.com

The science fiction science method

1
1. tonz 08 Aug 2025
  
  in Public
  
  suggests quantitative methods wrt predicting future tech impact on behaviour/socialaspects, in contrast with the usual qualitative narrative methods (futurism, narrative inquiry, scenarios presumably) The Science Fiction Science Method as PDF in Zotero PDF available CC BY at https://www.researchgate.net/publication/394323287_The_Science_Fiction_Science_Method
  
  via Bruce Sterling (Mastodon)
  
  sciencefiction research methodology
Visit annotations in context

Tags

methodology

research

sciencefiction

Annotators

tonz

URL

nature.com/articles/s41586-025-09194-6
Apr 2024
www.researchgate.net www.researchgate.net

FoundationPhaseMatters-RamadiroandPorteus-2017.pdf

1
1. PelserM 13 Apr 2024
  
  in Public
  
  educationaldesign research methodology.
  
  Educational design research methodology.
  
  Education design research methodology mother tongue foundation phase linguistic resources
Visit annotations in context

Tags

Education design research methodology

linguistic resources

mother tongue

foundation phase

Annotators

PelserM

URL

researchgate.net/profile/Brian-Ramadiro/publication/321330576_Foundation_Phase_Matters_Language_and_Learning_in_South_African_Rural_Classrooms/links/5a1d0c03aca2726120b28058/Foundation-Phase-Matters-Language-and-Learning-in-South-African-Rural-Classrooms.pdf
Feb 2024
www.gesis.org www.gesis.org

GESIS - Leibniz Institute for the Social Sciences

1
1. Saldner_DANS 29 Feb 2024
  
  in Public
  
  rda_graph research data surveys social sciences RDM research methodology
Visit annotations in context

Tags

RDM

rda_graph

research methodology

surveys

research data

social sciences

Annotators

Saldner_DANS

URL

gesis.org/en/home
Dec 2021
www.nature.com www.nature.com

Replicating scientific results is tough — but essential

1
1. lucyparfitt16 15 Dec 2021
  
  in BehSci
  
  Replicating scientific results is tough—But essential. (2021). Nature, 600(7889), 359–360. https://doi.org/10.1038/d41586-021-03736-4
  
  is:article lang:en science replication research cancer biology scientific journal Reproducibility Project experimental treatment bias time consuming detail open science scientific method methodology data peer review funding publishing investment progress
Visit annotations in context

Tags

open science

is:article

progress

scientific journal

science

Reproducibility Project

publishing

peer review

cancer biology

funding

bias

detail

methodology

lang:en

experimental treatment

research

replication

data

time consuming

scientific method

investment

Annotators

lucyparfitt16

URL

nature.com/articles/d41586-021-03736-4
Nov 2021
Local file Local file

Untitled document

1
1. Mark_C_Harris 27 Nov 2021
  
  in Public
  
  (the VTA is also part ofthis system, but is too small to image with standard fMRImethods, but see [35] for successful imaging methods).
  
  All imaging studies face questions of validity and should (and many do) link to comprehensive details on instrumentation, methodology, and interpretation. Apparently, the professional consensus remains that, properly executed and interpreted, fMRI and other functional imaging techniques based on detection of oxygenation can lead to highly valid conclusions. (See Nautil.us article.)
  
  brain imaging fMRI functional imaging validity research methodology oxygenation
Tags

validity

research

fMRI

brain imaging

functional imaging

methodology

oxygenation

Annotators

Mark_C_Harris
www.mobindustry.net www.mobindustry.net

How to Conduct Agile Market Research for Your Digital Product

1
1. DavidCutts 23 Nov 2021
  
  in Public
  
  How to Conduct Agile Market Research for Your Digital Product
  
  agile methodology market research digital product
Visit annotations in context

Tags

agile methodology

digital product

market research

Annotators

DavidCutts

URL

mobindustry.net/blog/how-to-conduct-agile-market-research-for-your-digital-product/
Jul 2021
www.sciencedirect.com www.sciencedirect.com

Pre-registration: Weighing costs and benefits for researchers

1
1. Lu17Cheryl 12 Jul 2021
  
  in BehSci
  
  Logg, Jennifer M., and Charles A. Dorison. “Pre-Registration: Weighing Costs and Benefits for Researchers.” Organizational Behavior and Human Decision Processes 167 (November 1, 2021): 18–27. https://doi.org/10.1016/j.obhdp.2021.05.006.
  
  is:article lang:en pre-registration credibility research replicability reproducibility cost benefit open science methodology replication
Visit annotations in context

Tags

open science

pre-registration

is:article

reproducibility

methodology

lang:en

cost

replicability

research

replication

benefit

credibility

Annotators

Lu17Cheryl

URL

sciencedirect.com/science/article/abs/pii/S0749597821000649
Jun 2021
metascience2021.org metascience2021.org

Metascience 2021

1
1. lucyparfitt16 27 Jun 2021
  
  in BehSci
  
  Metascience 2021. (n.d.). Retrieved June 27, 2021, from https://metascience2021.org/
  
  is:webpage webinar conference meta science science research scientific process technology modem methodology community research practice intervention video
Visit annotations in context

Tags

webinar

conference

technology

community

video

is:webpage

science

research

modem methodology

research practice

scientific process

meta science

intervention

Annotators

lucyparfitt16

URL

metascience2021.org/
www.jclinepi.com www.jclinepi.com

Methodology over metrics: Current scientific standards are a disservice to patients and society

1
1. SIYANYE 18 Jun 2021
  
  in BehSci
  
  Calster, B. V., Wynants, L., Riley, R. D., Smeden, M. van, & Collins, G. S. (2021). Methodology over metrics: Current scientific standards are a disservice to patients and society. Journal of Clinical Epidemiology, 0(0). https://doi.org/10.1016/j.jclinepi.2021.05.018
  
  lang:en is:article research quality methodology reporting COVID-19 peer review education
Visit annotations in context

Tags

is:article

methodology

lang:en

research quality

COVID-19

education

peer review

reporting

Annotators

SIYANYE

URL

jclinepi.com/article/S0895-4356(21)00170-0/fulltext
epjdatascience.springeropen.com epjdatascience.springeropen.com

Linking Twitter and survey data: asymmetry in quantity and its impact

1
1. jasminehollingworth 11 Jun 2021
  
  in BehSci
  
  Baghal, T. A., Wenz, A., Sloan, L., & Jessop, C. (2021). Linking Twitter and survey data: Asymmetry in quantity and its impact. EPJ Data Science, 10(1), 1–20. https://doi.org/10.1140/epjds/s13688-021-00286-7
  
  lang:en is:article Twitter survey data asymmetry quantity impact social media information unique research methodology variation bias outcome factor
Visit annotations in context

Tags

is:article

quantity

Twitter

asymmetry

information

bias

variation

outcome

methodology

lang:en

impact

research

factor

data

survey

social media

unique

Annotators

jasminehollingworth

URL

epjdatascience.springeropen.com/articles/10.1140/epjds/s13688-021-00286-7
Oct 2020
www.youtube.com www.youtube.com

Online Research Tools and Techniques

1
1. amyhcurtis 29 Oct 2020
  
  in BehSci
  
  Online Research Tools and Techniques. (2020, September 16). https://www.youtube.com/watch?v=wGWqBtDkOFs
  
  is:webinar is:youtube online research ethics methodology data funding application
Visit annotations in context

Tags

funding

methodology

online

research

data

is:webinar

is:youtube

application

ethics

Annotators

amyhcurtis

URL

youtube.com/watch
twitter.com twitter.com

Health Nerd on Twitter

1
1. ErikStuchly 17 Oct 2020
  
  in BehSci
  
  Health Nerd on Twitter. (n.d.). Twitter. Retrieved October 17, 2020, from https://twitter.com/GidMK/status/1316511734115385344
  
  is:tweet lang:en COVID-19 epidemiology criticism research peer review methodology flaw guideline review testing sampling estimation
Visit annotations in context

Tags

sampling

methodology

lang:en

COVID-19

testing

is:tweet

review

research

epidemiology

flaw

criticism

peer review

guideline

estimation

Annotators

ErikStuchly

URL

twitter.com/GidMK/status/1316511734115385344
Sep 2020
www.psychologicalscience.org www.psychologicalscience.org

Online Research: From Funding to Data Collection

1
1. ErikStuchly 25 Sep 2020
  
  in BehSci
  
  Online Research: From Funding to Data Collection. (n.d.). Association for Psychological Science - APS. Retrieved September 25, 2020, from https://www.psychologicalscience.org/news/online-research.html
  
  is:blog lang:en COVID-19 remote work online research funding data collection methodology creativity adaptation online data transparency
Visit annotations in context

Tags

funding

online data

methodology

lang:en

COVID-19

remote work

data collection

creativity

adaptation

transparency

online research

is:blog

Annotators

ErikStuchly

URL

psychologicalscience.org/news/online-research.html
Jun 2020
psyarxiv.com psyarxiv.com

Too WEIRD, Too Fast: Preprints about COVID-19 in the Psychological Sciences

1
1. katietaylor_99 11 Jun 2020
  
  in BehSci
  
  Puthillam, Arathy. ‘Too WEIRD, Too Fast: Preprints about COVID-19 in the Psychological Sciences’. Preprint. PsyArXiv, 10 June 2020. https://doi.org/10.31234/osf.io/5w7du.
  
  is:preprint lang:en COVID-19 psychology research homogeneity policies US Germany meta-science diversity culture methodology
Visit annotations in context

Tags

US

methodology

lang:en

COVID-19

meta-science

homogeneity

research

is:preprint

culture

policies

diversity

psychology

Germany

Annotators

katietaylor_99

URL

psyarxiv.com/5w7du/