- Apr 2022
-
twitter.com twitter.com
-
ReconfigBehSci. (2020, November 5). @ToddHorowitz3 2/2 so I would prefer to treat this as an opportunity for empirical observation and learning. Evaluation should focus on trying to assess actual contribution, not a priori judgments. [Tweet]. @SciBeh. https://twitter.com/SciBeh/status/1324367278352355330
-
- Dec 2020
-
psyarxiv.com psyarxiv.com
-
Rocca, R., & Yarkoni, T. (2020). Putting psychology to the test: Rethinking model evaluation through benchmarking and prediction. PsyArXiv. https://doi.org/10.31234/osf.io/e437b
-
- Mar 2019
-
deepblue.lib.umich.edu deepblue.lib.umich.edu
-
Human Performance Technology Model This page is an eight page PDF that gives an overview of the human performance technology model. This is a black and white PDF that is simply written and is accessible to the layperson. Authors are prominent writers in the field of performance technology. Rating 5/5
-
- Nov 2018
-
iphysresearch.github.io iphysresearch.github.io
-
Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift
该文做的实验是探索对数据集进行 shifts (某种可控的扰动) 后的模型表现,提出了classifier-based的方法/pipeline 来观察和评价:
这对于我的引力波数据研究来说,可以借鉴其数据的 shift 方法以及评价机制 (two-sample tests)。
-
- Oct 2018
-
iphysresearch.github.io iphysresearch.github.io
-
Approximate Fisher Information Matrix to Characterise the Training of Deep Neural Networks
深度神经网络训练(收敛/泛化性能)的近似Fisher信息矩阵表征,可自动优化mini-batch size/learning rate
挺有趣的 paper,提出了从 Fisher 矩阵抽象出新的量用来衡量训练过程中的模型表现,来优化mini-batch sizes and learning rates | 另外 paper 中的figure画的很好看 | 作者认为逐步增加batch sizes的传统理解只是partially true,存在逐步递减该 size 来提高 model 收敛和泛化能力的可能。
-