Hypothesis

24 Matching Annotations

Dec 2025
en.wikipedia.org en.wikipedia.org

Emily M. Bender - Wikipedia

1
1. chrisaldrich 07 Dec 2025
  
  in Public
  
  https://en.wikipedia.org/wiki/Emily_M._Bender
  
  Emily M. Bender stochastic parrots artificial intelligence computational linguistics natural language processing Temnit Gebru Dan Allosso Book Club 2025-12-06 artificial intelligence ethics
Visit annotations in context

Tags

Emily M. Bender

Temnit Gebru

computational linguistics

natural language processing

Dan Allosso Book Club 2025-12-06

artificial intelligence ethics

stochastic parrots

artificial intelligence

Annotators

chrisaldrich

URL

en.wikipedia.org/wiki/Emily_M._Bender
May 2025
ipfs.indy0.net ipfs.indy0.net

9781961334052

1
1. stopresetgo 01 May 2025
  
  in Public
  
  perceived by oneself “in here.” In this sense, the world consists of objects outthere in space (the container that holds them) before me as the perceivingsubject.
  
  for - adjacency - Indyweb dev - natural language - timebinding - parallel vs serial processing - comparison - spoken vs written language - what's also interesting is that spoken language is timebinding, sequential and our written language descended from that, - in spite of written language existing in 2D and 3D space, it inherited sequential flow, even though it does not have to - In this sense, legacy spoken language system constrains written language to be - serial - sequential and - timebound instead of - parallel - Read any written text and you will observe that the pattern is sequential - We constrain our syntax it to "flow" sequentially in 2D space, even though there is absolutely no other reason to constrain it to do so - This also reveals another implicit rule about language, that it assumes we can only focus our attention on one aspect of reality at a time
  
  comparison - spoken vs written language Indyweb dev - natural language - timebinding adjacency - Indyweb dev - natural language - timebinding - parallel vs serial processing
Visit annotations in context

Tags

adjacency - Indyweb dev - natural language - timebinding - parallel vs serial processing

comparison - spoken vs written language

Indyweb dev - natural language - timebinding

Annotators

stopresetgo

URL

ipfs.indy0.net/ipfs/bafybeihk6dcr7dfruu65z5e5ze2rkeiydkmgbbpadhyulckm4afnqbtdgy
Jul 2024
whoosh.readthedocs.io whoosh.readthedocs.io

Query expansion and Key word extraction — Whoosh 2.7.4 documentation

1
1. Spinningthoughts 03 Jul 2024
  
  in Public
  
  Whoosh provides methods for computing the “key terms” of a set of documents. For these methods, “key terms” basically means terms that are frequent in the given documents, but relatively infrequent in the indexed collection as a whole.
  
  Very interesting method, and way of looking at the signal. "What makes a document exceptional because something is common within itself and uncommon without".
  
  natural language processing
Visit annotations in context

Tags

natural language processing

Annotators

Spinningthoughts

URL

whoosh.readthedocs.io/en/latest/keywords.html
Feb 2024
www.cortical.io www.cortical.io

Semantic Folding | Semantic Fingerprinting | Language Intelligence | Cortical.io

1
1. stopresetgo 04 Feb 2024
  
  in Public
  
  for - semantic folding - semantic fingerprint - natural language processing - NLP - cortical.io - Numenta
  
  cortical.io Numenta natural language processing NLP semantic fingerprint semantic folding
Visit annotations in context

Tags

natural language processing

semantic folding

Numenta

cortical.io

semantic fingerprint

NLP

Annotators

stopresetgo

URL

cortical.io/science/semantic-folding/
Jan 2023
www.complexityexplorer.org www.complexityexplorer.org

Complexity Explorer

1
1. chrisaldrich 23 Jan 2023
  
  in Public
  
  a common technique in natural language processing is to operationalize certain semantic concepts (e.g., "synonym") in terms of syntactic structure (two words that tend to occur nearby in a sentence are more likely to be synonyms, etc). This is what word2vec does.
  
  Can I use some of these sorts of methods with respect to corpus linguistics over time to better identified calcified words or archaic phrases that stick with the language, but are heavily limited to narrower(ing) contexts?
  
  calcified words word2vec operationalization natural language processing historical linguistics open questions archaic phrases information theory
Visit annotations in context

Tags

calcified words

historical linguistics

information theory

open questions

natural language processing

archaic phrases

word2vec

operationalization

Annotators

chrisaldrich

URL

complexityexplorer.org/courses/162-foundations-applications-of-humanities-analytics/segments/15624
genizalab.princeton.edu genizalab.princeton.edu

Princeton Machine Learning and the Future of Philology Symposium

1
1. chrisaldrich 09 Jan 2023
  
  in Public
  
  https://genizalab.princeton.edu/events/2022/princeton-machine-learning-and-future-philology-symposium
  
  Was this recorded?
  
  machine learning philology symposia digital humanities manuscript studies artificial intelligence corpus linguistics incunabula handwriting recognition natural language processing
Visit annotations in context

Tags

symposia

digital humanities

machine learning

philology

incunabula

handwriting recognition

manuscript studies

natural language processing

artificial intelligence

corpus linguistics

Annotators

chrisaldrich

URL

genizalab.princeton.edu/events/2022/princeton-machine-learning-and-future-philology-symposium
Local file Local file

Finding a Fragment in a Pile of Geniza: A Practical Guide to Collections, Editions, and Resources

1
1. chrisaldrich 09 Jan 2023
  
  in Public
  
  Fried-berg Judeo-Arabic Project, accessible at http://fjms.genizah.org. This projectmaintains a digital corpus of Judeo-Arabic texts that can be searched and an-alyzed.
  
  The Friedberg Judeo-Arabic Project contains a large corpus of Judeo-Arabic text which can be manually searched to help improve translations of texts, but it might also be profitably mined using information theoretic and corpus linguistic methods to provide larger group textual translations and suggestions at a grander scale.
  
  Friedberg Jewish Manuscript Society Friedberg Judeo-Arabic Project corpus linguistics digital humanities information theory artificial intelligence natural language processing contextual clues contextual extrapolation
Tags

Friedberg Jewish Manuscript Society

digital humanities

Friedberg Judeo-Arabic Project

contextual extrapolation

information theory

contextual clues

natural language processing

artificial intelligence

corpus linguistics

Annotators

chrisaldrich
Dec 2022
inst-fs-iad-prod.inscloudgate.net inst-fs-iad-prod.inscloudgate.net

On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? "1F99COn the Dangers of Stochastic Parrots: Can Language Models Be Too Big? "1F99C

1
1. peter_murray 30 Dec 2022
  
  in Public
  
  Emily M. Bender, Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell. 2021. On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency (FAccT '21). Association for Computing Machinery, New York, NY, USA, 610–623. https://doi.org/10.1145/3442188.3445922
  
  natural language processing
Visit annotations in context

Tags

natural language processing

Annotators

peter_murray

URL

inst-fs-iad-prod.inscloudgate.net/files/cf4622a4-ec28-4c20-97e7-3d5b9da4852a/2021-bender-parrots.pdf
www.nlpdemystified.org www.nlpdemystified.org

Natural Language Processing Demystified: Course Content

1
1. chrisaldrich 10 Dec 2022
  
  in Public
  
  https://www.nlpdemystified.org/course
  
  MOOC natural language processing online courseware neural networks
Visit annotations in context

Tags

MOOC

neural networks

online courseware

natural language processing

Annotators

chrisaldrich

URL

nlpdemystified.org/course
Nov 2022
www.researchgate.net www.researchgate.net

(20) Robert Amsler

1
1. chrisaldrich 14 Nov 2022
  
  in Public
  
  Robert Amsler is a retired computational lexicology, computational linguist, information scientist. His P.D. was from UT-Austin in 1980. His primary work was in the area of understanding how machine-readable dictionaries could be used to create a taxonomy of dictionary word senses (which served as the motivation for the creation of WordNet) and in understanding how lexicon can be extracted from text corpora. He also invented a new technique in citation analysis that bears his name. His work is mentioned in Wikipedia articles on Machine-Readable dictionary, Computational lexicology, Bibliographic coupling, and Text mining. He currently lives in Vienna, VA and reads email at robert.amsler at utexas. edu. He is currenly interested in chronological studies of vocabulary, esp. computer terms.
  
  https://www.researchgate.net/profile/Robert-Amsler
  
  Apparently follow my blog. :)
  
  Makes me wonder how we might better process and semantically parse peoples' personal notes, particularly when they're atomic and cross-linked?
  
  Robert Amsler linguistics dictionaries natural language processing corpus linguistics idea links open questions
Visit annotations in context

Tags

open questions

natural language processing

idea links

linguistics

dictionaries

Robert Amsler

corpus linguistics

Annotators

chrisaldrich

URL

researchgate.net/profile/Robert-Amsler
Oct 2022
www.explainpaper.com www.explainpaper.com

Explainpaper

1
1. chrisaldrich 27 Oct 2022
  
  in Public
  
  https://www.explainpaper.com/
  
  Another in a growing line of research tools for processing and making sense of research literature including Research Rabbit, Connected Papers, Semantic Scholar, etc.
  
  Functionality includes the ability to highlight sections of research papers with natural language processing to explain what those sections mean. There's also a "chat" that allows you to ask questions about the paper which will attempt to return reasonable answers, which is an artificial intelligence sort of means of having an artificial "conversation with the text".
  
  cc: @dwhly @remikalir @jeremydean
  
  artificial intelligence research papers tools tools for thought literature review literature search information overload research tools Explainpaper annotations natural language processing conversations with the text
Visit annotations in context

Tags

information overload

annotations

conversations with the text

literature review

tools for thought

research tools

Explainpaper

natural language processing

literature search

research papers

tools

artificial intelligence

Annotators

chrisaldrich

URL

explainpaper.com/
Aug 2022
maggieappleton.com maggieappleton.com

Joining Ought

1
1. chrisaldrich 05 Aug 2022
  
  in Public
  
  https://maggieappleton.com/joining-ought
  
  read Maggie Appleton machine learning natural language processing GPT-3 Elicit Ought
Visit annotations in context

Tags

natural language processing

machine learning

GPT-3

Maggie Appleton

Elicit

read

Ought

Annotators

chrisaldrich

URL

maggieappleton.com/joining-ought
Dec 2021
cacm.acm.org cacm.acm.org

Converting Laws to Programs

1
1. peter_murray 23 Dec 2021
  
  in Public
  
  Catala, a programming language developed by Protzenko's graduate student Denis Merigoux, who is working at the National Institute for Research in Digital Science and Technology (INRIA) in Paris, France. It is not often lawyers and programmers find themselves working together, but Catala was designed to capture and execute legal algorithms and to be understood by lawyers and programmers alike in a language "that lets you follow the very specific legal train of thought," Protzenko says.
  
  A domain-specific language for encoding legal interpretations.
  
  natural-language-processing legal legislative-history
Visit annotations in context

Tags

natural-language-processing legal legislative-history

Annotators

peter_murray

URL

cacm.acm.org/magazines/2022/1/257436-converting-laws-to-programs/fulltext
Nov 2021
www.nature.com www.nature.com

Natural language processing and network analysis provide novel insights on policy and scientific discourse around Sustainable Development Goals

1
1. SamRose 20 Nov 2021
  
  in Public
  
  natural language processing nlp policy
Visit annotations in context

Tags

natural language processing

policy

nlp

Annotators

SamRose

URL

nature.com/articles/s41598-021-01801-6
Jun 2021
psyarxiv.com psyarxiv.com

Web-scraping the Expression of Loneliness during COVID-19

1
1. XanaButt 28 Jun 2021
  
  in BehSci
  
  Jung, Y., Lee, Y. K., & Hahn, S. (2021). Web-scraping the Expression of Loneliness during COVID-19. PsyArXiv. https://doi.org/10.31234/osf.io/59gwk
  
  is:preprint lang:en COVID-19 loneliness Natural Language Processing modeling internet social media emotion internal state appraisal online relationship
Visit annotations in context

Tags

emotion

social media

internal state

lang:en

internet

Natural Language Processing

loneliness

online relationship

is:preprint

appraisal

COVID-19

modeling

Annotators

XanaButt

URL

psyarxiv.com/59gwk/
Mar 2021
psyarxiv.com psyarxiv.com

Scared into Action: How Partisanship and Fear are Associated with Reactions to Public Health Directives

1
1. sophia.sterckx 15 Mar 2021
  
  in BehSci
  
  Lindow, Mike, David DeFranza, Arul Mishra, and Himanshu Mishra. ‘Scared into Action: How Partisanship and Fear Are Associated with Reactions to Public Health Directives’. PsyArXiv, 12 January 2021. https://doi.org/10.31234/osf.io/8me7q.
  
  is:preprint lang:en COVID-19 political ideology tweets natural language processing word embedding gradient boosted decision trees corona coronavirus health directives liberals conservatives politics federal government USA twitter
Visit annotations in context

Tags

politics

processing

word embedding

federal government

political ideology

natural language

twitter

conservatives

tweets

gradient boosted decision trees

lang:en

COVID-19

corona

coronavirus

is:preprint

health directives

liberals

USA

Annotators

sophia.sterckx

URL

psyarxiv.com/8me7q/
arxiv.org arxiv.org

Semantic and Relational Spaces in Science of Science: Deep Learning Models for Article Vectorisation

1
1. n.parfitt 15 Mar 2021
  
  in BehSci
  
  Kozlowski, Diego, Jennifer Dusdal, Jun Pang, and Andreas Zilian. ‘Semantic and Relational Spaces in Science of Science: Deep Learning Models for Article Vectorisation’. ArXiv:2011.02887 [Physics], 5 November 2020. http://arxiv.org/abs/2011.02887.
  
  lang:en is:article semantic relational science deep learning model article vectorization literature review epistemic social pattern computer science tool research Natural Language Processing Graph Neural Networks
Visit annotations in context

Tags

is:article

deep

model

Natural Language Processing

epistemic

computer science

research

science

Graph Neural Networks

pattern

semantic

learning

lang:en

relational

tool

article

review

literature

social

vectorization

Annotators

n.parfitt

URL

arxiv.org/abs/2011.02887
Aug 2020
onlinelibrary.wiley.com onlinelibrary.wiley.com

The More Who Die, the Less We Care: Evidence from Natural Language Analysis of Online News Articles and Social Media Posts

1
1. ErikStuchly 31 Aug 2020
  
  in BehSci
  
  Bhatia, S., Walasek, L., Slovic, P., & Kunreuther, H. (2020). The More Who Die, the Less We Care: Evidence from Natural Language Analysis of Online News Articles and Social Media Posts. Risk Analysis, risa.13582. https://doi.org/10.1111/risa.13582
  
  is:article lang:en COVID-19 natural language processing big data online news social media psychic numbing death rate caring affective reaction loss of life valence arousal emotional content psychology
Visit annotations in context

Tags

is:article

social media

online news

valence

psychic numbing

death rate

emotional content

psychology

big data

lang:en

affective reaction

natural language processing

loss of life

caring

COVID-19

arousal

Annotators

ErikStuchly

URL

onlinelibrary.wiley.com/doi/abs/10.1111/risa.13582
psyarxiv.com psyarxiv.com

Digital phenotyping of complex psychological responses to the COVID-19 pandemic

1
1. Gaurav_Saxena 14 Aug 2020
  
  in BehSci
  
  Hull, T., Levine, J., Bantilan, N., Desai, A., & Majumder, M. S. (2020, August 13). Digital phenotyping of complex psychological responses to the COVID-19 pandemic. https://doi.org/10.31234/osf.io/qtrpf
  
  is:preprint lang:en COVID-19 symptom tracking digital phenotyping psychological sequalae telehealth digital mental health natural language processing machine learning
Visit annotations in context

Tags

machine learning

lang:en

digital phenotyping

symptom tracking

telehealth

is:preprint

natural language processing

psychological sequalae

COVID-19

digital mental health

Annotators

Gaurav_Saxena

URL

psyarxiv.com/qtrpf/
Jul 2020
wit.ai wit.ai

Wit.ai

1
1. TylerRick 23 Jul 2020
  
  in Public
  
  natural language processing AI
Visit annotations in context

Tags

natural language processing

AI

Annotators

TylerRick

URL

wit.ai/
osf.io osf.io

Capturing and analyzing social representations. A first application of Natural Language Processing techniques to reader’s comments in COVID-19 news. Argentina, 2020

1
1. ErikStuchly 15 Jul 2020
  
  in BehSci
  
  Rosati, G., Domenech, L., Chazarreta, A., & Maguire, T. (2020). Capturing and analyzing social representations. A first application of Natural Language Processing techniques to reader’s comments in COVID-19 news. Argentina, 2020 [Preprint]. SocArXiv. https://doi.org/10.31235/osf.io/3pcdu
  
  is:preprint lang:en COVID-19 social representation analysis natural language processing comment news Argentina quantification topic Latent Dirichlet Allocation prototype FastText
Visit annotations in context

Tags

analysis

prototype

social representation

lang:en

quantification

natural language processing

is:preprint

Latent Dirichlet Allocation

FastText

topic

Argentina

news

COVID-19

comment

Annotators

ErikStuchly

URL

osf.io/preprints/socarxiv/3pcdu/
May 2020
arxiv.org arxiv.org

Complex Societies and the Growth of the Law

1
1. edampf 28 May 2020
  
  in BehSci
  
  Katz, D. M., Coupette, C., Beckedorf, J., & Hartung, D. (2020). Complex Societies and the Growth of the Law. ArXiv:2005.07646 [Physics]. http://arxiv.org/abs/2005.07646
  
  is:preprint lang:en computer science complex societies law Germany USA legislation modeling multidimensional time-evolving natural language processing network science welfare state tax state
Visit annotations in context

Tags

complex societies

legislation

time-evolving

welfare state

lang:en

natural language processing

is:preprint

Germany

computer science

multidimensional

law

network science

USA

tax state

modeling

Annotators

edampf

URL

arxiv.org/abs/2005.07646
psyarxiv.com psyarxiv.com

Moral Concerns are Differentially Observable in Language

1
1. edampf 13 May 2020
  
  in BehSci
  
  Kennedy, B., Atari, M., Davani, A. M., Hoover, J., Omrani, A., Graham, J., & Dehghani, M. (2020, May 7). Moral Concerns are Differentially Observable in Language. https://doi.org/10.31234/osf.io/uqmty
  
  is:preprint lang:en morality language text analysis moral foundations theory observational analysis psychology communication Facebook online status update questionnaire prediction self-report natural language processing
Visit annotations in context

Tags

text analysis

moral foundations theory

Facebook

language

morality

questionnaire

self-report

psychology

communication

lang:en

update

online status

natural language processing

is:preprint

observational analysis

prediction

Annotators

edampf

URL

psyarxiv.com/uqmty/
Apr 2020
en.wikipedia.org en.wikipedia.org

Hyphenation algorithm - Wikipedia

1
1. TylerRick 29 Apr 2020
  
  in Public
  
  algorithms natural language processing
Visit annotations in context

Tags

natural language processing

algorithms

Annotators

TylerRick

URL

en.wikipedia.org/wiki/Hyphenation_algorithm