Hypothesis

25 Matching Annotations

Jul 2024
whoosh.readthedocs.io whoosh.readthedocs.io

Query expansion and Key word extraction — Whoosh 2.7.4 documentation

1
1. Spinningthoughts 03 Jul 2024
  
  in Public
  
  Whoosh provides methods for computing the “key terms” of a set of documents. For these methods, “key terms” basically means terms that are frequent in the given documents, but relatively infrequent in the indexed collection as a whole.
  
  Very interesting method, and way of looking at the signal. "What makes a document exceptional because something is common within itself and uncommon without".
  
  natural language processing
Visit annotations in context

Tags

natural language processing

Annotators

Spinningthoughts

URL

whoosh.readthedocs.io/en/latest/keywords.html
Feb 2024
www.cortical.io www.cortical.io

Semantic Folding | Semantic Fingerprinting | Language Intelligence | Cortical.io

1
1. stopresetgo 04 Feb 2024
  
  in Public
  
  for - semantic folding - semantic fingerprint - natural language processing - NLP - cortical.io - Numenta
  
  cortical.io Numenta natural language processing NLP semantic fingerprint semantic folding
Visit annotations in context

Tags

semantic folding

natural language processing

cortical.io

NLP

Numenta

semantic fingerprint

Annotators

stopresetgo

URL

cortical.io/science/semantic-folding/
Jan 2023
www.complexityexplorer.org www.complexityexplorer.org

Complexity Explorer

1
1. chrisaldrich 23 Jan 2023
  
  in Public
  
  a common technique in natural language processing is to operationalize certain semantic concepts (e.g., "synonym") in terms of syntactic structure (two words that tend to occur nearby in a sentence are more likely to be synonyms, etc). This is what word2vec does.
  
  Can I use some of these sorts of methods with respect to corpus linguistics over time to better identified calcified words or archaic phrases that stick with the language, but are heavily limited to narrower(ing) contexts?
  
  calcified words word2vec operationalization natural language processing historical linguistics open questions archaic phrases information theory
Visit annotations in context

Tags

information theory

word2vec

open questions

natural language processing

operationalization

calcified words

archaic phrases

historical linguistics

Annotators

chrisaldrich

URL

complexityexplorer.org/courses/162-foundations-applications-of-humanities-analytics/segments/15624
genizalab.princeton.edu genizalab.princeton.edu

Princeton Machine Learning and the Future of Philology Symposium

1
1. chrisaldrich 09 Jan 2023
  
  in Public
  
  https://genizalab.princeton.edu/events/2022/princeton-machine-learning-and-future-philology-symposium
  
  Was this recorded?
  
  machine learning philology symposia digital humanities manuscript studies artificial intelligence corpus linguistics incunabula handwriting recognition natural language processing
Visit annotations in context

Tags

manuscript studies

symposia

incunabula

artificial intelligence

machine learning

digital humanities

handwriting recognition

natural language processing

corpus linguistics

philology

Annotators

chrisaldrich

URL

genizalab.princeton.edu/events/2022/princeton-machine-learning-and-future-philology-symposium
Local file Local file

Finding a Fragment in a Pile of Geniza: A Practical Guide to Collections, Editions, and Resources

1
1. chrisaldrich 09 Jan 2023
  
  in Public
  
  Fried-berg Judeo-Arabic Project, accessible at http://fjms.genizah.org. This projectmaintains a digital corpus of Judeo-Arabic texts that can be searched and an-alyzed.
  
  The Friedberg Judeo-Arabic Project contains a large corpus of Judeo-Arabic text which can be manually searched to help improve translations of texts, but it might also be profitably mined using information theoretic and corpus linguistic methods to provide larger group textual translations and suggestions at a grander scale.
  
  Friedberg Jewish Manuscript Society Friedberg Judeo-Arabic Project corpus linguistics digital humanities information theory artificial intelligence natural language processing contextual clues contextual extrapolation
Tags

information theory

Friedberg Judeo-Arabic Project

artificial intelligence

digital humanities

contextual clues

natural language processing

corpus linguistics

contextual extrapolation

Friedberg Jewish Manuscript Society

Annotators

chrisaldrich
Dec 2022
dl.acm.org dl.acm.org

On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? "1F99COn the Dangers of Stochastic Parrots: Can Language Models Be Too Big? "1F99C

1
1. peter_murray 30 Dec 2022
  
  in Public
  
  Emily M. Bender, Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell. 2021. On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency (FAccT '21). Association for Computing Machinery, New York, NY, USA, 610–623. https://doi.org/10.1145/3442188.3445922
  
  natural language processing
Visit annotations in context

Tags

natural language processing

Annotators

peter_murray

URL

dl.acm.org/doi/pdf/10.1145/3442188.3445922
www.nlpdemystified.org www.nlpdemystified.org

Natural Language Processing Demystified: Course Content

1
1. chrisaldrich 10 Dec 2022
  
  in Public
  
  https://www.nlpdemystified.org/course
  
  MOOC natural language processing online courseware neural networks
Visit annotations in context

Tags

neural networks

MOOC

natural language processing

online courseware

Annotators

chrisaldrich

URL

nlpdemystified.org/course
Nov 2022
www.researchgate.net www.researchgate.net

(20) Robert Amsler

1
1. chrisaldrich 14 Nov 2022
  
  in Public
  
  Robert Amsler is a retired computational lexicology, computational linguist, information scientist. His P.D. was from UT-Austin in 1980. His primary work was in the area of understanding how machine-readable dictionaries could be used to create a taxonomy of dictionary word senses (which served as the motivation for the creation of WordNet) and in understanding how lexicon can be extracted from text corpora. He also invented a new technique in citation analysis that bears his name. His work is mentioned in Wikipedia articles on Machine-Readable dictionary, Computational lexicology, Bibliographic coupling, and Text mining. He currently lives in Vienna, VA and reads email at robert.amsler at utexas. edu. He is currenly interested in chronological studies of vocabulary, esp. computer terms.
  
  https://www.researchgate.net/profile/Robert-Amsler
  
  Apparently follow my blog. :)
  
  Makes me wonder how we might better process and semantically parse peoples' personal notes, particularly when they're atomic and cross-linked?
  
  Robert Amsler linguistics dictionaries natural language processing corpus linguistics idea links open questions
Visit annotations in context

Tags

open questions

dictionaries

natural language processing

corpus linguistics

linguistics

Robert Amsler

idea links

Annotators

chrisaldrich

URL

researchgate.net/profile/Robert-Amsler
Oct 2022
www.explainpaper.com www.explainpaper.com

Explainpaper

1
1. chrisaldrich 27 Oct 2022
  
  in Public
  
  https://www.explainpaper.com/
  
  Another in a growing line of research tools for processing and making sense of research literature including Research Rabbit, Connected Papers, Semantic Scholar, etc.
  
  Functionality includes the ability to highlight sections of research papers with natural language processing to explain what those sections mean. There's also a "chat" that allows you to ask questions about the paper which will attempt to return reasonable answers, which is an artificial intelligence sort of means of having an artificial "conversation with the text".
  
  cc: @dwhly @remikalir @jeremydean
  
  artificial intelligence research papers tools tools for thought literature review literature search information overload research tools Explainpaper annotations natural language processing conversations with the text
Visit annotations in context

Tags

conversations with the text

research papers

literature review

artificial intelligence

research tools

annotations

information overload

tools

Explainpaper

literature search

natural language processing

tools for thought

Annotators

chrisaldrich

URL

explainpaper.com/
Aug 2022
maggieappleton.com maggieappleton.com

Joining Ought

1
1. chrisaldrich 05 Aug 2022
  
  in Public
  
  https://maggieappleton.com/joining-ought
  
  read Maggie Appleton machine learning natural language processing GPT-3 Elicit Ought
Visit annotations in context

Tags

natural language processing

Elicit

machine learning

read

GPT-3

Ought

Maggie Appleton

Annotators

chrisaldrich

URL

maggieappleton.com/joining-ought
Dec 2021
cacm.acm.org cacm.acm.org

Converting Laws to Programs

1
1. peter_murray 23 Dec 2021
  
  in Public
  
  Catala, a programming language developed by Protzenko's graduate student Denis Merigoux, who is working at the National Institute for Research in Digital Science and Technology (INRIA) in Paris, France. It is not often lawyers and programmers find themselves working together, but Catala was designed to capture and execute legal algorithms and to be understood by lawyers and programmers alike in a language "that lets you follow the very specific legal train of thought," Protzenko says.
  
  A domain-specific language for encoding legal interpretations.
  
  natural-language-processing legal legislative-history
Visit annotations in context

Tags

natural-language-processing legal legislative-history

Annotators

peter_murray

URL

cacm.acm.org/magazines/2022/1/257436-converting-laws-to-programs/fulltext
onlinelibrary.wiley.com onlinelibrary.wiley.com

How Do We Believe?

1
1. jackiekrauss 01 Dec 2021
  
  in BehSci
  
  Sloman, S. A. (2021). How Do We Believe? Topics in Cognitive Science, 0(2021), 1–14. https://doi.org/10.1111/tops.12580
  
  is:article lang:en cognitive science human thought information processing memory pattern recognition generalizability predictability representational scheme unfamiliar circumstance sophisticated associative model representational language causal reasoning dual system of thinking knowledge
Visit annotations in context

Tags

causal reasoning

generalizability

is:article

memory

knowledge

predictability

dual system of thinking

sophisticated associative model

human thought

representational language

pattern recognition

cognitive science

representational scheme

lang:en

information processing

unfamiliar circumstance

Annotators

jackiekrauss

URL

onlinelibrary.wiley.com/doi/abs/10.1111/tops.12580
Nov 2021
www.nature.com www.nature.com

Natural language processing and network analysis provide novel insights on policy and scientific discourse around Sustainable Development Goals

1
1. SamRose 20 Nov 2021
  
  in Public
  
  natural language processing nlp policy
Visit annotations in context

Tags

nlp

policy

natural language processing

Annotators

SamRose

URL

nature.com/articles/s41598-021-01801-6
Jun 2021
psyarxiv.com psyarxiv.com

Web-scraping the Expression of Loneliness during COVID-19

1
1. XanaButt 28 Jun 2021
  
  in BehSci
  
  Jung, Y., Lee, Y. K., & Hahn, S. (2021). Web-scraping the Expression of Loneliness during COVID-19. PsyArXiv. https://doi.org/10.31234/osf.io/59gwk
  
  is:preprint lang:en COVID-19 loneliness Natural Language Processing modeling internet social media emotion internal state appraisal online relationship
Visit annotations in context

Tags

loneliness

emotion

lang:en

Natural Language Processing

social media

appraisal

modeling

internal state

COVID-19

internet

is:preprint

online relationship

Annotators

XanaButt

URL

psyarxiv.com/59gwk/
Mar 2021
psyarxiv.com psyarxiv.com

Scared into Action: How Partisanship and Fear are Associated with Reactions to Public Health Directives

1
1. sophia.sterckx 15 Mar 2021
  
  in BehSci
  
  Lindow, Mike, David DeFranza, Arul Mishra, and Himanshu Mishra. ‘Scared into Action: How Partisanship and Fear Are Associated with Reactions to Public Health Directives’. PsyArXiv, 12 January 2021. https://doi.org/10.31234/osf.io/8me7q.
  
  is:preprint lang:en COVID-19 political ideology tweets natural language processing word embedding gradient boosted decision trees corona coronavirus health directives liberals conservatives politics federal government USA twitter
Visit annotations in context

Tags

politics

processing

coronavirus

gradient boosted decision trees

political ideology

natural language

liberals

tweets

federal government

USA

lang:en

word embedding

conservatives

corona

health directives

twitter

COVID-19

is:preprint

Annotators

sophia.sterckx

URL

psyarxiv.com/8me7q/
arxiv.org arxiv.org

Semantic and Relational Spaces in Science of Science: Deep Learning Models for Article Vectorisation

1
1. n.parfitt 15 Mar 2021
  
  in BehSci
  
  Kozlowski, Diego, Jennifer Dusdal, Jun Pang, and Andreas Zilian. ‘Semantic and Relational Spaces in Science of Science: Deep Learning Models for Article Vectorisation’. ArXiv:2011.02887 [Physics], 5 November 2020. http://arxiv.org/abs/2011.02887.
  
  lang:en is:article semantic relational science deep learning model article vectorization literature review epistemic social pattern computer science tool research Natural Language Processing Graph Neural Networks
Visit annotations in context

Tags

model

tool

Natural Language Processing

is:article

science

Graph Neural Networks

epistemic

social

deep

research

review

article

vectorization

learning

semantic

lang:en

relational

pattern

literature

computer science

Annotators

n.parfitt

URL

arxiv.org/abs/2011.02887
Aug 2020
onlinelibrary.wiley.com onlinelibrary.wiley.com

The More Who Die, the Less We Care: Evidence from Natural Language Analysis of Online News Articles and Social Media Posts

1
1. ErikStuchly 31 Aug 2020
  
  in BehSci
  
  Bhatia, S., Walasek, L., Slovic, P., & Kunreuther, H. (2020). The More Who Die, the Less We Care: Evidence from Natural Language Analysis of Online News Articles and Social Media Posts. Risk Analysis, risa.13582. https://doi.org/10.1111/risa.13582
  
  is:article lang:en COVID-19 natural language processing big data online news social media psychic numbing death rate caring affective reaction loss of life valence arousal emotional content psychology
Visit annotations in context

Tags

arousal

death rate

emotional content

is:article

social media

affective reaction

online news

psychic numbing

lang:en

loss of life

big data

natural language processing

caring

COVID-19

psychology

valence

Annotators

ErikStuchly

URL

onlinelibrary.wiley.com/doi/abs/10.1111/risa.13582
psyarxiv.com psyarxiv.com

Digital phenotyping of complex psychological responses to the COVID-19 pandemic

1
1. Gaurav_Saxena 14 Aug 2020
  
  in BehSci
  
  Hull, T., Levine, J., Bantilan, N., Desai, A., & Majumder, M. S. (2020, August 13). Digital phenotyping of complex psychological responses to the COVID-19 pandemic. https://doi.org/10.31234/osf.io/qtrpf
  
  is:preprint lang:en COVID-19 symptom tracking digital phenotyping psychological sequalae telehealth digital mental health natural language processing machine learning
Visit annotations in context

Tags

symptom tracking

psychological sequalae

lang:en

machine learning

digital phenotyping

telehealth

natural language processing

digital mental health

COVID-19

is:preprint

Annotators

Gaurav_Saxena

URL

psyarxiv.com/qtrpf/
Jul 2020
wit.ai wit.ai

Wit.ai

1
1. TylerRick 23 Jul 2020
  
  in Public
  
  natural language processing AI
Visit annotations in context

Tags

AI

natural language processing

Annotators

TylerRick

URL

wit.ai/
osf.io osf.io

Capturing and analyzing social representations. A first application of Natural Language Processing techniques to reader’s comments in COVID-19 news. Argentina, 2020

1
1. ErikStuchly 15 Jul 2020
  
  in BehSci
  
  Rosati, G., Domenech, L., Chazarreta, A., & Maguire, T. (2020). Capturing and analyzing social representations. A first application of Natural Language Processing techniques to reader’s comments in COVID-19 news. Argentina, 2020 [Preprint]. SocArXiv. https://doi.org/10.31235/osf.io/3pcdu
  
  is:preprint lang:en COVID-19 social representation analysis natural language processing comment news Argentina quantification topic Latent Dirichlet Allocation prototype FastText
Visit annotations in context

Tags

news

quantification

Latent Dirichlet Allocation

analysis

lang:en

prototype

topic

natural language processing

comment

Argentina

FastText

COVID-19

social representation

is:preprint

Annotators

ErikStuchly

URL

osf.io/preprints/socarxiv/3pcdu/
Jun 2020
psyarxiv.com psyarxiv.com

In Case of Doubt for the Suspicion?: When People Falsely Remember Facts in the News as Being Uncertain

1
1. Marlene_Wulf 02 Jun 2020
  
  in BehSci
  
  Meyerhoff, H. S., Brand, A.-K., & Scholl, A. (2020). In Case of Doubt for the Suspicion?: When People Falsely Remember Facts in the News as Being Uncertain. https://doi.org/10.31234/osf.io/rct7a
  
  is:preprint lang:en modern media information demand theory language processing uncertainty headline source credibility
Visit annotations in context

Tags

information

modern

theory

lang:en

demand

language processing

media

uncertainty

is:preprint

source credibility

headline

Annotators

Marlene_Wulf

URL

psyarxiv.com/rct7a/
May 2020
arxiv.org arxiv.org

Complex Societies and the Growth of the Law

1
1. edampf 28 May 2020
  
  in BehSci
  
  Katz, D. M., Coupette, C., Beckedorf, J., & Hartung, D. (2020). Complex Societies and the Growth of the Law. ArXiv:2005.07646 [Physics]. http://arxiv.org/abs/2005.07646
  
  is:preprint lang:en computer science complex societies law Germany USA legislation modeling multidimensional time-evolving natural language processing network science welfare state tax state
Visit annotations in context

Tags

multidimensional

law

USA

lang:en

complex societies

legislation

modeling

network science

time-evolving

welfare state

natural language processing

Germany

is:preprint

tax state

computer science

Annotators

edampf

URL

arxiv.org/abs/2005.07646
psyarxiv.com psyarxiv.com

Moral Concerns are Differentially Observable in Language

1
1. edampf 13 May 2020
  
  in BehSci
  
  Kennedy, B., Atari, M., Davani, A. M., Hoover, J., Omrani, A., Graham, J., & Dehghani, M. (2020, May 7). Moral Concerns are Differentially Observable in Language. https://doi.org/10.31234/osf.io/uqmty
  
  is:preprint lang:en morality language text analysis moral foundations theory observational analysis psychology communication Facebook online status update questionnaire prediction self-report natural language processing
Visit annotations in context

Tags

Facebook

text analysis

prediction

communication

moral foundations theory

lang:en

online status

observational analysis

questionnaire

morality

self-report

natural language processing

language

is:preprint

psychology

update

Annotators

edampf

URL

psyarxiv.com/uqmty/
Apr 2020
en.wikipedia.org en.wikipedia.org

Hyphenation algorithm - Wikipedia

1
1. TylerRick 29 Apr 2020
  
  in Public
  
  algorithms natural language processing
Visit annotations in context

Tags

natural language processing

algorithms

Annotators

TylerRick

URL

en.wikipedia.org/wiki/Hyphenation_algorithm
arxiv.org arxiv.org

Distributed peer review enhanced with natural language processing and machine learning

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  Kerzendorf, W. E., Patat, F., Bordelon, D., van de Ven, G., & Pritchard, T. A. (2020). Distributed peer review enhanced with natural language processing and machine learning. Nature Astronomy. https://doi.org/10.1038/s41550-020-1038-y
  
  is:article lang:en language processing machine learning peer review algorithm prediction review
Visit annotations in context

Tags

review

machine learning

lang:en

is:article

language processing

algorithm

peer review

prediction

Annotators

edampf

URL

arxiv.org/abs/2004.04165