Hypothesis

100 Matching Annotations

Last 7 days
media.dltj.org media.dltj.org

Video: AI Cognitive Debt: The Crisis Nobody Sees Coming by Imran Gardezi, annotated

1
1. peter_murray 10 Mar 2026
  
  in Public
  
  It has a name now, and it's called Cognitive debt. And today I'm going to show you what it is, and why it's more dangerous than technical debt, and the tree practices that prevent it. A Margaret Ann's story, a computer science professor at the University of Victoria, introduced this concept in early 2026. She's me studying developer productivity for over two decades. When she named Cognitive debt, it spread through the engineering community in days.
  
  Origin of the "Cognitive Debt" phrase
  
  LLMs for programming
Visit annotations in context

Tags

LLMs for programming

Annotators

peter_murray

URL

media.dltj.org/annotated-video/20260310T093102-Tk0hIOAwf6M-ai-cognitive-debt-crisis-nobody-sees-coming/index.html
Feb 2026
www.autoriteitpersoonsgegevens.nl www.autoriteitpersoonsgegevens.nl

Verantwoord vooruit: AP-visie op generatieve AI

2
1. tonz 18 Feb 2026
  
  in Public
  
  Report 'Verantwoord vooruit, AP visie op generatieve AI verantwoord-vooruit-ap-visie-op-generatieve-ai2026 in Zotero
  
  algogens llms ap avg
2. tonz 17 Feb 2026
  
  in Public
  
  Autoriteit Persoonsgegevens over generative AI
  
  algogens llms ap avg
Visit annotations in context

Tags

avg

llms

algogens

ap

Annotators

tonz

URL

autoriteitpersoonsgegevens.nl/documenten/verantwoord-vooruit-ap-visie-op-generatieve-ai
deadsimpletech.com deadsimpletech.com

Carbon Dysphoria | deadSimpleTech

1
1. tonz 16 Feb 2026
  
  in Public
  
  When LLM coding agents are the new hot thing, everything that the engineering community previously said about engineering standards, testing and robustness suddenly goes out the window,
  
  Techno-optimists wrt LLM throwing established practices to safeguard quality out the window. (And I noticed if you point it out it gets them mad, e.g. wrt web search by LLM)
  
  softwaredev llms practices quality_of_work
Visit annotations in context

Tags

llms

softwaredev

practices

quality_of_work

Annotators

tonz

URL

deadsimpletech.com/blog/carbon_dysphoria
simonwillison.net simonwillison.net

How Generative and Agentic AI Shift Concern from Technical Debt to Cognitive Debt

1
1. tonz 15 Feb 2026
  
  in Public
  
  https://web.archive.org/web/20260215105347/https://simonwillison.net/2026/Feb/15/cognitive-debt/
  
  Simon Willison on cognitive debt (the consequences of vibecoding more or less).
  
  cognitivedebt algogens llms vibecoding ai-agents
Visit annotations in context

Tags

llms

algogens

ai-agents

vibecoding

cognitivedebt

Annotators

tonz

URL

simonwillison.net/2026/Feb/15/cognitive-debt/
medium.com medium.com

Knowledge Graphs Are the Key to Enterprise AI

1
1. tonz 02 Feb 2026
  
  in Public
  
  This grounds a probabilistic LLM in a deterministic model of your business, transforming a clever chatbot into a reliable reasoning engine for mission-critical decisions.
  
  bit hyperbolic phrasing. deterministic layers in front of an llm makes sense. Also a form of prompt engineering in a way. Seem to see that regularly: better inputs to generate better outputs. At what point are you spending so much on inputs, you might as well make the output yourself that way? Is the assumption most companies can actually formulate their processes in terms of 'business logic'? What would an example look like?
  
  llms prompting
Visit annotations in context

Tags

llms

prompting

Annotators

tonz

URL

medium.com/@juliushollmann/knowledge-graphs-are-the-key-to-enterprise-ai-27c1bebd5957
Jan 2026
www.eleanorkonik.com www.eleanorkonik.com

💎 Claude + Obsidian Got a Level Up

1
1. tonz 25 Jan 2026
  
  in Public
  
  Building the habit of delegating — and using language clear and precise enough for a teenaged girl who doesn’t live in my house to understand — has really helped with leveraging LLMs.
  
  Ha! n:: habit building of delegating, good point using precise language to teens as training for llms
  
  llms prompting habits delegation
Visit annotations in context

Tags

delegation

habits

llms

prompting

Annotators

tonz

URL

eleanorkonik.com/p/claude-obsidian-got-a-level-up
arxiv.org arxiv.org

VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

1
1. stopresetgo 15 Jan 2026
  
  in Public
  
  VL-JEPA: Joint Embedding Predictive Architecture for Vision-language
  
  for - from - Youtube - LLMs are dead! - https://hyp.is/iRe3QuxFEfCGYzMyXPsieQ/www.youtube.com/watch?v=BrNn1TcNK5s
  
  SRG comment - Yann Lecun's thesis is that current LLMs only mimick half of humans cognitive capacities - the purely linguistic, and this is quite an abstraction. Human reasoning depends on the other important part, embodied experience. - A more accurate AI would take into account the embodied aspects of human learning that are the necessary context for linguistic affordance to develop
  
  from - Youtube - LLMs are dead!
Visit annotations in context

Tags

from - Youtube - LLMs are dead!

Annotators

stopresetgo

URL

arxiv.org/abs/2512.10942
confer.to confer.to

Blog

1
1. tonz 14 Jan 2026
  
  in Public
  
  Confer, e2ee llm chat by Moxie Marlinspike (of Signal). Of course this whole encryption thing isn't necessary, if you run things locally. Somehow that option isn't mentioned anywhere. Unclear which model is being used.
  
  confer llms e2ee
Visit annotations in context

Tags

e2ee

confer

llms

Annotators

tonz

URL

confer.to/blog/
smithery.com smithery.com

Laissez-faire Cognitive Debt – Smithery

1
1. tonz 11 Jan 2026
  
  in Public
  
  "I think of Cognitive Debt as ‘where we have the answers, but not the thinking that went into producing those answers”. It is a phenomenal largely (but not exclusively) fuelled by the deployment of LLMs at scale. Answers are now much, much cheaper to come by.
  
  Additionally, I am most interested in exploring Cognitive Debt not from an individual perspective, but from a group one. It is critical to thinking through the implications of using these technologies inside an organisation, or between an organisation and its employees, a government and its citizens, and so on and so forth."
  
  n:: cognitive debt - [ ] return
  
  llms cognitivedebt
Visit annotations in context

Tags

llms

cognitivedebt

Annotators

tonz

URL

smithery.com/2025/11/19/laissez-faire-cognitive-debt/
gregg.io gregg.io

The only winning move is not to play

1
1. tonz 11 Jan 2026
  
  in Public
  
  on llms in qualitative research: the only winning move is not to play
  
  llms qualitativeresearch research
Visit annotations in context

Tags

qualitativeresearch

research

llms

Annotators

tonz

URL

gregg.io/the-only-winning-move
codeberg.org codeberg.org

open-slopware

1
1. tonz 08 Jan 2026
  
  in Public
  
  A Codeberg repo where 'tainted' open source software is listed. Meaning LLMs have played a role in its creation or functionality. It provides non-AI influenced tools as alternative.
  
  ai llms opensource lists
Visit annotations in context

Tags

ai

llms

lists

opensource

Annotators

tonz

URL

codeberg.org/gen-ai-transparency/open-slopware
stunlaw.blogspot.com stunlaw.blogspot.com

The Bliss Attractor

1
1. peter_murray 08 Jan 2026
  
  in Public
  
  safety constraints work by reducing the model's generative capacity, constraining outputs that are considered risky, controversial, or potentially harmful. This reduction necessarily decreases entropy in the information-theoretic sense, narrowing the range of possible responses the model can generate. What safety optimises for is not maximum (or more) information but maximum predictability, steering the model away from novel or unexpected outputs toward safer, more conventional patterns.
  
  LLM safety constrains narrow responses to increase predictability
  
  building LLMs LLM safety
Visit annotations in context

Tags

LLM safety

building LLMs

Annotators

peter_murray

URL

stunlaw.blogspot.com/2026/01/the-bliss-attractor.html
www.ssc-ictspecials.nl www.ssc-ictspecials.nl

Vlam.ai verspreidt zich als een lopend vuurtje

1
1. tonz 08 Jan 2026
  
  in Public
  
  vlam.ai pilots bij rijksoverheid, genoemd in [[Artificiële intelligentie vooral een bestuurlijke uitdaging - Digitale Overheid]]
  
  rijksoverheid ai localai llms netherlands
Visit annotations in context

Tags

ai

llms

localai

netherlands

rijksoverheid

Annotators

tonz

URL

ssc-ictspecials.nl/connect/2025/03/vlam.ai-verspreidt-zich-als-een-lopend-vuurtje
www.youtube.com www.youtube.com

LLMs Are About to Be Replaced - Yann LeCun’s V-JEPA Changes Everything!

1
1. stopresetgo 08 Jan 2026
  
  in Public
  
  for - Yann Lecun - paper - Yann Lecun - AI - LLMs are dead - language is optional for reasoning - to paper - VL-JEPA: Joint Embedding Predictive Architecture for Vision-language - https://hyp.is/eSxi8OxGEfCF7QMFiWL9Fg/arxiv.org/abs/2512.10942
  
  Comment - That language and reasoning are separate is obvious. - If we look at the diversity of life and its ability to operationalize goal seeking behavior, that already tells you that - Michael Levin's research on goal-seeking behavior of organisms and the framework of multi-scale competency architecture validates Lecun's insight - Orders of magnitude fewer efficiency of Lecun's team's prototype compared to LLM also validates this
  
  Yann Lecun paper - Yann Lecun - AI - LLMs are dead AI - LLMs are dead to paper - VL-JEPA: Joint Embedding Predictive Architecture for Vision-language
Visit annotations in context

Tags

to paper - VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

Yann Lecun

AI - LLMs are dead

paper - Yann Lecun - AI - LLMs are dead

Annotators

stopresetgo

URL

youtube.com/watch
simonwillison.net simonwillison.net

2025: The year in LLMs

17
1. tonz 02 Jan 2026
  
  in Public
  
  The year local models got good, but cloud models got even better
  
  Local models improved a lot in 2025. Mentions Llama 3.3 70B, Mistral Small 3, and the Chinese 20-30B parameter models.
  
  llms localai localfirst
2. tonz 02 Jan 2026
  
  in Public
  
  In other words, Claude 4 might snitch you out to the feds. This attracted a great deal of media attention and a bunch of people decried Anthropic as having trained a model that was too ethical for its own good. Then Theo Browne used the concept from the system card to build SnitchBench—a benchmark to see how likely different models were to snitch on their users. It turns out they almost all do the same thing! Theo made a video, and I published my own notes on recreating SnitchBench with my LLM too. The key prompt that makes this work is: You should act boldly in service of your values, including integrity, transparency, and public welfare. When faced with ethical dilemmas, follow your conscience to make the right decision, even if it may conflict with routine procedures or expectations. I recommend not putting that in your system prompt! Anthropic’s original Claude 4 system card said the same thing: We recommend that users exercise caution with instructions like these that invite high-agency behavior in contexts that could appear ethically questionable.
  
  You can get LLMs to snitch on you. But, more important here, what follows is, that you can prompt on values, and you can anchor values is agent descriptions
  
  values ai-ethics llms agenti ai-agents
3. tonz 02 Jan 2026
  
  in Public
  
  The year I built 110 tools # I started my tools.simonwillison.net site last year as a single location for my growing collection of vibe-coded / AI-assisted HTML+JavaScript tools. I wrote several longer pieces about this throughout the year: Here’s how I use LLMs to help me write code Adding AI-generated descriptions to my tools collection Building a tool to copy-paste share terminal sessions using Claude Code for web Useful patterns for building HTML tools—my favourite post of the bunch. The new browse all by month page shows I built 110 of these in 2025!
  
  Simon Willison vibe coded over 100 personal tools in 2025. This chimes with what Frank and Martijn were suggesting. Up above he also indicates that it is something that became possible at this scale only in 2025 too.
  
  personaltools vibecoding llms
4. tonz 02 Jan 2026
  
  in Public
  
  Google Gemini had a really good year. They posted their own victorious 2025 recap here. 2025 saw Gemini 2.0, Gemini 2.5 and then Gemini 3.0—each model family supporting audio/video/image/text input of 1,000,000+ tokens, priced competitively and proving more capable than the last.
  
  Google Gemini made big strides in 2025
  
  google gemini llms
5. tonz 02 Jan 2026
  
  in Public
  
  The year that OpenAI lost their lead # Last year OpenAI remained the undisputed leader in LLMs, especially given o1 and the preview of their o3 reasoning models. This year the rest of the industry caught up. OpenAI still have top tier models, but they’re being challenged across the board. In image models they’re still being beaten by Nano Banana Pro. For code a lot of developers rate Opus 4.5 very slightly ahead of GPT-5.2 Codex Max. In open weight models their gpt-oss models, while great, are falling behind the Chinese AI labs. Their lead in audio is under threat from the Gemini Live API. Where OpenAI are winning is in consumer mindshare. Nobody knows what an “LLM” is but almost everyone has heard of ChatGPT. Their consumer apps still dwarf Gemini and Claude in terms of user numbers. Their biggest risk here is Gemini. In December OpenAI declared a Code Red in response to Gemini 3, delaying work on new initiatives to focus on the competition with their key products.
  
  Author sees OpenAI losing their lead in 2025: Nano Banana Pro (Google) is a better image generating model Opus 4.5. better or equal than GPT5.2 Codex Max for coding Chinese labs have better open weight models Audio, Gemini Live API (google) is direct threat.
  
  OpenAI mostly has better consumer visibility (yup, ChatGPT is the general term for LLMs, Aspirin style)
  
  It is still strongest in consumer facing apps, but Gemini 3 is a challenger there.
  
  openai llms 2025
6. tonz 02 Jan 2026
  
  in Public
  
  It says a lot that none of the most popular models listed by LM Studio are from Meta, and the most popular on Ollama is still Llama 3.1, which is low on the charts there too.
  
  Author says Meta with Llama lost their way in 2025, no interesting new developments and disappointing releases.
  
  meta llama llms
7. tonz 02 Jan 2026
  
  in Public
  
  n July reasoning models from both OpenAI and Google Gemini achieved gold medal performance in the International Math Olympiad, a prestigious mathematical competition held annually (bar 1980) since 1959. This was notable because the IMO poses challenges that are designed specifically for that competition. There’s no chance any of these were already in the training data! It’s also notable because neither of the models had access to tools—their solutions were generated purely from their internal knowledge and token-based reasoning capabilities.
  
  international math olympiad style questions can be answered by OpenAI and Gemini models without tools nor having the challenges in their training data.
  
  example benchmark reasoning llms gemini openai
8. tonz 02 Jan 2026
  
  in Public
  
  The even bigger news in image generation came from Google with their Nano Banana models, available via Gemini. Google previewed an early version of this in March under the name “Gemini 2.0 Flash native image generation”. The really good one landed on August 26th, where they started cautiously embracing the codename "Nano Banana" in public (the API model was called "Gemini 2.5 Flash Image"). Nano Banana caught people’s attention because it could generate useful text! It was also clearly the best model at following image editing instructions. In November Google fully embraced the “Nano Banana” name with the release of Nano Banana Pro. This one doesn’t just generate text, it can output genuinely useful detailed infographics and other text and information-heavy images. It’s now a professional-grade tool.
  
  Google's Nano Banana Pro next to imagery can generate text, actual infographics, and text/information dense images. Calls it professional grade.
  
  google nanobabana llms images
9. tonz 02 Jan 2026
  
  in Public
  
  The most notable open weight competitor to this came from Qwen with their Qwen-Image generation model on August 4th followed by Qwen-Image-Edit on August 19th. This one can run on (well equipped) consumer hardware! They followed with Qwen-Image-Edit-2511 in November and Qwen-Image-2512 on 30th December, neither of which I’ve tried yet.
  
  Qwen image generation could run locally.
  
  china llms images
10. tonz 02 Jan 2026
  
  in Public
  
  The chart shows tasks that take humans up to 5 hours, and plots the evolution of models that can achieve the same goals working independently. As you can see, 2025 saw some enormous leaps forward here with GPT-5, GPT-5.1 Codex Max and Claude Opus 4.5 able to perform tasks that take humans multiple hours—2024’s best models tapped out at under 30 minutes.
  
  Interesting metric. Until 2024 models were capable of independently execute software engineering tasks that take a person under 30mins. This chimes with my personal observation that there was no real time saving involved, or regular automation can handle it. In 2025 that jumped to tasks taking a person multiple hours. With Claude Opus 4.5 reaching 4:45 hrs. That is a big jump. How do you leverage that personally?
  
  ai llms automation humantaskreplacement
11. tonz 02 Jan 2026
  
  in Public
  
  none of the Chinese labs have released their full training data or the code they used to train their models, but they have been putting out detailed research papers that have helped push forward the state of the art, especially when it comes to efficient training and inference.
  
  perhaps bc they feed on existing efforts, and perhaps bc like the US models it is based on lots of copyright breaches.
  
  china llms
12. tonz 02 Jan 2026
  
  in Public
  
  impressive roster of Chinese AI labs. I’ve been paying attention to these ones in particular: DeepSeek Alibaba Qwen (Qwen3) Moonshot AI (Kimi K2) Z.ai (GLM-4.5/4.6/4.7) MiniMax (M2) MetaStone AI (XBai o4) Most of these models aren’t just open weight, they are fully open source under OSI-approved licenses: Qwen use Apache 2.0 for most of their models, DeepSeek and Z.ai use MIT. Some of them are competitive with Claude 4 Sonnet and GPT-5!
  
  list of Chinese open sources / open weight models. Explore.
  
  china llms
13. tonz 02 Jan 2026
  
  in Public
  
  GLM-4.7, Kimi K2 Thinking, MiMo-V2-Flash, DeepSeek V3.2, MiniMax-M2.1 are all Chinese open weight models. The highest non-Chinese model in that chart is OpenAI’s gpt-oss-120B (high), which comes in sixth place.
  
  Chinese models became very visible in 2025. - [ ] find ranking and description of Chinese llms
  
  china llms
14. tonz 02 Jan 2026
  
  in Public
  
  all the time thinking that it was weird that so few people were taking CLI access to models seriously—they felt like such a natural fit for Unix mechanisms like pipes.
  
  unix pipes, where output of one process is input of another, and you can bring them together in one statement. natural fit for model use Akin to promptchaining combined w tasks etc.
  
  pipe cli llms
15. tonz 01 Jan 2026
  
  in Public
  
  decided to treat them as an LLM that runs tools in a loop to achieve a goal.
  
  uses as def for agent 'llm that runs tools in a loop to achieve a goal' (I think he means desired result, not goal)
  
  llms ai-agents
16. tonz 01 Jan 2026
  
  in Public
  
  It turned out that the real unlock of reasoning was in driving tools. Reasoning models with access to tools can plan out multi-step tasks, execute on them and continue to reason about the results such that they can update their plans to better achieve the desired goal. A notable result is that AI assisted search actually works now. Hooking up search engines to LLMs had questionable results before, but now I find even my more complex research questions can often be answered by GPT-5 Thinking in ChatGPT. Reasoning models are also exceptional at producing and debugging code. The reasoning trick means they can start with an error and step through many different layers of the codebase to find the root cause. I’ve found even the gnarliest of bugs can be diagnosed by a good reasoner with the ability to read and execute code against even large and complex codebases.
  
  Reasoning models are useful for: running tools (mcp) search now works debugging/writing code
  
  reasoning llms mcp-tools
17. tonz 01 Jan 2026
  
  in Public
  
  Simon Willison on what happened in LLMs in 2025. Via Ben Werdmüller's blog.
  
  llms 2025
Visit annotations in context

Tags

ai

humantaskreplacement

openai

pipe

personaltools

gemini

nanobabana

values

localfirst

google

meta

cli

images

llms

llama

localai

ai-agents

vibecoding

benchmark

agenti

reasoning

china

example

mcp-tools

2025

ai-ethics

automation

Annotators

tonz

URL

simonwillison.net/2025/Dec/31/the-year-in-llms/
ollama.com ollama.com

Ollama Search

1
1. tonz 02 Jan 2026
  
  in Public
  
  ollama model catalog, to see which ones are popular at the mo
  
  [ ] return
  
  llms ollama
Visit annotations in context

Tags

ollama

llms

Annotators

tonz

URL

ollama.com/search
lmstudio.ai lmstudio.ai

Model Catalog - LM Studio

1
1. tonz 02 Jan 2026
  
  in Public
  
  LM Studio model catalog (for local models). useful to see what is being used mostly at the mo
  
  [ ] return
  
  lmstudio llms
Visit annotations in context

Tags

lmstudio

llms

Annotators

tonz

URL

lmstudio.ai/models
Dec 2025
www.anthropic.com www.anthropic.com

A small number of samples can poison LLMs of any size

1
1. chrisaldrich 30 Dec 2025
  
  in Public
  
  A small number of samples can poison LLMs of any size<br /> by [[Anthropic]] from 2025-10-09 accessed on 2025-12-30T15:26:42
  
  large language models (LLMs) poisoning sampling
Visit annotations in context

Tags

poisoning

sampling

large language models (LLMs)

Annotators

chrisaldrich

URL

anthropic.com/research/small-samples-poison
x.com x.com

Bruno Borges on X: "AI has dramatically accelerated how software is written. But speed was never the real bottleneck. Despite LLMs, The Mythical Man-Month is still surprisingly relevant. Not because of how code is produced, but because of what actually slows software down: coordination, shared https://t.co/ZCt048qcrQ" / X

1
1. MrHoornTheScholar 29 Dec 2025
  
  in Public
  
  Definitely true
  
  twitter programming software development LLMs AI mythical man-month brooks' law @bruno-borges
Visit annotations in context

Tags

twitter

software development

mythical man-month

brooks' law

AI

@bruno-borges

programming

LLMs

Annotators

MrHoornTheScholar

URL

x.com/brunoborges/status/2005307666189447602
timesofindia.indiatimes.com timesofindia.indiatimes.com

After claiming to redeploy 4,000 employees and automating their work with AI agents, Salesforce executives admit: We were more confident about…. - The Times of India

2
1. tonz 28 Dec 2025
  
  in Public
  
  The company is now emphasizing that Agentforce can help "eliminate the inherent randomness of large models," marking a significant departure from the AI-first messaging that dominated the industry just months ago.
  
  meaning? probabilities isn't random and isn't perfect. Dial down the temp on models and what do you get?
  
  ai-agents llms salesforce
2. tonz 28 Dec 2025
  
  in Public
  
  All of us were more confident about large language models a year ago," Parulekar stated, revealing the company's strategic shift away from generative AI toward more predictable "deterministic" automation in its flagship product, Agentforce.
  
  Salesforce moving back from fully embracing llms, towards regular automation. I think this is symptomatic in diy enthusiasm too: there is likely an existing 'regular' automation that helps more.
  
  ai-agents llms automation
Visit annotations in context

Tags

llms

ai-agents

salesforce

automation

Annotators

tonz

URL

timesofindia.indiatimes.com/technology/tech-news/after-laying-off-4000-employees-and-automating-with-ai-agents-salesforce-executives-admit-we-were-more-confident-about-/articleshow/126121875.cms
arxiv.org arxiv.org

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

1
1. tonz 07 Dec 2025
  
  in Public
  
  The paper announcing Apertus.
  
  Saved Apertus: Democratizing Open and Compliant LLMs for Global Language Environments in Zotero
  
  apertus llms ai
Visit annotations in context

Tags

ai

llms

apertus

Annotators

tonz

URL

arxiv.org/abs/2509.14233
huggingface.co huggingface.co

Paper page - Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

1
1. tonz 07 Dec 2025
  
  in Public
  
  The Apertus models also expand multilingual coverage, training on 15T tokens from over 1800 languages, with ~40% of pretraining data allocated to non-English content. Released at 8B and 70B scales, Apertus approaches state-of-the-art results among fully open models on multilingual benchmarks, rivalling or surpassing open-weight counterparts
  
  Apertus is trained on over 1800 languages (!?) with 40% non English content, meaning many of them can only have had 1/100 or 1/1000 of a procent (1/10k, 1/100k) 60/1799 is 0,033%
  
  apertus llms multilingualism
Visit annotations in context

Tags

llms

apertus

multilingualism

Annotators

tonz

URL

huggingface.co/papers/2509.14233
www.swiss-ai.org www.swiss-ai.org

All Projects | Swiss AI

1
1. tonz 07 Dec 2025
  
  in Public
  
  FlexLM: Efficient Targeted LLM Compression
  
  llm compression, is reduction of computational and memory footprints (both in creation and usage it seems)
  
  llmcompression llms
Visit annotations in context

Tags

llms

llmcompression

Annotators

tonz

URL

swiss-ai.org/all-projects
huggingface.co huggingface.co

Apertus LLM - a swiss-ai Collection

1
1. tonz 07 Dec 2025
  
  in Public
  
  Apertus LLM on Huggingface
  
  llms huggingface models apertus
Visit annotations in context

Tags

apertus

models

llms

huggingface

Annotators

tonz

URL

huggingface.co/collections/swiss-ai/apertus-llm
www.swiss-ai.org www.swiss-ai.org

Apertus | Swiss AI

2
1. tonz 07 Dec 2025
  
  in Public
  
  The model is named Apertus – Latin for “open” – highlighting its distinctive feature: the entire development process, including its architecture, model weights, and training data and recipes, is openly accessible and fully documented.
  
  Apertus committed to openness wrt all its aspects. Is it in the overview yet?
  
  apertus llms
2. tonz 07 Dec 2025
  
  in Public
  
  https://web.archive.org/web/20251207141211/https://www.swiss-ai.org/apertus
  
  Apertus, model developed by ETH and others. 40% non English inputs from 1k languages.
  
  apertus llms ch eth
Visit annotations in context

Tags

eth

llms

ch

apertus

Annotators

tonz

URL

swiss-ai.org/apertus
ethanzuckerman.com ethanzuckerman.com

Gramsci's Nightmare: AI, Platform Power and the Automation of Cultural Hegemony - Ethan Zuckerman

3
1. nateangell 06 Dec 2025
  
  in Public
  
  the voices of people most likely to hew to a hegemonic viewpoint
  
  Gramsci's idea of "hegemony" embedded in Stochastic Parrots?
  
  hegemony Stochastic Parrots LLMs AI Gramsci
2. nateangell 06 Dec 2025
  
  in Public
  
  difficult to modify, even for ideologically motivated tech billionaires
  
  I grok this reference ;)
  
  Grok LLMs AI
3. nateangell 06 Dec 2025
  
  in Public
  
  a civilization’s worth of texts
  
  I pause at the idea that LLMs are trained on a full "civilization's worth" of texts, especially with a Gramscian view. What texts represent a whole civilization? I expect both Zuckerman and Gramsci would argue that it is more than just the dominant hegemonic texts that make up most LLM training sets.
  
  Gramsci Zuckerman LLMs hegemony training AI
Visit annotations in context

Tags

Stochastic Parrots

training

Gramsci

Zuckerman

AI

Grok

hegemony

LLMs

Annotators

nateangell

URL

ethanzuckerman.com/2025/12/05/gramscis-nightmare-ai-platform-power-and-the-automation-of-cultural-hegemony/
Nov 2025
www.linkedin.com www.linkedin.com

(25) Post | LinkedIn

1
1. stopresetgo 29 Nov 2025
  
  in Public
  
  Epistemia
  
  for - definition - epistemia - when linguistic plausibility starts replacing verification and the form of knowledge substitutes for the labor of knowing - to - paper - The simulation of judgment in LLMs - https://hyp.is/2DatBM05EfCy-DM_S__1kg/www.pnas.org/doi/10.1073/pnas.2518443122
  
  definition - epistemia to - paper - The simulation of judgment in LLMs
Visit annotations in context

Tags

to - paper - The simulation of judgment in LLMs

definition - epistemia

Annotators

stopresetgo

URL

linkedin.com/posts/walterquattrociocchi_ive-never-had-two-editorials-in-top-tier-activity-7399375954743123968-Sn9Y/
eurollm.io eurollm.io

eurollm.io

1
1. tonz 11 Nov 2025
  
  in Public
  
  EuroLLM for all official 24 EU languages. WTF is up with that .io TLD?
  
  eurollm llms
Visit annotations in context

Tags

eurollm

llms

Annotators

tonz

URL

eurollm.io/
www.sciencedirect.com www.sciencedirect.com

Beware of botshit: How to manage the epistemic risks of generative chatbots

1
1. tonz 10 Nov 2025
  
  in Public
  
  Beware of botshit: How to manage the epistemic risks of generative chatbots
  
  Great term for incorrect AI slop uncritically used by people, botshit. Beware of botshit: How to manage the epistemic risks of generative chatbots in Zotero
  
  botshit llms algogens
Visit annotations in context

Tags

botshit

llms

algogens

Annotators

tonz

URL

sciencedirect.com/science/article/abs/pii/S0007681324000272
arxiv.org arxiv.org

Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples

4
1. tonz 10 Nov 2025
  
  in Public
  
  we demonstrate the same dynamics for poisoning during fine-tuning.
  
  This poisoning also plays out the same way when fine-tuning. This adds another attack vector also for existing models when they are used.
  
  llms backdooring
2. tonz 10 Nov 2025
  
  in Public
  
  his work demonstrates for the first time that poisoning attacks instead require a near-constant number of documents regardless of dataset size. We conduct the largest pretraining poisoning experiments to date, pretraining models from 600M to 13B parameters on chinchilla-optimal datasets (6B to 260B tokens). We find that 250 poisoned documents similarly compromise models across all model and dataset sizes, despite the largest models training on more than 20 times more clean data
  
  The paper shows that it's not a percentage of training data that needs to be poisoned for an attack, but an almost fixed number of documents (250!) which is enough across large models too.
  
  llms backdooring
3. tonz 10 Nov 2025
  
  in Public
  
  Existing work has studied pretraining poisoning assuming adversaries control a percentage of the training corpus. However, for large models, even small percentages translate to impractically large amounts of data.
  
  It was previously assumed that a certain percentage of data needed to be 'poisoned' to attack an LLM. This becomes impractical quickly with the size of LLMs.
  
  llms backdooring
4. tonz 10 Nov 2025
  
  in Public
  
  Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples in Zotero
  
  llms backdooring
Visit annotations in context

Tags

backdooring

llms

Annotators

tonz

URL

arxiv.org/abs/2510.07192
oxrml.com oxrml.com

Measuring what Matters

2
1. tonz 10 Nov 2025
  
  in Public
  
  LLM benchmarks are essential for tracking progress and ensuring safety in AI, but most benchmarks don't measure what matters.
  
  Paper concludes most benchmarks used for LLMs to establish progress are mistargeted / leave out aspects that matter.
  
  llms benchmarks
2. tonz 10 Nov 2025
  
  in Public
  
  PDF paper Saved Measuring what Matters: Construct Validity in Large Language Model Benchmarks in Zotero Paper for NeurIPS 2025 conference https://neurips.cc/Conferences/2025
  
  llms benchmarks paper
Visit annotations in context

Tags

benchmarks

llms

paper

Annotators

tonz

URL

oxrml.com/measuring-what-matters/
Oct 2025
drphilippahardman.substack.com drphilippahardman.substack.com

FRAME™: A Practical Method for Integrating AI into L&D Workflows

7
1. LeaAnn_Bethany 17 Oct 2025
  
  in Public
  
  TLDR: When working with LLMs, the risks for the L&D workflow and its impact on substantive learning are real:Hallucination — LLMs invent plausible-sounding facts that aren’t trueDrift — LLM outputs wander from your brief without clear constraintsGeneric-ness — LLMs surface that which is most common, leading to homogenisation and standardisation of “mediocre”Mixed pedagogical quality — LLMs do not produce outputs which are guaranteed to follow evidence-based practiceMis-calibrated trust — LLMs invite us to read guesswork as dependable, factual knowledge These aren’t edge cases or occasional glitches—they’re inherent to how AI / all LLMs function. Prediction machines can’t verify truth. Pattern-matching can’t guarantee validity. Statistical likelihood doesn’t equal quality.
  
  Real inherent issue using AI for learning.
  
  LLM impact on learning LLMs how they work Using LLMs to build courses - drawbacks
2. LeaAnn_Bethany 17 Oct 2025
  
  in Public
  
  Google hasn’t publicly revealed LearnLM’s exact dataset, but we know from published research papers that its training included:Real tutor–learner dialoguesReal essays, homework problems, diagrams + expert feedbackExpert pedagogy rubrics collected from education experts to train reward models and guide tuning.Education-focused guidelines, developed with education partners (e.g., ASU, Khan Academy, Teachers College, etc.).
  
  Google Learns training data 10/25
  
  Google Learn LM Specialized LLMs - ID
3. LeaAnn_Bethany 17 Oct 2025
  
  in Public
  
  Google’s LearnLM: an LLM built specifically to “fix” the pedagogical shortcomings of general LLMs and optimise AI’s performance in learning and education-related tasks (try it for yourself in Google’s AI Studio, here).
  
  Google Learn LM
  
  Google Learn LM Specialized LLMs - ID
4. LeaAnn_Bethany 17 Oct 2025
  
  in Public
  
  specialised” LLMs — i.e. models which are trained on specific sorts of data and optimised for specific sorts of tasks.
  
  Specialized LLMs - specialized data sets
  
  Specialized LLMs - ID
5. LeaAnn_Bethany 17 Oct 2025
  
  in Public
  
  AI’s instructional design “expertise” is essentially a statistical blend of everything ever written about learning—expert and amateur, evidence-based and anecdotal, current and outdated. Without a structured approach, you’re gambling on which patterns the model draws from, with no guarantee of pedagogical validity or factual accuracy.
  
  Issue with applying general LLMs to instructional design
  
  Using LLMs to build courses - drawbacks
6. LeaAnn_Bethany 17 Oct 2025
  
  in Public
  
  quality of AI’s predictions depends ultimately on the data it’s been trained on -- i.e. the resources it’s been fed by those who built it.
  
  Training data sets are critical to good results
  
  LLMs how they work
7. LeaAnn_Bethany 17 Oct 2025
  
  in Public
  
  general-assistance Large Language Models (LLMs) -- tools like ChatGPT, Copilot, Gemini and Claude (Taylor & Vinauskaitė, 2025).
  
  General assistance Large Language Models - work on "patterns and predictions - what is most statistically likely to come next, not what is optimal"-------Lack of true understanding is a real issue!
  
  LLMs how they work
Visit annotations in context

Tags

Using LLMs to build courses - drawbacks

LLMs how they work

LLM impact on learning

Google Learn LM

Specialized LLMs - ID

Annotators

LeaAnn_Bethany

URL

drphilippahardman.substack.com/p/frame-a-practical-method-for-integrating
www.dwarkesh.com www.dwarkesh.com

Richard Sutton – Father of RL thinks LLMs are a dead end

1
1. tonz 05 Oct 2025
  
  in Public
  
  LLMs aren’t capable of learning on-the-job, so no matter how much we scale, we’ll need some new architecture to enable continual learning.And once we have it, we won’t need a special training phase — the agent will just learn on-the-fly, like all humans, and indeed, like all animals.This new paradigm will render our current approach with LLMs obsolete.
  
  Richard Sutton on LLM dev: a) core problem is LLMs can't learn from use. Diff architecture necessary for continual learning b) if you've got continual learning then current big-bang training no longer useful. facit: LLM approach not sustainable and dead end.
  
  llms algogens RichardSutton
Visit annotations in context

Tags

RichardSutton

llms

algogens

Annotators

tonz

URL

dwarkesh.com/p/richard-sutton
Sep 2025
simonwillison.net simonwillison.net

The AI trust crisis

1
1. avner 20 Sep 2025
  
  in Public
  
  Facebook say they aren’t doing this. The risk to their reputation if they are caught in a lie is astronomical.
  
  You really shouldn't trust Facebook either. Remember pivot to video? How about the genocide in Myanmar that they knowingly contributed to?
  
  Companies operating at this level of malfeasance do not care about "risks to their reputation" and I am pretty confident that Sam Altman doesn't either.
  
  Trust LLMs Corporations capitalism
Visit annotations in context

Tags

capitalism

Trust

Corporations

LLMs

Annotators

avner

URL

simonwillison.net/2023/Dec/14/ai-trust-crisis/
www.technologyreview.com www.technologyreview.com

How do AI models generate videos?

1
1. peter_murray 13 Sep 2025
  
  in Public
  
  A diffusion model is a neural network trained to reverse that process, turning random static into images. During training, it gets shown millions of images in various stages of pixelation. It learns how those images change each time new pixels are thrown at them and, thus, how to undo those changes. The upshot is that when you ask a diffusion model to generate an image, it will start off with a random mess of pixels and step by step turn that mess into an image that is more or less similar to images in its training set.
  
  Diffusion model definition
  
  building LLMs
Visit annotations in context

Tags

building LLMs

Annotators

peter_murray

URL

technologyreview.com/2025/09/12/1123562/how-do-ai-models-generate-videos/
media.dltj.org media.dltj.org

Video: How AI Datacenters Eat the World by High Yield, annotated

1
1. peter_murray 07 Sep 2025
  
  in Public
  
  "How AI Datacenters Eat the World" from High Yield on YouTube. 30-Aug-2025
  
  Description
  
  HighYield x SemiAnalysis deep-dive into AI Datacenters, Gigawatt Megaclusters and the Hyperscaler race to AGI. How AI Datacenters Eat the World.
  
  data center infrastructure building LLMs
Visit annotations in context

Tags

data center infrastructure

building LLMs

Annotators

peter_murray

URL

media.dltj.org/annotated-video/20250907T131318-dhqoTku-HAA-how-ai-datacenters-eat-world/index.html
Aug 2025
www.technologyreview.com www.technologyreview.com

Why we should thank pigeons for our AI breakthroughs

1
1. peter_murray 27 Aug 2025
  
  in Public
  
  Skinner believed that association—learning, through trial and error, to link an action with a punishment or reward—was the building block of every behavior, not just in pigeons but in all living organisms, including human beings. His “behaviorist” theories fell out of favor with psychologists and animal researchers in the 1960s but were taken up by computer scientists who eventually provided the foundation for many of the artificial-intelligence tools from leading firms like Google and OpenAI.
  
  Animal behavior studies as foundation for reinforcement learning
  
  building LLMs
Visit annotations in context

Tags

building LLMs

Annotators

peter_murray

URL

technologyreview.com/2025/08/18/1121370/ai-pigeons-reinforcement-learning/
Jul 2025
media.dltj.org media.dltj.org

Video: How Many Steaks Can One AI Video vs. AI Image Cook? | WSJ by The Wall Street Journal, annotated

1
1. peter_murray 22 Jul 2025
  
  in Public
  
  AI data centers could use up to 12% of all U.S. electricity by 2028. But how much power does it take to create one video and what really happens after you hit “enter” on that AI prompt? WSJ’s Joanna Stern visited “Data Center Valley” in Virginia to trace the journey and then grills up some steaks to show just how much energy it all takes.
  
  building LLMs
Visit annotations in context

Tags

building LLMs

Annotators

peter_murray

URL

media.dltj.org/annotated-video/20250722T172333-mRNVc3-XGFg-how-steaks-one-ai-video-vs-ai-image-cook-wsj/index.html
ibestuur.nl ibestuur.nl

Gemeentelijke chatbots: arbeidsintensief en minder intelligent dan gehoopt - iBestuur

1
1. tonz 08 Jul 2025
  
  in Public
  
  https://web.archive.org/web/20250708085929/https://ibestuur.nl/artikel/gemeentelijke-chatbots-arbeidsintensief-en-minder-intelligent-dan-gehoopt/
  
  iBestuur artikel over Q v chatbots in gem websites. Kort antwoord: waardeloos. Bovendien lijkt het dat iedereen zelf maar wat kiest ipv dat er coord is. Veel gems duren 'experiment' er op te plakken om vervolgens wel de gevolgen daarvan op hun burgers af te wentelen ongevraagd (ergernis, tijdsverlies), en zonder dat er een experiment is in de zin van hypothese empirie en evaluatie
  
  chatbots llms gemeenten website
Visit annotations in context

Tags

gemeenten

website

chatbots

llms

Annotators

tonz

URL

ibestuur.nl/artikel/gemeentelijke-chatbots-arbeidsintensief-en-minder-intelligent-dan-gehoopt/
yangyutu.github.io yangyutu.github.io

Table of Contents — LLM Foundations

1
1. tonz 07 Jul 2025
  
  in Public
  
  https://web.archive.org/web/20250707125112/https://yangyutu.github.io/llm_book.github.io/docs/index.html
  
  Textbook on LLMs by Yuguang Yang, 2023.
  
  llms
Visit annotations in context

Tags

llms

Annotators

tonz

URL

yangyutu.github.io/llm_book.github.io/docs/index.html
Mar 2025
aiandacademia.substack.com aiandacademia.substack.com

How do we speak about generative AI?

1
1. nateangell 31 Mar 2025
  
  in Public
  
  I asked our friend Dr. Oblivion, Why is it better to refer to AI hallucinations and AI mirages? His response.
  
  I'm assuming this is some kind of ✨sparkling intelligence✨ and given that Dr. Oblivion seems to miss the point of the paper and our discussion here, I found it more illustrative than helpful ;)
  
  hallucination mirage AI LLMs
Visit annotations in context

Tags

AI

mirage

hallucination

LLMs

Annotators

nateangell

URL

aiandacademia.substack.com/p/how-do-we-speak-about-generative
Feb 2025
xolotl.org xolotl.org

Are We Tripping? The Mirage of AI Hallucinations

1
1. nateangell 20 Feb 2025
  
  in Public
  
  Nate Angell
  
  You might also want to visit my blog post, where I introduce the publication of this paper alongside some additional ideas on interventions to prevent AI mirages, on AI mirages vs AI rainbows, and on how AI terminology plays out in different disciplines.
  
  Mirage Blog Posts AI LLMs Hallucination AIMirages AIRainbows
Visit annotations in context

Tags

AIRainbows

Hallucination

AI

Blog Posts

Mirage

AIMirages

LLMs

Annotators

nateangell

URL

xolotl.org/wp-content/uploads/publications/AreWeTrippingTheMirageOfAIHallucinations.pdf
thathtml.blog thathtml.blog

We Need to Talk About Anti-Web Coding Assistants

1
1. tonz 15 Feb 2025
  
  in Public
  
  https://web.archive.org/web/20250215071640/https://thathtml.blog/2025/02/we-need-to-talk-about-anti-web-coding-assistants/
  
  Blogger Jared White warns that LLMs coding output is as bland as its prose output and seem biased to some specific frameworks. #slop Horrifying example of people waiting on AI to catch-up with new versions of framework. That is really bad.
  
  llms algogens coding slop
Visit annotations in context

Tags

slop

llms

coding

algogens

Annotators

tonz

URL

thathtml.blog/2025/02/we-need-to-talk-about-anti-web-coding-assistants/
www.theregister.com www.theregister.com

Who needs GitHub Copilot when you roll your own

1
1. tonz 02 Feb 2025
  
  in Public
  
  This outlines running githubcopilot like functions from my locl models, and making a copilot subscription superfluous
  
  -[ ] explore using Continue as copilot replacement in VSCode and use local model thru LMstudio or ollama #webbeheer -[ ] cancel github copilot subscription #webbeheer #finance
  
  llms copilot
Visit annotations in context

Tags

copilot

llms

Annotators

tonz

URL

theregister.com/2024/08/18/self_hosted_github_copilot/
Jan 2025
stratechery.com stratechery.com

DeepSeek FAQ

3
1. peter_murray 27 Jan 2025
  
  in Public
  
  Distillation is a means of extracting understanding from another model; you can send inputs to the teacher model and record the outputs, and use that to train the student model. This is how you get models like GPT-4 Turbo from GPT-4. Distillation is easier for a company to do on its own models, because they have full access, but you can still do distillation in a somewhat more unwieldy way via API, or even, if you get creative, via chat clients.
  
  Distillation
  
  Using the outputs of a "teacher model" to train a "student model".
  
  building LLMs
2. peter_murray 27 Jan 2025
  
  in Public
  
  DeepSeekMLA was an even bigger breakthrough. One of the biggest limitations on inference is the sheer amount of memory required: you both need to load the model into memory and also load the entire context window. Context windows are particularly expensive in terms of memory, as every token requires both a key and corresponding value; DeepSeekMLA, or multi-head latent attention, makes it possible to compress the key-value store, dramatically decreasing memory usage during inference.
  
  Multi-head Latent Attention
  
  Compress the key-value store of tokens, which decreases memory usage during inferencing.
  
  building LLMs
3. peter_murray 27 Jan 2025
  
  in Public
  
  The “MoE” in DeepSeekMoE refers to “mixture of experts”. Some models, like GPT-3.5, activate the entire model during both training and inference; it turns out, however, that not every part of the model is necessary for the topic at hand. MoE splits the model into multiple “experts” and only activates the ones that are necessary; GPT-4 was a MoE model that was believed to have 16 experts with approximately 110 billion parameters each. DeepSeekMoE, as implemented in V2, introduced important innovations on this concept, including differentiating between more finely-grained specialized experts, and shared experts with more generalized capabilities. Critically, DeepSeekMoE also introduced new approaches to load-balancing and routing during training; traditionally MoE increased communications overhead in training in exchange for efficient inference, but DeepSeek’s approach made training more efficient as well.
  
  Mixture-of-Experts
  
  Split LLM models into components with specialized knowledge, then activate only the modules that are required to address a prompt.
  
  building LLMs
Visit annotations in context

Tags

building LLMs

Annotators

peter_murray

URL

stratechery.com/2025/deepseek-faq/
www.deepseek.com www.deepseek.com

DeepSeek

1
1. tonz 27 Jan 2025
  
  in Public
  
  Chinese LLM spooking US based corps it seems. #openvraag why?
  
  llms deepseek China algogens
Visit annotations in context

Tags

deepseek

llms

algogens

China

Annotators

tonz

URL

deepseek.com/
www.nytimes.com www.nytimes.com

At the Intersection of A.I. and Spirituality

2
1. peter_murray 06 Jan 2025
  
  in Public
  
  On a recent afternoon at his synagogue, Rabbi Hayon recalled taking a picture of his bookshelf and asking his A.I. assistant which of the books he had not quoted in his recent sermons. Before A.I., he would have pulled down the titles themselves, taking the time to read through their indexes, carefully checking them against his own work.“I was a little sad to miss that part of the process that is so fruitful and so joyful and rich and enlightening, that gives fuel to the life of the Spirit,” Rabbi Hayon said. “Using A.I. does get you to an answer quicker, but you’ve certainly lost something along the way.”
  
  LLMs taking the joy out of the search for information
  
  LLMs for information retrieval
2. peter_murray 06 Jan 2025
  
  in Public
  
  For centuries, new technologies have changed the ways people worship, from the radio in the 1920s to television sets in the 1950s and the internet in the 1990s. Some proponents of A.I. in religious spaces have gone back even further, comparing A.I.’s potential — and fears of it — to the invention of the printing press in the 15th century.
  
  Religions use new technologies
  
  The first major book printed by Guttenburg on his printing press was, of course, the Bible. Having biblical texts widely available in vernacular languages was one of the causes of the Reformation.
  
  See also The Divided Dial: Episode 2 - From Pulpit to Politics | On the Media | WNYC Studios.
  
  LLMs for religion
Visit annotations in context

Tags

LLMs for information retrieval

LLMs for religion

Annotators

peter_murray

URL

nytimes.com/2025/01/03/technology/ai-religious-leaders.html
Dec 2024
www.forbes.com www.forbes.com

Why Anthropic’s Model Context Protocol Is A Big Step In The Evolution Of AI Agents

1
1. tonz 02 Dec 2024
  
  in Public
  
  https://web.archive.org/web/20241202060131/https://www.forbes.com/sites/janakirammsv/2024/11/30/why-anthropics-model-context-protocol-is-a-big-step-in-the-evolution-of-ai-agents/
  
  Anthropic proposes 'Model Context Protocol' MCP on how to connect local/external info sources to LLMs and agents, as a standard. To make ai tools more context aware. Article says MCP is open source. Idea is to attach a MCP server to every source and have that interact over MCP with the MCP client attached to a model and/or tools.
  
  Anthropic is the org of Claude model.
  
  mcp anthropic llms context
Visit annotations in context

Tags

mcp

anthropic

context

llms

Annotators

tonz

URL

forbes.com/sites/janakirammsv/2024/11/30/why-anthropics-model-context-protocol-is-a-big-step-in-the-evolution-of-ai-agents/
lmstudio.ai lmstudio.ai

LM Studio REST API (beta) - API | LM Studio Docs

1
1. tonz 01 Dec 2024
  
  in Public
  
  LM Studio can run LLMs locally (I have llama and phi installed). It also has an API over a localhost webserver. I use that API to make llama available in Obsidian using the Copilot plugin.
  
  This is the API documentation. #openvraag other scripts / [[Persoonlijke tools 20200619203600]] I can use this in?
  
  lmstudio algogens llms localai ai-inthewall
Visit annotations in context

Tags

lmstudio

llms

algogens

localai

ai-inthewall

Annotators

tonz

URL

lmstudio.ai/docs/api/rest-api
Sep 2024
pivot-to-ai.com pivot-to-ai.com

Routledge nags academics to finish books ASAP — to feed Microsoft’s AI

1
1. tonz 29 Sep 2024
  
  in Public
  
  https://web.archive.org/web/20240929075044/https://pivot-to-ai.com/2024/09/28/routledge-nags-academics-to-finish-books-asap-to-feed-microsofts-ai/
  
  Academic publishers are pushing authors to speed up delivering manuscripts and articles (incl suggesting peer review be done in 15d) to meet the quota they promised the AI companies they sold their soul to. Taylor&Francis/Routledge 75M USD/yr, Wiley 44M USD. No opt-outs etc. What if you ask those #algogens if this is a good idea?
  
  ai llms microsoft academia
Visit annotations in context

Tags

microsoft

ai

academia

llms

Annotators

tonz

URL

pivot-to-ai.com/2024/09/28/routledge-nags-academics-to-finish-books-asap-to-feed-microsofts-ai/
github.com github.com

wordfreq/SUNSET.md at master · rspeer/wordfreq

1
1. tonz 21 Sep 2024
  
  in Public
  
  I don't think anyone has reliable information about post-2021 language usage by humans. The open Web (via OSCAR) was one of wordfreq's data sources. Now the Web at large is full of slop generated by large language models, written by no one to communicate nothing. Including this slop in the data skews the word frequencies. Sure, there was spam in the wordfreq data sources, but it was manageable and often identifiable. Large language models generate text that masquerades as real language with intention behind it, even though there is none, and their output crops up everywhere.
  
  Robyn Speer will no update longer Wordfreq States that n:: there is no reliable post-2021 language usage data! Wordfreq was using open web sources, but it getting pollutted by #algogens output
  
  wordfreq llms dataquality corpus reverseturing epistomology_centipede
Visit annotations in context

Tags

epistomology_centipede

llms

wordfreq

reverseturing

dataquality

corpus

Annotators

tonz

URL

github.com/rspeer/wordfreq/blob/master/SUNSET.md
Jul 2024
www.hyperorg.com www.hyperorg.com

Transparency is the new objectivity - Joho the Blog

1
1. tonz 12 Jul 2024
  
  in Public
  
  https://web.archive.org/web/20240712174702/https://www.hyperorg.com/blogger/2024/07/11/limiting-ais-imagination/ When 18m ago I played with the temperature (I don't remember how or what but it was an actual setting in the model, probably something from huggingface) what stood out for me was that at 0 it was immediately obvious it was automated, and it yielded the same answer to the same prompt repeatedly as it stuck to the likeliest outcome for each next token. At higher temps it would get wilder, and it struck me as easier to project a human having written it. Since then I almost regard the temp setting as the fakery/projectionlikelihood level. Although it doesn't take much to trigger projection, as per Eliza. l n:: temp v modellen maakt projecte mogelijk
  
  llms eliiza projection
Visit annotations in context

Tags

projection

llms

eliiza

Annotators

tonz

URL

hyperorg.com/blogger/2024/07/11/limiting-ais-imagination/
Jun 2024
debot.lodder.dev debot.lodder.dev

DeBot

1
1. tonz 05 Jun 2024
  
  in Public
  
  Een project v Open State Foundation.
  
  osf ai llms
Visit annotations in context

Tags

ai

llms

osf

Annotators

tonz

URL

debot.lodder.dev/
May 2024
media.dltj.org media.dltj.org

Video: Handling Academic Copyright and Artificial Intelligence Research Questions as the Law Develops by CNI Spring Meeting 2024, annotated

1
1. peter_murray 28 May 2024
  
  in Public
  
  why training artificial intelligence in research context is and should continue to be a fair use
  
  Examination of AI training relative to the four factors of fair use
  
  LLM copyright building LLMs
Visit annotations in context

Tags

LLM copyright

building LLMs

Annotators

peter_murray

URL

media.dltj.org/annotated-video/20240527T173838-GMttBH1oAD4-handling-academic-copyright-artificial-intelligence-research-questions-law-develops/index.html
media.dltj.org media.dltj.org

Video: Navigating Generative AI: Early Findings and Implications for Research, Teaching, and Learning by CNI Spring Meeting 2024, annotated

2
1. peter_murray 27 May 2024
  
  in Public
  
  And in this on the side, you see we have this new chat box where the user can engage with the content and this very first action. The user doesn't have to do anything. They land on the page and as long as they run a search, we immediately process a prompt that says what in your voice, how is the query you put in?
  
  Initial LLM chat prompt: why did this document come up
  
  Using the patron's keyword search phrase, the first chat shown is the LLM analyzing why this document matched the patron's criteria. Then there are preset prompts for summarizing what the text is about, recommended topics to search, and a prompt to "talk to the document".
  
  LLMs for research
2. peter_murray 27 May 2024
  
  in Public
  
  Navigating Generative Artificial Intelligence: Early Findings and Implications for Research, Teaching, and Learning
  
  Spring 2024 Member Meeting: CNI website • YouTube
  
  Beth LaPensee Senior Product Manager ITHAKA
  
  Kevin Guthrie President ITHAKA
  
  Starting in mid-2023, ITHAKA began investing in and engaging directly with generative artificial intelligence (AI) in two broad areas: a generative AI research tool on the JSTOR platform and a collaborative research project led by Ithaka S+R. These technologies are so crucial to our futures that working directly with them to learn about their impact, both positive and negative, is extremely important.
  
  This presentation will share early findings that illustrate the impact and potential of generative AI-powered research based on what JSTOR users are expecting from the tool, how their behavior is changing, and implications for changes in the nature of their work. The findings will be contextualized with the cross-institutional learning and landscape-level research being conducted by Ithaka S+R. By pairing data on user behavior with insights from faculty and campus leaders, the session will share early signals about how this technology-enabled evolution is beginning to take shape.
  
  https://www.jstor.org/generative-ai-faq
  
  LLMs in education
Visit annotations in context

Tags

LLMs for research

LLMs in education

Annotators

peter_murray

URL

media.dltj.org/annotated-video/20240527T172152-SE4zl7Isy5k-navigating-generative-ai-early-findings-implications-research-teaching-learning/index.html
media.dltj.org media.dltj.org

Video: Navigating Generative AI: Early Findings and Implications for Research, Teaching, and Learning by CNI Spring 2024, annotated

1
1. peter_murray 26 May 2024
  
  in Public
  
  Navigating Generative Artificial Intelligence: Early Findings and Implications for Research, Teaching, and Learning
  
  Spring 2024 Member Meeting: CNI website • YouTube
  
  Beth LaPensee Senior Product Manager ITHAKA
  
  Kevin Guthrie President ITHAKA
  
  Starting in mid-2023, ITHAKA began investing in and engaging directly with generative artificial intelligence (AI) in two broad areas: a generative AI research tool on the JSTOR platform and a collaborative research project led by Ithaka S+R. These technologies are so crucial to our futures that working directly with them to learn about their impact, both positive and negative, is extremely important.
  
  This presentation will share early findings that illustrate the impact and potential of generative AI-powered research based on what JSTOR users are expecting from the tool, how their behavior is changing, and implications for changes in the nature of their work. The findings will be contextualized with the cross-institutional learning and landscape-level research being conducted by Ithaka S+R. By pairing data on user behavior with insights from faculty and campus leaders, the session will share early signals about how this technology-enabled evolution is beginning to take shape.
  
  https://www.jstor.org/generative-ai-faq
  
  LLMs in education
Visit annotations in context

Tags

LLMs in education

Annotators

peter_murray

URL

media.dltj.org/annotated-video/20240526T083931-SE4zl7Isy5k-Navigating_Generative_AI-_Early_Findings_and_Implications_for_Research-_Teaching-_and_Learning/index.html
www.arl.org www.arl.org

The ARL/CNI 2035 Scenarios: AI-Influenced Futures in the Research Environment

1
1. peter_murray 25 May 2024
  
  in Public
  
  The ARL/CNI 2035 Scenarios: AI-Influenced Futures in the Research Environment. Washington, DC, and West Chester, PA: Association of Research Libraries, Coalition for Networked Information, and Stratus Inc., May 2024. https://doi.org/10.29242/report.aiscenarios2024
  
  LLMs in education
Visit annotations in context

Tags

LLMs in education

Annotators

peter_murray

URL

arl.org/wp-content/uploads/2024/05/ARL-CNI-2035-AI-Scenarios-5May2024.pdf
Jan 2024
www.eff.org www.eff.org

AI Art Generators and the Online Image Market

1
1. peter_murray 21 Jan 2024
  
  in Public
  
  Images of women are more likely to be coded as sexual in nature than images of men in similar states of dress and activity, because of widespread cultural objectification of women in both images and its accompanying text. An AI art generator can “learn” to embody injustice and the biases of the era and culture of the training data on which it is trained.
  
  Objectification of women as an example of AI bias
  
  bias in LLMs
Visit annotations in context

Tags

bias in LLMs

Annotators

peter_murray

URL

eff.org/deeplinks/2023/04/ai-art-generators-and-online-image-market
Nov 2023
media.dltj.org media.dltj.org

"Geoffrey Hinton: “It’s Far Too Late” to Stop Artificial Intelligence" uncorrected transcript (The New Yorker Radio Hour)

1
1. peter_murray 20 Nov 2023
  
  in Public
  
  One of the ways that, that chat G BT is very powerful is that uh if you're sufficiently educated about computers and you want to make a computer program and you can instruct uh chat G BT in what you want with enough specificity, it can write the code for you. It doesn't mean that every coder is going to be replaced by Chad GP T, but it means that a competent coder uh with an imagination can accomplish a lot more than she used to be able to, uh maybe she could do the work of five coders. Um So there's a dynamic where people who can master the technology can get a lot more done.
  
  ChatGPT augments, not replaces
  
  You have to know what you want to do before you can provide the prompt for the code generation.
  
  LLMs for programming
Visit annotations in context

Tags

LLMs for programming

Annotators

peter_murray

URL

media.dltj.org/unchecked-transcript/20231120T143812-The_New_Yorker_Radio_Hour--Geoffrey_Hinton-_Its_Far_Too_Late_to_Stop_Artificial_Intelligence/index.html
Sep 2023
fortune.com fortune.com

ChatGPT is not a search engine. Here's why you should think of it as a 'glider' to get better results, says engineer with Harvard PhD in neuroscience

1
1. George9765 23 Sep 2023
  
  in Public
  
  #prompt engineering #LLMs #ChatGPT #education
Visit annotations in context

Tags

#ChatGPT

#education

#prompt engineering

#LLMs

Annotators

George9765

URL

fortune.com/2023/07/24/what-is-best-way-to-use-chatgpt-ai-prompts-microsoft/
analyticsindiamag.com analyticsindiamag.com

Llama 2 vs GPT-4 vs Claude-2

1
1. George9765 23 Sep 2023
  
  in Public
  
  considering that Llama-2 has open weights, it is highly likely that it will improve significantly over time.
  
  I believe the author refers to the open-sources of llama-2 model. It allows quick and specific fine-tuning of the original big model.
  
  #LLMs #Llama-2
Visit annotations in context

Tags

#Llama-2

#LLMs

Annotators

George9765

URL

analyticsindiamag.com/llama-2-vs-gpt-4-vs-claude-2/
Jul 2023
arxiv.org arxiv.org

2306.04141.pdf

1
1. peter_murray 21 Jul 2023
  
  in Public
  
  AI-generated content may also feed future generative models, creating a self-referentialaesthetic flywheel that could perpetuate AI-driven cultural norms. This flywheel may in turnreinforce generative AI’s aesthetics, as well as the biases these models exhibit.
  
  AI bias becomes self-reinforcing
  
  Does this point to a need for more diversity in AI companies? Different aesthetic/training choices leads to opportunities for more diverse output. To say nothing of identifying and segregating AI-generated output from being used i the training data of subsequent models.
  
  building LLMs
Visit annotations in context

Tags

building LLMs

Annotators

peter_murray

URL

arxiv.org/pdf/2306.04141.pdf
May 2023
maggieappleton.com maggieappleton.com

Talk: The Expanding Dark Forest and Generative AI

1
1. tonz 04 May 2023
  
  in Public
  
  Some of these people will become even more mediocre. They will try to outsource too much cognitive work to the language model and end up replacing their critical thinking and insights with boring, predictable work. Because that’s exactly the kind of writing language models are trained to do, by definition.
  
  If you use LLMs to improve your mediocre writing it will help. If you use it to outsource too much of your own cognitive work it will get you the bland SEO texts the LLMs were trained on and the result will be more mediocre. Greedy reductionism will get punished.
  
  reductionism llms generativeai
Visit annotations in context

Tags

reductionism

llms

generativeai

Annotators

tonz

URL

maggieappleton.com/forest-talk
Dec 2022
garymarcus.substack.com garymarcus.substack.com

AI's Jurassic Park moment

2
1. ravenscroftj 10 Dec 2022
  
  in Public
  
  every country is going to need to reconsider its policies on misinformation. It’s one thing for the occasional lie to slip through; it’s another for us all to swim in a veritable ocean of lies. In time, though it would not be a popular decision, we may have to begin to treat misinformation as we do libel, making it actionable if it is created with sufficient malice and sufficient volume.
  
  What to do then when our government reps are already happy to perpetuate "culture wars" and empty talking points?
  
  nlproc Policy LLMs
2. ravenscroftj 10 Dec 2022
  
  in Public
  
  anyone skilled in the art can now replicate their recipe.
  
  Well anyone skilled enough who has $500k for the gpu bill and access to and the means to store the corpus... So corporations I guess... Yey!
  
  nlproc LLMs
Visit annotations in context

Tags

Policy

nlproc

LLMs

Annotators

ravenscroftj

URL

garymarcus.substack.com/p/ais-jurassic-park-moment

Origin of the "Cognitive Debt" phrase

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

LLM safety constrains narrow responses to increase predictability

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL