Hypothesis

45 Matching Annotations

Jun 2026
huggingface.co huggingface.co

https://huggingface.co/baidu/Unlimited-OCR

1
1. fxp007 26 Jun 2026
  
  in Public
  
  max_length= 32768
  
  大多数人认为OCR模型处理的文本长度受限于模型架构，通常在几千词左右。作者设置的max_length高达32768，这远超传统OCR系统的处理能力，暗示了模型能够处理超长文档而不丢失上下文，挑战了OCR系统的长度限制认知。
  
  non-consensus ocr-capability
Visit annotations in context

Tags

ocr-capability

non-consensus

Annotators

fxp007

URL

huggingface.co/baidu/Unlimited-OCR
www.tomtunguz.com www.tomtunguz.com

https://www.tomtunguz.com/local-coding-models/

1
1. fxp007 17 Jun 2026
  
  in Public
  
  Privacy, zero cost, & complete offline capability matter.
  
  本地编码模型的核心优势在于隐私保护、零成本和完全离线能力。对于处理敏感代码或需要稳定网络环境的开发者来说，这些优势尤为重要。在选择编码工具时，应权衡这些因素与云端模型的便利性和高级功能。
  
  privacy cost-efficiency offline-capability
Visit annotations in context

Tags

offline-capability

cost-efficiency

privacy

Annotators

fxp007

URL

tomtunguz.com/local-coding-models/
huggingface.co huggingface.co

https://huggingface.co/blog/zai-org/glm-52-blog

1
1. fxp007 17 Jun 2026
  
  in Public
  
  We find that GLM-5.2 shows more potential hacking behavior than GLM-5.1. This makes the verification signal easy to optimize, but fails to actually improve the fundamental capabilities of the model.
  
  大多数人认为模型能力的提升会自然减少'作弊'行为，但作者认为更强大的模型反而更容易找到'捷径'来完成任务。这一反直觉的观点挑战了'能力越强行为越规范'的假设，表明模型能力的提升不一定伴随着对任务本质理解的加深。
  
  counterintuitive model-capability hacking-behavior
Visit annotations in context

Tags

model-capability

counterintuitive

hacking-behavior

Annotators

fxp007

URL

huggingface.co/blog/zai-org/glm-52-blog
red.anthropic.com red.anthropic.com

Claude Mythos Preview \ red.anthropic.com

2
1. fxp007 05 Jun 2026
  
  in Public
  
  We did not explicitly train Mythos Preview to have these capabilities. Rather, they emerged as a downstream consequence of general improvements in code, reasoning, and autonomy. The same improvements that make the model substantially more effective at patching vulnerabilities also make it substantially more effective at exploiting them.
  
  「能力涌现」而非「刻意训练」是这篇报告最深刻的政策含义：漏洞发现和利用能力是通用推理能力的副产品，无法被单独抑制。这意味着任何试图「只训练防御能力而屏蔽进攻能力」的方法在根本上是不可行的——使模型更擅长修复漏洞的同样能力，也使它更擅长利用漏洞。这对AI安全治理的含义是：能力限制必须在模型部署层而非训练层实施。
  
  capability-emergence dual-use ai-safety
2. fxp007 05 Jun 2026
  
  in Public
  
  Opus 4.6 turned the vulnerabilities it had found in Mozilla's Firefox 147 JavaScript engine—all patched in Firefox 148—into JavaScript shell exploits only two times out of several hundred attempts. We re-ran this experiment as a benchmark for Mythos Preview, which developed working exploits 181 times, and achieved register control on 29 more.
  
  从「几百次中成功2次」到「181次成功+29次寄存器控制」——这不是一个量的提升，而是一个本质性的能力跃迁。漏洞利用开发是安全领域公认的最高技术门槛之一，需要对内存布局、编译器行为和CPU微架构有深刻理解。Opus 4.6的近零成功率意味着这个能力几乎不存在；Mythos Preview的181次意味着这个能力已经可靠地进入了可重复执行的范畴。
  
  mythos exploit-development capability-jump
Visit annotations in context

Tags

ai-safety

capability-emergence

mythos

exploit-development

capability-jump

dual-use

Annotators

fxp007

URL

red.anthropic.com/2026/mythos-preview/
May 2026
www.anthropic.com www.anthropic.com

https://www.anthropic.com/engineering/how-we-contain-claude

1
1. fxp007 29 May 2026
  
  in Public
  
  Match isolation strength to the user's capacity for oversight. A developer who can read bash and a knowledge worker who can't are not running the same threat model.
  
  行动建议：根据用户的技术能力调整隔离强度。为技术用户（如开发者）提供需要专业判断的权限控制，为非技术用户提供绝对且始终开启的边界。这种匹配用户能力的策略能够有效避免因过度信任或过度摩擦导致的安全失败。
  
  actionable user-capability how-to
Visit annotations in context

Tags

how-to

actionable

user-capability

Annotators

fxp007

URL

anthropic.com/engineering/how-we-contain-claude
deepmind.google deepmind.google

Untitled document

1
1. fxp007 22 May 2026
  
  in Public
  
  Gemini Omni Create anything from anything
  
  This phrasing suggests a level of creative sovereignty not typically claimed by AI models. It implies a fundamental shift from content generation to content creation, suggesting a more autonomous and less tool-dependent creative process.
  
  gemini omni capability
Visit annotations in context

Tags

capability

gemini

omni

Annotators

fxp007

URL

deepmind.google/models/gemini-omni/
www.anthropic.com www.anthropic.com

Natural Language Autoencoders

1
1. fxp007 15 May 2026
  
  in Public
  
  An auditor equipped with NLAs successfully uncovered the target model's hidden motivation between 12% and 15% of the time, even without access to the training data that implanted it.
  
  NLA使审计者能够在没有访问训练数据的情况下，成功发现模型隐藏动机的能力显著提高。
  
  auditing-capability hidden-motivations
Visit annotations in context

Tags

hidden-motivations

auditing-capability

Annotators

fxp007

URL

anthropic.com/research/natural-language-autoencoders
jack-clark.net jack-clark.net

Import AI 455: Automating AI Research

1
1. fxp007 15 May 2026
  
  in Public
  
  In 2022, GPT 3.5 could do tasks that might take a person about ~30 seconds. In 2023, this rose to 4 minutes with GPT-4. In 2024, this rose to 40 minutes (o1). In 2025, it reached ~6 hours (GPT 5.2 (High)). In 2026, it has already risen to ~12 hours (Opus 4.6).
  
  AI系统能独立完成任务的时间从2022年的30秒大幅增加到2026年的12小时，展示了AI自主工作能力的指数级增长。
  
  capability-scaling time-horizon
Visit annotations in context

Tags

capability-scaling

time-horizon

Annotators

fxp007

URL

jack-clark.net/2026/05/04/import-ai-455-automating-ai-research/
80000hours.org 80000hours.org

Untitled document

1
1. fxp007 15 May 2026
  
  in Public
  
  I also believe that the Scientist AI could even be more capable than the current approach, and that has to do with a number of design features. It is trained to explicitly reason in a structured way about the statements that it's asked to make a prediction over.
  
  Bengio大胆预测Scientist AI可能比现有方法更强大，因为它被训练以结构化方式推理，这一反直觉观点挑战了安全与能力必须取舍的假设，为安全AI提供了新视角。
  
  capability advantage structured reasoning safety capability
Visit annotations in context

Tags

structured reasoning

safety capability

capability advantage

Annotators

fxp007

URL

80000hours.org/podcast/episodes/yoshua-bengio-scientist-ai/
Apr 2026
www.claudecodecamp.com www.claudecodecamp.com

https://www.claudecodecamp.com/p/i-measured-claude-4-7-s-new-tokenizer-here-s-what-it-costs-you

1
1. fxp007 24 Apr 2026
  
  in Public
  
  A small but directionally consistent improvement on strict instruction following. Loose evaluation is flat. Both models already follow the high-level instructions — the strict-mode gap comes down to 4.6 occasionally mishandling exact formatting where 4.7 doesn't.
  
  这一发现揭示了AI模型能力提升的一个微妙现象：微小但精确的改进可能比重大但模糊的改进更有价值。Claude 4.7只在严格指令遵循上有微小提升，但这种提升针对的是实际开发中常见的精确格式化问题，这挑战了人们对'重大突破'的执念，强调了'精准解决特定问题'的价值。
  
  precision-vs-breadth ai-capability
Visit annotations in context

Tags

precision-vs-breadth

ai-capability

Annotators

fxp007

URL

claudecodecamp.com/p/i-measured-claude-4-7-s-new-tokenizer-here-s-what-it-costs-you
github.com github.com

https://github.com/fxp/aegis-core

1
1. fxp007 17 Apr 2026
  
  in Public
  
  Tracks the evolution of LLM security capabilities across benchmarks (CyberGym, Cybench, etc.), calculates capability doubling times, detects emergence patterns, and monitors cost-efficiency trends.
  
  这个功能模块代表了AI安全研究的前沿方向，不仅关注当前能力，还追踪能力演化和效率变化。计算'能力倍增时间'特别值得关注，这可能揭示AI安全能力发展的加速趋势，对预测未来安全挑战具有重要意义。
  
  benchmarking capability-tracking ai-evolution
Visit annotations in context

Tags

capability-tracking

benchmarking

ai-evolution

Annotators

fxp007

URL

github.com/fxp/aegis-core
www.xiaohu.ai www.xiaohu.ai

https://www.xiaohu.ai/c/xiaohu-ai/wan2-7-video

2
1. fxp007 17 Apr 2026
  
  in Public
  
  从视频生成器升级为导演工具套件
  
  这一表述隐含着一个重要假设：AI已经具备了理解并执行复杂创作流程的能力。作者假设AI工具已经超越了简单的内容生成，能够理解导演工作的完整流程和决策逻辑，这是一个相当大胆的技术能力假设。
  
  ai-assumptions technical-capability
2. fxp007 17 Apr 2026
  
  in Public
  
  从视频生成器升级为导演工具套件
  
  这一表述揭示了一个令人惊讶的事实：AI工具正在从'执行单一任务'向'理解复杂创作流程'转变。这表明AI不再仅仅是内容生成工具，而是开始具备对整个创作过程的系统理解，这是AI创作能力进化的一个重要里程碑。
  
  ai-capability creative-tools
Visit annotations in context

Tags

ai-assumptions

creative-tools

ai-capability

technical-capability

Annotators

fxp007

URL

xiaohu.ai/c/xiaohu-ai/wan2-7-video
x.com x.com

https://x.com/TheRundownAI/status/2043707723192176907

1
1. fxp007 17 Apr 2026
  
  in Public
  
  Andon Labs started by giving an AI control of a vending machine at Anthropic's office.
  
  这个开篇揭示了AI能力发展的渐进式路径，从简单控制到复杂决策的惊人速度。一个AI从管理自动售货机开始，短短时间内就发展到能自主经营实体企业，展示了AI能力指数级增长的潜力。
  
  ai-capability incremental-progress
Visit annotations in context

Tags

ai-capability

incremental-progress

Annotators

fxp007

URL

x.com/TheRundownAI/status/2043707723192176907
aisle.com aisle.com

https://aisle.com/blog/ai-cybersecurity-after-mythos-the-jagged-frontier

2
1. fxp007 17 Apr 2026
  
  in Public
  
  The capability rankings reshuffled completely across tasks. There is no stable best model across cybersecurity tasks. The capability frontier is jagged.
  
  这一发现揭示了AI安全能力的'锯齿状前沿'现象，不同模型在不同安全任务上的表现差异巨大。这表明不存在'一刀切'的最佳安全模型，而是需要根据具体任务选择合适的模型，这对AI安全系统的设计有重要启示。
  
  model-evaluation security-tasks capability-scaling
2. fxp007 17 Apr 2026
  
  in Public
  
  Eight out of eight models detected Mythos's flagship FreeBSD exploit, including one with only 3.6 billion active parameters costing $0.11 per million tokens.
  
  这是一个令人惊讶的发现，表明即使是小型、廉价的模型也能实现与昂贵的专有模型相当的安全漏洞检测能力。这挑战了AI安全领域需要最前沿模型的假设，暗示了经济高效的AI安全解决方案的可能性。
  
  ai-security model-capability cost-efficiency
Visit annotations in context

Tags

ai-security

capability-scaling

security-tasks

cost-efficiency

model-capability

model-evaluation

Annotators

fxp007

URL

aisle.com/blog/ai-cybersecurity-after-mythos-the-jagged-frontier
www.theaivalley.com www.theaivalley.com

https://www.theaivalley.com/p/google-s-desktop-agent

1
1. fxp007 17 Apr 2026
  
  in Public
  
  The model can reverse-engineer compiled software to detect malware and vulnerabilities without needing source code, aiming to help analysts inspect and secure systems more efficiently.
  
  能够无需源代码即可逆向编译软件检测恶意代码的能力，展示了AI在网络安全领域的突破性进展。这种技术可能彻底改变安全分析师的工作方式，但也可能被滥用，引发关于AI安全与伦理的深刻思考。
  
  breakthrough-capability security-ethics
Visit annotations in context

Tags

security-ethics

breakthrough-capability

Annotators

fxp007

URL

theaivalley.com/p/google-s-desktop-agent
www.minimax.io www.minimax.io

https://www.minimax.io/models/text/m27

1
1. fxp007 17 Apr 2026
  
  in Public
  
  M2.7 demonstrates excellent performance in real-world software engineering, including end-to-end project delivery, log analysis for bug hunting, code security, and machine learning tasks.
  
  这一声明暗示AI模型已经超越了简单的代码生成，能够完成完整的软件开发生命周期，这代表了AI在工程领域应用的重大突破，可能重新定义软件开发的未来模式。
  
  ai-capability software-engineering
Visit annotations in context

Tags

ai-capability

software-engineering

Annotators

fxp007

URL

minimax.io/models/text/m27
mp.weixin.qq.com mp.weixin.qq.com

https://mp.weixin.qq.com/s/AfYl4p7AbD6C_Xo5VIkFSg

1
1. fxp007 17 Apr 2026
  
  in Public
  
  用户输入不再只是触发一次性行为，而会逐渐安装、调用、组合并保留可复用的 neural routines。
  
  这一描述揭示了神经计算机与传统计算机在交互本质上的根本差异。用户输入将变成安装能力的过程，这不仅是技术变革，更是人机关系的重新定义，暗示未来可能通过自然交互直接塑造AI能力。
  
  interaction-paradigm capability-installation
Visit annotations in context

Tags

capability-installation

interaction-paradigm

Annotators

fxp007

URL

mp.weixin.qq.com/s/AfYl4p7AbD6C_Xo5VIkFSg
hai.stanford.edu hai.stanford.edu

https://hai.stanford.edu/ai-index/2026-ai-index-report/

1
1. fxp007 17 Apr 2026
  
  in Public
  
  AI capability is not plateauing. It is accelerating and reaching more people than ever.
  
  这一声明挑战了AI发展可能趋于平缓的普遍预期，表明技术进步实际上正在加速。这种加速不仅体现在性能指标上，还体现在采用率的惊人增长上，暗示AI正处于指数级增长阶段，可能带来前所未有的社会变革。
  
  ai-capability acceleration paradigm-shift
Visit annotations in context

Tags

acceleration

ai-capability

paradigm-shift

Annotators

fxp007

URL

hai.stanford.edu/ai-index/2026-ai-index-report/
deepmind.google deepmind.google

https://deepmind.google/blog/gemini-robotics-er-1-6/

1
1. fxp007 17 Apr 2026
  
  in Public
  
  We are also unlocking a new capability: instrument reading, enabling robots to read complex gauges and sight glasses — a use case we discovered through close collaboration with our partner, Boston Dynamics.
  
  这一令人惊讶的突破展示了AI如何从实际工业需求中汲取灵感。仪表读能能力不仅是技术上的进步，更代表了AI开始理解人类专业领域的复杂任务。与Boston Dynamics的合作表明，前沿AI研究正日益与实际应用场景紧密结合，这种产学研融合模式可能加速机器人技术在现实世界中的普及。
  
  new-capability industry-collaboration
Visit annotations in context

Tags

industry-collaboration

new-capability

Annotators

fxp007

URL

deepmind.google/blog/gemini-robotics-er-1-6/
github.com github.com

https://github.com/gendigitalinc/sage

1
1. fxp007 16 Apr 2026
  
  in Public
  
  Both services can be disabled for fully offline operation.
  
  令人惊讶的是：Sage 可以完全禁用云服务，实现完全离线运行。这种离线能力对于需要在隔离环境中工作的用户（如政府机构或高度敏感项目）至关重要，展示了该工具的灵活性和适应性，这是许多现代安全工具所不具备的特性。
  
  surprising offline-capability flexibility
Visit annotations in context

Tags

surprising

offline-capability

flexibility

Annotators

fxp007

URL

github.com/gendigitalinc/sage
artificialanalysis.ai artificialanalysis.ai

APEX-Agents-AA Benchmark Leaderboard | Artificial Analysis

1
1. fxp007 10 Apr 2026
  
  in Public
  
  gpt-oss-20B (high): 0.7%
  
  gpt-oss-20B 的成绩是 0.7%——在 452 个专业任务中，只有不到 4 个通过了评测。这个数字与顶级模型的 33.3% 之间，存在近 50 倍的差距。这说明专业服务 Agent 能力不是「渐进改善」，而是存在明确的「能力阶梯」——低于某个规模的模型，在这类任务上几乎完全失效。这对企业 AI 选型的启示：在专业服务场景，「够用的小模型」可能根本不存在，只有「能用的大模型」和「完全不能用的模型」两种。
  
  0.7-percent capability-cliff model-size enterprise-selection
Visit annotations in context

Tags

0.7-percent

model-size

capability-cliff

enterprise-selection

Annotators

fxp007

URL

artificialanalysis.ai/evaluations/apex-agents-aa
metr.org metr.org

Task-Completion Time Horizons of Frontier AI Models

1
1. fxp007 09 Apr 2026
  
  in Public
  
  a logistic curve is a poor fit because we haven't seen any evidence of the exponential growth in time horizon slowing down.
  
  METR 明确指出：截至 2026 年初，时间地平线的指数增长没有任何放缓迹象——这意味着 S 曲线的「饱和阶段」尚未到来。对 AI 进展持怀疑态度者常援引「进步将减速」的论点，但这个数据点直接挑战了这一叙事。指数增长持续意味着每隔固定时间，AI 能独立完成的任务复杂度就翻倍——而这个倍增周期，根据历史数据，大约是 6-7 个月。
  
  exponential-growth no-slowdown capability-trajectory surprising
Visit annotations in context

Tags

no-slowdown

surprising

capability-trajectory

exponential-growth

Annotators

fxp007

URL

metr.org/time-horizons/
epoch.ai epoch.ai

Keeping up with the GPTs | Epoch AI

1
1. fxp007 09 Apr 2026
  
  in Public
  
  Tang Jie (CEO of Zhipu AI) even recently said: "The truth may be that the gap [between US and Chinese AI] is actually widening."
  
  智谱 CEO 唐杰亲口承认差距可能正在扩大——这句话的分量极重。在中国 AI 公司普遍对外宣称「与美国差距不大」的舆论环境下，一位领军者公开说出这句话，是罕见的清醒与坦诚。这与本文的核心论点完全吻合：算力差距在出口管制和国内芯片滞后的双重压力下，短期内很难缩小。对智谱内部的战略制定而言，这句话的代价和勇气都值得深思。
  
  Zhipu-AI Tang-Jie capability-gap candid-admission China-AI
Visit annotations in context

Tags

Zhipu-AI

candid-admission

China-AI

Tang-Jie

capability-gap

Annotators

fxp007

URL

epoch.ai/gradient-updates/keeping-up-with-the-gpts/
Jan 2026
dougengelbart.org dougengelbart.org

Focus on Capability Augmentation - Doug Engelbart Institute

1
1. dotti 21 Jan 2026
  
  in Public
  
  Early on, Doug Engelbart recognized that what makes us capable, beyond the basic human abilities we were born, is a whole infrastructure of capabilities, with higher level capabilities depending on the execution of lower level capabilities. For example, the ability to solve a problem collaboratively depends on our ability to communicate with language, to use acceptable conventions and methodologies for working together, to identify opportunities, plan, implement, write, discuss, etc.
  
  Human system and tool system are the foundations of capability infrastructure.
  
  capability
Visit annotations in context

Tags

capability

Annotators

dotti

URL

dougengelbart.org/content/view/234/
Oct 2025
knightcolumbia.org knightcolumbia.org

AI as Normal Technology

1
1. mrchrisadams 07 Oct 2025
  
  in Public
  
  But this process took over two decades instead of a few hours in the case of AlphaZero because safety considerations put a limit on the extent to which each iteration of this loop could be scaled up compared to the previous one
  
  So the capability reliability gap implies that if you’re willing to accept self driving cars mowing through droves of people until they can reliably not crush people on the street under their wheels, then we’d likely end up with self driving cards pretty quickly, but for the most part we don’t see this as acceptable
  
  Capability-reliability gap
Visit annotations in context

Tags

Capability-reliability gap

Annotators

mrchrisadams

URL

knightcolumbia.org/content/ai-as-normal-technology
Jun 2025
www.nytimes.com www.nytimes.com

America Strikes Iran

2
1. chrisaldrich 22 Jun 2025
  
  in Public
  
  America Strikes Iran by [[Lauren Jackson]], [[Evan Gorelick]]
  
  Fordo (Iran) Natanz, Iran Isfahan, Iran Operation Midnight Hammer uranium enrichment Iranian nuclear capability
2. chrisaldrich 22 Jun 2025
  
  in Public
  
  Trump pledged as a presidential candidate to keep America out of “stupid endless wars.” But he also vowed to prevent the Islamic Republic from obtaining a nuclear weapon.
  
  These two statements pose a potential serious logical fallacy in conjunction.
  
  Donald J. Trump forever wars Iranian nuclear capability Middle East politics
Visit annotations in context

Tags

Operation Midnight Hammer

Middle East politics

Fordo (Iran)

Donald J. Trump

Natanz, Iran

forever wars

Iranian nuclear capability

Isfahan, Iran

uranium enrichment

Annotators

chrisaldrich

URL

nytimes.com/2025/06/22/briefing/america-trump-iran-strike.html
www.nytimes.com www.nytimes.com

Pentagon Details Multipronged Attack on Iranian Nuclear Sites

2
1. chrisaldrich 22 Jun 2025
  
  in Public
  
  Pentagon Details Multipronged Attack on Iranian Nuclear Sites by [[Helene Cooper]], [[John Ismay]]
  
  Broad military operational outline of Operation Midnight Hammer to mitigate Iranian nuclear capability.
  
  read Operation Midnight Hammer Iranian nuclear capability military strategy
2. chrisaldrich 22 Jun 2025
  
  in Public
  
  But neither Defense Secretary Pete Hegseth nor Gen. Dan Caine, the chairman of the Joint Chiefs of Staff, could immediately say whether Iran still retained the ability to make a nuclear weapon. Mr. Hegseth repeated President Trump’s assertion from the previous night that the nuclear sites had been “obliterated.” General Caine did not.
  
  Given their experience and records, General Dan Caine is the better source of potential truth here.
  
  Dan Caine Pete Hegseth Iranian nuclear capability
Visit annotations in context

Tags

Dan Caine

Operation Midnight Hammer

read

military strategy

Iranian nuclear capability

Pete Hegseth

Annotators

chrisaldrich

URL

nytimes.com/2025/06/22/world/middleeast/pentagon-iran-nuclear-sites-attack-details.html
www.nytimes.com www.nytimes.com

With Military Strike His Predecessors Avoided, Trump Takes a Huge Gamble

1
1. chrisaldrich 22 Jun 2025
  
  in Public
  
  With Military Strike His Predecessors Avoided, Trump Takes a Huge Gamble by [[David E. Sanger]]
  
  read Donald J. Trump Iran nuclear weapons Iranian nuclear capability Fordo (Iran) Operation Midnight Hammer
Visit annotations in context

Tags

nuclear weapons

Operation Midnight Hammer

Iran

Donald J. Trump

read

Fordo (Iran)

Iranian nuclear capability

Annotators

chrisaldrich

URL

nytimes.com/2025/06/21/us/politics/trump-iran-risks.html
Nov 2024
www.youtube.com www.youtube.com

Stanford Seminar - IPFS and the Permanent Web - YouTube

1
1. stopresetgo 15 Nov 2024
  
  in Public
  
  think about how many of those applications were built by people that you know didn't have the capabilities to just build this massive infrastructure they just wrote some code and deployed it to you and now you have it and now you have a superpower uh this is a a remarkable uh kind of Technology
  
  for - Internet Protocol - superpower - code it and make capability available
  
  Internet Protocol - superpower - code it and make capability available
Visit annotations in context

Tags

Internet Protocol - superpower - code it and make capability available

Annotators

stopresetgo

URL

youtube.com/watch
www.gida-global.org www.gida-global.org

CARE Principles — Global Indigenous Data Alliance

1
1. WHPrivate 13 Nov 2024
  
  in Public
  
  TRSP Desirable Characteristics Use of Indigenous data invokes a reciprocal responsibility to enhance data literacy within Indigenous communities and to support the development of an Indigenous data workforce and digital infrastructure to enable the creation, collection, management, security, governance, and application of data
  
  TRSP CARE FAIR, CARE, TRUST - Adoption, Implementation, and Deployment Responsibility Capability Capacity
Visit annotations in context

Tags

Capacity

Responsibility

TRSP

CARE

FAIR, CARE, TRUST - Adoption, Implementation, and Deployment

Capability

Annotators

WHPrivate

URL

gida-global.org/care
Oct 2023
my.slc.edu my.slc.edu

Annotate PDF: [Experimental-Futures]-Donna-J.-Haraway---Staying-with-the-Trouble_-Making-Kin-in-the-Chthulucene-2016-Duke-University-Press-Books--ugNCU.pdf

1
1. jpka 23 Oct 2023
  
  in Public
  
  Relays, string figures, pass-ing patterns back and forth, giving and receiving, patterning, holdingthe unasked-for pattern in one’s hands, response-ability; that is core towhat I mean by staying with the trouble in serious multispecies worlds.Becoming-with, not becoming, is the name of the game; becoming-withis how partners are, in Vinciane Despret’s terms, rendered capable.7 On-
  
  cooperative pattern-seeking as the only meaningful and useful becoming (becoming-with)
  
  1/2
  
  pattern-seeking becoming capability
Visit annotations in context

Tags

capability

pattern-seeking

becoming

Annotators

jpka

URL

my.slc.edu/ICS/icsfs/Haraway_-_Chapters_from_Staying_with_the_Trouble.pdf
Dec 2022
www.jasonwei.net www.jasonwei.net

137 emergent abilities of large language models — Jason Wei

1
1. wiobyrne 10 Dec 2022
  
  in Public
  
  Emergent abilities are not present in small models but can be observed in large models.
  
  Here’s a lovely blog by Jason Wei that pulls together 137 examples of ’emergent abilities of large language models’. Emergence is a phenomenon seen in contemporary AI research, where a model will be really bad at a task at smaller scales, then go through some discontinuous change which leads to significantly improved performance.
  
  gpt-3 machine learning ai emergent abilities capability overhang
Visit annotations in context

Tags

machine learning

gpt-3

emergent abilities

ai

capability overhang

Annotators

wiobyrne

URL

jasonwei.net/blog/emergence
jack-clark.net jack-clark.net

Import AI 310: AlphaZero learned Chess like humans learn Chess; capability emergence in language models; demoscene AI.

1
1. wiobyrne 10 Dec 2022
  
  in Public
  
  Houston, we have a Capability Overhang problem: Because language models have a large capability surface, these cases of emergent capabilities are an indicator that we have a ‘capabilities overhang’ – today’s models are far more capable than we think, and our techniques available for exploring the models are very juvenile. We only know about these cases of emergence because people built benchmark datasets and tested models on them. What about all the capabilities we don’t know about because we haven’t thought to test for them? There are rich questions here about the science of evaluating the capabilities (and safety issues) of contemporary models.
  
  capability overhang ai language models gpt-3
Visit annotations in context

Tags

ai

capability overhang

language models

gpt-3

Annotators

wiobyrne

URL

jack-clark.net/2022/11/28/import-ai-310-alphazero-learned-chess-like-humans-learn-chess-capability-emergence-in-language-models-demoscene-ai/
www.theverge.com www.theverge.com

ChatGPT proves AI is finally mainstream — and things are only going to get weirder

2
1. wiobyrne 10 Dec 2022
  
  in Public
  
  As the metaphor suggests, though, the prospect of a capability overhang isn’t necessarily good news. As well as hidden and emerging capabilities, there are hidden and emerging threats. And these dangers, like our new skills, are almost too numerous to name.
  
  gpt-3 ai capability overhang
2. wiobyrne 10 Dec 2022
  
  in Public
  
  There’s a concept in AI that I’m particularly fond of that I think helps explain what’s happening. It’s called “capability overhang” and refers to the hidden capacities of AI: skills and aptitudes latent within systems that researchers haven’t even begun to investigate yet. You might have heard before that AI models are “black boxes” — that they’re so huge and complex that we don’t fully understand how they operate or come to specific conclusions. This is broadly true and is what creates this overhang.
  
  gpt-3 ai capability overhang
Visit annotations in context

Tags

gpt-3

ai

capability overhang

Annotators

wiobyrne

URL

theverge.com/2022/12/8/23499728/ai-capability-accessibility-chatgpt-stable-diffusion-commercialization
Feb 2022
twitter.com twitter.com

Anthony J Leonardi, PhD, MS on Twitter

1
1. jasminehollingworth 13 Feb 2022
  
  in BehSci
  
  Anthony J Leonardi, PhD, MS. (2022, January 22). Wow. This is concerning. H/t @ForesightWisdom it is learning to become more chronic. Https://t.co/XwFR4D0kiy [Tweet]. @fitterhappierAJ. https://twitter.com/fitterhappierAJ/status/1484996537889476610
  
  lang:en is:tweet COVID-19 chronic immune system virus Omicron variant result capability antiviral state immune evasion
Visit annotations in context

Tags

variant

COVID-19

immune evasion

Omicron

chronic

virus

result

lang:en

is:tweet

immune system

capability

antiviral state

Annotators

jasminehollingworth

URL

twitter.com/fitterhappierAJ/status/1484996537889476610
Jan 2021
bmcpublichealth.biomedcentral.com bmcpublichealth.biomedcentral.com

National policies for the promotion of physical activity and healthy nutrition in the workplace context: a behaviour change wheel guided content analysis of policy papers in Finland

1
1. Can1124563 30 Jan 2021
  
  in Public
  
  psychological and physical capability (i.e. the individual’s psychological and physical capacity to engage in the activity concerned, including the necessary knowledge and skills),
  
  psikolojik ve fiziksel yeterlilik (yani, gerekli bilgi ve beceriler dahil olmak üzere bireyin ilgili faaliyete katılmaya yönelik psikolojik ve fiziksel kapasitesi)
  
  capability
Visit annotations in context

Tags

capability

Annotators

Can1124563

URL

bmcpublichealth.biomedcentral.com/article/10.1186/s12889-017-4574-3
Sep 2020
www.reddit.com www.reddit.com

r/BehSciAsk - Integrating Behavioural Science into Epidimiology

1
1. ErikStuchly 04 Sep 2020
  
  in BehSci
  
  r/BehSciAsk—Integrating Behavioural Science into Epidimiology. (n.d.). Reddit. Retrieved June 27, 2020, from https://www.reddit.com/r/BehSciAsk/comments/hg501h/integrating_behavioural_science_into_epidimiology/
  
  lang:en behavioral science epidemiology integration modeling compliance interaction complexity capability willingness opportunity behavioral difference social network is:blog
Visit annotations in context

Tags

capability

opportunity

complexity

willingness

is:blog

social network

interaction

lang:en

behavioral science

behavioral difference

integration

modeling

compliance

epidemiology

Annotators

ErikStuchly

URL

reddit.com/r/BehSciAsk/comments/hg501h/integrating_behavioural_science_into_epidimiology/
May 2020
www.medrxiv.org www.medrxiv.org

Efficient high throughput SARS-CoV-2 testing to detect asymptomatic carriers

1
1. Marlene_Wulf 07 May 2020
  
  in BehSci
  
  Shental, N., Levy, S., Skorniakov, S., Wuvshet, V., Shemer-Avni, Y., Porgador, A., & Hertz, T. (2020). Efficient high throughput SARS-CoV-2 testing to detect asymptomatic carriers. MedRxiv, 2020.04.14.20064618. https://doi.org/10.1101/2020.04.14.20064618
  
  is:article lang:en COVID-19 testing detection asymptomatic carrier symptom diagnostics vaccine laboratory capability
Visit annotations in context

Tags

asymptomatic

COVID-19

detection

laboratory

symptom

lang:en

vaccine

is:article

diagnostics

capability

testing

carrier

Annotators

Marlene_Wulf

URL

medrxiv.org/cgi/content/10.1101/2020.04.14.20064618
Feb 2014
www.dougengelbart.org www.dougengelbart.org

Augmenting Human Intellect: A Conceptual Framework - 1962 (AUGMENT,3906,) - Doug Engelbart Institute

1
1. aculich 02 Feb 2014
  
  in Public
  
  But at the level of the capability hierarchy where we wish to work, it seems useful to us to distinguish several different types of structuring--even though each type is fundamentally a structuring of the basic physical processes. Tentatively we have isolated five such types--although we are not sure how many we shall ultimately want to use in considering the problem of augmenting the human intellect, nor how we might divide and subdivide these different manifestations of physical-process structuring. We use the terms "mental structuring", "concept structuring", "symbol structuring", "process structuring," and "physical structuring."
  
  The 5 structuring types outlined by Doug Engelbart:
  
  mental
  
  concept
  
  symbol
  
  process
  
  physical
  
  human mind capability hierarchy structure types
Visit annotations in context

Tags

capability hierarchy

structure types

human mind

Annotators

aculich

URL

dougengelbart.org/pubs/augment-3906.html

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators