Hypothesis

3,975 Matching Annotations

Apr 2026
arxiv.org arxiv.org

https://arxiv.org/abs/2604.07190

1
1. fxp007 17 Apr 2026
  
  in Public
  
  We present a comprehensive adoption snapshot of the leading open language models and who is building them, focusing on the ~1.5K mainline open models
  
  报告对约1500个主流开源模型进行全面分析，这种规模的数据收集为理解开源AI生态系统提供了前所未有的宏观视角。这种系统性的测量方法可能成为评估AI发展轨迹的重要基准。
  
  ecosystem-mapping data-scope benchmarking
Visit annotations in context

Tags

benchmarking

data-scope

ecosystem-mapping

Annotators

fxp007

URL

arxiv.org/abs/2604.07190
github.com github.com

https://github.com/gendigitalinc/sage

1
1. fxp007 17 Apr 2026
  
  in Public
  
  Sage sends URLs and package hashes to Gen Digital reputation APIs. File content, commands, and source code stay local.
  
  这个隐私声明揭示了Sage的数据处理策略，采用了最小化数据传输的设计哲学。这种平衡安全与隐私的做法很有洞察力，表明开发者理解用户对数据泄露的担忧，同时认识到某些云端分析对于有效威胁检测的必要性。
  
  privacy-design data-minimization security-philosophy
Visit annotations in context

Tags

privacy-design

security-philosophy

data-minimization

Annotators

fxp007

URL

github.com/gendigitalinc/sage
every.to every.to

https://every.to/playtesting/the-market-for-making-ai-better

4
1. fxp007 17 Apr 2026
  
  in Public
  
  Academic publishers, documentary archives, game studios, and companies sitting on years of enterprise data have all been courted for the seeds of intelligence needed to train the next generation of models.
  
  AI训练数据市场的扩张正在重塑多个传统行业的价值定位，从学术出版到游戏工作室，各种看似不相关的数据源都可能成为AI训练的'智能种子'。这种跨行业数据融合正在创造新的商业机会和市场动态。
  
  data-sources industry-transformation ai-training
2. fxp007 17 Apr 2026
  
  in Public
  
  Mercor, which provides data to AI labs for training, became one of the fastest-growing companies in history before losing four terabytes of data to hackers last week.
  
  Mercor的快速崛起与数据泄露事件形成了鲜明对比，凸显了数据安全在AI训练中的关键地位。这一事件可能引发行业对数据安全和隐私保护的重新审视，促使AI公司建立更严格的数据管理标准。
  
  data-security ai-startups risk-management
3. fxp007 17 Apr 2026
  
  in Public
  
  A small model trained on fewer than 2,000 examples from real lawyers, bankers, and consultants recently beat all but the best frontier models on corporate legal work, at a fraction of the price.
  
  这一发现挑战了'规模和计算能力胜过一切'的AI发展范式。高质量专业化数据训练的小型模型在特定领域表现优于通用大模型，暗示AI发展可能从'越大越好'转向'更专业、更高效'的新阶段。
  
  ai-performance specialized-models data-quality
4. fxp007 17 Apr 2026
  
  in Public
  
  Reddit, Shutterstock, and News Corp are making hundreds of millions a year licensing their high-quality data to companies training AI, and those contracts are growing about 20 percent annually, according to their quarterly filings.
  
  这一数据揭示了AI训练数据市场的巨大经济价值，表明高质量数据已成为AI公司的战略资产。传统内容公司正在转型为AI的'输入公司'，这种转变不仅改变了他们的商业模式，也重新定义了数据在AI生态系统中的核心地位。
  
  data-market ai-economy business-model
Visit annotations in context

Tags

ai-economy

ai-performance

data-security

risk-management

industry-transformation

ai-training

data-sources

data-market

business-model

specialized-models

ai-startups

data-quality

Annotators

fxp007

URL

every.to/playtesting/the-market-for-making-ai-better
epoch.ai epoch.ai

https://epoch.ai/data-insights/hyperscalers-control-most-compute

1
1. fxp007 17 Apr 2026
  
  in Public
  
  Our Chip Ownership data does not capture all global chip ownership, and has weaker coverage prior to 2023.
  
  数据覆盖范围的限制意味着我们对全球算力分布的理解存在盲点，特别是在2023年之前的时期和未被充分记录的地区。这种不完整性可能导致对算力集中趋势的过度解读，忽视了其他参与者可能发挥的更大作用。
  
  data-gaps geopolitical-bias
Visit annotations in context

Tags

geopolitical-bias

data-gaps

Annotators

fxp007

URL

epoch.ai/data-insights/hyperscalers-control-most-compute
aphyr.com aphyr.com

https://aphyr.com/posts/419-the-future-of-everything-is-lies-i-guess-new-jobs

1
1. fxp007 17 Apr 2026
  
  in Public
  
  As slop takes over the Internet, labs may struggle to obtain high-quality corpuses for training models.
  
  这一观察揭示了AI训练数据质量的危机。随着互联网内容质量的下降，AI系统可能面临'垃圾进，垃圾出'的风险。作者提出的'低背景钢'比喻巧妙地指出了使用2023年前纯净数据的解决方案，同时也暗示了数字时代知识污染的严重性，这可能会对AI系统的可靠性和偏见产生深远影响。
  
  data-quality training-corpus ai-bias
Visit annotations in context

Tags

training-corpus

ai-bias

data-quality

Annotators

fxp007

URL

aphyr.com/posts/419-the-future-of-everything-is-lies-i-guess-new-jobs
a16z.com a16z.com

Where Enterprises are Actually Adopting AI - a16z

2
1. fxp007 17 Apr 2026
  
  in Public
  
  Based on our analysis, **29% of the Fortune 500 and ~19% of the Global 2000**are live, paying customers of a leading AI startup.
  
  这一数据揭示了企业AI采用率远高于公众认知，颠覆了传统技术采用模式。财富500强中近三分之一的企业已经实际部署AI应用，这一惊人的采用速度表明AI技术正在以前所未有的速度渗透传统企业，打破了企业技术采用通常需要数年才能达到大规模采用的规律。
  
  adoption-rate enterprise-ai data-insight
2. fxp007 10 Apr 2026
  
  in Public
  
  Support teams are high volume and high turnover, and thus need to train new reps in a fast and standardized way. To do so, they have clearly articulated standard operating procedures (SOPs) that guide the work of each rep. These SOPs create clear rules and guidelines that AI agents can model themselves off of.
  
  AI 在客服领域成功的秘密竟然是：这个行业为了管理人类员工的高流失率，被迫建立了极其清晰的 SOP 文档——而这恰好是训练 AI Agent 的完美素材。这是一个意外的历史巧合：企业因为人类问题（高离职率）被迫文档化了所有流程，然后 AI 来了，直接把这些文档变成了自己的「培训手册」。低价值工作被最彻底地文档化，反而最容易被 AI 替代。
  
  SOP-as-training-data customer-support ironic-automation surprising
Visit annotations in context

Tags

SOP-as-training-data

data-insight

enterprise-ai

customer-support

surprising

ironic-automation

adoption-rate

Annotators

fxp007

URL

a16z.com/where-enterprises-are-actually-adopting-ai/
x.com x.com

https://x.com/__aakas__/status/2043346948309041227

1
1. fxp007 16 Apr 2026
  
  in Public
  
  Closed harnesses behind proprietary APIs force yielding control of agent memory to third parties.
  
  令人惊讶的是：专有API背后的封闭式代理工具迫使用户将代理记忆的控制权让渡给第三方。这意味着用户在使用AI代理时可能不知不觉地失去了对自己数据和个人信息的控制权，这可能引发严重的隐私和安全问题。
  
  surprising privacy proprietary-systems data-control
Visit annotations in context

Tags

privacy

data-control

proprietary-systems

surprising

Annotators

fxp007

URL

x.com/__aakas__/status/2043346948309041227
www.tomtunguz.com www.tomtunguz.com

https://www.tomtunguz.com/artemis/

1
1. fxp007 16 Apr 2026
  
  in Public
  
  Within a few months, they have more than a dozen production enterprise deployments & are processing over a billion events per hour.
  
  令人惊讶的是：Artemis安全公司在短短几个月内就处理了每小时超过10亿个安全事件，这种数据处理规模反映了现代企业面临的网络安全威胁的惊人频率和复杂性。
  
  surprising data-processing security-scale
Visit annotations in context

Tags

security-scale

data-processing

surprising

Annotators

fxp007

URL

tomtunguz.com/artemis/
www.gadgetreview.com www.gadgetreview.com

https://www.gadgetreview.com/maine-is-about-to-become-the-first-state-to-ban-major-new-data-centers

1
1. fxp007 16 Apr 2026
  
  in Public
  
  Maine advances first statewide moratorium blocking data centers requiring over 20 megawatts
  
  令人惊讶的是：缅因州将成为美国第一个全范围禁止大型数据中心建设的州，这一政策针对的是超过20兆瓦的数据中心设施，这在科技发展迅速的今天显得格外独特和出人意料。
  
  surprising data-center-ban state-policy
Visit annotations in context

Tags

data-center-ban

state-policy

surprising

Annotators

fxp007

URL

gadgetreview.com/maine-is-about-to-become-the-first-state-to-ban-major-new-data-centers
huggingface.co huggingface.co

https://huggingface.co/papers/2604.08377

1
1. fxp007 16 Apr 2026
  
  in Public
  
  the most interesting detail here is how SkillClaw clusters cross-user trajectories into referenced skills and then uses the evolver to translate those patterns into concrete updates.
  
  令人惊讶的是：SkillClaw能够将跨用户轨迹聚类为参考技能，然后使用进化器将这些模式转化为具体更新。这种处理异构用户经验的方法非常巧妙，它不仅解决了不同用户间信号差异的问题，还能从看似无关的用户行为中提取有价值的模式，实现真正的集体智慧。
  
  surprising pattern-recognition heterogeneous-data
Visit annotations in context

Tags

pattern-recognition

heterogeneous-data

surprising

Annotators

fxp007

URL

huggingface.co/papers/2604.08377
epoch.ai epoch.ai

https://epoch.ai/data-insights/claude-usage-rose

1
1. fxp007 16 Apr 2026
  
  in Public
  
  We test for a trend over time by fitting a weighted linear model to the log-odds of usage. Under this specification, Claude is the only AI service in the survey to show a statistically significant upward trend over this period
  
  令人惊讶的是：研究团队使用了对数几率加权线性模型来分析趋势，发现Claude是唯一一个在统计上显示出显著增长趋势的AI服务。这种复杂的统计分析方法揭示了表面上微小变化背后的真实趋势。
  
  surprising statistical-method data-analysis
Visit annotations in context

Tags

data-analysis

statistical-method

surprising

Annotators

fxp007

URL

epoch.ai/data-insights/claude-usage-rose
chatgpt.com chatgpt.com

https://chatgpt.com/apps/spreadsheets/

2
1. fxp007 16 Apr 2026
  
  in Public
  
  The ChatGPT for Excel add-in operates separately from your ChatGPT chat history. Conversations and data in Excel aren't shared with your ChatGPT chats, and activity doesn't sync between experiences at this time.
  
  令人惊讶的是：Excel中的ChatGPT功能与普通聊天历史是完全隔离的，两个系统之间没有数据同步。这意味着用户可以在Excel中使用AI处理敏感数据，而不用担心这些信息会出现在他们的常规聊天记录中，提供了额外的隐私保护层。
  
  surprising data-isolation privacy-features
2. fxp007 16 Apr 2026
  
  in Public
  
  By default, data shared with ChatGPT isn't used to improve our models for ChatGPT Business, ChatGPT Enterprise, ChatGPT Edu, and ChatGPT for Teachers.
  
  令人惊讶的是：企业级用户的Excel数据默认不会被用于训练AI模型，这与普通用户的数据处理方式有显著区别。这种差异反映了OpenAI对商业客户隐私的特别保护，可能是为了增强企业采用AI工具的信心。
  
  surprising data-privacy business-ai
Visit annotations in context

Tags

privacy-features

data-isolation

surprising

data-privacy

business-ai

Annotators

fxp007

URL

chatgpt.com/apps/spreadsheets/
ai.meta.com ai.meta.com

https://ai.meta.com/blog/introducing-muse-spark-msl/

1
1. fxp007 16 Apr 2026
  
  in Public
  
  we collaborated with over 1,000 physicians to curate training data that enables more factual and comprehensive responses.
  
  令人惊讶的是：为了提升Muse Spark在健康领域的推理能力，Meta竟然与超过1000名医生合作来筛选训练数据。这种规模的专家参与在AI模型开发中极为罕见，显示了Meta对医疗健康领域准确性的高度重视，也反映了AI模型专业化训练的新趋势。
  
  surprising health-ai data-curation
Visit annotations in context

Tags

health-ai

data-curation

surprising

Annotators

fxp007

URL

ai.meta.com/blog/introducing-muse-spark-msl/
www.theaivalley.com www.theaivalley.com

https://www.theaivalley.com/p/the-claude-mythos-era

1
1. fxp007 16 Apr 2026
  
  in Public
  
  The model reportedly scored 93.9% on SWE-bench Verified and 77.8% on SWE-bench Pro, but its strongest signal came from real-world results, including uncovering a 27-year-old flaw in OpenBSD, a 16-year-old vulnerability in FFmpeg, and autonomously chaining Linux kernel exploits without human input.
  
  这些惊人的安全漏洞发现能力表明AI已经超越了传统安全工具，能够自主发现几十年未被发现的漏洞。特别是能够自主链接Linux内核漏洞的能力，展示了AI在网络安全领域的革命性潜力，这可能彻底改变安全研究和漏洞修复的方式。
  
  ai-security benchmark-data real-world-results
Visit annotations in context

Tags

ai-security

benchmark-data

real-world-results

Annotators

fxp007

URL

theaivalley.com/p/the-claude-mythos-era
www.technologyreview.com www.technologyreview.com

https://www.technologyreview.com/2026/04/06/1135187/the-one-piece-of-data-that-could-actually-shed-light-on-your-job-and-ai/

2
1. fxp007 09 Apr 2026
  
  in Public
  
  We need, like, a Manhattan Project to collect this
  
  经济学家呼吁以“曼哈顿计划”的规模来收集各行业价格弹性数据，凸显了当前AI经济研究的底层基础设施极度匮乏。没有跨经济体的系统性微观数据支撑，任何关于AI就业前景的预测都是盲人摸象，政策制定更是无从谈起。
  
  data-infrastructure policy-making economic-research
2. fxp007 08 Apr 2026
  
  in Public
  
  We need, like, a Manhattan Project to collect this... Fields that are not exposed now will become exposed in the future, so you just want to track these statistics across the entire economy.
  
  大多数人认为应对AI就业影响应该专注于当前受威胁最大的行业，但作者认为我们需要像曼哈顿计划一样全面收集所有行业的价格弹性数据，包括目前尚未受到AI影响的领域。这种前瞻性视角挑战了危机应对的常规思维。
  
  non-consensus data-collection economic-planning
Visit annotations in context

Tags

economic-planning

data-collection

policy-making

non-consensus

data-infrastructure

economic-research

Annotators

fxp007

URL

technologyreview.com/2026/04/06/1135187/the-one-piece-of-data-that-could-actually-shed-light-on-your-job-and-ai/
www.arenaphysica.com www.arenaphysica.com

https://www.arenaphysica.com/publications/rf-studio

1
1. fxp007 09 Apr 2026
  
  in Public
  
  A learning system can continuously incorporate real-world data in a way that numerical solvers fundamentally cannot, capturing and compounding the knowledge that is currently trapped out there in the real world.
  
  揭示了AI驱动设计的另一大优势：打通仿真与现实的闭环。传统求解器难以穷尽制造公差等现实复杂因素，而学习系统能持续吸收实测数据，形成越用越聪明的“数据飞轮”。将现实中散落的隐性知识固化为模型能力，这是传统工具无法企及的质变。
  
  sim-to-real data-flywheel continuous-learning
Visit annotations in context

Tags

continuous-learning

sim-to-real

data-flywheel

Annotators

fxp007

URL

arenaphysica.com/publications/rf-studio
www.anthropic.com www.anthropic.com

Effective harnesses for long-running agents

1
1. fxp007 09 Apr 2026
  
  in Public
  
  inappropriately change or overwrite JSON files compared to Markdown files
  
  这是一个极具洞察力的工程经验。Markdown格式对LLM来说太“自由”，易被模型篡改或幻觉覆盖；而JSON具有严格的Schema约束。选择合适的数据格式本身就是一种隐式的Prompt防护栏。
  
  best-practice data-format prompt-engineering
Visit annotations in context

Tags

best-practice

prompt-engineering

data-format

Annotators

fxp007

URL

anthropic.com/engineering/effective-harnesses-for-long-running-agents
mp.weixin.qq.com mp.weixin.qq.com

https://mp.weixin.qq.com/s/zg2LiDRUipkV0RFB4DXpWg

1
1. fxp007 09 Apr 2026
  
  in Public
  
  按时间记录不完全合理，还是应该按任务记录。
  
  这一观点挑战了传统时间轴记录的惯性思维。时间轴看似客观，实则碎片化，增加了认知负担。以 Task 为核心组织记忆，实际上是模拟人类大脑的联想记忆机制，将散乱的行为建模为有序的因果关系，极大提升了信息的召回效率和应用价值。
  
  memory-system data-structure cognitive-science
Visit annotations in context

Tags

data-structure

cognitive-science

memory-system

Annotators

fxp007

URL

mp.weixin.qq.com/s/zg2LiDRUipkV0RFB4DXpWg
sakana.ai sakana.ai

https://sakana.ai/marlin-beta/

1
1. fxp007 09 Apr 2026
  
  in Public
  
  βテスト期間中のご利用は無料です。
  
  Beta 期间完全免费——对于一个声称能替代 CSO 团队数周工作的产品来说，这个策略令人惊讶。背后的逻辑是：Sakana 需要真实的企业级研究任务作为训练数据和案例积累，而这些数据只有企业用户才能提供。「用免费换真实场景数据」是 AI 产品冷启动的经典策略，但在如此高端的 B2B 定位下使用，意味着 Sakana 对自己产品当前状态的坦诚：它还不够好到让企业为初版买单，但已经足够好到值得企业免费试用。
  
  free-beta data-flywheel B2B-cold-start pricing-strategy
Visit annotations in context

Tags

B2B-cold-start

data-flywheel

pricing-strategy

free-beta

Annotators

fxp007

URL

sakana.ai/marlin-beta/
epoch.ai epoch.ai

Keeping up with the GPTs | Epoch AI

2
1. fxp007 09 Apr 2026
  
  in Public
  
  American hyperscalers are driving a data center buildout that's larger than the Manhattan Project and Apollo Program at their peaks.
  
  将美国 AI 数据中心建设规模与曼哈顿计划和阿波罗计划的峰值相比——这个类比既令人震惊，又揭示了竞争的本质已从技术竞争升级为「工业动员」。曼哈顿计划是战时国家意志的总动员，阿波罗计划是冷战荣耀的象征投入。如今的 AI 算力竞赛，在绝对体量上已超越这两个历史上最大规模的科技工程——而这场竞赛还远未触及天花板。
  
  data-center Manhattan-Project Apollo scale AI-race
2. fxp007 09 Apr 2026
  
  in Public
  
  MiniMax may have been able to get 100 billion tokens of data from interactions with Claude.
  
  100 亿 token 的 Claude 交互数据——这个估算令人瞠目。这意味着 MiniMax 的用户在不知情的情况下，可能成了为 Claude 蒸馏数据的「采集器」。从 Anthropic 的角度看，这是商业数据被盗用；从竞争视角看，这说明 API 开放策略本身就是一把双刃剑——越开放，越容易被「逆向汲取」。
  
  MiniMax data-extraction API-strategy 100B-tokens surprising
Visit annotations in context

Tags

Manhattan-Project

data-center

surprising

scale

MiniMax

100B-tokens

data-extraction

API-strategy

Apollo

AI-race

Annotators

fxp007

URL

epoch.ai/gradient-updates/keeping-up-with-the-gpts/
huggingface.co huggingface.co

https://huggingface.co/papers/2604.04771

4
1. fxp007 08 Apr 2026
  
  in Public
  
  A three-stage progressive training strategy -- large-scale pre-training, hard sample fine-tuning, and GRPO alignment -- sequentially exploits these data at different quality tiers.
  
  大多数人认为训练策略应该统一应用于所有数据，但作者提出了分阶段渐进式训练策略，在不同质量层级的数据上采用不同方法，这种针对数据质量差异的训练方法挑战了传统'一刀切'的训练范式，代表了数据为中心的AI新思路。
  
  non-consensus training-strategy data-quality
2. fxp007 08 Apr 2026
  
  in Public
  
  SOTA models of different architectures and parameter scales exhibit highly consistent failure patterns on the same set of hard samples, suggesting that the performance bottleneck stems from shared deficiencies in training data rather than architecture itself.
  
  大多数人认为不同架构的模型会有不同的失败模式和弱点，但作者发现无论架构和参数规模如何，SOTA模型在相同困难样本上表现出高度一致的失败模式，这表明性能瓶颈源于训练数据的共同缺陷，而非架构差异，这一发现挑战了模型多样化的传统观点。
  
  non-consensus model-architecture training-data
3. fxp007 08 Apr 2026
  
  in Public
  
  Without any architectural modification, MinerU2.5-Pro achieves 95.69 on OmniDocBench v1.6, improving over the same-architecture baseline by 2.71 points and surpassing all existing methods including models with over 200× more parameters.
  
  大多数人认为更大的模型架构必然带来性能提升，但作者仅通过数据工程和训练策略优化，在保持1.2B参数架构不变的情况下，超越了参数量超过200倍的现有模型，这挑战了'越大越好'的行业共识，证明了数据质量的重要性。
  
  counterintuitive model-scaling data-engineering
4. fxp007 08 Apr 2026
  
  in Public
  
  Current document parsing methods compete primarily on model architecture innovation, while systematic engineering of training data remains underexplored.
  
  大多数人认为文档解析性能的提升主要依赖于模型架构的创新和规模的扩大，但作者认为训练数据的系统性工程优化才是关键瓶颈，因为不同架构的SOTA模型在相同困难样本上表现出高度一致的失败模式，这表明问题在于数据质量而非架构本身。
  
  non-consensus data-centric document-parsing
Visit annotations in context

Tags

data-centric

data-engineering

model-scaling

model-architecture

non-consensus

document-parsing

training-data

counterintuitive

training-strategy

data-quality

Annotators

fxp007

URL

huggingface.co/papers/2604.04771
hackernoon.com hackernoon.com

https://hackernoon.com/the-uk-must-choose-between-protecting-creators-and-backing-big-tech-in-ai

1
1. fxp007 08 Apr 2026
  
  in Public
  
  introducing a commercial text and data mining exception for AI training would expand the AI sector in the country.
  
  大多数人认为放宽数据挖掘限制会促进AI创新和增长，但作者认为这种例外实际上不会扩大AI产业。这一观点与科技行业普遍倡导的'更多数据等于更好AI'的信念相悖，挑战了数据自由流动的主流叙事。
  
  counterintuitive ai-data-policy copyright-exception
Visit annotations in context

Tags

ai-data-policy

copyright-exception

counterintuitive

Annotators

fxp007

URL

hackernoon.com/the-uk-must-choose-between-protecting-creators-and-backing-big-tech-in-ai
arxiv.org arxiv.org

https://arxiv.org/abs/2604.02971

1
1. fxp007 08 Apr 2026
  
  in Public
  
  most existing large language model agent systems face severe limitations in data-intensive settings, including context saturation, cascading error propagation, and high end-to-end latency
  
  主流观点认为大型语言模型代理系统在处理复杂数据任务时表现出色，但作者指出它们在数据密集型环境中存在严重局限性，挑战了LLM代理系统的普遍有效性假设。
  
  non-consensus llm-limitations data-intensive
Visit annotations in context

Tags

llm-limitations

data-intensive

non-consensus

Annotators

fxp007

URL

arxiv.org/abs/2604.02971
arxiv.org arxiv.org

https://arxiv.org/abs/2604.02869

1
1. fxp007 08 Apr 2026
  
  in Public
  
  We introduce Iterative Reward Calibration, a methodology for designing per-turn rewards using empirical discriminative analysis of rollout data
  
  大多数人认为奖励设计应基于领域专家知识和预定义规则，但作者提出应基于实际训练数据的经验判别分析来迭代校准奖励。这种方法挑战了传统的奖励工程方法论，将奖励设计从'专家驱动'转向'数据驱动'。
  
  non-consensus reward-calibration methodology data-driven
Visit annotations in context

Tags

data-driven

reward-calibration

non-consensus

methodology

Annotators

fxp007

URL

arxiv.org/abs/2604.02869
ai.meta.com ai.meta.com

https://ai.meta.com/blog/alta-daily-fashion-app-segment-anything/

1
1. fxp007 08 Apr 2026
  
  in Public
  
  If we knew that every image uploaded was a beautiful model shot, segmentation would be far easier, but because of the nature of user-uploaded content, we need the best possible segmentation.
  
  大多数人可能认为高质量的专业照片是AI图像处理的理想输入，但作者暗示即使是'完美'的模特照片实际上比用户上传的真实内容更容易处理。这一观点挑战了人们对'理想训练数据'的假设，暗示真实世界数据的'不完美'实际上构成了更严峻的技术挑战。
  
  non-consensus counterintuitive ai-training-data
Visit annotations in context

Tags

ai-training-data

non-consensus

counterintuitive

Annotators

fxp007

URL

ai.meta.com/blog/alta-daily-fashion-app-segment-anything/
accessmedicine.mhmedical.com accessmedicine.mhmedical.com

Superior Vena Caval Obstruction

1
1. Arshadbhat 05 Apr 2026
  
  in Public
  
  Urgent treatment for neoplasm consists of (1) cautious use of intravenous diuretics and (2) mediastinal irradiation, starting within 24 hours, with a treatment plan designed to give a high daily dose of radiation but a short total course of therapy to rapidly shrink the local tumor. Intensive radiation therapy combined with chemotherapy will palliate the process in up to 90% of patients. In patients with a subacute presentation, radiation therapy alone usually suffices. Chemotherapy is added if lymphoma or small-cell carcinoma is diagnosed
  
  endovascular stenting emerging as first-line therapy for rapid symptom relief, while definitive treatment targets the underlying cause
  
  Glucocorticoids (dexamethasone 4 mg every 6 hours) are commonly prescribed but lack robust supporting data; they may be more beneficial in lymphoma or thymoma and as prophylaxis against radiation-induced edema. [2-4] Importantly, SVC syndrome is no longer considered a medical emergency except in rare cases with life-threatening cerebral edema, laryngeal edema, or altered mental status. When thrombosis is present, catheter-directed thrombolysis or aspiration thrombectomy should be performed within 2-5 days of symptom onset before thrombus organization occurs. [3] The role of long-term anticoagulation after stenting remains unclear, though it is standard when significant thrombosis is present The American College of Chest Physicians recommends obtaining histologic diagnosis before treatment in suspected lung cancer cases, as stenting does not interfere with tissue diagnosis. [2] For small cell lung cancer (SCLC), chemotherapy alone is recommended as first-line treatment given rapid response rates. [2] For non-small cell lung cancer (NSCLC), radiation therapy and/or stent insertion are recommended, with response rates of 59% for chemotherapy and 63% for radiation therapy. [2] Patients with chemotherapy- or radiation-refractory disease should receive vascular stents For device-related thrombosis (catheters, pacemakers), catheter removal should be considered in conjunction with anticoagulation. [4] Endovascular therapy is first-line for device-related obstruction, while surgical bypass may be preferred for mediastinal fibrosis. [7] Both approaches show good mid-term patency, though secondary interventions are common (approximately 27-28%
Visit annotations in context

Tags

The American College of Chest Physicians recommends obtaining histologic diagnosis before treatment in suspected lung cancer cases, as stenting does not interfere with tissue diagnosis. [2] For small cell lung cancer (SCLC), chemotherapy alone is recommended as first-line treatment given rapid response rates. [2] For non-small cell lung cancer (NSCLC), radiation therapy and/or stent insertion are recommended, with response rates of 59% for chemotherapy and 63% for radiation therapy. [2] Patients with chemotherapy- or radiation-refractory disease should receive vascular stents

When thrombosis is present, catheter-directed thrombolysis or aspiration thrombectomy should be performed within 2-5 days of symptom onset before thrombus organization occurs. [3] The role of long-term anticoagulation after stenting remains unclear, though it is standard when significant thrombosis is present

For device-related thrombosis (catheters, pacemakers), catheter removal should be considered in conjunction with anticoagulation. [4] Endovascular therapy is first-line for device-related obstruction, while surgical bypass may be preferred for mediastinal fibrosis. [7] Both approaches show good mid-term patency, though secondary interventions are common (approximately 27-28%

Glucocorticoids (dexamethasone 4 mg every 6 hours) are commonly prescribed but lack robust supporting data; they may be more beneficial in lymphoma or thymoma and as prophylaxis against radiation-induced edema. [2-4] Importantly, SVC syndrome is no longer considered a medical emergency except in rare cases with life-threatening cerebral edema, laryngeal edema, or altered mental status.

Annotators

Arshadbhat

URL

accessmedicine.mhmedical.com/content.aspx
Mar 2026
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu

AbstractExplorer: Leveraging Structure-Mapping Theory to Enhance Comparative Close Reading at Scale

16
1. elglassman 29 Mar 2026
  
  in Public
  
  Interviews were video and audio recorded. We transcribed the audio using OpenAI's Whisper automatic speech recognition system and anonymized the transcript before analysis. We analyzed the interview data using thematic analysis [1]. First, two members of the research team independently coded four (25% of collected data) randomly chosen participant data to generate low-level codes. The inter-coder reliability between the coders was 0.88 using Krippendorff's alpha [37]. The two coders then met together to cross-check, resolve coding conflicts, and consolidate the codes into a codebook across two sessions. Using the codebook, the two coders analyzed six randomly selected participant data each. The research team then met, discussed the analysis outcomes, and finalized themes over three sessions.
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  data analysis ai-user-approved
2. elglassman 25 Mar 2026
  
  in Public
  
  We conducted a qualitative analysis of user study transcripts and survey responses using a Grounded Theory approach [8]. First, the lead researcher collected a list of participants' behaviors, approaches, reflections on their experience, and feedback about the interface. The researcher then systematically coded this data, revisiting the data multiples times and refining the codes to ensure consistency and coherence. Through this process, high-level themes were identified and organized using affinity diagramming. Once the thematic structure was finalized, the researcher gathered supporting evidence for each theme and synthesized the findings, which were reviewed by the research team to ensure agreement on the results.
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
3. elglassman 25 Mar 2026
  
  in Public
  
  Activity log data, which revealed how participants actually used the interface, echoed the above findings. According to the log data, participants spent most of their reading time (66.31%) with vertical alignment on the second element in structure pairs, followed by alignment on the first element (29.19%), and left-justified alignment (5.13%). Highlighting usage showed a similar preference: 91.13% of time with all chunks highlighted, 8.25% with partial highlighting, and minimal time (0.63%) without highlights.
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
4. elglassman 25 Mar 2026
  
  in Public
  
  In this section, we present findings on how AbstractExplorer supports comparative close reading at scale by integrating quantitative survey responses and log data with qualitative analysis of transcripts and open-ended responses. The qualitative analysis process is described in detail in Appendix H.
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
5. elglassman 25 Mar 2026
  
  in Public
  
  Throughout the two tasks, we also collected detailed interaction logs including counts of user-defined aspects created, duration of highlighting usage, and time allocation across the three possible alignment options.
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
6. elglassman 25 Mar 2026
  
  in Public
  
  Both gaze data and the semi-structured interviews revealed that lower NFC participants were more willing to be guided by the three features and took advantage of them consciously.
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
7. elglassman 25 Mar 2026
  
  in Public
  
  Using a two-tailed Mann-Whitney U Test, we found that participants who reported their lowest perceived cognitive load when all three features were enabled had significantly lower NFC than participants who reported their lowest cognitive load level when skimming with no features enabled—in the baseline interface (p=0.03).
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
8. elglassman 25 Mar 2026
  
  in Public
  
  The raw NASA-TLX score is the sum of all 6 NASA-TLX questions after reversing the appropriate questions.
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
9. elglassman 25 Mar 2026
  
  in Public
  
  To compute a participant's NFC score, we averaged their response to the six questions, each ranging from 1 to 7, after reversing the appropriate questions.
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
10. elglassman 25 Mar 2026
  
  in Public
  
  For simplicity of analysis, we denote participants with NFC scores above the overall participants' median NFC of 5.42 (IQR = 0.583) as higher NFC, and lower NFC otherwise.
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
11. elglassman 25 Mar 2026
  
  in Public
  
  To contrast participants' gaze patterns in each condition, we used a Tobii Pro Spark eye-tracker placed below the desktop monitor used by all subjects; Tobii Pro Lab software recorded each participant's gaze over time in each condition.
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
12. elglassman 25 Mar 2026
  
  in Public
  
  We collected 80 sentences from our abstracts dataset labeled by our system as "Methodology/Contribution." Participants viewed the same 80 sentences in each condition—often with a different subset of sentences initially visible due to ordering changes—but only had two minutes to look at them in each condition.
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
13. elglassman 25 Mar 2026
  
  in Public
  
  After obtaining an expanded set of high-level chunk labels, we assign them to each of the sentence chunks by using LLMs in a multiclass classification few-shot learning task, with the initial labels and assignment as examples (see prompt used in Appendix D.3).
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
14. elglassman 25 Mar 2026
  
  in Public
  
  Then, we segment sentences within each aspect into grammarpreserving chunks (see prompt used in Appendix D.2). This results in grammatically coherent chunks that are the basis of structure patterns. After identifying chunk boundaries, we again prompt an LLM to generate labels for chunks in a human-in-the-loop approach: starting from an initial set of labels for chunk roles, when a new label is generated, a researcher from the research team examines the new label and merges it with existing labels if appropriate, controlling for the total number of labels.
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
15. elglassman 25 Mar 2026
  
  in Public
  
  We process this data in a three-stage pipeline (Figure 6). In the first stage, Sentence Segmentation and Categorization, abstracts are split into individual sentences using the NLTK package, and each sentence is classified into one of the five pre-defined aspects as listed in Section 4.1.1. Classification is performed by prompting an LLM (see prompt used in Appendix D.1) with the sentence and its full abstract.
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
16. elglassman 25 Mar 2026
  
  in Public
  
  After the interviews, we analyzed the data using the process described in Appendix B
  
  sentence describing how analysis was performed on data collected by the authors of this paper
  
  ai-pending data analysis
Visit annotations in context

Tags

ai-user-approved

data analysis

ai-pending

Annotators

elglassman

URL

glassmanlab.seas.harvard.edu/papers/abstractexplorer.pdf
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu

Supporting Co-Adaptive Machine Teaching through Human Concept Learning and Cognitive Theories

1
1. elglassman 29 Mar 2026
  
  in Public
  
  To analyze the annotation efficiency, we first conducted a Kruskal-Wallis rank sum test [39] to determine if there were statistically significant differences in annotation time across the three conditions, because our data violated the homogeneity of variances assumption, making non-parametric methods more appropriate.
  
  return any single sentence that describes data analysis done on data collected by the authors when running human subjects experiments.
  
  human subjects experiment data analysis ai-user-approved
Visit annotations in context

Tags

ai-user-approved

human subjects experiment data analysis

Annotators

elglassman

URL

glassmanlab.seas.harvard.edu/papers/mocha_chi25.pdf
substack.com substack.com

Kill Chain

1
1. mlenc 25 Mar 2026
  
  in Public
  
  great article kill chain palantir admin data targeting bureaucracy administration seeing like a state
Visit annotations in context

Tags

seeing like a state

palantir

bureaucracy

kill chain

great article

targeting

admin data

administration

Annotators

mlenc

URL

substack.com/home/post/p-191689642
higherlogicdownload.s3.amazonaws.com higherlogicdownload.s3.amazonaws.com

Institutional Perspectives on Microcredentials 2026 Report

2
1. SenorG 25 Mar 2026
  
  in Public
  
  Figure 3: Decrease in Perceived Fiscal Benefits of Microcredentials by Year
  
  Perception vs Reality disconnect. Why? My guess is huge gaps in understanding of the Fundamentals.
  
  MC Research UPCEA MC data MC Strategy MC adoption
2. SenorG 25 Mar 2026
  
  in Public
  
  Notably, traditional mindsets and legacysystems are seen as far greater barriers in 2025 (61%) than in 2021 (5%), high-lighting a growing tension between innovation and institutional resistance tochange.
  
  In 2021, the question would have been interpreted differently. 61% is probably still lower than it should be.
  
  MC data UPCEA MC Research white papers
Visit annotations in context

Tags

MC adoption

white papers

MC Strategy

MC data

MC Research

UPCEA

Annotators

SenorG

URL

higherlogicdownload.s3.amazonaws.com/UPCEA/5b7be308-e3bb-4dd5-b27c-6f8d40472aec_file.pdf
www.statecraft.pub www.statecraft.pub

Ten Thoughts on Government Data

1
1. mlenc 16 Mar 2026
  
  in Public
  
  admin data government data
Visit annotations in context

Tags

government data

admin data

Annotators

mlenc

URL

statecraft.pub/p/ten-thoughts-on-government-data
www.imsglobal.org www.imsglobal.org

Comprehensive Learner Record Transcript 2.0 | IMS Global Learning Consortium

1
1. SenorG 09 Mar 2026
  
  in Public
  
  Comprehensive Learner Record Standard Transcript Guide
  
  CLR Playbook
  
  Standards CLR MC Best Practices MC data MC quality 1EdTech LER Resources
Visit annotations in context

Tags

MC Best Practices

LER Resources

Standards

MC data

1EdTech

MC quality

CLR

Annotators

SenorG

URL

imsglobal.org/spec/clr/v2p0/transcript
Feb 2026
oec.world oec.world

The Observatory of Economic Complexity

1
1. chrisaldrich 21 Feb 2026
  
  in Public
  
  The Observatory of Economic Complexity (OEC)<br /> https://oec.world/en
  
  data visualizations The Observatory of Economic Complexity (OEC) Dan Allosso Book Club 2026-02-21 César Hidalgo complexity theory economic complexity trade data economics
Visit annotations in context

Tags

complexity theory

economic complexity

Dan Allosso Book Club 2026-02-21

data visualizations

economics

César Hidalgo

The Observatory of Economic Complexity (OEC)

trade data

Annotators

chrisaldrich

URL

oec.world/en/
openhumanitiesdata.metajnl.com openhumanitiesdata.metajnl.com

Call for Edits Nearby: Open Archives Metadata from Saxony

1
1. JensBe 12 Feb 2026
  
  in Public
  
  23
  
  23 See: Wikidata:Data round-tripping, https://www.wikidata.org/w/index.php?title=Wikidata:Data_round-tripping&oldid=2440906511
  
  Wikidata Data round-tripping
Visit annotations in context

Tags

Data round-tripping

Wikidata

Annotators

JensBe

URL

openhumanitiesdata.metajnl.com/articles/10.5334/johd.430
Jan 2026
www.phoronix.com www.phoronix.com

Fedora Continued At The Forefront Of Upstream Linux Innovations In 2025 - Phoronix Forums

1
1. almereyda 02 Jan 2026
  
  in Public
  
  If you value your data I suggest not trusting any filesystem or media, consider them all equally fallible.
  
  backup data
Visit annotations in context

Tags

backup

data

Annotators

almereyda

URL

phoronix.com/forums/forum/software/distributions/1602296-fedora-continued-at-the-forefront-of-upstream-linux-innovations-in-2025
Dec 2025
doc.anytype.io doc.anytype.io

Storage & Deletion | Anytype Docs

1
1. tonz 29 Dec 2025
  
  in Public
  
  Media files are not directly downloaded in overall syncing to save bandwidth. Instead, when that file is requested, it is streamed to your device from the backup node or your devices on the network. For example, if you have a 4K Video, it will be streamed from the backup node or P2P devices to your device. So when you open an object with an image, it downloads. When you press play on video & audio, it begins to download. After that, this file will be stored in the application cache.
  
  media files may not be locally available, and require a internet connection to be streamed/downloaded on demand. Generally excluded from syncing to save bandwidth. Doesn't this also mean that media files aren't backed-up, in the sense that people will treat sync as back-ups.
  
  anytype data
Visit annotations in context

Tags

data

anytype

Annotators

tonz

URL

doc.anytype.io/anytype-docs/advanced/data-and-security/data-storage-and-deletion
timesofindia.indiatimes.com timesofindia.indiatimes.com

After claiming to redeploy 4,000 employees and automating their work with AI agents, Salesforce executives admit: We were more confident about…. - The Times of India

1
1. tonz 28 Dec 2025
  
  in Public
  
  Benioff had recently told Business Insider that he's drafting the company's annual strategic document with data foundations—not AI models—as the top priority, explicitly citing concerns about "hallucinations" without proper data context.
  
  The annual strategic document now puts data foundations in focus, not AI models. Well duh. How even get to the notion that you can AI-all the things, it implies an uncritical belief in the promises of vendors, or magical thinking. How do you get to be CEO if you fall for that. Vibe-leading iow, the wizard behind the curtain.
  
  leadership vibeleading datastrategy data-centric ai
Visit annotations in context

Tags

data-centric

vibeleading

ai

datastrategy

leadership

Annotators

tonz

URL

timesofindia.indiatimes.com/technology/tech-news/after-laying-off-4000-employees-and-automating-with-ai-agents-salesforce-executives-admit-we-were-more-confident-about-/articleshow/126121875.cms
www.istr.org www.istr.org

The Comparative Third Sector Project (CTSP) - www.istr.org

1
1. mlenc 18 Dec 2025
  
  in Public
  
  data infrastructure research infrastructure third sector global academia
Visit annotations in context

Tags

research infrastructure

data infrastructure

third sector

academia

global

Annotators

mlenc

URL

istr.org/page/CTSP
www.youtube.com www.youtube.com

The Myth of Progress | Samuel Miller McDonald

1
1. stopresetgo 14 Dec 2025
  
  in Public
  
  our world and data does they do have some legitimate research because that's what think tanks do. They launder illegitimate research with legitimate research. uh and their tactic primarily is to uh set the scope of what they are commenting on or researching uh that it you know it puts forward the kind of results that they want uh that aligns with their ideology.
  
  for - Our World in Data - discredited website - mix legitimate with illegitimate research to advance a biased ideology
  
  Our World in Data - discredited website - mix legitimate with illegitimate research to advance a biased ideology
Visit annotations in context

Tags

Our World in Data - discredited website - mix legitimate with illegitimate research to advance a biased ideology

Annotators

stopresetgo

URL

youtube.com/watch
link.springer.com link.springer.com

Micro-Credentials and Digital Badges: An Exploration of Definitions and Implications in Higher Education and Workforce

1
1. SenorG 09 Dec 2025
  
  in Public
  
  Micro-Credentials and Digital Badges: An Exploration of Definitions and Implications in Higher Education and Workforce
  
  Digital Promise LER Ecosystem MC Research MC quality MC data Marilys Galindo
Visit annotations in context

Tags

Digital Promise

MC Research

MC data

Marilys Galindo

MC quality

LER Ecosystem

Annotators

SenorG

URL

link.springer.com/article/10.1007/s11528-025-01148-z
www.worldpop.org www.worldpop.org

Open Spatial Demographic Data and Research

1
1. stopresetgo 04 Dec 2025
  
  in Public
  
  for - open source - world population data - Worldpop
  
  open source - world population open source - world population data Worldpop
Visit annotations in context

Tags

Worldpop

open source - world population

open source - world population data

Annotators

stopresetgo

URL

worldpop.org/
Nov 2025
journals.plos.org journals.plos.org

The effects of landscape on visual preference and fatigue recovery among university students: Differences in gender, grade level and major

2
1. brynnnusz 21 Nov 2025
  
  in Public
  
  Systolic blood pressure.
  
  Gender and grade level had no effect on systolic BP recovery. The only major-related difference was that landscape/environmental majors showed a significantly larger SBP reduction than non-environment majors when viewing desert landscapes, but not for any other landscape type.
  
  Results Data Research Physical Health Natural Environments College Major
2. brynnnusz 21 Nov 2025
  
  in Public
  
  3.1 Physiological response of viewing different landscape types
  
  This study shows that visual exposure to natural environments, especially forests and water, produces measurable physiological relaxation: • nature images lower systolic BP • forest images lower diastolic BP • water images lower HR Suggests that different types of natural scenes have different calming effects, and body overall responds physiologically to nature in ways that promote relaxation and reduce stress.
  
  Results Data Research Mental Health Physical Health Natural Environments
Visit annotations in context

Tags

Research

Natural Environments

Data

Physical Health

Results

Mental Health

College Major

Annotators

brynnnusz

URL

journals.plos.org/plosone/article
blue.isr.umich.edu blue.isr.umich.edu

School Report - 1 Year Template – School Reports

6
1. frscott 13 Nov 2025
  
  in Public
  
  How important for being looked up to or having high status in your school is...
  
  missing data
  
  data management
2. frscott 13 Nov 2025
  
  in Public
  
  How often do you feel...
  
  missing data
  
  data management
3. frscott 13 Nov 2025
  
  in Public
  
  How much competition for grades is there�
  
  weird amount of missing data again
  
  data management
4. frscott 13 Nov 2025
  
  in Public
  
  During an average school week, about how many times�
  
  Item05a-b have a weird amount of missing data
  
  data management
5. frscott 10 Nov 2025
  
  in Public
  
  NeverLess than once a week1-2 times a week3-5 times a week6-9 times a week10-19 times a week20 or more
  
  Maybe redo this graph so that the color legend isn't so large and the questions don't take up so much space.
  
  data viz
6. frscott 10 Nov 2025
  
  in Public
  
  I feel I am a person of worth, on an equal plane with others
  
  Item01b has a weird amount of missing data go back and check the data management
  
  data management
Visit annotations in context

Tags

data management

data viz

Annotators

frscott

URL

blue.isr.umich.edu/School-Report/1-Year-Template/
www.hirensbootcd.org www.hirensbootcd.org

Download | Hiren's BootCD PE

1
1. chrisaldrich 09 Nov 2025
  
  in Public
  
  https://www.hirensbootcd.org/download/
  
  operating systems Hirens data recovery Rufus (software)
Visit annotations in context

Tags

Hirens

Rufus (software)

data recovery

operating systems

Annotators

chrisaldrich

URL

hirensbootcd.org/download/
Oct 2025
www.mipex.eu www.mipex.eu

France | MIPEX 2015

1
1. tanemika 22 Oct 2025
  
  in Public
  
  Synthèse du MIPEX 2025 : Politiques d'Intégration en France
  
  Résumé
  
  L'analyse des politiques d'intégration de la France dans le cadre du Migrant Integration Policy Index (MIPEX) 2025 révèle un tableau contrasté.
  
  Avec un score global de 56 sur 100, la France se positionne à mi-chemin, appliquant des politiques qui offrent des opportunités mais aussi des obstacles significatifs à l'intégration.
  
  Cette note, inchangée depuis 2019, masque des évolutions divergentes :
  
  des progrès notables dans le domaine de l'éducation sont contrebalancés par des reculs en matière d'accès aux soins de santé et de résidence permanente.
  
  L'approche française est classée comme "Intégration Temporaire", un modèle qui accorde des droits fondamentaux aux citoyens non-européens mais leur refuse la sécurité à long terme nécessaire pour s'établir durablement et participer pleinement à la vie citoyenne.
  
  Les points forts de la France résident dans son cadre législatif solide en matière de lutte contre les discriminations et dans les récentes améliorations de l'accès à l'enseignement supérieur.
  
  Cependant, ces avancées sont minées par des politiques restrictives concernant la résidence permanente, le regroupement familial et un processus d'accès à la nationalité jugé discrétionnaire et politisé.
  
  La loi "Immigration & Intégration" de janvier 2024 et les décrets d'application subséquents marquent un tournant vers une approche plus sélective et exigeante, renforçant les exigences linguistiques et civiques.
  
  Pour améliorer son modèle, il est recommandé à la France d'adopter une approche plus cohérente, alignant ses politiques sur un objectif d'intégration à long terme et traitant les immigrés comme de futurs citoyens plutôt que comme des résidents temporaires.
  
  Analyse Détaillée des Politiques d'Intégration
  
  Score Global et Classification
  
  Avec un score de 56 sur 100, les politiques d'intégration de la France sont jugées "à mi-chemin" (halfway to promote societal integration).
  
  Ce score place la France dans la catégorie de l'"Intégration Temporaire". Selon la typologie du MIPEX, ce modèle se caractérise par :
  
  • L'octroi de droits fondamentaux et de certaines mesures favorisant l'égalité des chances.
  
  • Le refus de la sécurité à long terme indispensable pour s'installer de manière permanente, investir dans l'intégration et participer pleinement en tant que citoyen.
  
  • La perpétuation d'une perception des immigrés comme étant partiellement égaux, mais restant fondamentalement des étrangers (outsiders).
  
  Cette approche contraste avec celle des pays du "Top Ten" du MIPEX, qui traitent les immigrés comme des égaux, des voisins et des citoyens potentiels, investissant dans l'intégration comme un processus mutuel bénéfique pour l'ensemble de la société.
  
  Évolutions Récentes des Politiques (Depuis 2019)
  
  Le score global de la France est stable depuis 2019, mais cette stabilité cache des changements contradictoires dans différents domaines politiques.
  
  Changements Positifs :
  
  • Accès à l'enseignement supérieur : Des programmes ciblés ont été mis en place pour améliorer l'accès des migrants à l'enseignement supérieur.
  
  • Intégration dans le corps enseignant : Des initiatives soutiennent l'intégration des migrants dans la profession d'enseignant.
  
  • Projets spécifiques :
  
  ◦ AIMES+ (depuis 2023) : Vise à améliorer la qualité des cours de français pour les étudiants immigrés.
  
  ◦ L'Université en Exil (UXIL) : Offre un parcours académique aux étudiants et chercheurs en exil.
  
  Changements Négatifs :
  
  • Résidence permanente : Les conditions de renouvellement du statut de résident permanent ont été durcies, notamment par la réduction des périodes d'absence autorisées hors du territoire français.
  
  • Accès aux soins de santé (depuis 2020) : Les demandeurs d'asile et les immigrés non-européens font face à des obstacles accrus, avec des conditions supplémentaires et des délais d'attente plus longs pour la couverture santé.
  
  Un changement juridique clé en 2019 a introduit un délai de carence de trois mois et une condition de résidence minimale pour l'éligibilité à la Protection Universelle Maladie (PUMa).
  
  • Loi "Immigration & Intégration" (janvier 2024) : Cette loi, dont le score n'est pas encore intégré au MIPEX, a centralisé et renforcé les exigences en matière de langue, de civisme et d'emploi.
  
  Elle introduit des limites au renouvellement des titres de séjour temporaires et des tests de langue et de valeurs plus stricts pour la résidence et la citoyenneté.
  
  Les décrets et circulaires de mi-2024 et début 2025 ont activé ce cadre, augmentant la pression administrative et les obligations d'intégration.
  
  Analyse par Domaine Politique
  
  Domaine Politique
  
  Classification MIPEX
  
  Résumé des Constatations
  
  Mobilité sur le Marché du Travail
  
  Halfway favourable (Moyennement favorable)
  
  Les résidents permanents et les familles ont accès au marché du travail, mais sont exclus de plus de professions réglementées que dans tout autre pays.
  
  Les nouveaux arrivants ont accès aux services généraux d'emploi mais souvent pas à la reconnaissance de leurs diplômes ou à des bourses d'études.
  
  Regroupement Familial
  
  Halfway favourable (Moyennement favorable)
  
  Les exigences (économiques, logement) sont strictes et le processus peut être long et discrétionnaire.
  
  Cependant, une fois réunies, les familles bénéficient de droits socio-économiques égaux et d'un soutien à l'intégration, avec une augmentation des heures de cours de langue (jusqu'à 400h, et 600h pour les personnes analphabètes).
  
  Éducation
  
  Halfway favourable (Moyennement favorable)
  
  La France a renforcé son soutien, notamment via des programmes ciblés depuis 2015 (AIMES+, UXIL).
  
  Tous les élèves, quel que soit leur statut, ont les mêmes droits à l'éducation.
  
  Le point faible reste l'absence de valorisation de la diversité dans l'éducation à la citoyenneté.
  
  Santé
  
  Slightly favourable (Légèrement favorable)
  
  Le système de santé est inclusif, mais il ne répond que faiblement aux besoins spécifiques des patients migrants.
  
  Depuis 2020, les barrières à l'accès se sont renforcées pour les demandeurs d'asile et les immigrés non-UE (conditions plus strictes, délais d'attente allongés).
  
  Participation Politique
  
  Halfway favourable (Moyennement favorable)
  
  Les étrangers sont peu informés et consultés par les autorités.
  
  La France est l'un des rares grands pays de destination sans droit de vote local pour les étrangers.
  
  Une consultation accrue des groupes de réfugiés est notée au niveau national depuis 2018.
  
  Résidence Permanente
  
  Halfway favourable (Moyennement favorable)
  
  L'accès au statut sécurisé de 10 ans est conditionné par des exigences linguistiques, d'intégration et parfois économiques parmi les plus restrictives.
  
  Bien que le statut lui-même soit protecteur, il est très difficile à obtenir et à renouveler (notamment depuis 2024).
  
  Accès à la Nationalité
  
  Slightly favourable (Légèrement favorable)
  
  Le parcours est similaire à d'autres pays occidentaux (5 ans de résidence, double nationalité possible).
  
  Cependant, le processus est de plus en plus politisé, discrétionnaire et décourageant pour certains candidats.
  
  Les exigences strictes (stabilité financière, niveau B1 en langue, entretien d'assimilation subjectif) constituent des barrières importantes.
  
  Antidiscrimination
  
  Slightly favourable (Légèrement favorable)
  
  Il s'agit du plus grand point fort de la France en matière d'intégration.
  
  La législation est solide et l'organe de défense (Défenseur des Droits) est efficace pour informer le public et aider les victimes.
  
  Ces politiques semblent avoir eu un impact positif à long terme sur les mentalités publiques en Europe.
  
  Conclusions et Recommandations
  
  Le modèle d'intégration français est marqué par une incohérence fondamentale :
  
  ses forces reconnues en matière de lutte contre la discrimination et
  
  ses progrès dans l'éducation sont sapés par une approche restrictive et précaire concernant les piliers de l'intégration à long terme que sont la résidence, la famille et la nationalité.
  
  La trajectoire politique récente confirme cette tendance restrictive.
  
  La loi de 2024, les nouvelles instructions préfectorales sur la naturalisation (mai 2025) et une proposition de 2024 remettant en cause le droit du sol témoignent d'un changement de discours vers des politiques d'intégration plus exclusives.
  
  Pour renforcer son modèle, la France devrait :
  
  1. Adopter une Approche Cohérente : Aligner les politiques restrictives de résidence et de regroupement familial sur ses mesures plus inclusives en matière d'éducation et d'antidiscrimination.
  
  2. Sécuriser les Parcours d'Intégration : Réduire le caractère discrétionnaire et les exigences excessives dans les procédures d'accès à la résidence permanente et à la nationalité pour offrir la stabilité nécessaire à une intégration réussie.
  
  3. Traiter les Immigrés comme de Futurs Citoyens : Mettre en œuvre une vision de l'intégration comme un processus à double sens qui renforce la confiance mutuelle et bénéficie à l'ensemble de la société.
  
  Comme le démontrent 130 études scientifiques indépendantes utilisant les données du MIPEX, la manière dont les gouvernements traitent les immigrés est un facteur déterminant qui influence non seulement l'acceptation par le public, mais aussi le sentiment d'appartenance, la participation et même la santé des immigrés dans leur nouveau pays.
  
  politiques publiques migrants mineurs isolés RESF analyse international outil data visuel data visualisation 2025 index
Visit annotations in context

Tags

mineurs isolés

migrants

data

data visualisation

2025

politiques publiques

analyse

outil

RESF

visuel

international

index

Annotators

tanemika

URL

mipex.eu/france
substack.com substack.com

We Were Guaranteed a Haka

1
1. jbhoward 17 Oct 2025
  
  in Public
  
  I will not link to the photo here. Allow me my parental illusions of protection. I reprint it here on social media only in doctored form. I didn’t know what to use to cover her eyes.
  
  Responsible parenting
  
  Child protection Data protection Online images
Visit annotations in context

Tags

Data protection

Online images

Child protection

Annotators

jbhoward

URL

substack.com/home/post/p-176207573
Sep 2025
www.dispatch.com www.dispatch.com

No more data centers: Ohio township pushes back against influx of Amazon, others

1
1. peter_murray 08 Sep 2025
  
  in Public
  
  During each call, Stewart said, Amazon officials have not been helpful."They wanted to do background checks on all my firefighters; I wouldn't let them," he said. "And we've struggled to gain access to emergencies. They'll stop us at the gate, and our medic units have been delayed. They're denying us access to patients.
  
  AWS denies first responder access to facilities
  
  data center infrastructure
Visit annotations in context

Tags

data center infrastructure

Annotators

peter_murray

URL

dispatch.com/story/news/local/2025/09/08/ohio-township-pauses-data-center-construction-amid-influx-of-amazon-others/85989839007/
media.dltj.org media.dltj.org

Video: How AI Datacenters Eat the World by High Yield, annotated

1
1. peter_murray 07 Sep 2025
  
  in Public
  
  "How AI Datacenters Eat the World" from High Yield on YouTube. 30-Aug-2025
  
  Description
  
  HighYield x SemiAnalysis deep-dive into AI Datacenters, Gigawatt Megaclusters and the Hyperscaler race to AGI. How AI Datacenters Eat the World.
  
  data center infrastructure building LLMs
Visit annotations in context

Tags

data center infrastructure

building LLMs

Annotators

peter_murray

URL

media.dltj.org/annotated-video/20250907T131318-dhqoTku-HAA-how-ai-datacenters-eat-world/index.html
Aug 2025
udel.edu udel.edu

Slide Rule "Data-Guide"

1
1. chrisaldrich 18 Aug 2025
  
  in Public
  
  https://udel.edu/~mm/sliderule/dataGuide/
  
  Data-Guide slide rules 1956
Visit annotations in context

Tags

1956

slide rules

Data-Guide

Annotators

chrisaldrich

URL

udel.edu/~mm/sliderule/dataGuide/
Jul 2025
media.dltj.org media.dltj.org

Video: How Many Steaks Can One AI Video vs. AI Image Cook? | WSJ by The Wall Street Journal, annotated

1
1. peter_murray 22 Jul 2025
  
  in Public
  
  Recently, OpenAI has shared something. In a blog post, CEO Sam Altman said that the average query uses about 0.34 watt hours of energy.
  
  OpenAI's accounting of text generation energy usage
  
  From the 10-Jun-2025 blog post:
  
  People are often curious about how much energy a ChatGPT query uses; the average query uses about 0.34 watt-hours, about what an oven would use in a little over one second, or a high-efficiency lightbulb would use in a couple of minutes. It also uses about 0.000085 gallons of water; roughly one fifteenth of a teaspoon.
  
  data center infrastructure
Visit annotations in context

Tags

data center infrastructure

Annotators

peter_murray

URL

media.dltj.org/annotated-video/20250722T172333-mRNVc3-XGFg-how-steaks-one-ai-video-vs-ai-image-cook-wsj/index.html
www.loopwerk.io www.loopwerk.io

Loopwerk: Refactoring Svelte stores to $state runes

4
1. TylerRick 21 Jul 2025
  
  in Public
  
  When you open this in two browsers and refresh a few times, one browser after the other, you’ll see the count go up and up (when looking at the page source), proving that the state is shared between both browsers (well, not really, it’s shared on the server, and used by both users). This will have serious consequences if you go this route: if user A is logged in and you’d write the user object to the shared state, and user B is not logged in, they’d still see a flash of user A’s username appear in the navigation bar, until the shared state is overwritten by the undefined user object.
  
  Svelte: problem: leaking a user's data with other users
2. TylerRick 21 Jul 2025
  
  in Public
  
  export const state: State = $state({ user: undefined });
  
  The problem is, this creates global (server-wide) state, when it should be "user-local" global state.
  
  don't do this Svelte: problem: leaking a user's data with other users global state
3. TylerRick 21 Jul 2025
  
  in Public
  
  But sadly this introduces shared state on the server (when we use SSR), and this is a big problem since we’re now leaking data between different users.
  
  Svelte: problem: leaking a user's data with other users
4. TylerRick 21 Jul 2025
  
  in Public
  
  One pattern that I love to use in my SvelteKit projects is returning writable stores from the layout’s load function. This makes it possible to fetch data from the server (for example the user object for the logged in user), and then you make this object available as a writable reactive store throughout the whole application. So when the user updates their username or avatar, you do the PUT request to the server and you get the updated user object back from the server as the response, you can simply update the $user writable store value and every place in your app where you show the user object gets updated immediately.
  
  pattern (software) state management Svelte: store state management: store data flow
Visit annotations in context

Tags

Svelte: store

state management: store

pattern (software)

data flow

state management

global state

don't do this

Svelte: problem: leaking a user's data with other users

Annotators

TylerRick

URL

loopwerk.io/articles/2025/svelte-5-stores/
svelte.dev svelte.dev

State management • Docs • Svelte

2
1. TylerRick 21 Jul 2025
  
  in Public
  
  risk of accidentally exposing one user’s data to another
  
  Svelte: problem: leaking a user's data with other users svelte-kit caveat
2. TylerRick 21 Jul 2025
  
  in Public
  
  As with the previous example, this puts one user’s information in a place that is shared by all users.
  
  svelte-kit caveat Svelte: problem: leaking a user's data with other users
Visit annotations in context

Tags

svelte-kit

caveat

Svelte: problem: leaking a user's data with other users

Annotators

TylerRick

URL

svelte.dev/docs/kit/state-management
www.loopwerk.io www.loopwerk.io

Loopwerk: SvelteKit architecture tip: return a writable store from your load function

1
1. TylerRick 21 Jul 2025
  
  in Public
  
  But what if you want to update this user instance? For example on your website you have a form where the user can change their name, username, or avatar. When the form is submitted this gets stored on the server, but the site still shows the old user information, for example it still shows the old avatar of the user in the top menu. The user variable isn’t writable, so how do you overwrite this?
  
  data flow state management
Visit annotations in context

Tags

data flow

state management

Annotators

TylerRick

URL

loopwerk.io/articles/2024/sveltekit-writable-store-from-load/
batesoninstitute.org batesoninstitute.org

Warm Data - The International Bateson Institute

1
1. stopresetgo 17 Jul 2025
  
  in Public
  
  for - warm data - Nora Bateson - warm data
  
  Nora Bateson - warm data warm data
Visit annotations in context

Tags

warm data

Nora Bateson - warm data

Annotators

stopresetgo

URL

batesoninstitute.org/warm-data/
support.mozilla.org support.mozilla.org

How to export your Pocket saves | Pocket Help

1
1. chrisaldrich 02 Jul 2025
  
  in Public
  
  https://support.mozilla.org/en-US/kb/exporting-your-pocket-list
  
  data export Pocket IndieWeb site deaths
Visit annotations in context

Tags

site deaths

IndieWeb

data export

Pocket

Annotators

chrisaldrich

URL

support.mozilla.org/en-US/kb/exporting-your-pocket-list
Jun 2025
papers.ssrn.com papers.ssrn.com

Data as Policy

1
1. mlenc 13 Jun 2025
  
  in Public
  
  data as policy regulation data strategy
Visit annotations in context

Tags

data as policy

data strategy

regulation

Annotators

mlenc

URL

papers.ssrn.com/sol3/papers.cfm
www2.gov.bc.ca www2.gov.bc.ca

Preventing and reducing homelessness: An integrated data project - Province of British Columbia

1
1. mlenc 12 Jun 2025
  
  in Public
  
  admin data integrated data homelessness data bc canada
Visit annotations in context

Tags

integrated data

bc

canada

homelessness data

admin data

Annotators

mlenc

URL

www2.gov.bc.ca/gov/content/housing-tenancy/affordable-and-social-housing/homelessness/homelessness-cohort
apxml.com apxml.com

Data Preprocessing with PyTorch Transforms vs Keras

1
1. jmk412 09 Jun 2025
  
  in Public
  
  Stateless vs. Stateful Preprocessing: Most PyTorch transforms are stateless (e.g., RandomHorizontalFlip) or configured with fixed parameters (e.g., Normalize with pre-defined mean/std). If you need to compute statistics from your data (like the mean and standard deviation for normalization), you typically do this once offline and then hardcode these values into the Normalize transform. This contrasts with Keras's Normalization layer, which has an adapt() method to compute these statistics online from a batch of data.
  
  Additional perspective on preprocessing
  
  data-platform
Visit annotations in context

Tags

data-platform

Annotators

jmk412

URL

apxml.com/courses/pytorch-for-tensorflow-developers/chapter-3-pytorch-data-loading-for-tf-users/preprocessing-pytorch-transforms
www.tensorflow.org www.tensorflow.org

Data preprocessing for ML: options and recommendations | TFX | TensorFlow

2
1. jmk412 09 Jun 2025
  
  in Public
  
  Preprocessing challenges The following are the primary challenges of implementing data preprocessing: Training-serving skew. Training-serving skew refers to a difference between effectiveness (predictive performance) during training and during serving. This skew can be caused by a discrepancy between how you handle data in the training and the serving pipelines. For example, if your model is trained on a logarithmically transformed feature, but it's presented with the raw feature during serving, the prediction output might not be accurate. If the transformations become part of the model itself, it can be straightforward to handle instance-level transformations, as described earlier in Option C: TensorFlow. In that case, the model serving interface (the serving_fn function) expects raw data, while the model internally transforms this data before computing the output. The transformations are the same as those that were applied on the raw training and prediction data points. Full-pass transformations. You can't implement full-pass transformations such as scaling and normalization transformations in your TensorFlow model. In full-pass transformations, some statistics (for example, max and min values to scale numeric features) must be computed on the training data beforehand, as described in Option B: Dataflow. The values then have to be stored somewhere to be used during model serving for prediction to transform the new raw data points as instance-level transformations, which avoids training-serving skew. You can use the TensorFlow Transform (tf.Transform) library to directly embed the statistics in your TensorFlow model. This approach is explained later in How tf.Transform works. Preparing the data up front for better training efficiency. Implementing instance-level transformations as part of the model can degrade the efficiency of the training process. This degradation occurs because the same transformations are repeatedly applied to the same training data on each epoch. Imagine that you have raw training data with 1,000 features, and you apply a mix of instance-level transformations to generate 10,000 features. If you implement these transformations as part of your model, and if you then feed the model the raw training data, these 10,000 operations are applied N times on each instance, where N is the number of epochs. In addition, if you're using accelerators (GPUs or TPUs), they sit idle while the CPU performs those transformations, which isn't an efficient use of your costly accelerators. Ideally, the training data is transformed before training, using the technique described under Option B: Dataflow, where the 10,000 transformation operations are applied only once on each training instance. The transformed training data is then presented to the model. No further transformations are applied, and the accelerators are busy all of the time. In addition, using Dataflow helps you to preprocess large amounts of data at scale, using a fully managed service. Preparing the training data up front can improve training efficiency. However, implementing the transformation logic outside of the model (the approaches described in Option A: BigQuery or Option B: Dataflow) doesn't resolve the issue of training-serving skew. Unless you store the engineered feature in the feature store to be used for both training and prediction, the transformation logic must be implemented somewhere to be applied on new data points coming for prediction, because the model interface expects transformed data. The TensorFlow Transform (tf.Transform) library can help you to address this issue, as described in the following section.
  
  Challenges with data preprocessing
  
  data-platform
2. jmk412 09 Jun 2025
  
  in Public
  
  You preprocess the raw training data using the transformation implemented in the tf.Transform Apache Beam APIs, and run it at scale on Dataflow. The preprocessing occurs in the following phases: Analyze phase: During the analyze phase, the required statistics (like means, variances, and quantiles) for stateful transformations are computed on the training data with full-pass operations. This phase produces a set of transformation artifacts, including the transform_fn graph. The transform_fn graph is a TensorFlow graph that has the transformation logic as instance-level operations. It includes the statistics computed in the analyze phase as constants. Transform phase: During the transform phase, the transform_fn graph is applied to the raw training data, where the computed statistics are used to process the data records (for example, to scale numerical columns) in an instance-level fashion.
  
  Good dichotomy for data preprocessing
  
  data-platform
Visit annotations in context

Tags

data-platform

Annotators

jmk412

URL

tensorflow.org/tfx/guide/tft_bestpractices
upcea.edu upcea.edu

Why Credential Terminology Matters in Higher Education and Workforce Development - UPCEA

1
1. SenorG 04 Jun 2025
  
  in Public
  
  Stackable credentials are also critical to the “Some College, No Credential” (SCNC) market, which reached a total of 36.8 million under the age of 65 in the U.S., up 2.9% from the previous year. Recent research from UPCEA and StraighterLine found that 76% of SCNC adults said being able to earn alternative or microcredentials that could stack toward a degree would increase or greatly increase their interest in completing their degree
  
  In other words: 36.8M people have some college, and 76% say the ability to earn formal credentials that stack to degrees would increase their interest in completing their degree. That's 28 MILLION adults who already did post-secondary once and could be re-engaged. The dreaded enrollment cliff is 3M and yet 10x that number of people who already self-selected into college once gets none of the same attention. It's a massive opportunity.
  
  SCND MC data
Visit annotations in context

Tags

SCND

MC data

Annotators

SenorG

URL

upcea.edu/
mutabit.com mutabit.com

Revisando la participación continua

2
1. NESTORCRISTANCHO 03 Jun 2025
  
  in Public
  
  Data Frame
  
  Un DataFrame es una tabla de datos estructurada en filas y columnas que permite organizar, manipular y analizar información de forma sencilla y ordenada en programación y ciencia de datos.
  
  No me está funcionando la tabla, creo que me falta definir alguna variable o estoy olvidando algún paso.
  
  Ciencia de datos Programación Ordenada Data Frame
2. JPAR 01 Jun 2025
  
  in Public
  
  fighter1 data
  
  Aquí en esta orden al ejecutarla en Glamorous también se produce un error de respuesta nula
  
  Respuesta nula Data Caracteristicas Pokemon
Visit annotations in context

Tags

Data

Data Frame

Programación Ordenada

Caracteristicas Pokemon

Respuesta nula

Ciencia de datos

Annotators

JPAR

NESTORCRISTANCHO

URL

mutabit.com/repos.fossil/labci/doc/tip/wiki/revisando-la-participacion-continua--2w3jt.md.html
scholarship.law.bu.edu scholarship.law.bu.edu

Data as Policy

1
1. mlenc 02 Jun 2025
  
  in Public
  
  data as policy
Visit annotations in context

Tags

data as policy

Annotators

mlenc

URL

scholarship.law.bu.edu/cgi/viewcontent.cgi
May 2025
bellwether.org bellwether.org

Transforming Education Data Sharing for Nebraska’s Court-Involved Students: Improving Academic Outcomes Through Cross-Agency Collaboration | Bellwether

1
1. mlenc 22 May 2025
  
  in Public
  
  nebraska case study of data sharing for court-involved youth
  
  data sharing data infrastructure nebraska foster care court-involved youth youth education data
Visit annotations in context

Tags

youth

data infrastructure

data sharing

court-involved youth

nebraska

education data

foster care

Annotators

mlenc

URL

bellwether.org/publications/transforming-education-data-sharing-for-nebraskas-court-involved-students/
www.theguardian.com www.theguardian.com

Half of world’s CO2 emissions come from 36 fossil fuel firms, study shows

1
1. HeinzWittenbrink 06 May 2025
  
  in Public
  
  Nur 36 Firmen waren 2023 für über die Hälfte der weltweit ausgestoßenen Treibhausgase verantwortlich. Das ergibt eine Analyse der Daten in der Carbon Majors Database. Die meisten der 169 in dieser Datenbank erhaltenen Firmen haben im Jahr 2023, dem damals heißesten Jahr der Weltgeschichte, ihre Emissionen gesteigert.
  
  Zu den Hauptverschmutzern gehört auch die #Adnoc, deren Anteile an der österreichischen #OMV mit denen des österreichischen Staates syndiziert sind.
  
  Frühere Versionen des von InfluenceMap produzierten Carbon Majors-Bericht spielten bei Prozessen gegen fossile Unternehmen eine wichtige Rolle. https://www.theguardian.com/environment/2025/mar/05/half-of-worlds-co2-emissions-come-from-36-fossil-fuel-firms-study-shows
  
  Carbon Majors 2023 Data Update: https://carbonmajors.org/briefing/The-Carbon-Majors-Database-2023-Update-31397
  
  Aramco Coal India CHN Energy NIOC Jinneng Group Shell BP TotalEnergies ExxonMobil Chevron by: Damian Carrington 2025-01-05 Christiana Figueres Carbon Majors Emmett Connaire InfluenceMap ClientEarth Black Rock Carbon Majors: 2023 Data Update Gazprom Eni Adnoc Kumi Naidoo Fossil Fuels Non-Proliferation Treaty 2023 CO2 emissions
Visit annotations in context

Tags

CHN Energy

2023

Shell

NIOC

Emmett Connaire

Fossil Fuels Non-Proliferation Treaty

Christiana Figueres

Eni

CO2 emissions

Chevron

Coal India

Kumi Naidoo

InfluenceMap

Carbon Majors

BP

2025-01-05

ClientEarth

by: Damian Carrington

Aramco

Gazprom

Carbon Majors: 2023 Data Update

Adnoc

Black Rock

ExxonMobil

Jinneng Group

TotalEnergies

Annotators

HeinzWittenbrink

URL

theguardian.com/environment/2025/mar/05/half-of-worlds-co2-emissions-come-from-36-fossil-fuel-firms-study-shows
Apr 2025
www.youtube.com www.youtube.com

A.I. Experiments: Visualizing High-Dimensional Space

3
1. stopresetgo 12 Apr 2025
  
  in Public
  
  open sourcing all of this as part of TensorFlow so that anyone can use these tools to explore their data.
  
  for - tensorflow - data visualization of words - question - tensorflow - for SRG tool?
  
  tensorflow - data visualization of words question - tensorflow - for SRG tool?
2. stopresetgo 12 Apr 2025
  
  in Public
  
  for - data visualization - words in high dimensional space - Google tensorflow - open source data visualization - of words
  
  data visualization - words in high dimensional space Google tensorflow - open source data visualization - of words
3. stopresetgo 12 Apr 2025
  
  in Public
  
  words are treated as high-dimensional data points.
  
  for - words - high dimensionality data points
  
  words - high dimensionality data points
Visit annotations in context

Tags

tensorflow - data visualization of words

question - tensorflow - for SRG tool?

data visualization - words in high dimensional space

words - high dimensionality data points

Google tensorflow - open source data visualization - of words

Annotators

stopresetgo

URL

youtube.com/watch
dtcc.chalmers.se dtcc.chalmers.se

Digital Twin Cities Centre – A Vinnova competence centre

1
1. stopresetgo 12 Apr 2025
  
  in Public
  
  for - chalmers university - digital twin cities centre - from - youtube - urban data visualization using mixed reality - https://hyp.is/ptvO5BexEfC063-4BZXD-A/www.youtube.com/watch?v=tN2_TJ1ZYhQ
  
  chalmers university - digital twin cities centre from - youtube - urban data visualization using mixed reality
Visit annotations in context

Tags

chalmers university - digital twin cities centre

from - youtube - urban data visualization using mixed reality

Annotators

stopresetgo

URL

dtcc.chalmers.se/
www.youtube.com www.youtube.com

IDA Ideas: Demonstration: 3D Network Data Visualization

1
1. stopresetgo 12 Apr 2025
  
  in Public
  
  for - mixed reality 3d graph data visualization - skyrails - gelphi
  
  mixed reality 3d graph data visualization - skyrails - gelphi
Visit annotations in context

Tags

mixed reality 3d graph data visualization - skyrails - gelphi

Annotators

stopresetgo

URL

youtube.com/watch
www.redbook.io www.redbook.io

Red Book, 5th ed. Ch. 2: Traditional RDBMS Systems

1
1. almereyda 06 Apr 2025
  
  in Public
  
  data modelling relational systems
Visit annotations in context

Tags

relational

systems

data

modelling

Annotators

almereyda

URL

redbook.io/ch2-importantdbms.html
dsf.berkeley.edu dsf.berkeley.edu

whatgoesaround-stonebraker.pdf

1
1. almereyda 06 Apr 2025
  
  in Public
  
  data modelling filetype:PDF
Visit annotations in context

Tags

filetype:PDF

data

modelling

Annotators

almereyda

URL

dsf.berkeley.edu/cs286/papers/goesaround-redbook2005.pdf
Mar 2025
www.nyc.gov www.nyc.gov

Administrative Data Access Initiative - CIDI

1
1. mlenc 31 Mar 2025
  
  in Public
  
  accessing admin data admin data access homelessness data data infrastructure
Visit annotations in context

Tags

data infrastructure

access

accessing admin data

homelessness data

admin data

Annotators

mlenc

URL

nyc.gov/site/cidi/collaborations/administrative-data-access-initiative.page
www.infoway-inforoute.ca www.infoway-inforoute.ca

Shared Pan-Canadian Interoperability Roadmap

1
1. mlenc 20 Mar 2025
  
  in Public
  
  data infrastructure roadmap data roadmap health data canada
Visit annotations in context

Tags

data infrastructure

health data

canada

roadmap

data roadmap

Annotators

mlenc

URL

infoway-inforoute.ca/en/component/edocman/6444-connecting-you-to-modern-health-care-shared-pan-canadian-interoperability-roadmap/view-document
cyfroweobywatelstwo.pl cyfroweobywatelstwo.pl

Internet Dzieci - Instytut Cyfrowego Obywatelstwa

1
1. pyxelr 16 Mar 2025
  
  in Public
  
  Wybrane dane z raportu:Grupa wiekowa 7-12 lat:Z serwisów społecznościowych i komunikatorów dozwolonych od 13. roku życia aktywnie korzysta znacznie ponad połowa tej grupy wiekowej – aż 1,4 mln dzieci (58%). co trzecie dziecko (760 tys.) (32%) ma regularny dostęp do platformy TikTok, 24% (580 tys.) do Facebooka, zaś 12% (290 tys.) – do Instagrama.Dzieci powszechnie używają komunikatorów: 38% Messengera (900 tys.), a 31% Whatsappa (720 tys.).Najintensywniej korzystają z TikToka – aktywni użytkownicy tej platformy spędzają w aplikacji średnio 2 godziny i 11 minut dziennie i w większości przypadków uruchamiają ją kilkanaście lub kilkadziesiąt razy w ciągu jednego dnia. Można szacować, że ponad dwie godziny dziennie na tej platformie spędza ponad 300 tys. dzieci.Grupa wiekowa 7-14 lat:85% z nich korzysta z internetu (2,7 mln).Spośród nich 96 proc. (2,6 mln) łączy się z siecią poprzez urządzenia mobilne.Najczęściej korzystają z platform społecznościowych i streamingowych. W serwisach społecznościowych spędzają ponad 2 godziny dziennie, zaś na platformach streamingowych blisko 2 godziny. Najczęściej wybieranymi kategoriami tematycznymi są: kultura i rozrywka, edukacja oraz erotyka. Z rozrywki – głównie gier oraz muzyki – korzysta 95 proc internautów z tej grupy, podobny odsetek odwiedziło treści edukacyjne, zaś erotyczne – 51 proc. Do korzystania z serwisów erotycznych najczęściej wykorzystują urządzenia mobilne.
  
  children internet data polish TikTok
Visit annotations in context

Tags

polish

TikTok

data

children

internet

Annotators

pyxelr

URL

cyfroweobywatelstwo.pl/internetdzieci/
Feb 2025
moodle.cnc.bc.ca moodle.cnc.bc.ca

The Ebola Epidemic Annotation and Fact Checking Activity .pdf

1
1. CGaugano 24 Feb 2025
  
  in Public
  
  Manyvillagers believed that abandoning these rituals would anger their ancestors and cause harm totheir families (WHO, 2015)
  
  This is an incorrect information since under the "Factors that contributed to undetected spread", it did not state this information given. Additionally, the given information was unrelated from the reference/citation given.
  
  Incorrect Data
Visit annotations in context

Tags

Incorrect Data

Annotators

CGaugano

URL

moodle.cnc.bc.ca/pluginfile.php/1517598/mod_resource/content/7/The Ebola Epidemic Annotation and Fact Checking Activity .pdf
www.modernisation.gouv.fr www.modernisation.gouv.fr

Le plan d’action national 2024-2026 pour un gouvernement ouvert | Direction interministérielle de la transformation publique

1
1. tanemika 21 Feb 2025
  
  in Public
  
  DITP CRPA texte officiel gouvernance ouverte open data 2024
Visit annotations in context

Tags

DITP

CRPA

gouvernance ouverte

2024

texte officiel

open data

Annotators

tanemika

URL

modernisation.gouv.fr/publications/le-plan-daction-national-2024-2026-pour-un-gouvernement-ouvert
www.datavis.ca www.datavis.ca

100+ Years of Graphs of the Titanic Data

1
1. UnarchivedStudent 21 Feb 2025
  
  in Public
  
  Parallel sets Parallel coordinate plots provide a way to display multidimensional data in 2D plots. They do this by representing the variables as a set of parallel axes, and showing each observation as a line in parallel coordinate space, rather than as a point in standard coordinate space. Extensions of this idea for categorical data led to “parallel sets plots”, and some variations, a number of which use the Titanic data for examples. Bendix, Kosara, and Hauser (2005) Parallel sets: Visual analysis of categorical data and Kosara:2006-parallel Parallel sets: Interactive exploration and visual analysis of categorical data developed an interactive system to explore multivariate categorical data using parallel sets, in which the lines between categories of successive variables are of width proportional to the joint frequencies.
  
  Due to the lack of visual clarity, I struggled to understand what 2005 parallels sets were actually representing in this context (especially when external searching seems to tell me that these types of plots are usually formatted horizontally), to the point of forgetting how most of these charts are tracking how of a certain grouping lived/died from the sinking, which makes me question on what benefits we get from them. I do appreciate the 2013 charts not only for an accurate line widths, but being clear enough with the color and shade distinctions in certain lines to make clear what feeds into what (although I do wish the "Survived" category was either on top or bottom rather than the middle).
  
  titanic paralell coordinate plots parallel coordinate plots parallel sets graphs data graphing
Visit annotations in context

Tags

parallel coordinate plots

data graphing

titanic

graphs

parallel sets

paralell coordinate plots

Annotators

UnarchivedStudent

URL

datavis.ca/papers/titanic/
srsergiorodriguez.github.io srsergiorodriguez.github.io

Humanidades digitales en América Latina

1
1. offray 19 Feb 2025
  
  in Public
  
  Por ejemplo, según comentaron, la inversión en desarrollo de interfaces gráficas de usuarios se suele posponer por los altos costos que implica el diseño y la puesta a prueba de ellas. Algo similar sucede con las traducciones y localizaciones, pues requieren de personas con conocimiento situado. Adicionalmente, muchos proyectos paran sus actividades una vez el primer empujón institucional y financiero cesa, y por lo tanto sus características quedan congeladas en el tiempo o caducan por falta de soporte.
  
  Es interesante como Grafoscopio ha evitado varias de estas fallas al hacer elecciones extrañas como ser desarrollado en Pharo (que de entrada le da interfaz gráfica y modelos de persistencia de datos ad-hoc), organizando talleres informales como las Data Week y las Data Rodas que crean conocimiento localizado y hacen una diglosia puente en lugar de abismo y basarse en las economías del cuidado y los afectos, reconociéndolas para no requerir tanto dinero inicial. Si bien se comparten las fragilidades de los proyectos de pequeña y mediana, por ejemplo respecto a el número pequeño de desarrolladores, vale la pena visibilizar también estas estrategias diferenciadas para lidiar con estos problemas comunes.
  
  metaherramientas alfabetismos digitales críticos Data Rodas data week Grafoscopio diglosia
Visit annotations in context

Tags

Grafoscopio

Data Rodas

alfabetismos digitales críticos

metaherramientas

diglosia

data week

Annotators

offray

URL

srsergiorodriguez.github.io/exploraciones-digitales/infraestructura.html
srsergiorodriguez.github.io srsergiorodriguez.github.io

Humanidades digitales en América Latina

2
1. offray 19 Feb 2025
  
  in Public
  
  No obstante, también se ha convertido en un espacio frustrante por el fenómeno del freeriding, pues las personas que participan aprovechan momentáneamente los espacios y conocimientos del club, pero no sienten compromisos mínimos con él, como un respeto por el tiempo de quiénes lo organizan o la necesidad de informar sobre su eventual falta de participación. Quienes han participado a largo aliento en el club, sin embargo, han encontrado que su aprendizaje en comunidad es mucho más potente que el ejercicio autodidacta en solitario
  
  Algo similar experimentamos con las Data Weeks y Data Rodas en Grafoscopio, lo que nos llevó a establecer una serie de principios que incluían cosas como las prácticas de cuidado mútuo y reconocemos el carácter flotante de la mayoría de læs participantes y el duradero de muy pocos (por ello y otras cosas es clave la creación permanente de memoria viva hipertextual en nuestras infraestructuras de bolsillo).
  
  Data Rodas data week economías del cuidado
2. offray 19 Feb 2025
  
  in Public
  
  salir de la lógica del taller: nosotros vivimos haciendo talleres y talleres y talleres, y pasar a una lógica mucho más concreta de generar desarrollos y soluciones que permitan ser sostenibles. La institucionalización permite la sostenibilidad"48«Entrevista a Jairo Melo»..
  
  Otra posibilidad es la construcción de memoria viva durante y entre los talleres que les de un sentido de continuidad y progreso y que permita valorar la lógica de los talleres para construir tecnologías propias en lugar de pensarla sólo para la apropiación de tecnologías externas, muy en resonancia con lo dicho en este comentario.
  
  Si bien tenemos aún problemas, en Grafoscopio, para el aprendizaje entre pares gradual, la memoria viva y los problemas altamente contextuales hacen de ellos problemas encarnados que asumimos en talleres futuros y vínculos entre comunidades de práctica y espacios institucionalizados
  
  Grafoscopio institucionalidad comunidades de práctica bienes comunes Data Rodas data week
Visit annotations in context

Tags

comunidades de práctica

bienes comunes

data week

Grafoscopio

Data Rodas

institucionalidad

economías del cuidado

Annotators

offray

URL

srsergiorodriguez.github.io/exploraciones-digitales/comunidad.html
gcgh.grandchallenges.org gcgh.grandchallenges.org

Innovative Data and Modeling Approaches to Measure Women’s H

1
1. mlenc 17 Feb 2025
  
  in Public
  
  toblog no novel data use what's there
Visit annotations in context

Tags

no novel data

toblog

use what's there

Annotators

mlenc

URL

gcgh.grandchallenges.org/challenge/innovative-data-and-modeling-approaches-measure-womens-health
srsergiorodriguez.github.io srsergiorodriguez.github.io

Humanidades digitales en América Latina

2
1. offray 17 Feb 2025
  
  in Public
  
  El interactivo 5, a continuación, presenta un generador de textos que entremezcla las palabras de distintos autores que, cada uno a su manera, han reflexionado sobre el humanismo en América Latina: Manuel Quintín Lame, Domingo Sarmiento, Leopoldo Zea, Oswald de Andrade y José Vasconcelos. Usando un sistema de cadenas de Markov, comúnmente aplicado en obras de literatura electrónica, este generador remezcla distintos textos y crea una amalgama que brinca entre los términos usados en ellos:
  
  De nuevo, un sesgo de género, en el material publicado/seleccionado.
  
  Interesante experimento de humanidades digitales en pequeño.
  
  sesgos de género infraestructuras de bolsillo small data cadenas de Markov
2. offray 17 Feb 2025
  
  in Public
  
  situar y entender los textos en el mundo humano construido en estos procesos básicos nos devuelve a la disciplina humanística de la hermenéutica, de la que las humanidades digitales son una encarnación tecnológica"78
  
  Y sin embargo, este bonito propósito, es ortogonal al tamaño de las bases de datos. Otras hermenéuticas computacionales podrían ocurrir y de hecho ocurren, en lo pequeño.
  
  small data infraestructuras de bolsillo
Visit annotations in context

Tags

sesgos de género

cadenas de Markov

infraestructuras de bolsillo

small data

Annotators

offray

URL

srsergiorodriguez.github.io/exploraciones-digitales/tradicion.html
www.theguardian.com www.theguardian.com

Scientists brace ‘for the worst’ as Trump purges climate mentions from websites

1
1. HeinzWittenbrink 07 Feb 2025
  
  in Public
  
  Die Trumpadministration entfernt systematisch Hinweise auf die Klimakrise und die globale Erhitzung von amerikanischen Regierungswebseits. Der Klimaforscher Michael Mann sagt, dass man mit dem schlimmsten Rechnen müsse, weil die Verschwurzer jetzt an die Macht gekommen sein. Fachleute gehen davon aus, kommen, dass die neue Regierung systematisch versuchen wird, kommen Informationen über die Ursachen und die Folgen der Klimakrise zu unterdrücken. Gleichzeitig werden Regierungsmaßnahmen zur Klimaanpassung und zur Reduzierung von Treibhausgas Emissionen blockiert. https://www.theguardian.com/us-news/2025/feb/04/trump-climate-change-federal-websites
  
  USA Trump administration Oliver Milman Andrew Witherspoon 2025-02-04 Michael Mann Gretchen Gehrke Ben Jealous Sierra Club Environmental Data and Governance Initiative
Visit annotations in context

Tags

Sierra Club

USA

Michael Mann

Trump administration

Gretchen Gehrke

2025-02-04

Environmental Data and Governance Initiative

Ben Jealous

Oliver Milman

Andrew Witherspoon

Annotators

HeinzWittenbrink

URL

theguardian.com/us-news/2025/feb/04/trump-climate-change-federal-websites
toby-89881.medium.com toby-89881.medium.com

Explode on Impact

1
1. mlenc 06 Feb 2025
  
  in Public
  
  fantasy data incentives social change toby lowe impact measurement
Visit annotations in context

Tags

incentives

social change

toby lowe

impact measurement

fantasy data

Annotators

mlenc

URL

toby-89881.medium.com/explode-on-impact-cba283b908cb
www.derstandard.at www.derstandard.at

Wann kommt die Energiewende? Oder kommt sie gar nicht?

1
1. HeinzWittenbrink 03 Feb 2025
  
  in Public
  
  Im Standard stellt Martin Auber mit aktuellen Daten belegt dar, warum der bloße Ausbau der Kapazitäten zur Erzeugung erneuerbarer Energien nicht zu einer Dekabonisierung führen wird. Der Energiebedarf wächst wesentlich schneller als die zur Verfügung stehende erneuerbare Energiepunkt. Durch den KI-Boom wird er noch einmal deutlich gesteigert. https://www.derstandard.at/story/3000000255154/wann-kommt-die-energiewende-oder-kommt-sie-gar-nicht
  
  energy transition by: Martin Auer 2025-02-03 IRENA Richard York Elizabeth Bell Energy transitions or additions? AI power: Expanding data center capacity to meet growing demand Tripling Renewables by 2030 Requires a Minimum of 16.4% Annual Growth Rate
Visit annotations in context

Tags

AI power: Expanding data center capacity to meet growing demand

IRENA

Tripling Renewables by 2030 Requires a Minimum of 16.4% Annual Growth Rate

Richard York

Elizabeth Bell

Energy transitions or additions?

energy transition

by: Martin Auer

2025-02-03

Annotators

HeinzWittenbrink

URL

derstandard.at/story/3000000255154/wann-kommt-die-energiewende-oder-kommt-sie-gar-nicht
www.youtube.com www.youtube.com

Big data : données, données, donnez-moi ! - #DATAGUEULE 15

1
1. tanemika 01 Feb 2025
  
  in Public
  
  À mesure que nos vies sont de plus en plus connectées, les données personnelles que nous émettons lors de chacune de nos activités deviennent un enjeu industriel considérable.
  
  Partons à la découverte d’un monde bâti autour du big data.
  
  numérique EMI big data 2025 vidéo
Visit annotations in context

Tags

vidéo

2025

numérique

EMI

big data

Annotators

tanemika

URL

youtube.com/watch
Jan 2025
en.wikipedia.org en.wikipedia.org

JSON-LD - Wikipedia

1
1. TylerRick 30 Jan 2025
  
  in Public
  
  linked data JSON-LD JSON RDF
Visit annotations in context

Tags

linked data

RDF

JSON

JSON-LD

Annotators

TylerRick

URL

en.wikipedia.org/wiki/JSON-LD
lepiter.io lepiter.io

What exactly is Glamorous Toolkit v1.0?

1
1. offray 30 Jan 2025
  
  in Public
  
  Moldable Development involves two distinct roles, each with its own set of skills. The facilitator (in blue on the map) is a technical role that is concerned with the technical part of building tools. But that alone is not enough. The stakeholder (in red) is at least as important. Tools are only meaningful when the relate to a question or hypothesis that is tied to value. That's the job of the stakeholder.
  
  En la comunidad de Grafoscopio, también hay roles con mayor experticia técnica que otros, a pesar de que todos estamos involucrados en la solución del problema. En nuestro caso, no hay una toma de decisiones, retos y problemas específicos que son decididos por el público no técnico (las que están en rojo en el mapa de Wardley), sino y otras acciones técnicas (en azul) sólo referidas a los desarrolladores, sino que hay un difuminado de color (quizás morado) que se torna más azul entre más técnica es la acción, a pesar de que participan de ella también personas no técnicas, y que se hace más roja entre más administrativa, a pesar de que en la administración participan también los desarrolladores. El lugar de encuentro de estas dos experiencia y la mezcla de colores ocurre particularmente en los talleres como las Data Weeks y las Data Rodas
  
  Data Weeks Data Rodas
Visit annotations in context

Tags

Data Rodas

Data Weeks

Annotators

offray

URL

lepiter.io/feenk/what-exactly-is-glamorous-toolkit-v1-0--7sex44dze2dqlocqxwfz8ju0i/
ericmjl.github.io ericmjl.github.io

Untitled document

1
1. structseeker 26 Jan 2025
  
  in Public
  
  Python Data Science Bootstrap
  
  Software Engineering Python Handbook Data Science Guide
Visit annotations in context

Tags

Data Science

Python

Software Engineering

Handbook

Guide

Annotators

structseeker

URL

ericmjl.github.io/data-science-bootstrap-notes/get-bootstrapped-on-your-data-science-projects/
feministai.pubpub.org feministai.pubpub.org

Leveraging AI

1
1. Marcoguzman2024 24 Jan 2025
  
  in Public
  
  the most critical issues to harness innovation within the AI ecosystem
  
  La diversidad corporal en Colombia abarca una amplia gama de experiencias, marcadas por la riqueza multicultural y la interacción de comunidades indígenas, afrodescendientes, campesinas y urbanas. Esta diversidad también está entrelazada con el acceso desigual a la tecnología, la salud y la educación, especialmente en áreas rurales.
  
  El uso de la Inteligencia Artificial para abordar problemas sociales, como se ha hecho en África, puede inspirar iniciativas en Colombia. Por ejemplo:
  
  La Inteligencia Artificial para diagnósticos tempranos de enfermedades como el cáncer de mama o la tuberculosis, adaptados a los contextos rurales colombianos, donde los servicios médicos son limitados.
  
  Modelos de Inteligencia Artificial para identificar plagas y enfermedades en cultivos de importancia para las comunidades rurales, como el café, el plátano o el maíz.
  
  Considerar las diversidades corporales al diseñar soluciones que sean accesibles para todas las personas, independientemente de sus capacidades físicas o contexto social.
  
  La traducción en Colombia puede desempeñar un papel fundamental en la creación y el uso de datos localizados para entrenar a la Inteligencia Artificial. Similar a la inclusión de Luganda en el proyecto Common Voice en África, se pueden desarrollar iniciativas para recopilar y traducir datos en lenguas indígenas colombianas, como el wayuunaiki, nasa yuwe o emberá.
  
  Ampliar la representación de las lenguas indígenas en aplicaciones de la Inteligencia Artificial, como asistentes virtuales o sistemas de reconocimiento de voz.
  
  Ayudar a preservar y revitalizar estas lenguas al integrarlas en tecnologías modernas.
  
  Generar datasets lingüísticos diversos que fomenten el desarrollo de Inteligencia Artificial inclusivas, contextualizadas y éticamente responsables.
  
  La Inteligencia Artificial para el bien social descrito en África puede adaptarse al contexto colombiano, aprovechando la “tubería de datos a impacto” para resolver problemas reales.
  
  La identificación de problemas debe ser participativa, integrando a las comunidades afectadas.
  
  Soluciones para mejorar la logística de distribución de alimentos en regiones apartadas.
  
  Inteligencia Artificial para identificar y mitigar riesgos ambientales en zonas afectadas por la minería ilegal o la deforestación.
  
  Es crucial desarrollar datasets localizados y representativos para evitar sesgos en los modelos de Inteligencia Artificial.
  
  Bases de datos agrícolas que reflejen las particularidades de los ecosistemas colombianos.
  
  Datos de salud adaptados a las diversidades genéticas y culturales del país.
  
  El diseño de IA debe basarse en el entendimiento del contexto local y cultural.
  
  Adaptar modelos a las necesidades específicas de comunidades indígenas y afrodescendientes.
  
  Integrar saberes tradicionales en soluciones tecnológicas, reconociendo el conocimiento colectivo y las prácticas ancestrales.
  
  La educación en ética de la Inteligencia Artificial es esencial para formar profesionales conscientes de los impactos sociales y culturales de sus creaciones. Además, deben establecerse directrices claras para implementar principios éticos en el desarrollo de tecnologías, fomentando prácticas inclusivas y no extractivas.
  
  Traducción-data Inteligencia artificial-data Corporalidades-data
Visit annotations in context

Tags

Inteligencia artificial-data

Corporalidades-data

Traducción-data

Annotators

Marcoguzman2024

URL

feministai.pubpub.org/pub/leveraging-ai
feministai.pubpub.org feministai.pubpub.org

Data and Indigenous Communities

1
1. Marcoguzman2024 24 Jan 2025
  
  in Public
  
  We are essentially digitizing trees, animals, and plants and rivers, and boundaries, defining those using satellite imagery.
  
  En Colombia, las corporalidades están profundamente vinculadas a la identidad cultural, territorial y espiritual. Para muchas comunidades indígenas, afrodescendientes y campesinas, el cuerpo no solo es físico, sino también un puente con la tierra y la naturaleza.
  
  Estas comunidades entienden el territorio como un elemento vital de su existencia colectiva, lo que contrasta con las visiones occidentales que separan al individuo del entorno natural.
  
  La digitalización de territorios, como se plantea en el uso de la Inteligencia Artificial para conservación, presenta desafíos éticos importantes. Clasificar y definir tierras y recursos naturales a través de imágenes satelitales y algoritmos puede despojar a estas comunidades de su conexión simbólica y material con el territorio, perpetuando desigualdades históricas y vulnerando sus derechos culturales y corporales.
  
  La traducción en Colombia podría desempeñar un papel clave al mediar entre las perspectivas indígenas y las prácticas occidentales de conservación y digitalización de territorios.
  
  Traducir no solo lenguas, sino también conceptos culturales como la relacionalidad con la naturaleza y el conocimiento colectivo, es esencial para evitar malentendidos y garantizar que las voces de las comunidades sean escuchadas.
  
  Por ejemplo, cuando se desarrollan proyectos de conservación basados en la Inteligencia Artificial, la traducción puede ayudar a garantizar que los principios, usos y riesgos de estas tecnologías sean entendidos desde las cosmovisiones indígenas, en lugar de imponer terminologías y enfoques que no respeten sus prácticas y saberes.
  
  La implementación de Inteligencia Artificial en conservación y digitalización de tierras en Colombia debería centrarse en que:
  
  Las comunidades indígenas deban ser incluidas como actores principales en el diseño de tecnologías que afectan sus territorios. Esto requiere procesos de consulta previos, libres e informados, en línea con los estándares internacionales de derechos humanos.
  
  En lugar de imponer un modelo de digitalización basado en la separación tierra-persona, la Inteligencia Artificial deba reflejar cómo estas comunidades perciben su conexión espiritual, cultural y económica con la naturaleza.
  
  La Inteligencia Artificial deba reconocer y respetar el conocimiento colectivo de las comunidades. Esto incluye evitar la apropiación de datos que no consideren el carácter comunal de la identidad y el saber indígena, promoviendo en su lugar principios éticos como los planteados en la posición de Indigenous AI.
  
  Corporalidades-data Traducción-data Inteligencia artificial-data Traducción-comunidades Cartografías de tecnodiversidades-ética Inteligencia Artificial-cartografía Traducción-cartografías Cartografía
Visit annotations in context

Tags

Traducción-comunidades

Corporalidades-data

Inteligencia artificial-data

Cartografías de tecnodiversidades-ética

Inteligencia Artificial-cartografía

Traducción-data

Traducción-cartografías

Cartografía

Annotators

Marcoguzman2024

URL

feministai.pubpub.org/pub/data-and-indigenous-communities
connect.apollo.roche.com connect.apollo.roche.com

Feasibility for Group Based Trajectory Modeling

1
1. schuldtr 21 Jan 2025
  
  in Public
  
  Tables of Possible Cohorts - MS DX Only with and without washout
  
  Look at who is and is not switching.
  
  #data #methods
Visit annotations in context

Tags

#data #methods

Annotators

schuldtr

URL

connect.apollo.roche.com/e4ati_1046/E4ATI_1046_Report-2024-12-11.html
attachments.are.na attachments.are.na

Mining the Algorithmic Sublime: A Qualitative Analysis of Learning Analytics Discourse

3
1. l3slie 05 Jan 2025
  
  in Public
  
  “The revenue model behind these open platforms is to be found in the user data and the value that data can represent.”
  
  data scallion
2. l3slie 05 Jan 2025
  
  in Public
  
  a more completelearner profile
  
  more complete: for who? by what means? what implications?
  
  data privacy consent scallion
3. l3slie 05 Jan 2025
  
  in Public
  
  “Collect it all” is a phrase used to encapsulate the mission of General Keith Alexander, director of the US National Security Agency
  
  cf matters of disclosure, consent, and differing orientation to/with privacy: MIT Tech Review article on CMU Mites in TCS Hall
  
  data privacy consent scallion
Visit annotations in context

Tags

consent

data

scallion

privacy

Annotators

l3slie

URL

attachments.are.na/33412372/14a37b35d0cee83afe0d2da6d32ee3f3.pdf
mail.cyberneticforests.com mail.cyberneticforests.com

Slop Infrastructures 3 & 4

1
1. l3slie 05 Jan 2025
  
  in Public
  
  Slop – in the sense of the flood of information and the calibration of how information is filtered – is power.
  
  cf Rob Kitchin on big data: capture everything anyway
  
  data —return
Visit annotations in context

Tags

—return

data

Annotators

l3slie

URL

mail.cyberneticforests.com/slop-infrastructures-3-4/
cyberneticforests.substack.com cyberneticforests.substack.com

Artificial Intelligence is a Compost Heap (Ideally)

1
1. l3slie 03 Jan 2025
  
  in Public
  
  object recognition
  
  and cf romanticized (or oft-told) narratives of this in CAPTCHAs
  
  captcha data
Visit annotations in context

Tags

data

captcha

Annotators

l3slie

URL

cyberneticforests.substack.com/p/artificial-intelligence-is-a-compost
www.youtube.com www.youtube.com

The Hidden Autopilot Data That Reveals Why Teslas Crash | WSJ

1
1. stopresetgo 01 Jan 2025
  
  in Public
  
  for - progress trap - Tesla autopilot - YouTube - The hidden data that reveals why Tesla's crash - WSJ - 2024 - Dec
  
  progress trap - Tesla autopilot - YouTube - The hidden data that reveals why Tesla's crash - WSJ - 2024 - Dec
Visit annotations in context

Tags

progress trap - Tesla autopilot - YouTube - The hidden data that reveals why Tesla's crash - WSJ - 2024 - Dec

Annotators

stopresetgo

URL

youtube.com/watch
Dec 2024
forsal.pl forsal.pl

Tajemnica szybkiego chodu a otyłość: wyniki badań prof. Ishii

1
1. pyxelr 26 Dec 2024
  
  in Public
  
  Zaskakujące odkrycie naukowców: Jak szybki chód działa na zdrowie metaboliczne?
  
  Participants were asked: "Is your walking speed faster than people of your gender and age?" Based on their answers, they were categorized as "fast walkers" or "slow walkers."
  
  The study included:
  
  8,578 obese individuals,
  
  9,626 individuals with a large waist circumference,
  
  6,742 individuals meeting both criteria.
  
  Summary:
  
  Overweight individuals who perceive their walking speed as fast have a 30% lower risk of diabetes, along with reduced risks of hypertension and dyslipidemia.
  
  Subjective assessment of walking speed can serve as a simple and cost-effective tool to identify metabolic health risks.
  
  Fast walking indicates the good condition of muscles, joints, and the cardiovascular and respiratory systems.
  
  Previous research has linked slow walking speed to an increased risk of cardiovascular diseases and higher mortality rates in older adults.
  
  health walking data study polish
Visit annotations in context

Tags

study

polish

data

health

walking

Annotators

pyxelr

URL

forsal.pl/lifestyle/zdrowie/artykuly/9698284,zaskakujace-odkrycie-naukowcow-jak-szybki-chod-dziala-na-zdrowie-meta.html
www.youtube.com www.youtube.com

Ajit Narayanan: A word game to communicate in any language

2
1. stopresetgo 23 Dec 2024
  
  in Public
  
  supposing I was a writer, say, for a newspaper or for a magazine. I could create content in one language, FreeSpeech, and the person who's consuming that content, the person who's reading that particular information could choose any engine, and they could read it in their own mother tongue, in their native language
  
  for - freespeech can be used as an international language translator - data structure of thought - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan
  
  freespeech can be used as an international language translator - data structure of thought - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan
2. stopresetgo 23 Dec 2024
  
  in Public
  
  when you want to use Google, you go into Google search, and you type in English, and it matches the English with the English. What if we could do this in FreeSpeech instead? I have a suspicion that if we did this, we'd find that algorithms like searching, like retrieval, all of these things, are much simpler and also more effective, because they don't process the data structure of speech. Instead they're processing the data structure of thought
  
  for - indyweb dev - question - alternative to AI Large Language Models? - Is indyweb functionality the same as Freespeech functionality? - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan - data structure of thought - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan
  
  data structure of thought - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan indyweb dev - question - alternative to AI Large Language Models? - Is indyweb functionality the same as Freespeech functionality? - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan
Visit annotations in context

Tags

freespeech can be used as an international language translator - data structure of thought - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan

data structure of thought - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan

indyweb dev - question - alternative to AI Large Language Models? - Is indyweb functionality the same as Freespeech functionality? - from TED Talk - YouTube - A word game to convey any language - Ajit Narayanan

Annotators

stopresetgo

URL

youtube.com/watch
www.youtube.com www.youtube.com

An Overview Of CHM’s Work On “Well-Being And Tukdam” By Professor Dr. Richard J. Davidson

1
1. stopresetgo 19 Dec 2024
  
  in Public
  
  he earliest we've been able to get to a case of tukdam is 26 hours after a practitioner has died so we've missed the first full day and there is some reason to believe that that first 24-hour period is is going to be very very important
  
  for - trivia - measuring tukdam after death - 24 hour period immediately following death is important but to date, no data captured - Youtube - Tukdam talk - An Overview Of CHM’s Work On “Well-Being And Tukdam” - Prof. Richard J. Davidson
  
  trivia - measuring tukdam after death - 24 hour period immediately following death is important but to date, no data captured - Youtube - Tukdam talk - An Overview Of CHM’s Work On “Well-Being And Tukdam” - Prof. Richard J. Davidson
Visit annotations in context

Tags

trivia - measuring tukdam after death - 24 hour period immediately following death is important but to date, no data captured - Youtube - Tukdam talk - An Overview Of CHM’s Work On “Well-Being And Tukdam” - Prof. Richard J. Davidson

Annotators

stopresetgo

URL

youtube.com/watch
Nov 2024
www.youtube.com www.youtube.com

Stanford Seminar - IPFS and the Permanent Web - YouTube

1
1. stopresetgo 15 Nov 2024
  
  in Public
  
  we have all of these huge applications that are gathering all this data uh and it's out there and theoretically is our data sort of but in reality they control it and you can't actually link the data to each other you only link to accessing the data through their application
  
  for - quote - silos - internet limitations - location addressed server architecture limitations - silos - cannot link data from each silo - Juan Benet - IPFS
  
  quote - silos - internet limitations - location addressed server architecture limitations - silos - cannot link data from each silo - Juan Benet - IPFS
Visit annotations in context

Tags

quote - silos - internet limitations - location addressed server architecture limitations - silos - cannot link data from each silo - Juan Benet - IPFS

Annotators

stopresetgo

URL

youtube.com/watch
www.gida-global.org www.gida-global.org

CARE Principles — Global Indigenous Data Alliance

2
1. WHPrivate 13 Nov 2024
  
  in Public
  
  TRSP Desirable Characteristics Indigenous Peoples have the right to develop cultural governance protocols for Indigenous data and be active leaders in the stewardship of, and access to, Indigenous data especially in the context of Indigenous Knowledge
  
  TRSP CARE FAIR, CARE, TRUST - Adoption, Implementation, and Deployment Authority to Control Governance Data Governance
2. WHPrivate 13 Nov 2024
  
  in Public
  
  TRSP Desirable Characteristics Indigenous Peoples have the right to data that are relevant to their world views and empower self-determination and effective self-governance. Indigenous data must be made available and accessible to Indigenous nations and communities in order to support Indigenous governance.
  
  TRSP CARE FAIR, CARE, TRUST - Adoption, Implementation, and Deployment Authority to Control Data for Governance
Visit annotations in context

Tags

Data Governance

TRSP

CARE

FAIR, CARE, TRUST - Adoption, Implementation, and Deployment

Data for Governance

Governance

Authority to Control

Annotators

WHPrivate

URL

gida-global.org/care
www.youtube.com www.youtube.com

"Can we create new senses for humans?" by David Eagleman

1
1. stopresetgo 11 Nov 2024
  
  in Public
  
  you can feel that as you're walking around you can feel that data on your wrist
  
  for - sensory substitution - like a new interoception - new exterocepation - feel the data
  
  sensory substitution - like a new interoception - new exterocepation - feel the data
Visit annotations in context

Tags

sensory substitution - like a new interoception - new exterocepation - feel the data

Annotators

stopresetgo

URL

youtube.com/watch
osf.io osf.io

Let's be clear

1
1. mlenc 11 Nov 2024
  
  in Public
  
  research transparency open science nonprofit data nonprofit sector academic research
Visit annotations in context

Tags

academic research

open science

nonprofit sector

nonprofit data

research transparency

Annotators

mlenc

URL

osf.io/3mfvq/
www.statcan.gc.ca www.statcan.gc.ca

Canadian Statistics Advisory Council 2024 Annual Report - Navigating Social and Technological Change in the National Statistical System

1
1. mlenc 08 Nov 2024
  
  in Public
  
  statscan recommendations admin data data strategy
Visit annotations in context

Tags

recommendations

statscan

data strategy

admin data

Annotators

mlenc

URL

statcan.gc.ca/en/about/relevant/CSAC/report/annual2024
www.canada.ca www.canada.ca

Expert Advisory Group Report 3: Toward a world-class health data system - Canada.ca

2
1. mlenc 08 Nov 2024
  
  in Public
  
  health data health system
2. mlenc 08 Nov 2024
  
  in Public
  
  data strategy statscan data products data linkages admin data recommendations mts homelessness
Visit annotations in context

Tags

data products

data linkages

statscan

health system

admin data

mts

health data

data strategy

homelessness

recommendations

Annotators

mlenc

URL

canada.ca/en/public-health/corporate/mandate/about-agency/external-advisory-bodies/list/pan-canadian-health-data-strategy-reports-summaries/expert-advisory-group-report-03-toward-world-class-health-data-system.html
www.nature.com www.nature.com

AI models collapse when trained on recursively generated data

1
1. chrisaldrich 06 Nov 2024
  
  in Public
  
  AI models collapse when trained on recursively generated data by Ilia Shumailov et al.
  
  ᔥ[[Mathew Lowry]] in AI4Communities post - MyHub Experiments Wiki (accessed:: 2024-11-06 09:43:23)
  
  artificial intelligence models collapse training data
Visit annotations in context

Tags

collapse

artificial intelligence models

training data

Annotators

chrisaldrich

URL

nature.com/articles/s41586-024-07566-y
experiments.myhub.ai experiments.myhub.ai

AI4Communities post - MyHub Experiments Wiki

1
1. chrisaldrich 06 Nov 2024
  
  in Public
  
  the model collapse paper now suggests that the training data created by well-managed communities could be the new currency of collective intelligence.
  
  artificial intelligence training data data ownership collective memory collective intelligence sense making zettelkasten ratchet
Visit annotations in context

Tags

zettelkasten ratchet

collective intelligence

training data

collective memory

data ownership

sense making

artificial intelligence

Annotators

chrisaldrich

URL

experiments.myhub.ai/ai4communities_post
Oct 2024
www.webnerd.me www.webnerd.me

Know and Master Your Social Media Data Flow

1
1. chrisaldrich 23 Oct 2024
  
  in Public
  
  Know and Master Your Social Media Data Flow by [[Louis Gray]]
  
  See commentary at https://boffosocko.com/2017/04/11/a-new-way-to-know-and-master-your-social-media-flow/
  
  social media content distribution reach own your data POSSE examples read Friends of the Link 2024-10-23
Visit annotations in context

Tags

social media

content distribution

own your data

examples

reach

read

POSSE

Friends of the Link 2024-10-23

Annotators

chrisaldrich

URL

webnerd.me/2009/05/know-and-master-your-social-media-data.html
mathewlowry.medium.com mathewlowry.medium.com

A Minimum Viable Ecosystem for collective intelligence

1
1. chrisaldrich 23 Oct 2024
  
  in Public
  
  A Minimum Viable Ecosystem for collective intelligence by [[Mathew Lowry]]
  
  Relation to Louis Gray's 2009 diagram/post: https://boffosocko.com/2017/04/11/a-new-way-to-know-and-master-your-social-media-flow/
  
  social media POSSE own your data content distribution Louis Gray Matthew Lowry Friends of the Link 2024-10-23
Visit annotations in context

Tags

Louis Gray

social media

own your data

content distribution

Matthew Lowry

POSSE

Friends of the Link 2024-10-23

Annotators

chrisaldrich

URL

mathewlowry.medium.com/a-minimum-viable-ecosystem-for-collective-intelligence-7738848ce9c4
en.wikipedia.org en.wikipedia.org

Data Transfer Project - Wikipedia

1
1. chrisaldrich 23 Oct 2024
  
  in Public
  
  https://en.wikipedia.org/wiki/Data_Transfer_Project
  
  see GitHub repo at https://github.com/google/data-transfer-project
  
  Facebook Google Data Liberation Front Microsoft Twitter Apple data portability social media Friends of the Link 2024-10-23 Data Transfer Project
Visit annotations in context

Tags

Apple

Data Transfer Project

social media

Google Data Liberation Front

data portability

Facebook

Twitter

Microsoft

Friends of the Link 2024-10-23

Annotators

chrisaldrich

URL

en.wikipedia.org/wiki/Data_Transfer_Project
www.pulshr.pl www.pulshr.pl

Coś się ruszyło na rynku pracy. Takiego odbicia nie mieliśmy już od dawna | pulshr.pl

1
1. pyxelr 15 Oct 2024
  
  in Public
  
  Niewielkie wzrosty odnotował też marketing i sprzedaż - o 1 proc. czy sektor IT - o 5 proc.
  
  In September, employers in Poland published 12% more job offers y/y (288 thousand).
  
  The increase has been going on for 4 months, and now it has reached the highest level since March 2022. The largest increase in the number of job offers is in medical professions (24%) and manual workers (13%). Decrease among financiers (-15%), HR specialists (-10%) and lawyers (-7%). In IT, the number of offers increased by 5%, and in marketing and sales by 1%.
  
  work data polish
Visit annotations in context

Tags

polish

data

work

Annotators

pyxelr

URL

pulshr.pl/rekrutacja/cos-sie-ruszylo-na-rynku-pracy-takiego-odbicia-nie-mielismy-juz-od-dawna,108080.html
trailhead.salesforce.com trailhead.salesforce.com

DataRaptor Extract for Type Ahead Block | Salesforce Trailhead

1
1. daraul 11 Oct 2024
  
  in Public
  
  salesforce omnistudio developer omnistudio data mappers tutorial
Visit annotations in context

Tags

tutorial

omnistudio data mappers

salesforce

omnistudio developer

Annotators

daraul

URL

trailhead.salesforce.com/content/learn/modules/omnistudio-data-tools-and-internal-data/build-a-dataraptor-extract-for-a-type-ahead-block
trailhead.salesforce.com trailhead.salesforce.com

DataRaptor Extract Relationship Queries | Salesforce Trailhead

1
1. daraul 11 Oct 2024
  
  in Public
  
  tutorial omnistudio developer omnistudio data mappers
Visit annotations in context

Tags

tutorial

omnistudio data mappers

omnistudio developer

Annotators

daraul

URL

trailhead.salesforce.com/content/learn/modules/omnistudio-data-tools-and-internal-data/extract-data-from-salesforce-objects
Sep 2024
www.researchgate.net www.researchgate.net

(PDF) Weaving a Decentralized Semantic Web of (Personal) Knowledge

1
1. PaulMooney 29 Sep 2024
  
  in Public
  
  Intertwingularity: Linked Data meets Linked Text
  
  QUERY Is there a taxonomy for data, information, insightr, knowledge, ontology, wisdom, cosmic knowing etc?
  
  Taxonomy for data
Visit annotations in context

Tags

Taxonomy for data

Annotators

PaulMooney

URL

researchgate.net/publication/334126329_Weaving_a_Decentralized_Semantic_Web_of_Personal_Knowledge
www.cnn.com www.cnn.com

What happened to 23andMe? | CNN Business

1
1. tonz 24 Sep 2024
  
  in Public
  
  https://web.archive.org/web/20240924124652/https://edition.cnn.com/2024/09/20/business/23andme-board-resigns-nightcap/index.html
  
  23andme postorder DNA profiling is slowly collapsing. CEO wants to take it private, and board has resigned in protest. This is one of the corps that went public using a SPAC to capitalise on the height of hype. Founded 2006, SPAC in 2021. Revenue is down, and money should run out soon. Seems there's no business model on top of the 1 time purchase of a DNA test. Key asset obv is the data, so I think we can wait for it to be sold to whoever bids most.
  
  23andme dna data-assets
Visit annotations in context

Tags

dna

23andme

data-assets

Annotators

tonz

URL

cnn.com/2024/09/20/business/23andme-board-resigns-nightcap/index.html
dl.acm.org dl.acm.org

What Counts as ‘Creative’ Work? Articulating Four Epistemic Positions in Creativity-Oriented HCI Research | Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems

1
1. antonya 21 Sep 2024
  
  in Public
  
  Therefore, similar to Ribes et al. in their study of domain [113], the epistemic positions we propose aim to provide conceptual tools for reasoning about different styles of organizing creativity-oriented research practices in HCI.
  
  David Ribes' work explores the definition of domain in computing and data science; offers insight into how studying domains helps organize computational systems.
  
  previous findings data science computational systems domain definition
Visit annotations in context

Tags

previous findings

domain

definition

computational systems

data science

Annotators

antonya

URL

dl.acm.org/doi/10.1145/3613904.3642854
www.nature.com www.nature.com

Best practices for analysing microbiomes

1
1. pbk1 21 Sep 2024
  
  in Public
  
  poses challenges for many classical methods, such as parametric statistical tests (for example, Student’s t-test and ANOVA) and measures of correlation, including Spearman’s rank correlation, often leading to completely unacceptable false discovery rates above 90%
  
  CoDa (compositional data analysis)
Visit annotations in context

Tags

CoDa (compositional data analysis)

Annotators

pbk1

URL

nature.com/articles/s41579-018-0029-9
pec.ac.uk pec.ac.uk

Creative PEC

1
1. mlenc 19 Sep 2024
  
  in Public
  
  data infrastructure arts data arts infrastructure research infrastructure evidence infrastructure arts research
Visit annotations in context

Tags

research infrastructure

data infrastructure

arts research

arts data

arts infrastructure

evidence infrastructure

Annotators

mlenc

URL

pec.ac.uk/
github.com github.com

ioquatix/synco: A Ruby DSL for running synchronisation and backup tasks.

1
1. TylerRick 11 Sep 2024
  
  in Public
  
  Fingerprint Integration
  
  data fingerprint
Visit annotations in context

Tags

data fingerprint

Annotators

TylerRick

URL

github.com/ioquatix/synco
github.com github.com

ioquatix/fingerprint: Fingerprint is a simple tool that can be used to verify the contents of a directory.

3
1. TylerRick 11 Sep 2024
  
  in Public
  
  Fingerprint is a general purpose data integrity tool that uses cryptographic hashes to detect changes in files and directory trees. The fingerprint command scans a directory tree and generates a fingerprint file containing the names and cryptographic hashes of the files in the tree. This snapshot can be later used to generate a list of files that have been created, deleted or modified. If so much as a single bit in the file data has changed, Fingerprint will detect it.
  
  data integrity tool
2. TylerRick 11 Sep 2024
  
  in Public
  
  data fingerprint built on: Ruby cryptographic hash function file syncing
3. TylerRick 11 Sep 2024
  
  in Public
  
  In cases where I've been concerned about the migration of data (e.g. copying my entire home directory from one system to another), I've used fingerprint to generate a transcript on the source machine, and then run it on the destination machine, to reassure me that the data was copied correctly and completely.
  
  use case good example backups: verification data fingerprint
Visit annotations in context

Tags

tool

use case

file syncing

built on: Ruby

data integrity

data fingerprint

backups: verification

good example

cryptographic hash function

Annotators

TylerRick

URL

github.com/ioquatix/fingerprint
en.wikipedia.org en.wikipedia.org

Rabin fingerprint - Wikipedia

1
1. TylerRick 11 Sep 2024
  
  in Public
  
  data fingerprint
Visit annotations in context

Tags

data fingerprint

Annotators

TylerRick

URL

en.wikipedia.org/wiki/Rabin_fingerprint
en.wikipedia.org en.wikipedia.org

Fingerprint (computing) - Wikipedia

1
1. TylerRick 11 Sep 2024
  
  in Public
  
  data fingerprint
Visit annotations in context

Tags

data fingerprint

Annotators

TylerRick

URL

en.wikipedia.org/wiki/Fingerprint_(computing)
www.theguardian.com www.theguardian.com

University funding from fossil fuels slowing switch to green energy – report

1
1. HeinzWittenbrink 06 Sep 2024
  
  in Public
  
  Die Fossilindustrie finanziert seit Jahrzehten Universitäten und fördert damit Publikationen in ihrem Interesse, z.B. zu false solutions wie #CCS. Hintergrundbericht anlässlich einer neuen Studie: https://www.theguardian.com/business/article/2024/sep/05/universities-fossil-fuel-funding-green-energy
  
  Studie: https://doi.org/10.1002/wcc.904
  
  Fossilindustrie by: Dharma Noor Jennie Stephens climate obstructionism in.higher education Geoffrey Supran BP negative emission technologies MIT Energy Initiative Princeton University’s Carbon Mitigation Initiative Exxon American Petroleum Institute Data for Progress Emily Eaton Campus Climate Network Jake Lowe Fossil fuel industry influence in higher education: A review and a research agenda Accountable Allies: The Undue Influence of Fossil Fuel Money in Academia disinformation Favourability towards natural gas relates to funding source of university energy centres
Visit annotations in context

Tags

Exxon

by: Dharma Noor

disinformation

Princeton University’s Carbon Mitigation Initiative

Favourability towards natural gas relates to funding source of university energy centres

Geoffrey Supran

BP

Emily Eaton

MIT Energy Initiative

Fossil fuel industry influence in higher education: A review and a research agenda

Campus Climate Network

American Petroleum Institute

climate obstructionism in.higher education

Jennie Stephens

Jake Lowe

negative emission technologies

Data for Progress

Fossilindustrie

Accountable Allies: The Undue Influence of Fossil Fuel Money in Academia

Annotators

HeinzWittenbrink

URL

theguardian.com/business/article/2024/sep/05/universities-fossil-fuel-funding-green-energy
dynamicland.org dynamicland.org

Is Realtalk open source?

1
1. offray 05 Sep 2024
  
  in Public
  
  Realtalk is just one component of a culture, and downloading source code does not download values, norms, practices, and tacit knowledge. We intend the culture to spread in a manner similar to scientific practices, trades and crafts, martial arts, spoken language, and so on — in-person immersion in a community of practice, teachers teaching teachers. This will take time, and it may appear that Realtalk is “exclusive” during that time. But open-source software is also exclusive, to those who find meaning in source code. And those people already seem well-provided for.
  
  No tiene porque haber contradicción entre los encuentros en persona, que transmiten y encarnan cultura y los medios digitales donde también transitan. Nuestras Data Rodas tienen también inspiración en una cultura del cuerpo, con encuentros en persona y virtuales, a la vez que producen código y prosa que transita para quienes no están en los encuentros cara a cara.
  
  Data Rodas código abierto
Visit annotations in context

Tags

Data Rodas

código abierto

Annotators

offray

URL

dynamicland.org/2024/Is_Realtalk_open_source/
www.eatingpolicy.com www.eatingpolicy.com

Crime problems are capacity problems

1
1. mlenc 04 Sep 2024
  
  in Public
  
  data architecture social problems social systems social services jennifer pahlka code for america
Visit annotations in context

Tags

social systems

social services

data architecture

code for america

social problems

jennifer pahlka

Annotators

mlenc

URL

eatingpolicy.com/p/crime-problems-are-capacity-problems
Aug 2024
www.mulesoft.com www.mulesoft.com

What is a Single Source of Truth (SSOT) | MuleSoft

1
1. daraul 22 Aug 2024
  
  in Public
  
  I wonder what a source of truth is -- not necessarily an SSOT for the customer/company data, which is what Salesforce could become.
  
  salesforce mulesoft data management single source of truth
Visit annotations in context

Tags

single source of truth

data management

salesforce

mulesoft

Annotators

daraul

URL

mulesoft.com/resources/esb/what-is-single-source-of-truth-ssot
neo4j.com neo4j.com

RDF Triple Stores vs. Labeled Property Graphs: What's the Difference?

2
1. Apiphine 11 Aug 2024
  
  in Public
  
  “Oil and gas is a connected network of processes, people, and infrastructure,” Dalgliesh says. “We are working with a client now modeling saltwater disposal wells. If you turn a valve that decreases the flow in a pipeline, it has a downstream effect on the disposal well. Knowledge graphs built on Neo4j are the perfect abstraction layer to model relationships across this kind of complex network and were a much better fit for reView than triple stores.”
  
  “Oil and gas is a connected network of processes, people, and infrastructure,” ... “We are working with a client now modeling saltwater disposal wells. If you turn a valve that decreases the flow in a pipeline, it has a downstream effect on the disposal well. Knowledge graphs built on Neo4j are the perfect abstraction layer to model relationships across this kind of complex network and were a much better fit for reView than triple stores.” [Jeff Dalgliesh], Chief Technology Officer at Data².
  
  Data² Jeff Dalgliesh CTO NEO NEO4j Data Analysis
2. Apiphine 11 Aug 2024
  
  in Public
  
  “Analysts need to be able to dissect exactly how the AI reached a particular conclusion or recommendation,” says Chief Business Officer Eric Costantini. “Neo4j enables us to enforce robust information security by applying access controls at the subgraph level.”
  
  “Analysts need to be able to dissect exactly how the AI reached a particular conclusion or recommendation,” “Neo4j enables us to enforce robust information security by applying access controls at the subgraph level.” Chief Business Officer Eric Costantini.
  
  neo4j data^2 Triple Stores Evidence AI Data Analysis
Visit annotations in context

Tags

AI

NEO4j

Evidence

Triple Stores

Jeff Dalgliesh

NEO

CTO

data^2

Data²

neo4j

Data Analysis

Annotators

Apiphine

URL

neo4j.com/blog/genai/what-is-graphrag/
www.linkedin.com www.linkedin.com

An Ecology of Communication

1
1. stopresetgo 08 Aug 2024
  
  in Public
  
  Nora reminds us, is be attentive to not what has been said but what the relationship is between what has and has not been said. Life happens in between the stories, not in them.
  
  for - warm data - the silence between words
  
  warm data - the silence between words
Visit annotations in context

Tags

warm data - the silence between words

Annotators

stopresetgo

URL

linkedin.com/pulse/ecology-communication-adam-widawski-zrwtf
www.derstandard.de www.derstandard.de

Kippt der Golfstrom erst in 6000 Jahren?

1
1. HeinzWittenbrink 05 Aug 2024
  
  in Public
  
  https://www.derstandard.at/story/3000000230946/kippt-der-golfstrom-erst-in-6000-jahren
  
  Uncertainties too large to predict tipping times of major Earth system components from historical data
Visit annotations in context

Tags

Uncertainties too large to predict tipping times of major Earth system components from historical data

Annotators

HeinzWittenbrink

URL

derstandard.de/story/3000000230946/kippt-der-golfstrom-erst-in-6000-jahren
www.youtube.com www.youtube.com

Zebra Insights x PODIUM Talks - Keynote speaker Dr. Henning Beck

1
1. stopresetgo 04 Aug 2024
  
  in Public
  
  you can measure data but you cannot measure having an idea you cannot measure Innovation you cannot measure knowledge there's no metric there is no quantifiable scale for knowledge or having an idea you cannot say one meter of knowledge one kilogram of idea
  
  for - comparison - data vs ideas - no metric for ideas
  
  comparison - data vs ideas - no metric for ideas
Visit annotations in context

Tags

comparison - data vs ideas - no metric for ideas

Annotators

stopresetgo

URL

youtube.com/watch
Jul 2024
newamerica.org newamerica.org

What’s Driving the Spread of DPI?

1
1. mrchrisadams 29 Jul 2024
  
  in Public
  
  DPI can be seen as part of a broader effort to reinvent our relationship to the internet—and, more generally, our digital ecosystem. A large part of its normative appeal stems from the “P” in the acronym: the sense that core functionality on the internet (i.e., identity, payments, data exchange) should not merely serve private ends but rather be reimagined as a set of public goods
  
  Who needs Venmo when you have this?
  
  dpi identity payments data exchange
Visit annotations in context

Tags

dpi

identity

payments

data exchange

Annotators

mrchrisadams

URL

newamerica.org/planetary-politics/blog/whats-driving-the-spread-of-dpi/
www.o.team www.o.team

Ending Data Integrations with Solid | O.team - Solid Consulting

1
1. TylerRick 18 Jul 2024
  
  in Public
  
  Today, data is abundant, but for the most part, unusable. Seventy percent of a data scientist’s job is just cleansing data. The modern software architecture encourages data to be hoarded only accessible through proprietary APIs. And, even with proprietary APIs the market for data integrations is expected to grow to a trillion dollars by the end of the decade. When humanity is spending the GDP of Indonesia just so that the data in System X can work with the data in System Y, the field of software engineering has failed us. So much data - data that could be used by new startups and nonprofits that couldn’t exist today - goes unused because it’s so difficult to access.
  
  data hoarding proprietary API free data Solid
Visit annotations in context

Tags

Solid

free data

proprietary API

data hoarding

Annotators

TylerRick

URL

o.team/ending-data-integrations
graphmetrix.com graphmetrix.com

GraphMetrix

1
1. TylerRick 18 Jul 2024
  
  in Public
  
  very organized
  
  linked data Solid knowledge management system
Visit annotations in context

Tags

linked data

knowledge management system

Solid

Annotators

TylerRick

URL

graphmetrix.com/trinpod
github.com github.com

The Computational Democracy Project

1
1. chrisaldrich 17 Jul 2024
  
  in Public
  
  The Computational Democracy Project
  
  We bring data science to deliberative democracy, so that governance may better reflect the multidimensionality of the public's will.
  
  Polis The Computational Democracy Project Democracy data science artificial intelligence apps Friends of the Link 2024-07-17
Visit annotations in context

Tags

artificial intelligence apps

Democracy

Polis

The Computational Democracy Project

data science

Friends of the Link 2024-07-17

Annotators

chrisaldrich

URL

github.com/compdemocracy
www.youtube.com www.youtube.com

Rian Doris & Conor Murphy: Flow — Cultivating optimal experience of life (part 1) - YouTube

1
1. M.AKilic50 09 Jul 2024
  
  in Public
  
  05:00 Connor Murphy is a data strategist and works at the Flow Research Collective. He is also into tracking data, on himself (quantified self).
  
  data strategy Flow Research Collective quantified self Conor Murphy
Visit annotations in context

Tags

quantified self

Conor Murphy

Flow Research Collective

data strategy

Annotators

M.AKilic50

URL

youtube.com/watch
Jun 2024
docdrop.org docdrop.org

Video: RSPF24 - Session 8 - L’open data de Santé publique France : quelles données pour quels publics ? (DocDrop)

1
1. tanemika 24 Jun 2024
  
  in Public
  
  Résumé de la vidéo [00:00:05][^1^][1] - [00:26:39][^2^][2]:
  
  La vidéo présente une session sur l'open data de Santé publique France, discutant de l'utilisation des données pour différents publics. Elle aborde la refonte de la stratégie d'open data, l'importance de la transparence et de la collaboration, ainsi que les défis liés à la sensibilité des données de santé.
  
  Points forts: + [00:00:05][^3^][3] Introduction de la session * Présentation des modérateurs et du thème de l'open data * Discussion sur l'utilité des données pour le public + [00:01:49][^4^][4] Projet de mise à jour de la stratégie d'open data * Contextualisation de l'open data et ses principes * Défis spécifiques aux données de santé et leur protection juridique + [00:06:03][^5^][5] Identification des publics cibles et méthodologie * Choix des décideurs publics et acteurs de la société civile comme cibles * Organisation du travail en quatre axes pour répondre aux besoins + [00:10:07][^6^][6] Approche méthodologique combinée * Utilisation de focus groups, enquêtes et entretiens pour collecter des informations * Co-conception avec les acteurs pour construire de futurs indicateurs + [00:13:01][^7^][7] Besoins d'une agence régionale de santé * Importance des données fiables pour la coordination et la régulation * Utilisation des données pour la cartographie et la projection des besoins en soins + [00:25:15][^8^][8] Questions et réponses * Échange avec l'audience sur les fonctions de l'ARS et l'utilisation des outils prédictifs * Discussion sur l'intelligence artificielle et la mesure de la fiabilité des prédictions
  
  Résumé de la vidéo [00:00:05][^1^][1] - [00:26:39][^2^][2]:
  
  La vidéo présente une session sur l'open data de Santé publique France, discutant de l'utilisation des données pour différents publics. Elle aborde la refonte de la stratégie d'open data, l'importance de la transparence et de la collaboration, et les défis liés à la sensibilité des données de santé.
  
  Points forts: + [00:00:05][^3^][3] Introduction à l'open data de Santé publique France * Présentation des animateurs et objectifs de la session * Discussion sur l'utilité des indicateurs en open data + [00:01:48][^4^][4] Projet de mise à jour de la stratégie d'open data * Contexte et principes de l'open data * Caractéristiques et restrictions liées aux données de santé + [00:06:03][^5^][5] Identification des publics cibles et méthodologie * Focus sur les décideurs publics et acteurs de la société civile * Organisation du travail en quatre axes pour répondre aux besoins + [00:12:33][^6^][6] Besoins d'une Agence Régionale de Santé (ARS) * Importance des données fiables pour la coordination et la régulation * Projets et croisement de données pour la prise de décision éclairée
  
  Résumé de la vidéo [00:50:28][^1^][1] - [01:15:17][^2^][2]:
  
  La vidéo traite de l'utilisation des données ouvertes de Santé publique France pour améliorer la santé publique. Elle aborde les défis de la communication, de la compréhension et de l'application des données, en particulier à une échelle infracommunale, et souligne l'importance de choisir des indicateurs pertinents pour les politiques de santé.
  
  Points forts: + [00:50:28][^3^][3] Compréhension des données * Difficultés perçues par les habitants et les associations * Importance de la formation et de la sensibilisation + [00:51:01][^4^][4] Besoins et défis * Faciliter l'accès et l'utilisation des données * Sécurité et anonymat dans le partage des données + [00:53:34][^5^][5] Outils et limites * Développement d'outils pour l'accès aux données * Exemples d'outils utilisés dans d'autres pays
  
  Résumé de la vidéo [01:15:20][^1^][1] - [01:38:45][^2^][2]:
  
  La vidéo traite de l'utilisation des données ouvertes de Santé publique France et de leur importance pour divers publics, notamment les politiques de santé et les journalistes. Elle souligne la nécessité d'une approche politique et sociale pour aborder les questions de santé environnementale et la collaboration entre les villes et les régions pour une politique de santé cohérente.
  
  Points forts: + [01:15:20][^3^][3] Politiques de santé locales * Importance de la collaboration entre villes et départements * Actions municipales spécifiques et politiques régionales plus larges + [01:17:02][^4^][4] Collaboration interrégionale * Nécessité de travailler ensemble sur des sujets communs * Exemple de la trame verte à l'échelle métropolitaine + [01:20:05][^5^][5] Journalisme et données de santé * Impact du COVID-19 sur l'utilisation des données par les journalistes * Importance de la granularité et de la temporalité des données + [01:35:50][^6^][6] Formation des journalistes * Besoin de diversifier les profils dans les écoles de journalisme * Intégration des outils de gestion de données dans la formation
  
  Résumé de la vidéo [01:38:47][^1^][1] - [02:03:42][^2^][2]:
  
  Cette vidéo présente une session sur l'open data de Santé publique France, discutant des données disponibles pour différents publics. Les intervenants explorent les défis de la production d'indicateurs, la médiation des données, et l'équilibre entre la rapidité de mise en ligne et l'accompagnement nécessaire pour les utilisateurs.
  
  Points forts: + [01:38:47][^3^][3] Production et médiation des données * Temps de production incompressible * Choix entre rapports détaillés ou données agrégées rapides * Dilemme entre l'accompagnement et la rapidité + [01:39:47][^4^][4] Diffusion des données et expertise * Journalistes cherchent des mises à jour régulières * Importance d'une explication succincte avec les données * Rapports d'experts pour une diffusion plus large + [01:41:11][^5^][5] Formation des journalistes et collaboration * Formation à la démarche scientifique * Collaboration avec Santé publique France pour une information précise * Besoin de données infracommunales pour les villes + [01:47:26][^6^][6] Troisième vague de l'Open Data * Travailler avec les usagers autour d'objectifs de politique publique * Étendre le public des données et développer la datalitéracie * Importance de connaître les usagers actuels et potentiels des données + [01:58:36][^7^][7] Accès aux données et enjeux de santé publique * Difficultés d'accès aux données infracommunales * Nécessité de partenariats pour des données plus fines * Enjeux sensibles liés à la restitution des données de santé
  
  Résumé de la vidéo [02:03:49][^1^][1] - [02:27:30][^2^][2]:
  
  Cette partie de la vidéo discute de l'open data de Santé publique France et de l'identification des publics nécessitant des données spécifiques. Elle aborde les défis de la médiation et de la définition des besoins en données pour divers secteurs, y compris la santé et l'environnement.
  
  Points forts: + [02:03:49][^3^][3] Identification des besoins en données * Difficulté à dialoguer et à médier entre les fournisseurs de données et les utilisateurs * Importance de définir clairement les besoins en données pour les politiques publiques + [02:08:59][^4^][4] Exemples de données non accessibles * Manque de données sur la vaccination au niveau local pendant la COVID-19 * Difficulté à obtenir des données de santé scolaire pour les villes + [02:17:01][^5^][5] Création de nouvelles données pour les politiques publiques * Nécessité de produire des données pertinentes pour répondre à des problèmes spécifiques * Exemple du baromètre des villes cyclables pour évaluer la cyclabilité + [02:22:02][^6^][6] Littératie en données de santé et obstacles à l'ouverture des données * Importance de la formation pour comprendre la production et la collecte de données * Défis liés à l'accessibilité et à l'utilité des données pour le grand public
  
  Résumé de la vidéo [02:27:32][^1^][1] - [02:42:03][^2^][2]:
  
  La vidéo aborde l'importance de l'open data de Santé publique France et les défis liés à la collecte, la documentation et l'utilisation des données pour divers publics. Elle souligne la nécessité d'une documentation claire des données et d'une médiation pour aider les utilisateurs à comprendre et à utiliser les données de manière éthique et efficace.
  
  Points forts: + [02:27:32][^3^][3] L'éthique de l'open data * Discussion sur la pertinence éthique de détailler l'état de santé des citoyens * Importance de la transparence et de la responsabilité dans la collecte des données + [02:28:02][^4^][4] La documentation des données * Présentation du "datashheet for dataset" pour une documentation standardisée * Importance de documenter le processus de collecte et le contexte de production des données + [02:30:29][^5^][5] Les besoins des utilisateurs territoriaux * Manque de connaissances précises sur l'état de santé des populations locales * Exemple d'une ville ayant besoin de données pour répondre à une situation de soins de santé + [02:33:48][^6^][6] La démarche de la ville de Paris * Création de portraits de santé infracommunaux pour répondre aux besoins des acteurs locaux * Processus participatif impliquant élus et partenaires de santé pour identifier les indicateurs pertinents
  
  Santé publique France conférence webinaire RSPF 2024 open data politique publique données
Visit annotations in context

Tags

RSPF

2024

open data

webinaire

Santé publique France

données

conférence

politique publique

Annotators

tanemika

URL

docdrop.org/video/FlC_8LcjIUE/
www.geeksforgeeks.org www.geeksforgeeks.org

What is Data Structure: Types, Classifications and Applications - GeeksforGeeks

1
1. lissyM 20 Jun 2024
  
  in Public
  
  Data structures are an integral part of computers used for the arrangement of data in memory.
  
  The importance of data structures (related to memory)
  
  Computer Science Self-study School studies Year 12 1.4.2 Data structures
Visit annotations in context

Tags

Computer Science

Year 12

Self-study

1.4.2 Data structures

School studies

Annotators

lissyM

URL

geeksforgeeks.org/what-is-data-structure-types-classifications-and-applications/
investinopen.org investinopen.org

Foreword

1
1. mlenc 14 Jun 2024
  
  in Public
  
  report open infrastructure grantmaking infrastructure open data open science
Visit annotations in context

Tags

open infrastructure

open science

infrastructure

report

grantmaking

open data

Annotators

mlenc

URL

investinopen.org/state-of-open-infrastructure-2024/sooi-foreword-2024/
www.derstandard.de www.derstandard.de

Eines der stärksten Treibhausgase nimmt rasant zu

1
1. HeinzWittenbrink 12 Jun 2024
  
  in Public
  
  https://www.derstandard.de/story/3000000224037/eines-der-staerksten-treibhausgase-nimmt-rasant-zu
  
  Global Carbon Project Hanqin Tian Earth System Science Data: Global Nitrous Oxide Budget 1980-2020." Earth System Science Data: Global Nitrous Oxide Budget 1980-2020.
Visit annotations in context

Tags

Earth System Science Data: Global Nitrous Oxide Budget 1980-2020.

Hanqin Tian

Earth System Science Data: Global Nitrous Oxide Budget 1980-2020."

Global Carbon Project

Annotators

HeinzWittenbrink

URL

derstandard.de/story/3000000224037/eines-der-staerksten-treibhausgase-nimmt-rasant-zu
www.nature.com www.nature.com

The TRUST Principles for digital repositories

1
1. WHPrivate 07 Jun 2024
  
  in Public
  
  TRSP Desirable Characteristics
  
  TRSP-TRUST-Sensitive Data
Visit annotations in context

Tags

TRSP-TRUST-Sensitive Data

Annotators

WHPrivate

URL

nature.com/articles/s41597-020-0486-7
www.belfercenter.org www.belfercenter.org

Seeing Like a Data Structure

2
1. almereyda 02 Jun 2024
  
  in Public
  
  TensionThe ability to see like a data structure afforded us the technology we have today. But it was built for and within a set of societal systems—and stories—that can’t cope with nebulosity. Worse still is the transitional era we’ve entered, in which overwhelming complexity leads more and more people to believe in nothing. That way lies madness. Seeing is a choice, and we need to reclaim that choice. However, we need to see things and do things differently, and build sociotechnical systems that embody this difference.This is best seen through a small example. In our jobs, many of us deal with interpersonal dynamics that sometimes overwhelm the rules. The rules are still there—those that the company operates by and laws that it follows—meaning there are limits to how those interpersonal dynamics can play out. But those rules are rigid and bureaucratic, and most of the time they are irrelevant to what you’re dealing with. People learn to work with and around the rules rather than follow them to the letter. Some of these might be deliberate hacks, ones that are known, and passed down, by an organization’s workers. A work-to-rule strike, or quiet quitting for that matter, is effective at slowing a company to a halt because work is never as routine as schedules, processes, leadership principles, or any other codified rules might allow management to believe.The tension we face is that on an everyday basis, we want things to be simple and certain. But that means ignoring the messiness of reality. And when we delegate that simplicity and certainty to systems—either to institutions or increasingly to software—they feel impersonal and oppressive. People used to say that they felt like large institutions were treating them like a number. For decades, we have literally been numbers in government and corporate data structures. BreakdownAs historian Jill Lepore wrote, we used to be in a world of mystery. Then we began to understand those mysteries and use science to turn them into facts. And then we quantified and operationalized those facts through numbers. We’re currently in a world of data—overwhelming, human-incomprehensible amounts of data—that we use to make predictions even though that data isn’t enough to fully grapple with the complexity of reality.How do we move past this era of breakdown? It’s not by eschewing technology. We need our complex socio-technical systems. We need mental models to make sense of the complexities of our world. But we also need to understand and accept their inherent imperfections. We need to make sure we’re avoiding static and biased patterns—of the sort that a state functionary or a rigid algorithm might produce—while leaving room for the messiness inherent in human interactions. Chapman calls this balance “fluidity,” where society (and really, the tech we use every day) gives us the disparate things we need to be happy while also enabling the complex global society we have today.
  
  rigidity bureaucracy data structure technology society narratives nebulosity uncertainty socio-technical systems facts quantification operationalisation breakdown complexity fluidity
2. almereyda 02 Jun 2024
  
  in Public
  
  To boost its search engine rankings, Thai Food Near Me, a New York City restaurant, is named after a search term commonly used by potential customers. It’s a data layer on top of reality. And the problems get worse when the relative importance of the data and reality flip. Is it more important to make a restaurant’s food taste better, or just more Instagrammable? People are already working to exploit the data structures and algorithms that govern our world. Amazon drivers hang smartphones in trees to trick the system. Songwriters put their catchy choruses near the beginning to exploit Spotify’s algorithms. And podcasters deliberately mispronounce words because people comment with corrections and those comments count as “engagement” to the algorithms.These hacks are fundamentally about the breakdown of “the system.” (We’re not suggesting that there’s a single system that governs society but rather a mess of systems that interact and overlap in our lives and are more or less relevant in particular contexts.)
  
  reality intent data structure algorithm systems
Visit annotations in context

Tags

algorithm

rigidity

narratives

operationalisation

nebulosity

facts

complexity

systems

reality

uncertainty

fluidity

bureaucracy

society

technology

quantification

socio-technical

breakdown

intent

data structure

Annotators

almereyda

URL

belfercenter.org/publication/seeing-data-structure
May 2024
www.bloomberg.com www.bloomberg.com

Is Net Zero by 2050 Still Possible? Yes, But It’ll Cost 19% More

1
1. HeinzWittenbrink 28 May 2024
  
  in Public
  
  Governments and companies need to spend an extra $34 trillion on the clean energy transition between now and 2050 to reach net-zero emissions, according to BloombergNEF.
  
  Die Kosten der Energiewende liegen deutlich höher als bisher angenommen.
  
  data points climate finance BloombergNEF energy transition expert: David Hostert
Visit annotations in context

Tags

BloombergNEF

data points

climate finance

energy transition

expert: David Hostert

Annotators

HeinzWittenbrink

URL

bloomberg.com/news/articles/2024-05-21/key-takeaways-from-bloombergnef-s-new-energy-outlook
media.dltj.org media.dltj.org

Video: Linked Data in Production: Moving Beyond Ontologies by CNI Spring Meeting 2024, annotated

1
1. peter_murray 27 May 2024
  
  in Public
  
  Linked Data in Production: Moving Beyond Ontologies
  
  Spring 2024 Member Meeting: CNI website • YouTube
  
  David Newbury Assistant Director, Software and UX Getty
  
  Over the past six years, Getty has been engaged in a project to transform and unify its complex digital infrastructure for cultural heritage information. One of the project’s core goals was to provide validation of the impact and value of the use of linked data throughout this process. With museum, archival, media, and vocabularies in production and others underway, this sessions shares some of the practical implications (and pitfalls) of this work—particularly as it relates to interoperability, discovery, staffing, stakeholder engagement, and complexity management. The session will also share examples of how other organizations can streamline their own, similar work going forward.
  
  http://getty.edu/art/collection/ http://getty.edu/research/collections/ http://vocab.getty.edu https://www.getty.edu/projects/remodeling-getty-provenance-index/
  
  linked data
Visit annotations in context

Tags

linked data

Annotators

peter_murray

URL

media.dltj.org/annotated-video/20240527T175053-ApFnMJR4_lM-linked-data-production-moving-beyond-ontologies/index.html
clippings.io clippings.io

Export your Kindle Highlights

1
1. chrisaldrich 22 May 2024
  
  in Public
  
  https://clippings.io/
  
  I've used this before, but never added it to the collection.
  
  Amazon Kindle annotation tools data export highlights Clippings.io Friends of the Link 2024-05-22
Visit annotations in context

Tags

Clippings.io

data export

Amazon Kindle

highlights

Friends of the Link 2024-05-22

annotation tools

Annotators

chrisaldrich

URL

clippings.io/
readwise.io readwise.io

Bookcision: Export/Download Your Kindle Highlights

1
1. chrisaldrich 22 May 2024
  
  in Public
  
  https://readwise.io/bookcision
  
  Links to: https://hypothes.is/a/9bHHZhhsEe-3LgOZEkaQcg
  
  bookcision Readwise Amazon Kindle highlights data export tools annotation tools
Visit annotations in context

Tags

bookcision

data export

annotation tools

Amazon Kindle

Readwise

tools

highlights

Annotators

chrisaldrich

URL

readwise.io/bookcision
github.com github.com

GitHub - TristanH/bookcision

1
1. chrisaldrich 22 May 2024
  
  in Public
  
  an old (6 years) project that readwise maintained, to export kindle highlights: https://github.com/tristanh/bookcision<br /> via PK
  
  see also: https://clippings.io/
  
  Amazon Kindle highlights Readwise data export Friends of the Link 2024-05-22 Clippings.io bookcision
Visit annotations in context

Tags

Readwise

bookcision

Clippings.io

data export

Amazon Kindle

highlights

Friends of the Link 2024-05-22

Annotators

chrisaldrich

URL

github.com/TristanH/bookcision
archive.org archive.org

Stack Exchange Data Dump : Stack Exchange, Inc. : Free Download & Streaming : Internet Archive

1
1. TylerRick 15 May 2024
  
  in Public
  
  archive Creative Commons Attribution-ShareAlike StackExchange data dump
Visit annotations in context

Tags

archive

Creative Commons Attribution-ShareAlike

data dump

StackExchange

Annotators

TylerRick

URL

archive.org/details/stackexchange
www.derstandard.at www.derstandard.at

Billionenkredite für fossile Großprojekte: Wie Banken die Klimakrise mitfinanzieren

1
1. HeinzWittenbrink 14 May 2024
  
  in Public
  
  Seit dem Pariser Abkommen finanzierten die 60 größten Banken 425 fossile Großprojekte - sogenannte carbon bombs mit einem zu erwartenden CO2-Ausstoß von jeweils über einer Gigatonne - mit insgesamt 1,8 Billionen Dollar. Der Standard-Artikel geht auf ein Projekt zurück, bei dem Daten des Carbon Bombs-Projekts, des Global Energy Monitor und von Banking on Climate Chaos ausgewertet und visualisiert werden. https://www.derstandard.at/story/3000000193065/billionenkredite-fuer-fossile-grossprojekte-wie-banken-die-klimakrise-mitfinanzieren
  
  Bericht/Visualisierung: https://www.carbonbombs.org/
  
  2023-10-31 fossil fuel finance fossil expansion banks by: Alicia Prager by: Philip Prayer by: Anastasia Trenkler Libya El Sharara TotalEnergies OMV Repsol Equinor carbon bombs éclaircies Data for Good Permian basin fracking Marcellus Shale BP USA Saudi Arabia Abdulaziz bin Salman China Stranded fossil-fuel assets translate to major losses for investors in advanced economies stranded fossil fuel assets ICBC JPMorgan Citi BNP Paribas Unicredit Net Zero Banking Alliance Eni Gazprom Deutsche Bank Shell ExxonMobil Bill Farren-Price Oxford Institute for Energy Studies
Visit annotations in context

Tags

Repsol

fossil fuel finance

JPMorgan

stranded fossil fuel assets

BNP Paribas

Data for Good

fossil expansion

by: Alicia Prager

USA

OMV

Citi

Bill Farren-Price

ExxonMobil

Oxford Institute for Energy Studies

Shell

Abdulaziz bin Salman

fracking

Net Zero Banking Alliance

China

El Sharara

Libya

Eni

Saudi Arabia

Stranded fossil-fuel assets translate to major losses for investors in advanced economies

BP

Marcellus Shale

banks

by: Philip Prayer

éclaircies

Deutsche Bank

ICBC

Equinor

Gazprom

2023-10-31

carbon bombs

Unicredit

by: Anastasia Trenkler

Permian basin

TotalEnergies

Annotators

HeinzWittenbrink

URL

derstandard.at/story/3000000193065/billionenkredite-fuer-fossile-grossprojekte-wie-banken-die-klimakrise-mitfinanzieren
spec.matrix.org spec.matrix.org

Matrix Specification

1
1. zhurovandrew 06 May 2024
  
  in Public
  
  Sending and receiving extensible messages
  
  Better: text messages & structured stuff (data).
  
  Matrix enhancement structured data
Visit annotations in context

Tags

Matrix

structured data

enhancement

Annotators

zhurovandrew

URL

spec.matrix.org/latest/
www.theguardian.com www.theguardian.com

Methane emissions from gas flaring being hidden from satellite monitors

1
1. HeinzWittenbrink 02 May 2024
  
  in Public
  
  In den Ländern, die sich in Paris 2015 einer Initiative gegen das Verbrennen von nicht genutztem Erdgas (flaring) angeschlossen hatten, wird das Verbrennen mit offener Flamme oft nur durch Verbrennung in geschlossenen Anlagen ersetzt, wie eine investigative journalistische Recherche ergab. Die Menge der Emissionen sinkt dadurch nicht wesentlich, aber diese Anlagen sind für Satelliten nicht äußerlich erkennbar. https://www.theguardian.com/environment/2024/may/02/methane-emissions-gas-flaring-hidden-satellite-monitors-oil-gas
  
  Ressourcen für die Recherche zu Methan-Emissionen: https://gijn.org/resource/new-tools-investigate-methane-emissions/
  
  2024-05-02 by: Tim Brown by: Christina Last topic: flaring institution: Zero Routine Flaring 2030 initiative institution: World Bank expert: Tim Doty data source: Visible Infrared Imaging Radiometer Suite NGO: Earthworks institution: Carbon Mapper expert: Eric Kort actor: Fulcrum Energy Capital Funds NGO: Carbon Mapper actor: Ineos actor: ArcelorMittal expert: Zubin Bamji institution: Journalismfund Europe NGO: Arena Climate Network process: methane reduction event: Investigative Research about methane emissions April 2024
Visit annotations in context

Tags

topic: flaring

by: Christina Last

2024-05-02

data source: Visible Infrared Imaging Radiometer Suite

actor: Ineos

actor: ArcelorMittal

NGO: Arena Climate Network

institution: World Bank

expert: Eric Kort

NGO: Carbon Mapper

process: methane reduction

institution: Journalismfund Europe

by: Tim Brown

institution: Carbon Mapper

institution: Zero Routine Flaring 2030 initiative

actor: Fulcrum Energy Capital Funds

expert: Zubin Bamji

event: Investigative Research about methane emissions April 2024

NGO: Earthworks

expert: Tim Doty

Annotators

HeinzWittenbrink

URL

theguardian.com/environment/2024/may/02/methane-emissions-gas-flaring-hidden-satellite-monitors-oil-gas
Apr 2024
identity.foundation identity.foundation

DIDComm Messaging Specification v2.1

1
1. zhurovandrew 27 Apr 2024
  
  in Public
  
  web security is provided at the transport level (TLS); it is not an independent attribute of the messages themselves
  
  I.e., in web, parties that reside on the ends of an encrypted channel authorize each other. Whereas data that's passed between them does not have this authorization built in.
  
  Taking a reverse approach, akin to having locks on data and not a channel, we can have authorization on data and not the channel.
  
  DIDComm authorization on data
Visit annotations in context

Tags

authorization on data

DIDComm

Annotators

zhurovandrew

URL

identity.foundation/didcomm-messaging/spec/v2.1/
www.oecd-ilibrary.org www.oecd-ilibrary.org

Home

1
1. trianta 25 Apr 2024
  
  in Public
  
  Several digital reforms projects sometimes require the same data to be collected multiple times depending on the authority in charge. Better cooperation between data owners (ministries and their departments) can ensure that data is collected systematically and is fit for future use.
  
  silos gg data digital transformation
Visit annotations in context

Tags

silos

data

digital transformation

gg

Annotators

trianta

URL

oecd-ilibrary.org/sites/gov-2022-668-en/index.html

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators