Hypothesis

8 Matching Annotations

Jan 2025
openreview.net openreview.net

74_Mapping_Social_Choice_Theor.pdf

1
1. mark.crowley 31 Jan 2025
  
  in Public
  
  MAPPING SOCIAL CHOICE THEORY TO RLHF Jessica Dai and Eve Fleisig ICLR Workshop on Reliable and Responsible Foundation Models 2024
  
  Nice overview of how social choice theory has been connected to RLHF and AI alignment ideas.
  
  #ai-morality align rlhf llm #reinforcement-learning
Visit annotations in context

Tags

align

rlhf

#ai-morality

llm

#reinforcement-learning

Annotators

mark.crowley

URL

openreview.net/pdf
Sep 2024
www.youtube.com www.youtube.com

Yuval Noah Harari: “We Are on the Verge of Destroying Ourselves” | Amanpour and Company

2
1. stopresetgo 17 Sep 2024
  
  in Public
  
  nobody told it what to do that's that's the kind of really amazing and frightening thing about these situations when Facebook gave uh the algorithm the uh uh aim of increased user engagement the managers of Facebook did not anticipate that it will do it by spreading hatefield conspiracy theories this is something the algorithm discovered by itself the same with the capture puzzle and this is the big problem we are facing with AI
  
  for - AI - progress trap - example - Facebook AI algorithm - target - increase user engagement - by spreading hateful conspiracy theories - AI did this autonomously - no morality - Yuval Noah Harari story
  
  AI - progress trap - example - Facebook AI algorithm - target - increase user engagement - by spreading hateful conspiracy theories - AI did this autonomously - no morality - Yuval Noah Harari story
2. stopresetgo 17 Sep 2024
  
  in Public
  
  when a open AI developed a gp4 and they wanted to test what this new AI can do they gave it the task of solving capture puzzles it's these puzzles you encounter online when you try to access a website and the website needs to decide whether you're a human or a robot now uh gp4 could not solve the capture but it accessed a website task rabbit where you can hire people online to do things for you and it wanted to hire a human worker to solve the capture puzzle
  
  for - AI - progress trap - example - no morality - Open AI - GPT4 - could not solve captcha - so hired human at Task Rabbit to solve - Yuval Noah Harari story
  
  AI - progress trap - example - no morality - Open AI - GPT4 - could not solve captcha - so hired human at Task Rabbit to solve - Yuval Noah Harari story
Visit annotations in context

Tags

AI - progress trap - example - no morality - Open AI - GPT4 - could not solve captcha - so hired human at Task Rabbit to solve - Yuval Noah Harari story

AI - progress trap - example - Facebook AI algorithm - target - increase user engagement - by spreading hateful conspiracy theories - AI did this autonomously - no morality - Yuval Noah Harari story

Annotators

stopresetgo

URL

youtube.com/watch
Feb 2024
arxiv.org arxiv.org

2205.08192.pdf

1
1. mark.crowley 18 Feb 2024
  
  in Public
  
  T. Herlau, "Moral Reinforcement Learning Using Actual Causation," 2022 2nd International Conference on Computer, Control and Robotics (ICCCR), Shanghai, China, 2022, pp. 179-185, doi: 10.1109/ICCCR54399.2022.9790262. keywords: {Digital control;Ethics;Costs;Philosophical considerations;Toy manufacturing industry;Reinforcement learning;Forestry;Causality;Reinforcement learning;Actual Causation;Ethical reinforcement learning}
  
  ai-ethics ai-morality reinforcement-learning
Visit annotations in context

Tags

ai-morality

ai-ethics

reinforcement-learning

Annotators

mark.crowley

URL

arxiv.org/pdf/2205.08192.pdf
pdf.sciencedirectassets.com pdf.sciencedirectassets.com

Can model-free reinforcement learning explain deontological moral judgments?

1
1. mark.crowley 18 Feb 2024
  
  in Public
  
  Can model-free reinforcement learning explain deontological moraljudgments?Alisabeth AyarsUniversity of Arizona, Dept. of Psychology, Tucson, AZ, USA
  
  ai-morality ai-ethics reinforcement-learning
Visit annotations in context

Tags

ai-morality

ai-ethics

reinforcement-learning

Annotators

mark.crowley

URL

pdf.sciencedirectassets.com/271061/1-s2.0-S0010027716X00030/1-s2.0-S0010027716300300/am.pdf
Jul 2023
docdrop.org docdrop.org

Video: Join the Movement #onebillionhappy (DocDrop)

1
1. stopresetgo 13 Jul 2023
  
  in Public
  
  That's the way computers are learning today. 00:02:35 We basically write algorithms that allow computers to understand those patterns… And then we get them to try and try and try. And through pattern recognition, through billions of observations, they learn. They're learning by observing. And what are they observing? They're observing a world that's full of greed, disregard for other species, violence, ego, 00:03:05 showing off The only way to be not only intelligent but also to have the right value set is that we start to portray that right value set today. THE PROBLEM IS UNHAPPINESS
  
  Machine learning
  
  will learn all our bad habits
  
  and become supercharged, amplified versions of them
  
  The antidote to apocalyptic machine learning
  
  is human happiness and wisdom
  
  Immoral AI - antidote AI - morality
Visit annotations in context

Tags

Immoral AI - antidote

AI - morality

Annotators

stopresetgo

URL

docdrop.org/video/G7BkixWHG8o/
docdrop.org docdrop.org

Video: Mo Gawdat Warns The Dangers of AI Are "Happening As We Speak" (DocDrop)

2
1. stopresetgo 13 Jul 2023
  
  in Public
  
  even though the existential threats are possible you're concerned with what humans teach I'm concerned 00:07:43 with humans with AI
  
  It is the immoral human being that is the real problem
  
  they will teach AI to be immoral and with its power, can end up destroying humanity
  
  AI morality AI progress trap progress trap progress trap - AI
2. stopresetgo 13 Jul 2023
  
  in Public
  
  a nefarious controller of AI presumably could teach it to be immoral
  
  bad actor will teach AI to be immoral
  
  this also creates an arms race as "good" actors are forced to develop AI to counter the AI of bad actors
  
  AI arms race AI morality
Visit annotations in context

Tags

progress trap

progress trap - AI

AI morality

AI arms race

AI progress trap

Annotators

stopresetgo

URL

docdrop.org/video/oxRZqzth9r4/

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL