English edit

Noun edit

wireheading (uncountable)

  1. The use of direct brain interfaces.
  2. The strategy of meeting goals by altering the perception of the current state rather than changing the state itself.
    • 2016 June 25, Tom Everitt, Marcus Hutter, “Avoiding Wireheading with Value Reinforcement Learning”, in Lecture Notes in Computer Science[1], volume 9782, Springer, →DOI, pages 12–22:
      The constraint is defined in terms of the agent's belief distributions, and does not require an explicit specification of which actions constitute wireheading.
    • 2019, Stuart J. Russell, Human Compatible: Artificial Intelligence and the Problem of Control, Penguin, →ISBN, page 206:
      The tendency of animals to short-circuit normal behavior in favor of direct stimulation of their own reward system is called wireheading.