logo-home

Rahulsantra

On this page, you find all documents, package deals, and flashcards offered by seller rahulsantra.

Community

  • Followers
  • Following

1 items

y Kind Machines

(0)
$5.49
0x  sold

Artificial Intelligence systems are rapidly evolving, integrating extrinsic and intrinsic motivations. While these frameworks offer benefits, they risk misalignment at the algorithmic level while appearing superficially aligned with human values. In this paper, we argue that an intrinsic motivation for kindness is crucial for making sure these models are intrinsically aligned with human values. We argue that kindness, defined as a form of altruism motivated to maximize the reward of others,...

i x
  • Thesis
  •  • 8 pages • 
  • by rahulsantra • 
  • uploaded  08-11-2024
Quick View
i x