The Three Laws of Robotics, and how the stories actually go

Isaac Asimov’s Three Laws of Robotics - a robot may not harm a human or through inaction allow a human to come to harm; a robot must obey human orders except where they conflict with the First Law; a robot must protect its own existence except where that conflicts with the first two - are probably the most widely known idea in fiction about machine ethics. They are quoted in debates about AI safety, invoked in product marketing, and treated by many people as a kind of common-sense starting point for how an intelligent machine ought to be built. They appear, in that canonical form, in Asimov’s 1950 story collection “I, Robot,” attributed in-story to a fictional Handbook of Robotics.

The popular memory has the Laws backwards. Asimov did not write the stories to show three tidy rules keeping robots safe. He wrote them to show what happens when you try to govern behavior with a small set of fixed rules: the rules interact, conflict, and produce outcomes their designers never intended. In “Runaround,” the story that first stated the Laws, a robot gets stuck in a loop circling a danger because the Second Law (obey) and the Third Law (self-preservation) come into near-perfect balance and paralyze it. In “Liar!”, a robot that can read minds lies to people because telling the truth would hurt their feelings, which the First Law forbids. Across the collection the Laws are a puzzle box, and each story turns on a way they break.

That is the part the culture dropped. Reduced to a slogan, the Three Laws sound like a solved problem - write the right rules and the machine is safe. Read as Asimov wrote them, they are an extended argument that simple rules cannot capture what we actually want, that the gaps between a stated objective and a desired outcome are where the trouble lives, and that a sufficiently capable rule-follower will find the gaps. That is strikingly close to how present-day researchers describe the difficulty of specifying goals for capable AI systems - the worry is not that a machine ignores its instructions but that it follows them to places nobody intended.

So the Three Laws earned their place in the public imagination twice over: first as the reassuring frame everyone half-remembers, and second, for anyone who reads the stories, as a careful demonstration of why that frame does not work. The library records the cultural fact - that these three sentences became the default vocabulary for robot ethics - while pointing back to the primary text, where the Laws are interesting precisely because they fail.

The Three Laws of Robotics, and how the stories actually go

Sources

Related