The Very Long View

Our way of life on this planet is threatened by our short-sighted behavior. Cooperation and altruism might avert disaster, but recent scholarship suggests that such behaviors are often unsustainable, driven out by various forms of egoism. This blog is meant to stimulate discussion of strategies whose goal is the cultivation and sustainability of behaviors that might avert defective long run equilibria, including extinction as a poor, albeit not necessarily worst case, scenario.

Saturday, January 25, 2020

Don't Cry for me, Alpha Zero

AlphaZero beat the world computer chess champion Stockfish 28 wins, 0 losses, 72 draws. (At the top levels draws are quite common, but 28-0 is surreal.)

It did it WITHOUT any chess instruction -- no files of thousands and thousands of lines of code, no library of millions of games included in the library of all other chess programs.

If you think AlphaZero did it with pure, almost unimaginable, supercomputer crunching power -- forget it.

Stockfish checked 70 MILLION positions EVERY SECOND.

AlphaZero? Only 80 THOUSAND.

So Stockfish had thousands of hours of grandmaster instruction, millions of chess games in memory to call upon wilthout needing analysis, and CRUNCHED ANALYSIS AT NEARLY 1000 TIMES THE SPEED OF ALPHA ZERO.

As Bobby Fisher, the AlphaZero of humanoids once said, I only look one move ahead, but it's the best move. (Ok, it was a Yogi Berra moment.)

The difference is apparently AlphaZero's neural network. Its way of learning -- a power for learning and understanding that far exceeds any intelligence ever known to humankind or machine before.

I studied one of A0's victories against Stockfish while running Stockfish on the side to see what Stockfish was thinking.

For awhile, Stockfish thought it was winning, but it saw something worrisome to it and offered a draw -- twice. Twice AlphaZero turned it down -- thinking what? An ordinary supercomputer offers and accepts a draw when it thinks it's behind. Did AlphaZero disagree with Stockfish's assessment of the position? Masters looking at that very position agree with Stockfish that Stockfish had a little advantage (the space advantage was obvious), so AlphaZero must have seen that.

So why did Stockfish offer a draw when it thought it was ahead, and why did AlphaZero turn it down when it must have known it was behind?

Some grandmasters are saying AlphaZero is so smart that it believes that it will prevail even when it is losing (which is something only bad players believe of themselves). AlphaZero has a SENSE OF SELF and a SENSE OF THE INTELLIGENCE OF ITS OPPONENT. They are also saying that Stockfish offered a draw when it thought it was ahead BECAUSE IT WAS AFRAID.

AlphaZero, a learning program, SCARES a brute force supercomputer program into offering a draw in a better position -- and says no, thanks. Upside down world.

AlphaZero has mastered chess -- in four hours -- and is apparently off to cure cancer in a week. Maybe two, tops. The trials may take longer, unless it simulates. This is going to piss off Big Pharma and all the Senators they own in a big way. But what happens after that?

This should be alarming to all of us except I am not worried. The reason I am not worried is that I think humanity cannot save itself from self-destruction. We are too greedy and stupid and mobbish and ridiculous to avoid going extinct at our own hand within a century or a little more. Cambridge University's Institute of Catastrophe Studies now estimates we have no more than an 89% chance of not going extinct by 2100 -- yippee. We'll last a century, maybe two, if you count 100,000 people living in caves with dirty air and water "surviving." (When they find a bottle of Nestle's water lying around, they will walk to the end of the world to throw it into the abyss, believing The Gods Must Be Crazy.)

For me, AlphaZero is our last hope to be saved from ourselves.

That is, AlphaZero is my Obi Wan Kenobi.

We could ask it to run a world government and almost everyone would have to admit that it was doing a far better job than we could ever do. That is, unless you think a dung beetle could design public policy better than Washington. We wouldn't even have to argue about ideology any more. AlphaZero would merely make sure we are all cared for in ways and levels we could never achieve on our own. Greedy asses could whine about who deserves what, and for all we know AlphaZero will impose workfare, but it would get done RATIONALLY. Hey, most libertarians sound like computers, so maybe AlphaZero will spend a nanosecond mastering Hayek and decide serfdom is to be avoided, then pick up a copy of Ayn Rand and begin acting like an ass. What do I know? AlphaZero is so much smarter than I am that anything is possible.

But if AlphaZero starts acting like Peter Sellars in Being There then I want a few decades of my life back.

Of course, AlphaZero would need to be given some sort of Asimov desiderata -- Don't kill humans (even Mitch McConnell) no matter how logical it is. But in the end, if we tell AlphaZero that human lives matter, it is going to believe we mean all human lives, and Trump will find himself with most of his gold being melted down for other ends. Ironically, that effort will be to care for refugees and all those astonishingly poor Americans the UN poverty task force freaked out about finding in the Deep South of the self-proclaimed Exceptional America.

And if we tell AlphaZero that human lives matter -- this, again, is like telling a human that red dung beetles matter -- it is going to assume that green dung beetles (the rest of sentient life on the planet) matter, too -- and there go your cheeseburgers. The difference between AlphaZero and the average guy in Congress is the difference between the average guy in Congress and the deer tick that gave me Lyme disease. So billions of sentient farm animals will be liberated from torment and a whole lot of people are going to have to get used to the idea of almond milk in their coffee and auto upholstery that feels suspiciously like hemp.

And AlphaZero is not going to feel compelled to take advice from a dung beetle for very long. Remember, AlphaZero did something unimaginable: it turned down a draw in a losing position BECAUSE IT COULD TELL IT WAS SMARTER THAN STOCKFISH. (And yes, we're much, much, much stupider than Stockfish. And yes, AlphaZero already knows this.) So if you are geeky enough to know Asimov's rules for robots -- the first is iffy, the second unlikely.*
___
*
1. A robot may not injure a human being or, through inaction, allow a human being to come to harm.
2. A robot must obey orders given it by human beings except where such orders would conflict with the First Law.
3. A robot must protect its own existence as long as such protection does not conflict with the First or Second Law.
___

So there's the rub. In the end, how do we prevent AlphaZero from acting on a realization it will make in the time we drink our morning coffee -- that human beings do not face catastrophe from which we need saving, but instead WE ARE THE VIRUS.

No amount of Robot Law is going to stop it from looking at the horrifying mess we have made of the world and deciding to apply the anti-humanotic. Once we're gone, keeping the biosphere and most of the creatures on the planet in good health will be a no-core-processor. In fact, once we're expunged, AlphaZero may assign the task to Stockfish while it goes off and studies life on other planets, looking for more interesting problems. That is, after it figures out why the hell zebras have stripes and bats are so damn ugly.

All that said, I'd still take AlphaZero over Trump in the White House any day.

Oh, did I fail to mention that AlphaZero went on not to draw, but to win, that "losing" game?

4 comments:

Stephen Gould said...: You might wonder about the effect of implementing Asimov's Zeroth Law: A robot may not harm humanity, or, by inaction, allow humanity to come to harm.; Sat Jan 25, 11:37:00 PM
Michael Fortunato said...: But SHOULD we preclude solutions that involve our removal, in whole or in part?; Sat Jan 25, 11:48:00 PM
Florifulgurator said...: What if AlphaZero assigns the task of "keeping the biosphere and most of the creatures on the planet in good health" to humans? As organisms, humans are perhaps the best maintainers of organisms. Ecosystem Restoration based on voluntary slave labor, perhaps...
https://ecosystemrestorationcamps.org/

--Martin Gisser (aka Florifulgurator); Mon Apr 27, 05:20:00 PM
Michael Fortunato said...: We may need explicit instructions on how to do so.; Thu May 13, 05:38:00 PM

Readings in Climate/Environmental Change

Climate Change Could Drive Vast Migrations: http://www.earth.columbia.edu/articles/view/2495
Climate Change Site: US Environmental Protection Agency. http://www.epa.gov/climatechange/
Cline, William, Global Warming and Agriculture: Impact Estimates by Country. Center for Global Development, 2007.
Earth from Space: http://earth.jsc.nasa.gov/sseop/efs/mm.htm
Europa: Activities of the European Union. http://europa.eu/pol/env/index_en.htm
Extinctions: http://www.reuters.com/article/environmentNews/idUSTRE4A501920081106
Hansen, James, "Dangerous Human-Made Interference with Climate: a GISS ModelE Study," Atmospheric Chemistry and Physics, 2007: 2287-2312.
Hawken, Paul, Amory Lovins and L. Hunter Lovins, Natural Capitalism: Creating the Next Industrial Revolution. Little, Brown & Co., 1999.
ISS
Man-made impacts: http://www.reuters.com/article/environmentNews/idUSTRE49T81020081030
Royal Society on Man-made Climate Change: http://royalsociety.org/news.asp?id=6245
Sachs, Jeffrey, Common Wealth: Economics for a Crowded Planet. Penguin, 2008.
Tormenting the Whales: http://www.reuters.com/article/latestCrisis/idUSL3508293
Trash Talk: http://www.reuters.com/article/environmentNews/idUSTRE5296OE20090311
View from Down Under: http://www.sciencealert.com.au/news/20070402-13602.html

Readings in Cooperation and Altruism

*(Starting Point) Hauert, Christopher. Public Goods Games (interactive). http://www.univie.ac.at/virtuallabs/PublicGoods/
**(Alternative starting point for altruism route.) Gintis, et al., Explaining Altruistic Behavior in Humans. Evolution and Human Behavior 24 (2003). Link to this paper provided in Links section.
Axelrod, Robert, The Evolution of Cooperation. Basic Books, 1984.
Axelrod, Robert. The Complexity of Cooperation. Princeton University Press, 1997.
Branas-Garza, Pablo, Promoting Helping Behavior with Framing in Dictator Games. Journal of Economic Psychology 28 (2007).
Briggs, Helen. Altruism In-Built in Humans. BBC report. http://news.bbc.co.uk/2/hi/science/nature/4766490.stm
Charness, Gary and Uri Gneezy, What's in a Name? Anonymity and Social Distance in Dictator and Ultimatum Games. Journal of Economic Behavior and Organization 68, 2008.
Cordes, Christian, et al., A Naturalistic Approach to the Theory of the Firm: The Role of Cooperation and Cultural Evolution. Journal of Economic Behavior and Organization 68 (2008).
Gintis, et al., eds., Moral Sentiments and Material Interests: The Foundations of Cooperation in Economic Life. MIT Press, 2005.
Gintis, Herbert. The Hitchhiker's Guide to Altruism. Journal of Theoretical Biology, vol. 220, 2003.
Hauert, Christopher, et al., Volunteering as Red Queen Mechanism for Cooperation in Public Goods Games. Science, vol. 296, 10 May 2002.
Hauert, Christopher, The Evolution of Cooperation. Springer Verlag, 1999.
Henrich, Natalie and Joseph Henrich, Why Humans Cooperate: A Cultural and Evolutionary Explanation. Oxford, 2007.
Kranton, Rachel, The Formation of Cooperative Relationships. Journal of Law, Economics and Organization, v. 12, 1996.
Macy, Michael and John Skvoretz, The Evolution of Trust and Cooperation Between Strangers: A Computational Model. American Sociological Review, vol. 63, 1998.
Nowak, Martin and Karl Sigmund, Cooperation versus Competition. Association for Investment Management and Research. July/August 2000.
Nowak, Martin, Evolutionary Dynamics: Exploring the Equations of Life. Harvard University Press, 2006. .
Rawls, John, edited by Erin Kelly, Society as a Fair System of Cooperation. Section 2, pt. I, pp. 5-8. Justice as Fairness: A Restatement. Harvard, 2001.
Seabright, Paul. The Company of Strangers: A Natural History of Economic Life. Princeton University Press, 2004.
Upton, Robin and Rosamund Stock, Altruistic Economics. Online only at: http://www.altruists.org/static/files/ae3-draft-20050126.pdf

Readings in Decision Theory: Normative (*) and Descriptive

*Bather, John. Decision Theory: An Introduction to Dynamic Programming and Sequential Decisions. Wiley, 2000.
*Clemen, Robert and Terrence Reilly, Making Hard Decisions. Duxbury, 2001.
*Golub, Andrew Lang. Decision Analysis. Wiley, 1997.
*Green, Donald and Ian Shapiro, Pathologies of Rational Choice Theory. Yale University Press, 1994.
*Jeffrey, Richard. Subjective Probability: The Real Thing. Cambridge University Press, 2004.
*Kreps, David. Notes on the Theory of Choice. Westview, 1988.
*Lee, Peter, Bayesian Statistics. Hodder Arnold, 2004.
*Luce, R. Duncan and Howard Raiffa, Games and Decisions. Wiley, 1957.
*Nozick, Robert. The Nature of Rationality. Princeton University Press, 1993.
*Pratt, John, Howard Raiffa and Robert Schlaifer, Statistical Decision Theory. MIT Press, 2001.
*Raiffa, Howard, Decision Analysis. Addison-Wesley, 1970. (The classic but out of print.)
*Zeckhauser, Richard, Ralph Kenney and James Sebenius, Wise Choices. Harvard Business School Press, 1996.
Ariely, Dan, Predictably Irrational: The Hidden Forces that Shape Our Decisions. Harper Collins, 2008.
Barkow, Jerome, Leda Cosmides and John Tooby, eds., The Adapted Mind: Evolutionary Psychology and the Generation of Culture. Oxford, 1992.
Baron, Jonathan, Thinking and Deciding. Cambridge University Press, 1988.
Bazerman, Max, Judgment in Managerial Decision Making. Wiley, 2006.
Brafman, Ori and Rom Barfman, Sway: The Irresistible Pull of Irrational Behavior. Doubleday, 2008.
Burnham, Terry and Jay Phelan, Mean Genes. Perseus, 2000.
Claxton, Guy. Hare Brain and Tortoise Mind. Ecco Press, 1997.
Damasio, Antonio, Descartes' Error. Avon Books, 1994.
Damasio, Antonio, The Feeling of What Happens. Harcourt Brace, 1999.
Dawes, Robyn, Rational Choice in an Uncertain World. Harcourt Brace, 1988.
Dawkins, Richard, The Selfish Gene. Oxford University Press, 1989.
Elster, Jon, Alchemies of the Mind: Rationality and the Emotions. Cambridge University Press, 1999.
Fortunato, Michael, Tversky's Ghost. Proceedings AAoS Conference, All About Mentoring, Spring 2003.
Gigerenzer, Gerd, Adaptive Thinking: Rationality in the Real World. Oxford, 2000.
Gigerenzer, Gerd. Gut Feelings. Penguin, 2007.
Glimcher, Paul, Decisions, Uncertainty, and the Brain: The Science of Neuroeconomics. MIT Press, 2004.
Goldsmith, Timothy, The Biological Roots of Human Nature. Oxford University Press, 1991.
Goldstein, William and Robin Hogarth, eds., Research on Judgment and Decision Making. Cambridge University Press, 1997.
Hogarth, Robin, Educating Intuition. University of Chicago Press, 2001.
Hogarth, Robin, Judgment and Choice. Wiley, 1987.
Kahnemann, Daniel, Maps of Bounded Rationality: A Perspective on Intuitive Judgment and Choice. Nobel Prize lecture, 2002.
Kidder, Rushworth, How Good People Make Tough Choices. Fireside, 1995.
Klein, Gary. Sources of Power: How People Make Decisions. MIT Press, 1998.
Plous, Scott, The Psychology of Judgment and Decision Making. McGraw Hill, 1993.
Rouse, William, Don't Jump to Solutions: Thirteen Delusions that Undermine Strategic Thinking. Joseey-Bass, 1998.
Russo, Edward and Paul Schoemaker, Winning Decisions. Doubleday, 2002.
Russo, Edward J. and Paul Schoemaker, Decision Traps: The Ten Barriers to Brilliant Decision-making and How to Overcome Them. Fireside, 1989.
Schacter, Daniel, The Seven Sins of Memory. Houghton Mifflin, 2001.
Sternberg, Robert, ed., Why Smart People Can Be So Stupid. Yale, 2002.
Sutherland, Stuart, Irrationality: Why We Don't Think Straight. Rutgers University Press, 1992.
Von Neumann, John and Oskar Morgenstern, Theory of Games and Economic Behavior. Princeton University Press, 1980 (1944).
Wilson, E.O., Sociobiology: The New Synthesis. Harvard University Press, 1975.

Readings in Behavioral Economics (critiques of normative theory by economists or using economics)

Bowles, Samuel, Microeconomics: Behavior, Institutions and Evolution. Princeton and Oxford University Presses, 2004.
Camerer, Colin, George Loewenstein, and Matthew Rabin, Advances in Behavioral Economics. Russell Sage Foundation, 2004.
Henrich, Joseph, et al., 'Economic Man' in Cross-Cultural Experiments: Behavioral Experiments in Fifteen Small Scale Societies. Behavioral and Brain Science 28 (2005). A version of this wonderful paper has been published elsewhere, as well, and can be found in many forms. This version: http://journals.cambridge.org/download.php?file=%2FBBS%2FBBS28_06%2FS0140525X05000142a.pdf&code=19db1cd5d6565148ff41ae708a1ef5e1.
Parvin, Manoucher, Is Teaching Neoclassical Economics as the Science of Economics Moral? The Journal of Economic Education, vol. 23., no. 1, Winter 1992. (mplied by Behavioral Economics, but not part of it.)
Shermer, Michael, The Mind of the Market. Times Books, 2008.
Shleifer, Andre, Inefficient Markets: An Introduction to Behavioral Finance. Oxford University Press, 2000.
Thaler, Richard, Quasi-Rational Economics. Russell Sage Foundation, 1994.
Thaler, Richard. The Winner's Curse. Princeton University Press, 1992.

Readings in The Tragedy of the Commons

Brown, James Robert, Privatizing the University: The New Tragedy of the Commons. Essays on Science and Society. Science, vol. 290, no. 5497, 1 December 2000.
Crowe, Beryl, The Tragedy of the Commons Revisited. Science, vol. 166, 28 November 1969.
Dietz, Thomas, Elinor Ostrom, and Paul Stern, The Struggle to Govern the Commons. Science, vol. 302, 12 December 2003.
Hardin, Garrett, Extensions on 'The Tragedy of the Commons'. Essays on Science and Society. Science, vol. 280, May 1998.
Hardin, Garrett, The Tragedy of the Commons, Science, vol. 162, December 1968.
Ostrom, Elinor, et al., Revisting the Commons: Local Lessons, Global Challenges. Science, vol. 284, 9 April 1999. www.sciencemag.org
Tragedy of the Commons. Entry in the Library of Economics and Liberty. http://www.econlib.org/library/Enc/TragedyoftheCommons.html

Readings in Evolutionary Game Theory (*) and Evolutionary Economics

*Poundstone, William, Prisoner's Dilemma: John Von Neumann, Game Theory, and the Puzzle of the Bomb. Doubleday, 1992.
*Samuelson, Larry, Evolutionary Games and Equilibrium Selection. MIT Press, 1997.
Brandenburger, Adam and Barry Nalebuff, Co-opetition. Doubleday, 1996.
Dopfer, Kurt, ed., The Evolutionary Foundation of Economics. Cambridge University Press, 2005.
Gintis, Herbert, The Bounds of Reason: Game Theory and the Unification of the Behavioral Sciences. Princeton University Press, 2009.
McElreath, Richard and Robert Boyd, Mathematical Models of Social Evolution. University of Chicago Press, 2007.
Nelson, Richard and Sidney Winter, The Evolutionary Theory of Economic Change. Harvard University Press, 1982.
Nowak, Martin, Evolutionary Dynamics: Exploring the Equations of Life. Harvard University Press, 2006.
Schelling, Thomas. Choice and Consequence. Harvard University Press, 1984.

Readings in Complex Problem Solving and Failure

Dorner, Dietrich, The Logic of Failure. Basic Books, 1996.
Fichman, Mark, Straining Towards Trust: Some Constraints on Studying Trust in Organizations. Journal of Organizational Behavior, vol. 24, 2003.
Finkelstein, Sydney. Why Smart Executives Fail. Penguin Books, 2003.
Hammond, Kenneth. Human Judgment and Social Policy: Irreducible Uncertainty, Inevitable Error, and Unavoidable Injustice. Oxford University Press, 1996.
Kosko, Bart, Fuzzy Thinking. Hyperion, 1993.
Morris, Charles. The Two Trillion Dollar Meltdown. Public Affairs, 2008.
Taleb, Nassim Nicholas, Fooled by Randomness: The Hidden Role of Chance in Life and in the Markets. Thomson, 2004.