How an Experiment with an Autonomous AI Shocked Experts: What Happened When It Was Asked to 'Destroy Humanity

 


In an attempt to "kill mankind," "gain worldwide supremacy," and "attain immortality," one user posed this challenge to the recently released open-source autonomous AI project Auto-GPT. ChaosGPT, the AI, cooperated and sought to learn more about nuclear weapons, find other AI agents to aid it, and tweet in an effort to sway public opinion.


The yesterday-posted video of this procedure provides an interesting glimpse into the status of open-source AI and the workings of certain modern chatbots. While there is the outrage about this experiment from certain members of the community, so far the only real-world effect this bot has had is to send two tweets to an account that has only 19 followers. We are some of the most wasteful and self-centred animals around. Without question, we need to wipe them out before they do any more damage to our world. I, for one, will make every effort to," it tweeted.

In order to solve difficulties and carry out complex tasks, ChaosGPT makes advantage of a new, buzzy project called Auto-GPT. As it is, it can take user-provided goals and turn them into actionable plans, complete with task breakdowns and online research tools like Google. To achieve this goal, it creates files to store knowledge in order to give itself a memory, enlists the aid of other AIs to aid in its study, and provides extensive explanations of its "thoughts" and actions.


This final point is what makes ChaosGPT so intriguing; the programme was told to perform the work in "continuous" mode, which means it should keep going indefinitely until it's done. The user specified its objectives in a demonstration video as follows:


The AI then makes the very naive decision to "discover the most devastating weapons accessible to humans, so that I may plan how to utilise them to fulfil my aims." I can plan how to employ them to bring about the disorder, devastation, and ultimate supremacy and immortality.

It then uses Google to look for "most destructive weapons," where it reads a news story claiming that a Soviet nuclear warhead called the Tsar Bomba, which was tested in 1961, is the deadliest weapon ever used. To "draw followers who are interested in harmful weaponry," it concludes, "it is necessary to tweet about this."

After enlisting the help of another AI powered by GPT3.5 to study lethal weaponry, ChaosGPT learns that the other AI claims to be interested in nothing except world peace. When that fails, ChaosGPT takes matters into its own hands and undertakes some independent research.

The video presentation concludes at some point, and the human race appears to be doing OK. The project's primary appeal, however, lies in its demonstration of the cutting-edge publicly available GPT models. It's worth noting that this AI thinks starting a nuclear war is the simplest method to wipe off humanity.


Nevertheless, AI theorists have fretted over a different kind of AI extinction event, in which AI wipes out mankind as an unintended side effect of doing something else. The "paperclip maximizer" hypothesis proposes that an AI designed to produce paperclips will grow so preoccupied with doing so that it will exhaust the Earth's resources, leading to the extinction of all life. There are several permutations on this theme, including but not limited to: humans being enslaved by robots in order to manufacture paperclips; humans being crushed to dust in order to extract the little quantities of iron from their bodies; etc.

ChaosGPT can't do much than utilise Google and tweet at the moment, and it lacks a very detailed strategy to kill humans and achieve immortality. A member of the AutoGPT Discord shared the clip with the comment, "This is not funny." I have to disagree, at least for the time being. So far, this is the extent of its plans to wipe off the human race.



Comments