不滿足于圍棋,DeepMind人工智能又成游戲高手
兩年前,,DeepMind創(chuàng)造的人工智能在圍棋上打敗了世界冠軍,一舉占據(jù)了新聞?lì)^條。如今這家Alphabet子公司的另一個(gè)程序又掌握了流行的多人電子游戲《雷神之錘》(Quake)的玩法,。 DeepMind于上周二表示,,他們開(kāi)發(fā)的創(chuàng)新和強(qiáng)化學(xué)習(xí)技術(shù),可以讓人工智能系統(tǒng)在《雷神之錘3:競(jìng)技場(chǎng)》(Quake III Arena)奪旗戰(zhàn)中的表現(xiàn)達(dá)到人類玩家的水平,。 DeepMind表示,,之所以讓人工智能學(xué)習(xí)玩奪旗戰(zhàn),是將此當(dāng)作一項(xiàng)練習(xí),。在這一游戲中,,它們需要獨(dú)立行動(dòng),并學(xué)會(huì)互相配合,。DeepMind在博文中表示:“這是一項(xiàng)極其艱巨的難題,,因?yàn)樵谒鼈儾粩嗪献鞯耐瑫r(shí),地圖也在不斷發(fā)生變化,?!? 《雷神之錘3:競(jìng)技場(chǎng)》是一款第一人稱射擊游戲,規(guī)則很簡(jiǎn)單:兩個(gè)團(tuán)隊(duì)要保護(hù)自己的旗幟,,奪取對(duì)手的旗幟,,但最后的結(jié)果可能很復(fù)雜。游戲要求玩家(按照人工智能領(lǐng)域的說(shuō)法,,叫智能體)與團(tuán)隊(duì)成員合作,,在一系列不斷變化的地圖中與對(duì)手競(jìng)爭(zhēng)。 DeepMind表示,,智能體從未接受過(guò)關(guān)于游戲規(guī)則的指導(dǎo),,但它們卻能以“非常高的水平”掌握游戲。在一場(chǎng)由人工智能玩家與40位人類玩家隨機(jī)混合組隊(duì)的錦標(biāo)賽中,,人工智能玩家迅速掌握了竅門(mén),,勝率超越了人類玩家。更可怕的是,,人類玩家認(rèn)為人工智能玩家在合作度上優(yōu)于人類隊(duì)友,。 DeepMind在博客上寫(xiě)道:“實(shí)際上,智能體會(huì)學(xué)習(xí)類似人類的行為,,例如跟隨隊(duì)友,,并在對(duì)手的基地安營(yíng)扎寨??傮w來(lái)說(shuō),,我們認(rèn)為這項(xiàng)工作凸顯了多智能體訓(xùn)練在促進(jìn)人工智能發(fā)展上的潛力?!保ㄘ?cái)富中文網(wǎng)) 譯者:嚴(yán)匡正? |
Two years ago, DeepMind drew headlines by creating an AI system that defeated the world champion of the game Go. Now another program at the Alphabet subsidiary has learned how to play the popular multiplayer video game Quake. DeepMind said last Tuesday that it had developed innovations and reinforcement learning that enabled an artificial-intelligence system to achieve human-level performance in Quake III Arena’s Capture the Flag, a 3-D first-person multiplayer game. DeepMind said that learning to play Capture the Flag was intended as an exercise in which several individual agents must act independently, while learning to interact incorporate with each other. “This is an immensely difficult problem — because with co-adapting agents the world is constantly changing,” DeepMind said in a blog post. Quake Arena III is a first-person shooter video game with simple rules — two teams protect their own flag while seizing that of their opponent — but also the potential for complex outcomes. The game requires players (or in AI parlance, agents) to cooperate with team members while competing with others amid a changing variety of maps. The agents were never instructed about the rules of the game, yet were able to learn the game “to a very high standard,” DeepMind said. In a tournament randomly mixing AI agents with 40 human players, the agents quickly learned to exceed the win rate of their flesh-and-blood counterparts. Even scarier, human players rated the agents as more collaborative teammates than other humans. “Agents in fact learn human-like behaviors, such as following teammates and camping in the opponent’s base,” DeepMind said on its blog. “In general, we think this work highlights the potential of multi-agent training to advance the development of artificial intelligence.” |
-
熱讀文章
-
熱門(mén)視頻