Multi-Agent Hide and Seek

Share
Embed
  • Published on Sep 17, 2019
  • We’ve observed agents discovering progressively more complex tool use while playing a simple game of hide-and-seek. Through training in our new simulated hide-and-seek environment, agents build a series of six distinct strategies and counterstrategies, some of which we did not know our environment supported. The self-supervised emergent complexity in this simple environment further suggests that multi-agent co-adaptation may one day produce extremely complex and intelligent behavior.
    Learn more: openai.com/blog/emergent-tool-use/
  • Science & TechnologyScience & Technology

Comments • 2 658

  • Myles Brown
    Myles Brown 2 hours ago

    There’s a video like this out there we’re the future humans live, they’re showing us off right now

  • sub to awoke
    sub to awoke 4 hours ago

    wait i wanna play this

  • Aqua bukan Ades
    Aqua bukan Ades 5 hours ago

    Why don’t the hiders lock the seekers?

  • Levent Acemi
    Levent Acemi 6 hours ago

    Next video: AI learns blocking seekers

  • gumowy123
    gumowy123 12 hours ago

    A multiplayer game like this would be fun

  • Jai Manatvaal
    Jai Manatvaal 12 hours ago

    There faces when they are caught...

  • Kirboi
    Kirboi 12 hours ago

    I want to play this game.

  • Shashank Kumar
    Shashank Kumar 13 hours ago

    Wait till these agents are replaced by atlas robots (weilding guns).

  • Allen A
    Allen A 15 hours ago

    creepy

  • Dylan Bilic
    Dylan Bilic 16 hours ago

    Hiders: yo we are going to block the entrances of the room we are hiding in
    Seekers: well then we are going to use a ramp to jump to you
    Hiders: well then we will just stop you from moving the ramp
    Seekers: well we are going to *abuse the laws of physics and move blocks while we are on top of them by using the ramp and then jump to you*
    Hiders: well then... ...uh... ...you win.

  • Minh đỗ
    Minh đỗ 16 hours ago

    one day robot can see all ur browser history without touching ur laptop

  • Philip Yan
    Philip Yan 18 hours ago

    Hmm..Is our world just one iteration of a simulation?

  • A Brittish Panfish
    A Brittish Panfish 22 hours ago

    we are doomed when they learn how to accelerative backhop

  • Phil Thicc
    Phil Thicc Day ago

    Everybody gangsta till AI learns box surfing

  • P1X3L D3L74
    P1X3L D3L74 Day ago +2

    Detroit: Become Seeker

  • wat1243
    wat1243 Day ago

    If only we could download this it would be so cool

  • Galactic Pirate
    Galactic Pirate Day ago

    All these people reffering to prop surfing/bhopping/ literally any source engine exploit... They don't even realise this was done on the source engine

  • Yoga Mokalu
    Yoga Mokalu Day ago

    Short and easy to understand, good job team

  • MooseyFate
    MooseyFate Day ago

    I hope this kind of technology makes its way into game AI soon.
    I love seeing the kind of emergent stuff AI in games can do that even the developers didn't think of

  • Xsauced
    Xsauced Day ago

    Alibaba Intelligence

  • Kayky Gabriel
    Kayky Gabriel Day ago

    2:00 run!

  • Lexandritte
    Lexandritte Day ago +1

    Now plug them into Minecraft and we can have Singularity in about five years.

  • deep mind
    deep mind Day ago

    But can they cure cancer?

  • Jaebez Bleah
    Jaebez Bleah 2 days ago +4

    I love the cute little faces of joy every time the seeker finds the hider.

  • M4rkoz
    M4rkoz 2 days ago

    Wow, would like to play this game in multiplayer

  • Jason Lima
    Jason Lima 2 days ago

    How do we speed this up?

  • João Pacheco
    João Pacheco 2 days ago

    Simply fascinating...

  • Nicko G.
    Nicko G. 2 days ago

    Box surfing seems a bit glitchy.

  • Nathan Huisman
    Nathan Huisman 2 days ago +2

    In a couple of years, AIs will hack their reward code to give themselves infinite reward

  • 김재욱
    김재욱 2 days ago +1

    Let's play this on a huge server! With AI agents!!!

  • Miguel Carlo Gallano

    What if humans were playing with ai as well

  • TheEpicSandwich goc
    TheEpicSandwich goc 2 days ago

    Ai is dumb as fk... why don't the hiders trap the seekers in instead?

    • BakedBeans
      BakedBeans Day ago

      because that'd be harder to do. all they needed to do to win is to not get caught, so there'd be no reason to trap the seekers in

  • Caleb Cruz
    Caleb Cruz 2 days ago

    Dead by daylight really downgraded

  • Im On Da Web
    Im On Da Web 2 days ago

    Dang I wish normal people could run this simulation

  • Agitated
    Agitated 2 days ago

    Fake news

  • MartyMacaroni
    MartyMacaroni 2 days ago +1

    Yo this game looks sick is it on steam?

  • Don’t touch my phone Gamer

    Ai really be learning how to glitch

  • Jordan Bigby
    Jordan Bigby 3 days ago +1

    I’m very disappointed too say I’m here from TikTok 😔

  • zbobz12
    zbobz12 3 days ago

    That's freaking cool

  • dxxPacmanxxb
    dxxPacmanxxb 3 days ago

    This is not in-depth enouuugh

  • xX_Kjcomputer_Xx
    xX_Kjcomputer_Xx 3 days ago +1

    -until the hider are smart enough to lock the seeker inside the cage

  • The Other Side
    The Other Side 3 days ago

    *This will be implemented in future robots and then they will learn that we are destructive to yourselves. And then decide that they are the ones best suited to protect us from us. And thus we begin our journey into robotic slavery.*

  • Pseudo X
    Pseudo X 3 days ago +6

    He attacc
    He protecc
    but most importantly
    He surfs in buccs

  • Lucas R
    Lucas R 3 days ago +3

    So basically it’s slavery with extra steps

  • Dolank
    Dolank 3 days ago +2

    Okay seriously wtf, all TVclip comments are just quoting the videos now. This is seriously weird.

  • SMART THOUGHTS
    SMART THOUGHTS 4 days ago

    Better . Far far better

  • i love love song
    i love love song 4 days ago

    Tech them to speak

  • Shivam Dhoot
    Shivam Dhoot 4 days ago +1

    Which 3D simulation program did they use? Pretty cool stuff though!

  • SpaceDave1337
    SpaceDave1337 4 days ago +1

    You should make this a Videogame somehow

  • Mr. MindReader
    Mr. MindReader 4 days ago +44

    Me: Just surround the seekers with walls
    AI: *Circuits Blown*

    • Mr. MindReader
      Mr. MindReader 12 hours ago +1

      @Hlebuw3k They work on reward and punishment method, according to them they are already doing it in the best way...

    • Ian Prado
      Ian Prado 17 hours ago

      Nice

    • Hlebuw3k
      Hlebuw3k 19 hours ago +5

      Thats one of the things AI struggles to do - discover more efficent strategies. If their current method of performing the task works, then they are fine with that, and the probability of finding a more efficent method is very low

    • 김재욱
      김재욱 2 days ago

      cool stratagy

  • gangster gandalf
    gangster gandalf 4 days ago +1

    Im surprised they didnt lock in the seekers

  • fl00fydragon
    fl00fydragon 4 days ago +15

    Everyone else: AI is learning to hunt us down.
    Me: AI learned speed run exploits.

  • The Potato
    The Potato 5 days ago +2

    terminator age is coming.
    And it's looking so cute.

  • HackTor
    HackTor 5 days ago

    Remember when humans use to play hide and seek?

  • Harry
    Harry 5 days ago

    I wonder if AI will learn how to ABH...

  • vijay vittal
    vijay vittal 5 days ago

    How do I learn to do this?

  • DuoBV Channel
    DuoBV Channel 5 days ago

    these little creatures, reminds me of little big planet Sackboy :,D

  • mb k
    mb k 6 days ago

    Hiders can box the seekers ,problem solved for seekers that use other object to jump over and totally in lockdown

  • Ee Cheng LEE
    Ee Cheng LEE 6 days ago +3

    didn't expect people to be meme-ing down here
    not complaining tho •ᴗ•

  • Loop
    Loop 6 days ago

    now, this is a open world game i would like to play

    • Loop
      Loop 6 days ago

      @John DC ofc they can, whole AI system is actually based on reward and penalty system

    • John DC
      John DC 6 days ago

      @Loop even better if the NPCs can somehow learn to give players apporopriate quests and rewards based on what they want. Everything would basically be procedural and you would actually be shaping your own world alongside the NPCs.

    • Loop
      Loop 6 days ago

      ​ John DC Exactly, and as a developer, instead of building boring and liner quests, you would only implement game dynamics and let NPC's decide for them selves what they want to do.

    • John DC
      John DC 6 days ago

      Dude imagine if you just had an open world game that also included learning NPCs that have neural nets. You'd have a whole world that changes artificially from the players and naturally from other AIs. Probably gonna be a PC killer though lol