Blog Layout

Researchers from DeepMind, the UK’s juggernaut AI lab, have forsaken the noble games of chess and Go for a more plebeian delight: football.

The Google sister company yesterday published a research paper and accompanying blog post detailing its new neural probabilistic motor primitives (NPMP) — a method by which artificial intelligence agents can learn to operate physical bodies.

Per the blog post :

An NPMP is a general-purpose motor control module that translates short-horizon motor intentions to low-level control signals, and it’s trained offline or via RL by imitating motion capture (MoCap) data, recorded with trackers on humans or animals performing motions of interest.

Up front: Essentially, the DeepMind team created an AI system that can learn how to do things inside of a physics simulator by watching videos of other agents performing those tasks.

And, of course, if you’ve got a giant physics engine and an endless supply of curious robots, the only rational thing to do is to teach it how to dribble and shoot:

<iframe srcdoc="*{padding:0;margin:0;overflow:hidden}html,body{background:#000;height:100%}img{position:absolute;top:0;left:0;width:100%;height:100%;object-fit:cover;transition:opacity .1s cubic-bezier(0.4,0,1,1)}a:hover img+img{opacity:1!important} ” height=”240″ width=”320″ allow=”accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture” allowfullscreen frameborder=”0″>[embedded content]

According to the team’s research paper :

We optimized teams of agents to play simulated football via reinforcement learning, constraining the solution space to that of plausible movements learned using human motion capture data.

Background: In order to train AI to operate and control robots in the world, researchers have to prepare the machines for reality. And, outside of simulations, anything can happen. Agents have to deal with gravity, unexpectedly slippery surfaces, and unplanned interference from other agents.

The point of the exercise isn’t to build a better footballer — Cristiano Ronaldo has nothing to fear from the robots, for now — but instead to help the AI and its developers figure out how to optimize the agents’ ability to predict outcomes.

As the AI starts its training, it’s barely able to move its physics-based humanoid avatar around the field. But, by rewarding an agent every time its team scores a goal, the model is able to get the figures up and running in around 50 hours. After several days of training, the AI begins to predict where the ball will go and how the other agents will react to its movement.

Per the paper:

The result is a team of coordinated humanoid football players that exhibit complex behavior at different scales, quantified by a range of analysis and statistics, including those used in real-world sport analytics. Our work constitutes a complete demonstration of learned integrated decision-making at multiple scales in a multiagent setting.

Quick take: This work is pretty rad. But we’re not so sure it represents a “complete demonstration” of anything. The model is obviously capable of operating an embodied agent. But, based on the apparently cherry-picked GIFs on the blog post, this work is still deeply in the simulation phase.

The bottom line here, is that the AI isn’t “learning” how to play football. It’s brute-forcing movement within the boundaries of its simulation. That may seem like a minor quibble, but the results are quite evident:

The above AI agent looks absolutely terrified. I don’t know what it’s running away from, but I’m certain that it’s the scariest thing ever.

It moves like an alien wearing a human suit for the first time because, unlike humans, AI cannot learn by watching. Systems like the one DeepMind trained parse thousands of hours of video and, essentially, extricate motion data about the subject their trying to “learn” from.

However, it’s almost certain that these models will become more robust as time goes on. We’ve seen what Boston Dynamics can do with machine learning algorithms and pre-programmed choreography.

It’ll be interesting to see how more adaptive models, such as the ones being developed by DeepMind, will fare once they move beyond the laboratory environment and into actual robotics applications.

By Laurence November 21, 2022
Usually, the winners of a pitching competition are bathed with accolades, media attention, and applause. After it’s done and dusted, all they have to think about is what to spend
By Laurence November 19, 2022
Above all else, FTX advertisements wanted you to know two things: that cryptocurrency is a force for good, and that you don’t need to be an expert to buy and
By Laurence November 19, 2022
This article was originally published on .cult by Luis Minvielle. .cult is a Berlin-based community platform for developers. We write about all things career-related, make original documentaries, and share heaps
By Laurence November 18, 2022
Okay, that’s a good question. Red Crew, Blue Crew Had it not been for the heroics of three members of NASA’s specialized “Red Crew,” NASA’s absolutely massive — and incredibly
By Laurence November 18, 2022
Pharmaceutical manufacturing is closely linked to mass production. In order for medicines to be sold cheaply, they often have to be made in huge amounts. But what happens if you
By Laurence November 17, 2022
“I’m in checkmark purgatory.” Checkmate They say “don’t meet your heroes,” but what’s even worse? When your hero buys Twitter, forces you and others to start paying eight dollars per
More Posts
Share by: