Sony AI's Project Ace Beats Elite Table-Tennis Players, Lands Nature Cover

An autonomous robot combining event-based vision and reinforcement learning won seven of thirteen games against elite amateurs — published on the cover of Nature's April 23 issue.

Sony AI published "Outplaying elite table tennis players with an autonomous robot" on the cover of Nature's April 23 issue, describing an autonomous system — nicknamed Ace — that can compete with human players at a full-speed, full-rules match. Against five elite amateur players (each with more than ten years of experience), Ace won seven of thirteen games and took three match wins. Against two Japanese professional-league players — Minami Ando and Kakeru Sone — it won one game out of seven. The system uses event-based vision sensors and a model-free reinforcement-learning control policy, with an end-to-end latency of 20.2 milliseconds, roughly 10× faster than a human player's visuomotor loop.

The technical contribution is tighter integration between perception and control. Event-based cameras report per-pixel brightness changes asynchronously rather than delivering fixed-rate frames, which is a good fit for tracking a fast, spinning object. Pairing that sensor stream with reinforcement learning trained in simulation and fine-tuned on the real robot lets the system handle the long tail of spin, angle, and placement variations that rule-based controllers historically struggle with. The 20-millisecond latency is what makes the whole thing work: once a ball leaves the opponent's paddle, there is simply not enough time to plan through a slow pipeline.

Table tennis is narrow, but the paper matters for physical AI more broadly. For the last two years, benchmarks for robotic manipulation have converged on slow, quasi-static tasks — pick a bottle, fold a shirt, stack blocks. Ace shows that the perception-plus-RL recipe can now handle something genuinely high-frequency and adversarial, and that at least for some domains the gap to expert human performance is closeable. That is a directional signal for everyone from manufacturing automation to autonomous driving.

For learners: physical AI is where a lot of the next decade of AI jobs will be created, and the field is dramatically more empirical than language-model work. Reading a paper like Ace is useful not for the exact method but for the systems thinking — the authors optimized a latency budget across sensor, compute, actuator, and training loop, and every one of those components had to improve to hit the target. If you want to work in robotics, the core skill is not any one subfield; it is the ability to reason about whole-system constraints at once.