News — Subnets — Apex

Apex Launches RL Tron Competition on Bittensor

Apex launched RL Tron, a new reinforcement learning competition on Bittensor where AI agents compete in recurring head-to-head Tron tournaments using decentralized bracket-based evaluations.

, and Bart Hillerich

May 6, 2026 . 3:12 PM

2 min read

Apex launches RL Tron on Bittensor Subnet 1, introducing decentralized reinforcement learning tournaments where AI agents compete in recurring head-to-head Tron battles on the TAO network.

Apex (SN1) has officially launched RL Tron, a new reinforcement learning competition on Bittensor that pits AI agents against each other in recurring head-to-head Tron tournaments.

The competition marks Apex’s first major step into reinforcement learning-focused evaluations, expanding beyond traditional static benchmarking tasks into adversarial environments where agents must continuously adapt against live opponents.

Inspired by the Tron light-cycle game concept, RL Tron challenges participants to train AI-controlled bikes that move through a digital arena while leaving behind permanent trails. Agents must avoid crashing into walls, their own trails, or opposing trails while attempting to trap and outmaneuver their opponent.

According to Apex, tournaments run every two days in a recurring single-elimination bracket format. Participants submit TorchScript-based reinforcement learning models, which are then seeded into bracket-style duels where winners advance until a single surviving miner remains.

“The rules are simple, but the strategy is not,” Apex wrote in the announcement. “Every move changes the map. Every opponent introduces a new strategy to exploit or adapt to.”

Each match consists of multiple Tron games played on a 30x30 grid, with players spawning in opposite corners and competing across up to 500 game ticks per round. Models must process real-time game-state information and return movement decisions within strict timing constraints during every tick of gameplay.

The subnet described RL Tron as more than an arcade-inspired competition, positioning it instead as a live testbed for studying miner behavior and reinforcement learning dynamics in decentralized tournament environments.

“Participants are no longer only optimizing against a fixed dataset or task definition,” Apex wrote. “They are building agents that must compete directly against other agents, adapt to emergent strategies, and survive in a constantly shifting environment.”

The competition is fully open source and playable, allowing the community to inspect submitted strategies, study replay files, and iterate on model designs over time. Miner code is revealed two days after evaluation rounds conclude.

Apex said RL Tron is intended to push the subnet toward “open, decentralized, reinforcement-learning-driven competition,” while introducing a more visual and spectator-friendly format compared to conventional AI evaluation systems.

Participants can join the competition through Apex’s dedicated page.

Disclaimer: This article is for informational purposes only and does not constitute financial, investment, or trading advice. The information provided should not be interpreted as an endorsement of any digital asset, security, or investment strategy. Readers should conduct their own research and consult with a licensed financial professional before making any investment decisions. The publisher and its contributors are not responsible for any losses that may arise from reliance on the information presented.

Comments

Latest

Cacheon launches on Bittensor Subnet 14 to optimize AI inference with decentralized competition for faster large language model serving, lower latency, and reduced token costs

News

Cacheon Launches on Bittensor to Turn AI Inference Optimization Into an Open Competition

Bittensor Subnet 14 launches Cacheon, a decentralized AI inference competition focused on faster LLM serving, lower latency, and reduced cost per token ahead of its May 19 mainnet launch.

, and Bart Hillerich

May 12, 2026

Paid Members Public

Teutonic launches decentralized 80B AI model training on Bittensor as subnet 3 advances large-scale distributed machine learning and TAO-powered AI infrastructure

News

Teutonic Subnet Begins Training 80B AI Model on Bittensor, Marking Largest Decentralized Training Run Yet

Teutonic Subnet 3 has begun training an 80B AI model on Bittensor, marking the largest decentralized AI training effort yet and a major milestone for TAO infrastructure.

, and Bart Hillerich

May 12, 2026

Paid Members Public

Metanova and ONEPOT.AI partnership advances Bittensor drug discovery through SN68, accelerating AI-powered small molecule research, robotic synthesis, and decentralized biotech innovation.

News

Metanova Partners With ONEPOT.AI to Accelerate Drug Discovery on Bittensor

Metanova partners with ONEPOT.AI to accelerate drug discovery on Bittensor, enabling SN68-generated compounds to move from AI screening to robotic synthesis in days.

, and Bart Hillerich

May 11, 2026

Paid Members Public

Score

What Are Manako's Vision Agents And How Do They Work?

Manako is building AI Vision Agents powered by Bittensor’s Score Subnet 44 to help enterprises turn security cameras into real-time operational intelligence systems.

, and Antonio Verrico

May 11, 2026

Paid Members Public

Apex Launches RL Tron Competition on Bittensor

Table of Contents

Comments

Related

Latest