Flatland: Multi-Agent Reinforcement Learning on Trains
Scheduling trains is hard: railway networks are growing fast, and the decision-making methods commonly used don’t scale well. How can we solve this problem?
With machine learning, of course! In this workshop, we will use reinforcement learning to tackle this real-world challenge.
In the morning, we will introduce the main reinforcement learning methods. Participants will get familiar with them by solving toy problems. In the afternoon, participants will design their own agents, which will then compete with other people’s agents in a (friendly) competitive setting.
We will use the Flatland railway simulator, developed in collaboration with SBB and Deutsche Bahn. We plan to invite SBB researchers to give insights on this problem, as well as competitive participants in previous Flatland challenges.
Following this workshop, participants can take part in the other Flatland workshop organized by Deutsche Bahn and InstaDeep, which will introduce the bleeding-edge innovations they have been working on to tackle this problem.
Agenda
Tuesday 13th of April, 9:00 - 12:00 CEST 14:00 - 16:00 CEST
Topic | Topic details | Presenter | Time | Duration |
---|---|---|---|---|
Introduction | Workshop plan, “Go to Zoom” | Florian | 9:00am 9:05am | 5min |
What is RL/MARL | Florian | 9:05am 9:50am | 45min | |
Story of Flatland | Erik | 9:50am 10:00am | 10min | |
Intro to the environment | Presentation | Yanick | 10:00am 10:15am | 15min |
Notebook walkthrough | Yanick | 10:15am 10:30am | 15min | |
Intro to Q-learning and DQN | Presentation | Yanick | 10:30am 11:10am | 40min |
Notebook walkthrough | Florian | 11:10am 11:20am | 10min | |
Submitting to AIcrowd | Florian | 11:20am 11:30am | 10min | |
COMPETITION (breakout rooms) | Everyone (groups of 5) | 11:30am 12:00pm | 30min | |
LUNCH BREAK | 12:00pm 2:00pm | 2 hours | ||
Curiosity | Presentation | Christian | 2:00pm 2:45pm | 45min |
Notebook walkthrough | Christian | 2:45pm 2:50pm | 5min | |
COMPETITION (breakout rooms) | Everyone | 2:50pm 3:50pm | 50min | |
AIcrowd pitch | Mohanty | 3:50pm 4:00pm | 10min | |
Wrap-up | Let winning participants talk about their approaches | Everyone | 4:00pm 4:10pm | 10min |
Resources
The slides and code is available on GitHub
Topic and relevance
In this tutorial, we will provide an introduction to reinforcement learning, followed by an explanation of one of its fundamental methods: Deep Q-Learning. We will then introduce the concepts of prioritised experience replay (PER) and intrinsic curiosity module (ICM) as extensions to DQN. We will provide Colab notebooks where participants can get familiar with these concepts by solving toy problems.
Participants will then get hands-on experience by building and tweaking agents in a competitive setting. We will use the AIcrowd platform to run this competition. The participants who will reach the best scores will be invited to come on stage to explain their method and insights to everyone.
We will use the Flatland railway simulator, developed in collaboration with SBB, Deutsche Bahn and SNCF. We plan to invite SBB researchers to give insights on this problem, as well as competitive participants in previous Flatland challenges.
Reinforcement learning is becoming more and more relevant in all sorts of applications. With reinforcement learning, one can build solutions for online learning problems and areas where there is not enough labelled or unlabeled data available. Due to its generality, reinforcement learning has found applications in many disciplines. Recently, reinforcement learning methods have achieved breakthroughs on complex tasks such as board games, video games, robotics, molecule discovery and chip design. In the web domain, news recommendation, online web systems auto-configuration and online advertising real-time bidding are possible applications for reinforcement learning.
All presenters have a publishing background, practical experience and previously organized workshops and challenges on deep reinforcement learning. Three of the organizers have already organized previous versions of this workshop, which were very well received.
Interaction style
Hands-on tutorial
Intended audience and level
Technical skill level needed to attend the workshop: Intermediate. Each participant is expected to actively take part by designing and training RL agents on their machine. Participants can also form teams, working on the same laptop on a single agent. The training will be done on the Google Colab service, which is free but requires a Google account. Participants should have a good knowledge of Python, and at least a basic understanding of machine learning. No knowledge of reinforcement learning is expected. We will use the PyTorch framework, but we don’t expect participants to be familiar with it.
Participants will discover what reinforcement learning is, what it can do, and what are its current limitations and perspectives. They will get hands-on experience by building and tweaking agents in a competitive setting.
Organisers
Florian Laurent, ML Engineer, AIcrowd
Manuel Schneider, ETH
Christian Scheller, Research associate, FHNW
Yanick Schraner, Research assistant, Master student, FHNW
Sharada Mohanty, CEO, AIcrowd