View on GitHub

rl-on-trains-workshop

Flatland: Multi-Agent Reinforcement Learning on Trains

Scheduling trains is hard: railway networks are growing fast, and the decision-making methods commonly used don’t scale well. How can we solve this problem?

With machine learning, of course! In this workshop, we will use reinforcement learning to tackle this real-world challenge.

In the morning, we will introduce the main reinforcement learning methods. Participants will get familiar with them by solving toy problems. In the afternoon, participants will design their own agents, which will then compete with other people’s agents in a (friendly) competitive setting.

We will use the Flatland railway simulator, developed in collaboration with SBB and Deutsche Bahn. We plan to invite SBB researchers to give insights on this problem, as well as competitive participants in previous Flatland challenges.

Following this workshop, participants can take part in the other Flatland workshop organized by Deutsche Bahn and InstaDeep, which will introduce the bleeding-edge innovations they have been working on to tackle this problem.

Agenda

Tuesday 13th of April, 9:00 - 12:00 CEST 14:00 - 16:00 CEST

Topic	Topic details	Presenter	Time	Duration
Introduction	Workshop plan, “Go to Zoom”	Florian	9:00am 9:05am	5min
	What is RL/MARL	Florian	9:05am 9:50am	45min
	Story of Flatland	Erik	9:50am 10:00am	10min
Intro to the environment	Presentation	Yanick	10:00am 10:15am	15min
	Notebook walkthrough	Yanick	10:15am 10:30am	15min
Intro to Q-learning and DQN	Presentation	Yanick	10:30am 11:10am	40min
	Notebook walkthrough	Florian	11:10am 11:20am	10min
	Submitting to AIcrowd	Florian	11:20am 11:30am	10min
	COMPETITION (breakout rooms)	Everyone (groups of 5)	11:30am 12:00pm	30min
LUNCH BREAK			12:00pm 2:00pm	2 hours
Curiosity	Presentation	Christian	2:00pm 2:45pm	45min
	Notebook walkthrough	Christian	2:45pm 2:50pm	5min
	COMPETITION (breakout rooms)	Everyone	2:50pm 3:50pm	50min
AIcrowd pitch		Mohanty	3:50pm 4:00pm	10min
Wrap-up	Let winning participants talk about their approaches	Everyone	4:00pm 4:10pm	10min

Resources

The slides and code is available on GitHub

Topic and relevance

In this tutorial, we will provide an introduction to reinforcement learning, followed by an explanation of one of its fundamental methods: Deep Q-Learning. We will then introduce the concepts of prioritised experience replay (PER) and intrinsic curiosity module (ICM) as extensions to DQN. We will provide Colab notebooks where participants can get familiar with these concepts by solving toy problems.

Participants will then get hands-on experience by building and tweaking agents in a competitive setting. We will use the AIcrowd platform to run this competition. The participants who will reach the best scores will be invited to come on stage to explain their method and insights to everyone.

We will use the Flatland railway simulator, developed in collaboration with SBB, Deutsche Bahn and SNCF. We plan to invite SBB researchers to give insights on this problem, as well as competitive participants in previous Flatland challenges.

Reinforcement learning is becoming more and more relevant in all sorts of applications. With reinforcement learning, one can build solutions for online learning problems and areas where there is not enough labelled or unlabeled data available. Due to its generality, reinforcement learning has found applications in many disciplines. Recently, reinforcement learning methods have achieved breakthroughs on complex tasks such as board games, video games, robotics, molecule discovery and chip design. In the web domain, news recommendation, online web systems auto-configuration and online advertising real-time bidding are possible applications for reinforcement learning.

All presenters have a publishing background, practical experience and previously organized workshops and challenges on deep reinforcement learning. Three of the organizers have already organized previous versions of this workshop, which were very well received.

Interaction style

Hands-on tutorial

Intended audience and level

Technical skill level needed to attend the workshop: Intermediate. Each participant is expected to actively take part by designing and training RL agents on their machine. Participants can also form teams, working on the same laptop on a single agent. The training will be done on the Google Colab service, which is free but requires a Google account. Participants should have a good knowledge of Python, and at least a basic understanding of machine learning. No knowledge of reinforcement learning is expected. We will use the PyTorch framework, but we don’t expect participants to be familiar with it.

Participants will discover what reinforcement learning is, what it can do, and what are its current limitations and perspectives. They will get hands-on experience by building and tweaking agents in a competitive setting.

Organisers

Florian Laurent, ML Engineer, AIcrowd

Manuel Schneider, ETH

Christian Scheller, Research associate, FHNW

Yanick Schraner, Research assistant, Master student, FHNW

Sharada Mohanty, CEO, AIcrowd