lunduniversity.lu.se

Automatic Control

LTH best Master's thesis

Johanna Wilroth has been awarded the prize "LTHs Jubileumsstipendium 2020" for the best master thesis at LTH. In her thesis "Domain Adaptation for Attention Steering," Wilroth studied novel algorithms for improving the performance of hearing aids using EEG signals in combination with auditory input. The master thesis was done at the Department of Automatic Control, with Carolina Bergeling as main supervisor, in collaboration with the research center Eriksholm/Oticon in Denmark. The collaboration also involved the departments of Psychology and Mathematical Statistics at Lund University.

 

M.Sc by Trulsson:    Dynamic Scheduling of Shared Resources using Reinforcement Learning

M.Sc by Trulsson: Dynamic Scheduling of Shared Resources using Reinforcement Learning

Seminarium

From: 2021-06-14 11:00 to 11:45
Place: https://lu-se.zoom.us/j/62273486703.
Contact: karl-erik [dot] arzen [at] control [dot] lth [dot] se
Save event to your calendar


Abstract:

The goal of the thesis is to simulate the Ericsson Many-Core Architecture, EMCA, and implement a dynamic scheduler for the system using reinforcement learning methods. The system contains shared resources that receives and completes jobs. Also, the deadlines and latency definitions can change depending on the job type. Furthermore, the scheduler should aim to avoid missing deadlines as well as aim to reduce the overall latency in the system. A python simulation has been implemented of the Ericsson Many-Core Architecture and two reinforcement learning based schedulers have then been used for different configurations. They are evaluated by comparing their performance to a random and a static scheduler. The first scheduler use Q-learning and the second uses a version of Q-learning with a neural net that approximates the Q-function. The results showed that the second version experienced issues with convergence which caused deadline misses and poor latency. The regular version of Q-learning showed promising results, avoiding deadline misses and was able to reduce the latency below that of the static scheduler for one of the systems. There are still some issues that could be addressed as well as revenues to explore regarding the scheduler. Furthermore, in order to apply the scheduler to the real system some modifications are necessary. However, the simulations show that the reinforcement learning can successfully be used as a scheduler on the EMCA for different configurations.


Student: Patrik Trulsson
Advisors:
     Karl-Erik Årzen, Dept. of Automatic Control, LTH
     William Tidelund, Ericsson
     Jonas Korsell, Ericsson
Examiner: Bo Bernhardsson, Dept. of Automatic Control, LTH


The seminar will be held at https://lu-se.zoom.us/j/62273486703.



Recent Publications