Raul D. Steleac

PhD Student at the University of Edinburgh.

I am a third-year PhD student at the Edinburgh Centre for Robotics, following a Robotics and Autonomous Systems CDT advised by Mohan Sridharan. I am also a member of MARBLE, an interest group with a focus on Reinforcement Learning.

My research interests lie primarily in RL, with a particular emphasis on temporally extended actions (options, skills, macro-actions, you name it), hierarchical RL, and multi-agent systems. My recent work brings these themes together by studying the discovery and reuse of task-agnostic coordinated behaviours in multi-agent RL. I believe coordinated behaviours give agent teams the strongest head start when it comes to finding better solutions in downstream tasks.

Before starting my PhD, I worked as a Machine Learning Engineer for two years in biomedical drug discovery and finance, and previously as a Junior Software Developer for three years (professional experience section). I hold an MSc in Computing from Imperial College London, with a specialisation in Artificial Intelligence and Machine Learning (education section).

profile_pic2.jpg

Publications:

  1. Raul D. Steleac, Mohan Sridharan, and David Abel
    In The Fourteenth International Conference on Learning Representations (ICLR), 2026
  2. Aleksandar Krnjaic, Raul D. Steleac, Jonathan D. Thomas, Georgios Papoudakis, Lukas Schäfer, Andrew Wing Keung To, Kuan-Ho Lao, Murat Cubuktepe, Matthew Haley, Peter Börsting, and Stefano V. Albrecht
    In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024

Professional Experience:

Machine Learning Scientist

Developed transformer-based architectures for a Natural-language pipeline that extracts valuable financial information from chats between investment banking officials and clients aiming to assist traders in their daily transactions leading to more efficient and precise deals.

Jan. 2023 – Aug. 2023
London, UK
Machine Learning Engineer

Developed NLP methods to construct biomedical knowledge graphs for drug discovery in rare diseases. Designed and implemented a Contextual Entity Linking transformer-based architecture that successfully disambiguates and maps in-sentence entities to internal biomedical ontologies.

Nov. 2021 – Jan. 2023
Cambridge, UK
Software Development Intern

Contributed to the development of two versions of the Intel Movidius Visual Processing Unit chip, used to accelerate computations inside neural networks for real-time applications like drones and robots.

Dec. 2018 – Aug. 2020
Timișoara, Romania
Junior Software Developer

Investigated and resolved software issues in C++ within the Fault Detection and Alarm Raising department, applying object-oriented methodologies.

Oct. 2017 – Dec. 2018
Timișoara, Romania

Education:

University of Edinburgh
PhD in Robotics and Autonomous Systems
Sept. 2023 – Present
Imperial College London
MSc in Computing (Artificial Intelligence and Machine Learning)

Grade: Distinction.

Relevant courses: Reinforcement Learning, Deep Learning, Probabilistic Inference, Computer Vision, Natural Language Processing.

Thesis: Curriculum Reinforcement Learning in Tabular Methods.

Oct. 2020 – Oct. 2021
Polytechnic University Timisoara
BEng in Computers and Information Technology

Merit scholarships in 7 out of the 8 semesters.

Thesis: End-to-end Speech Emotion Recognition using BLSTMs with Attention layer and Multi-domain training.

Sept. 2016 – July 2020