Raul D. Steleac
PhD Student at the University of Edinburgh.
I am a third-year PhD student at the Edinburgh Centre for Robotics, following a Robotics and Autonomous Systems CDT advised by Mohan Sridharan. I am also a member of MARBLE, an interest group with a focus on Reinforcement Learning.
My research interests lie primarily in RL, with a particular emphasis on temporally extended actions (options, skills, macro-actions, you name it), hierarchical RL, and multi-agent systems. My recent work brings these themes together by studying the discovery and reuse of task-agnostic coordinated behaviours in multi-agent RL. I believe coordinated behaviours give agent teams the strongest head start when it comes to finding better solutions in downstream tasks.
Before starting my PhD, I worked as a Machine Learning Engineer for two years in biomedical drug discovery and finance, and previously as a Junior Software Developer for three years (professional experience section). I hold an MSc in Computing from Imperial College London, with a specialisation in Artificial Intelligence and Machine Learning (education section).
Publications:
Professional Experience:
Developed transformer-based architectures for a Natural-language pipeline that extracts valuable financial information from chats between investment banking officials and clients aiming to assist traders in their daily transactions leading to more efficient and precise deals.
London, UK
Developed NLP methods to construct biomedical knowledge graphs for drug discovery in rare diseases. Designed and implemented a Contextual Entity Linking transformer-based architecture that successfully disambiguates and maps in-sentence entities to internal biomedical ontologies.
Cambridge, UK
Contributed to the development of two versions of the Intel Movidius Visual Processing Unit chip, used to accelerate computations inside neural networks for real-time applications like drones and robots.
Timișoara, Romania
Investigated and resolved software issues in C++ within the Fault Detection and Alarm Raising department, applying object-oriented methodologies.
Timișoara, Romania
Education:
Grade: Distinction.
Relevant courses: Reinforcement Learning, Deep Learning, Probabilistic Inference, Computer Vision, Natural Language Processing.
Thesis: Curriculum Reinforcement Learning in Tabular Methods.
Merit scholarships in 7 out of the 8 semesters.
Thesis: End-to-end Speech Emotion Recognition using BLSTMs with Attention layer and Multi-domain training.