Dr Matthias Rolf
PhD (Dr.-Eng.), MSc, PGCHE
Reader in Computer Science
School of Engineering, Computing and Mathematics
Role
I am teaching and researching in robotics and machine learning.
Areas of expertise
- Robotics
- Machine Learning
- Software Engineering
- Cognitive Science
- Developmental Robotics
Teaching and supervision
Courses
- Artificial Intelligence (BSc (Hons), MSci)
- Artificial Intelligence (MSc, PGDip, PGCert)
- Computer Science (BSc (Hons))
Modules taught
- Machine Learning
- Mobile Robotics
- Machine Learning
- Autonomous Intelligent Systems
- Advanced Robotic Control
- Robotics Systems Engineering
Subject coordinator for:
- BSc Robotics
- BSc/MSci Artificial Intelligence
- MSc Artificial Intelligence
Supervision
I am a postgraduate research tutor for Computing.
I welcome enquiries from people considering doing a PhD or Master by Research on my subject of expertise.
Research
Key recent research focus included:
Multi-Objective Reinforcement Learning
- Rolf M (2020). 'The Need for MORE: Need Systems as Non-Linear Multi-Objective Reinforcement Learning'
Ethical AI
- Rolf M, Crook NT, Steil JJ (2018). 'From social interaction to ethical AI: a developmental roadmap'
- Rolf M, Crook NT (2016). 'What if: robots create novel goals? Ethics based on social value systems'
Robotics
- Gama F, Shcherban M, Rolf M, Hoffmann M (2020). 'Active exploration for body model learning through self-touch on a humanoid robot with artificial skin'
- Baker T, Rolf M (2020). 'Tactile feedback in a tele-operation pick-and-place task improves perceived workload'
Centres and institutes
- Centre for AI, Culture and Society (CAICS)
- Artificial Intelligence, Data Analysis and Systems (AIDAS) Institute
Groups
Publications
Journal articles
-
Gama F, Shcherban M, Rolf M, Hoffmann M, 'Goal-directed tactile exploration for body model learning through self-touch on a humanoid robot'
IEEE Transactions on Cognitive and Developmental Systems 15 (2) (2021) pp.419-433
ISSN: 2379-8920 eISSN: 2379-8939AbstractPublished here Open Access on RADARAn early integration of tactile sensing into motor coordination is the norm in animals, but still a challenge for robots. Tactile exploration through touches on the body gives rise to first body models and bootstraps further development such as reaching competence. Reaching to one’s own body requires connections of the tactile and motor space only. Still, the problems of high dimensionality and motor redundancy persist. Through an embodied computational model for the learning of self-touch on a simulated humanoid robot with artificial sensitive skin, we demonstrate that this task can be achieved (i) effectively and (ii) efficiently at scale by employing the computational frameworks for the learning of internal models for reaching: intrinsic motivation and goal babbling. We relate our results to infant studies on spontaneous body exploration as well as reaching to vibrotactile targets on the body. We analyze the reaching configurations of one infant followed weekly between 4 and 18 months of age and derive further requirements for the computational model: accounting for (iii) continuous rather than sporadic touch and (iv) consistent redundancy resolution. Results show the general success of the learning models in the touch domain, but also point out limitations in achieving fully continuous touch.
-
Crook N, Nugent S, Rolf M, Baimel A, Raper R, 'Computing morality: Synthetic ethical decision making and behaviour'
Cognitive Computation and Systems 3 (2) (2021) pp.79-82
ISSN: 2517-7567 eISSN: 2517-7567AbstractPublished hereWe find ourselves at a unique point of time in history. Following over two millennia of debate amongst some of the greatest minds that ever existed about the nature of morality, the philosophy of ethics and the attributes of moral agency, and after all that time still not having reached consensus, we are coming to a point where artificial intelligence (AI) technology is enabling the creation of machines that will possess a convincing degree of moral competence. The existence of these machines will undoubtedly have an impact on this age old debate, but we believe that they will have a greater impact on society at large, as AI technology deepens its integration into the social fabric of our world. The purpose of this special issue on Computing Morality is to bring together different perspectives on this technology and its impact on society. The special issue contains four very different and inspiring contributions.
-
Rolf M, Neumann K, Quei{ß}er J, Reinhart R, Nordmann A, Steil JJ, 'A multi-level control architecture for the bionic handling assistant'
Advanced Robotics 29 (13) (2015) pp.847-859
ISSN: 0169-1864AbstractThe bionic handling assistant is one of the largest soft continuum robots and very special in being a pneumatically operated platform that is able to bend, stretch, and grasp in all directions. It nevertheless shares many challenges with smaller continuum and other soft robots such as parallel actuation, complex movement dynamics, slow pneumatic actuation, non-stationary behavior, and a lack of analytic models. To master the control of this challenging robot, we argue for a tight integration of standard analytic tools, simulation, control, and state-of-the-art machine learning into an overall architecture that can serve as blueprint for control design also beyond the BHA. To this aim, we show how to integrate specific modes of operation and different levels of control in a synergistic manner, which is enabled by using modern paradigms of software architecture and middleware. We thereby achieve an architecture with unique overall control abilities for a soft continuum robot that allow for flexible experimentation toward compliant user-interaction, grasping, and online learning of internal models.Published here -
Rolf M, Steil JJ, 'Efficient exploratory learning of inverse kinematics on a bionic elephant trunk'
IEEE Transactions on Neural Networks and Learning Systems 25 (6) (2014) pp.1147-1160
ISSN: 2162-237X -
Miyazaki M, Takahashi H, Rolf M, Okada H, Omori T, 'The image-scratch paradigm: a new paradigm for evaluating infants' motivated gaze control'
Scientific Reports 4 (2014)
ISSN: 2045-2322 -
Rolf M, Asada M, 'Where do goals come from? A generic approach to autonomous goal-system development'
arXiv (2014)
ISSN: 2331-8422 -
Rolf M, Steil JJ, 'Explorative learning of inverse models: a theoretical perspective'
Neurocomputing 131 (2013) pp.2-14
ISSN: 0925-2312 -
Neumann K, Rolf M, Steil JJ, 'Reliable Integration of Continuous Constraints into Extreme Learning Machines'
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems (2013)
-
Rolf M, Hanheide M, Rohlfing KJ, 'Attention via synchrony: Making use of multimodal cues in social learning'
IEEE Transactions on Autonomous Mental Development 1 (1) (2009) pp.55-67
ISSN: 1943-0604
Conference papers
-
Al-Husaini Y, Rolf M, 'Getting Priorities Right: Intrinsic Motivation with Multi-Objective Reinforcement Learning'
(2022)
AbstractPublished here Open Access on RADARIntrinsic motivation is a common method to facilitate exploration in reinforcement learning agents. Curiosity is thereby supposed to aid the learning of a primary goal. However, indulging in curiosity may also stand in conflict with more urgent or essential objectives such as self-sustenance. This paper addresses the problem of balancing curiosity, and correctly prioritising other needs in a reinforcement learning context. We demonstrate the use of the multi-objective reinforcement learning framework C-MORE to integrate curiosity, and compare results to a standard linear reinforcement learning integration. Results clearly demonstrate that curiosity can be modelled with the priority-objective reinforcement learning paradigm. In particular , C-MORE is found to explore robustly while maintaining self-sustenance objectives, whereas the linear approach is found to over-explore and take unnecessary risks. The findings demonstrate a significant weakness of the common linear integration method for intrinsic motivation, and the need to acknowledge the potential conflicts between curiosity and other objectives in a multi-objective framework.
-
Al-Husaini Y, Rolf M, 'Priority-Objective Reinforcement Learning'
(2021)
AbstractPublished here Open Access on RADARIntelligent agents often have to cope with situations in which their various needs must be prioritised. Efforts have been made, in the fields of cognitive robotics and machine learning, to model need prioritization. Examples of existing frameworks include normative decision theory, the subsumption architecture and reinforcement learning. Reinforcement learning algorithms oriented towards active goal prioritization include the options framework from hierarchical reinforcement learning and the ranking approach as well as the MORE framework from multi-objective reinforcement learning. Previous approaches can be configured to make an agent function optimally in individual environments, but cannot effectively model dynamic and efficient goal selection behaviour in a generalisable framework. Here, we propose an altered version of the MORE framework that includes a threshold constant in order to guide the agent towards making economic decisions in a broad range of ‘priority-objective reinforcement learning’ (PORL) scenarios. The results of our experiments indicate that pre-existing frameworks such as the standard linear scalarization, the ranking approach and the options framework are unable to induce opportunistic objective optimisation in a diverse set of environments. In particular, they display strong dependency on the exact choice of reward values at design time. However, the modified MORE framework appears to deliver adequate performance in all cases tested. From the results of this study, we conclude that employing MORE along with integrated thresholds, can effectively simulate opportunistic objective prioritization in a wide variety of contexts.
-
Gama F, Shcherban M, Rolf M, Hoffmann M, 'Active exploration for body model learning through self-touch on a humanoid robot with artificial skin'
(2020)
ISBN: 978-1-7281-7306-1AbstractPublished hereThe mechanisms of infant development are far from understood. Learning about one's own body is likely a foundation for subsequent development. Here we look specifically at the problem of how spontaneous touches to the body in early infancy may give rise to first body models and bootstrap further development such as reaching competence. Unlike visually elicited reaching, reaching to own body requires connections of the tactile and motor space only, bypassing vision. Still, the problems of high dimensionality and redundancy of the motor system persist. In this work, we present an embodied computational model on a simulated humanoid robot with artificial sensitive skin on large areas of its body. The robot should autonomously develop the capacity to reach for every tactile sensor on its body. To do this efficiently, we employ the computational framework of intrinsic motivations and variants of goal babbling-as opposed to motor babbling-that prove to make the exploration process faster and alleviate the ill-posedness of learning inverse kinematics. Based on our results, we discuss the next steps in relation to infant studies: what information will be necessary to further ground this computational model in behavioral data.
-
Vaswani Bhavnani C, Rolf M, 'Attitudes towards a handheld robot that learns Proxemics'
(2020)
ISBN: 978-1-7281-7306-1AbstractPublished hereRobots that cohabitate in social spaces must abide by the same behavioural cues humans follow, including interpersonal distancing. Proxemics investigates the appropriate distances and the impact of factors affecting it, such as gender and age. This paper investigates people's attitudes towards a robot that can learn Proxemics rules by gauging direct individual feedback from a person, and utilizing it in a reinforcement learning framework. Previous learning attempts have relied on larger robots, for which physical safety is a primary concern. In contrast, our study uses a handheld sized robot that allows us to focus on the impact of distance on engageability in dialogue. General consensus between interviewees was a feeling of ease and safety during interactions, as well as disparity regarding the invasion of personal space, which was influenced by cultural background.
-
Rolf M, 'The Need for MORE: Need Systems as Non-Linear Multi-Objective Reinforcement Learning'
(2020)
ISBN: 978-1-7281-7306-1AbstractPublished hereBoth biological and artificial agents need to coordinate their behavior to suit various needs at the same time. Reconciling conflicts of different needs and contradictory interests such as self-preservation and curiosity is the central difficulty arising in the design and modelling of need and value systems. Current models of multi-objective reinforcement learning do either not provide satisfactory power to describe such conflicts, or lack the power to actually resolve them. This paper aims to promote a clear understanding of these limitations, and to overcome them with a theory-driven approach rather than ad hoc solutions. The first contribution of this paper is the development of an example that demonstrates previous approaches' limitations concisely. The second contribution is a new, non-linear objective function design, MORE, that addresses these and leads to a practical algorithm. Experiments show that standard RL methods fail to grasp the nature of the problem and ad-hoc solutions struggle to describe consistent preferences. MORE consistently learns a highly satisfactory solution that balances contradictory needs based on a consistent notion of optimality.
-
Baker T, Rolf M, 'Tactile feedback in a tele-operation pick-and-place task improves perceived workload'
(2020) pp.261-273
ISBN: 9783030634858 eISBN: 9783030634865AbstractPublished here Open Access on RADARRobotic tele-operation systems have vast potential in areas ranging from surgical robotics and underwater exploration to disposing of toxic, explosive and nuclear materials. While visual camera feeds for the human operator are typically available and well studied, tactile sensory information is often vital for successful and efficient manipulation. Previous studies have largely focused on execution time alone as measure of success of feedback methods on individual tasks. The present study complements this by a comparative analysis of vibration and visual feedback of tactile information across a range of manipulation tasks. Results show a significant reduction in perceived workload with the implementation of vibration feedback and an improvement of error rates for visual feedback. Contrary to expectation, we did not find a reduction in task completion time. The negative finding on completion time challenges the belief that the mere existence of task-relevant feedback aids efficient task completion. The reduced workload, however, clearly points out potential for enhancing performance on more difficult and prolonged tasks with highly skilled operators.
-
Idries MO, Rolf M, olde Scheper TV, 'Exploration: Do We Need a Map?'
(2019)
AbstractPublished hereExploration is one of the fundamental problems in mobile robotics. Efforts to address this problem made over the past two decades divide into two approaches: reactive approaches, that make only instantaneous decisions, and map-based approaches involving e.g. grid, metric, or topological representations. Comparative studies have so far largely focused on comparing different map-based algorithms, while no common framework to compare them to purely reactive approaches currently exists. This paper aims at creating a framework to simulate, evaluate, and compare exploratory algorithms as different as reactive and map-based approaches. Preliminary results are demonstrated for two reactive algorithms, random walk and wall follower, and one map based approach, pheromone potential field, have been implemented. Measurements of navigation success, time to success, as well as computational and memory usage reveal a dominance of simple wall-following over the map-based potential field approach, and a distinct load/efficacy trade off for random walks. These preliminary results challenge the common assumptions that maps are needed for successful and efficient exploration and navigation.
-
Rolf M, Crook N, Steil J, 'From Social Interaction to Ethical AI: A Developmental Roadmap'
(2018)
eISSN: 2161-9484AbstractPublished here Open Access on RADARAI and robot ethics have recently gained a lot of attention because adaptive machines are increasingly involved in ethically sensitive scenarios and cause incidents of public outcry. Much of the debate has been focused on achieving highest moral standards in handling ethical dilemmas on which not even humans can agree, which indicates that the wrong questions are being asked. We suggest to address this ethics debate strictly through the lens of what behavior seems socially acceptable, rather than idealistically ethical. Learning such behavior puts the debate into the very heart of developmental robotics. This paper poses a roadmap of computational and experimental questions to address the development of socially acceptable machines. We emphasize the need for social reward mechanisms and learning architectures that integrate these while reaching beyond limitations of plain reinforcement learning agents. We suggest to use the metaphor of “needs” to bridge rewards and higher level abstractions such as goals for both communication and action generation in a social context. We then suggest a series of experimental questions and possible platforms and paradigms to guide future research in the area.
-
Barker S, Izadi H, Crook NT, Hayatleh K, Rolf M, Hughes P, Fellows N, 'Natural head movement for HRI with a muscular-skeletal head and neck robot'
(2017) pp.587-592
eISSN: 1944-9437 ISBN: 9781538635186AbstractThis paper presents a study of the movements of a humanoid head-and-neck robot called Eddie. Eddie has a musculo-skeletal structure similar to that found in human necks enabling it to perform head movements that are comparable with human head movements. This study compares the movements of Eddie with those of a more conventional robotic neck structure and with those of a human head. Results show that Eddie’s movements are perceived as significantly more natural and by trend more lifelike than the conventional head’s. No differences were found with respect to the impression of humanlikeness, consciousness, and elegance.Published here Open Access on RADAR -
Rolf M, Crook NT, 'What if: robots create novel goals? Ethics based on social value systems'
1668 (2016) pp.20-25
AbstractFuture personal robots might possess the capability to autonomously generate novel goals that exceed their initial programming as well as their past experience. We discuss the ethical challenges involved in such a scenario, ranging from the construction of ethics into such machines to the standard of ethics we could actually demand from such machines. We argue that we might have to accept those machines committing human-like ethical failures if they should ever reach human-level autonomy and intentionality. We base our discussion on recent ideas that novel goals could be originated from agents’ value system that express a subjective goodness of world or internal states. Novel goals could then be generated by extrapolating what future states would be good to achieve. Ethics could be built into such systems not just by simple utilitarian measures but also by constructing a value for the expected social acceptance of a the agent’s conduct.Published here Open Access on RADAR -
Rolf M, Asada M, 'Latent Goal Analysis for Dimension Reduction in Reinforcement Learning'
(2015) pp.26-30
AbstractIn contrast to reinforcement learning, adaptive control formulations [Nguyen-Tuong and Peters, 2011] already come with expressive and typically low-dimensional goal and task representations, which have been generally considered more expressive than the RL setting [Kaelbling et al., 1996]. Goal and actual values in motor control define a relation similar [Rolf and Steil, 2014] to actual and target outputs in classical supervised learning settings by providing “directional information” in contrast to a mere “magnitude of an error” in reinforcement learning [Barto, 1994]. Recent work [Rolf and Asada, 2014] however showed that these two problem formulations can be transformed into each other. Hence, highly descriptive task representations can be extracted out of reinforcement learning problems by transforming them into adaptive control problems. After introducing the method called Latent Goal Analysis, we discuss the possible application of this approach as dimension reduction technique in reinforcement learning. Experimental results in a web recommender scenario confirm the potential of this technique.Published here -
Rolf M, Asada M, 'What are goals? And if so, how many?'
(2015)
ISBN: 978-1-4673-9320-1/15AbstractGoals are concepts used in many different areas of robotics, artificial intelligence, psychology, neuroscience, and also philosophy. Despite the wide usage, there is no common definition of a “goal”. Rather, the term is used in substantially different ways even within disciplines. This paper discusses these notions and potentially unified views on goals, and points out how different perspectives on the same term lead to different arguments and can cause communication difficulties in the interdisciplinary community. We discuss how far goal terminologies can be generally considered as desired end states of action and point out the pivotal aspect of their explicit representation. As a major point we discuss the relation of such goals with reward and value systems from various perspectives.Published here -
Queisser JF, Neumann K, Rolf M, Reinhart RF, Steil JJ, 'An active compliant control mode for interaction with a pneumatic soft robot'
(2014) pp.573-579
Published here
Other publications
-
Rolf M, Asada M, 'Autonomous development of goals: From generic rewards to goal and self detection', (2014)
-
Rolf M, Asada M, 'Latent Goal Analysis: Learning goals and body schema from generic rewards', (2014)
-
Rolf M, Asada M, 'Visual Attention by Audiovisual Signal-Level Synchrony', (2014)
-
Schillingmann L, Rolf M, Kumagaya S, Ayaya S, Nagai Y, 'Assistance for Autistic People by Segmenting and Highlighting Cross-Modal Perceptual Information', (2013)
-
Rolf M, 'Goal babbling with unknown ranges: A direction-sampling approach', (2013)
-
Rolf M, Asada M, 'Learning Inverse Models in High Dimensions with Goal Babbling and Reward-Weighted Averaging', (2013)
-
Reinhart RF, Rolf M, 'Learning versatile sensorimotor coordination with goal babbling and neural associative dynamics', (2013)
-
Rolf M, Asada M, 'Motor synergies are naturally observed during goal babbling', (2013)
-
Rolf M, Gienger M, Steil JJ, 'Robot control with bootstrapping inverse kinematics', (2013)
-
Rolf M, Steil JJ, 'Constant curvature continuum kinematics as fast approximate model for the Bionic Handling Assistant', (2012)
-
Rolf M, Steil JJ, 'Explorative learning of right inverse functions: theoretical implications of redundancy', (2012)
-
Rolf M, Steil JJ, 'Goal Babbling: a New Concept for Early Sensorimotor Exploration', (2012)
-
Rolf M, 'Goal babbling for an efficient bootstrapping of inverse models in high dimensions', (2012)
-
Nordmann A, Rolf M, Wrede S, 'Software Abstractions for Simulation and Control of a Continuum Robot', (2012)
-
Rolf M, Steil JJ, Gienger M, 'Online goal babbling for rapid bootstrapping of inverse models in high dimensions', (2011)
-
Rolf M, Steil JJ, 'Online learning in the loop: fast explorative learning of inverse models in high dimensions', (2011)
-
Rolf M, Steil JJ, Gienger M, 'Bootstrapping inverse kinematics with Goal Babbling', (2010)
-
Ajallooeian M, Gay S, Ijspeert A, Khansari-Zadeh M, Kim S, Billard A, Rückert E, Neumann G, Waegeman T, Schrauwen B, others, 'Comparative evaluation of approaches in T. 4.1-4.3 and working definition of adaptive module', (2010)
-
Rolf M, Steil JJ, Gienger M, 'Learning flexible full body kinematics for humanoid tool use', (2010)
-
Neumann K, Rolf M, Steil JJ, Gienger M, 'Learning inverse kinematics for pose-constraint bi-manual movements', (2010)
-
Rolf M, Steil JJ, Gienger M, 'Mastering Growth while Bootstrapping Sensorimotor Coordination', (2010)
-
Rolf M, Hanheide M, Rohlfing K, 'Attention Manipulation with Multimodal Synchrony', (2009)
-
Rolf M, Steil JJ, Gienger M, 'Efficient exploration and learning of whole body kinematics', (2009)
-
Rolf M, Hanheide M, Rohlfing KJ, 'The use of synchrony in parent-child interaction can be measured on a signal-level', (2009)
-
Rolf M, 'Audiovisual attention via Synchrony', (2008)
Further details
Press, publicity and reviews
- Ulrich Eberl, 2016: Smarte Maschinen: Wie Künstliche Intelligenz unser Leben verändert
- Technology Review, 2015: Das Kind im Roboter - Eingebaute Kindheit macht Maschinen schlauer
- New Scientist, 2014: Robot elephant trunk learns motor skills like a baby
- BI.Research, 2013: Robotern das Lernen beibringen
- IEEE Spectrum Video Friday, 2012: Curiosity Learns to Scoop, Robot Tentacle Learns to Grab, iCub Learns About Rolling