Cybathlon - Legged Mobile Assistance for Quadriplegics

Carmen Scheidemann, Andrei Cramariuc, and Marco Hutter All authors are members of the Robotic Systems Lab, ETH Zürich, Switzerland carmensc@ethz, [email protected], [email protected]

Abstract

Assistance robots are the future for people who need daily care due to limited mobility or being wheelchair-bound. Current solutions of attaching robotic arms to motorized wheelchairs only provide limited additional mobility at the cost of increased size. We present a mouth joystick control interface, augmented with voice commands, for an independent quadrupedal assistance robot with an arm. We validate and showcase our system in the Cybathlon [1] Challenges February 2024 Assistance Robot Race, where we solve four everyday tasks in record time, winning first place. Our system remains generic and sets the basis for a platform that could help and provide independence in the everyday lives of people in wheelchairs.

I INTRODUCTION

Refer to caption — Figure 1: A overview of our proposed system in operation during the Cybathlon Challenges February 2024. The main components of the system are an ANYmal quadrupedal base and a DynaArm robotic arm. Pictured is the pilot operating the robot to open a mailbox, which is the first task of the racetrack.

People with severe motor impairments (from e.g. muscular disease, spinal cord injury, cerebral palsy, or neurological conditions) or missing limbs struggle in day-to-day tasks. Currently, they rely on dedicated help from nurses or other caretakers. This limits their independence, which is an important aspect of health and happiness [2]. Automated robotic solutions can provide the necessary tools for people with reduced mobility to perform more tasks on their own [3], and are the future for improved quality of life and healthcare. However, these solutions are still in their early stages [4], and there is a need to both raise awareness and push towards stable platforms that can be certified.

Cybathlon [1, 5] is a competition dedicated to showcasing and advancing technologies that help people with disabilities, e.g., blindness, people in wheelchairs, and prosthetic arms. The Robotic Assistance Race (ROB) in Cybathlon focuses on a robotic assistance platform for people who are quadriplegic. The platform can take the form of either an arm attached to the wheelchair or a completely separate moving robot that the pilot can remotely control. The race consists of a set of everyday tasks that the pilot has to complete on their own, utilizing only the robotic assistant. The tasks in the February Cybathlon Challenge [6] were: collecting a package from a mailbox, brushing the pilot’s teeth, hanging a scarf on a clothesline, and emptying a dishwasher.

Assistive robots will play a key role in people’s lives, as current technologies are being developed for rehabilitation [7], personal assistance [8], and social companionship [9]. Specifically for people with limited mobility who are confined to wheelchairs, most technologies focus on either smart wheelchairs [10] or a robotic arm attached to the wheelchair [11, 12, 13]. With robotic arms directly attached to the wheelchair, the major limitations are reduced mobility of the equipped wheelchair and shared battery supply. In this case, the power and lifetime of the arm have to be limited to prevent the entire wheelchair from becoming inoperational. For example, the commercial iARM [14, 13] arm attachment from Assistive Innovations can only lift 1.5kg [13]. The size is also a limiting factor – too big, and it becomes cumbersome to move with the wheelchair, too small, and the workspace is very limited.

Our proposed solution is the use of an independent quadrupedal robotic platform, ANYmal [15], with a robotic arm, DynaArm [16], on top. This setup offers more versatility than mounting an arm directly on the wheelchair, as it can go places where the wheelchair can not, it has a separate power supply, and it does not limit the wheelchair’s mobility or bloat its size. The independent robot can perform tasks autonomously without the pilot’s direct supervision, e.g. fetch things from another room or empty a dishwasher while the pilot does something else. It also poses less of a safety risk, as any system failures are less likely to affect the wheelchair’s basic mobility, and it does not require the pilot to carry a larger battery and be in constant proximity to a moving robotic arm. To the best of our knowledge, our system is the first walking robotic personal care assistant proposed for people with limited mobility [17, 18].

We showcased our system at the Cybathlon Challenges February 2024, where the pilot had to perform four predefined tasks in a racetrack scenario. The pilot is able to operate the robot through a mouth joystick interface, where they have direct control of its movements. Furthermore, we automate some generic actions, such as bringing an object to the mouth, which is crucial in day-to-day life e.g., for eating or brushing one’s teeth. Finally, to enable unlimited high-level instruction without the need for tedious GUI menus, we integrate a voice command interface. Our system won in its category in the challenge, achieving a full score and a best time of $6$ minutes $34$ seconds. Although some of the task descriptions in the challenge are very specific, in this work, we present a versatile general-purpose system that can perform a variety of other tasks and actions.

II SYSTEM

The system consists of three main components: (i) an ANYbotics ANYmal D body is the base of the robot, allowing for quick and robust locomotion across the track; (ii) a Dyna-Tech DynaArm, attached to the top side of the body and used for precise manipulation; (iii) a Quadstick gaming controller which is used as the main control input device by the pilot. Our system is operated by Samuel Kunz, a mechanical engineer who became quadriplegic after a swimming accident in 2014. An overview of the system and the main components is presented in Figure 1.

II-A ANYmal with an Arm

The robot is a combination of an ANYbotics ANYmal D body, which serves as the quadrupedal base, and a DynaTech DynaArm [16], for manipulation. The ANYmal body can reach a maximum walking speed of $1.3$ m/s, allowing us to move between competition tasks quickly, and is capable of carrying a $15$ kg payload [19]. It is largely unmodified from its original configuration, the main difference being the custom payload, which consists of an additional computer (Jetson Xavier) and the robotic arm. The DynaArm is our robotic arm of choice due to its high payload-to-weight ratio: it weighs eight kilograms and can continuously carry the same weight [16]. It is equipped with a parallel gripper end-effector (Robotiq 2F-140) and an L515 Intel Realsense depth camera, which is mounted at the elbow joint and utilized as the primary RGB-D input for automation tasks.

II-B Moving the System

We have two main ways of actuating the robot and giving it pose commands, following two different control paradigms. The first is Ocs2 [20], a precise MPC-based controller used for 6 Degrees of Freedom (DoF) stable control of the end-effector. Although the controller is not used for locomotion, it has access to the full system, allowing the base to tilt and translate while keeping all feet in contact with the ground. This significantly extends the robot’s workspace. For example, the base can pitch down when retrieving something from the floor or lean forward to grab something far away, reducing the risk of it tipping over. Additionally, access to the full system enables it to autonomously avoid self-collision between arm and base, easing control for the pilot. Compared to learning-based full-body controllers [21], its model-based nature allows it to retain a stable and high-precision end-effector pose control. However, its locomotion capabilities are limited. Therefore, movement along the race track is achieved through our second control method: A learned policy, trained in simulation with reinforcement learning [22]. The policy operates in 3 DoF, accepting x, y, and yaw targets for the base. As of now, the two controllers are separate control entities which the pilot can seamlessly switch between, to either move the base or end-effector of the robot. In the future we aim to unify these two controllers into a single learned policy combining the advantages of both.

II-C Interfacing with the Pilot

The pilot maneuvers himself along the race track using his own personal motorized wheelchair, to which we attach both a display laptop and a QuadStick gaming controller for the duration of the competition. The QuadStick controls will be discussed in depth in the following section II-D. The display laptop is used to relay important information about the system to the pilot, such as the joint states of the arm, and to receive high-level input from the pilot, such as activating specific autonomy tasks with voice commands.

II-D Quadstick Interface

One of our core contributions is the pilot’s control interface via the Quadstick. The nature of Cybathlon as a race with extremely limited time for the completion of tasks demands a control scheme that is precise and robust while remaining intuitive for the pilot. The physical interface of the Quadstick is fairly limited and consists of a mouthpiece and a push button jointly mounted on a two-axis joystick. The mouthpiece itself contains four breath-based input channels, each of which can register no input, blowing, or sucking in a discrete manner.

A difficulty here lies in reducing the complexity of controls without compromising performance. The system possesses nine degrees of freedom, three in the base and six in the arm. These nine degrees of freedom, as well as the switching between base and end-effector controllers, need to be mapped down to the seven available input channels and the primary and secondary axis mappings of the Quadstick.

To make the end-effector control more intuitive, we implemented two different control modes using Ocs2, which the pilot can toggle between. These differ from each other in two ways: Firstly, in their initial arm configuration, which can be seen in Figure 2. Secondly, they differ in their secondary joystick axis mapping, which will be elaborated upon in Section II-D2. The first control mode is referred to in Table I as EE Control Mode Front, the second as EE Control Mode Top. The third control mode we implement is referred to as Base Control Mode, in which the pilot can move the base of the robot around using the learned locomotion policy while keeping the arm frozen.

The following explains the control scheme in two parts: First the layout of the Quadstick and the mode-independent input assignments are described. Subsequently we explain the mode-specific subset of channel mappings and the mapping of the primary and secondary axis assignments.

II-D1 Mode-Independent Inputs

The left-most breath-based input channel (referred to as Channel 0 in Figure 3) and the separate push button are used to switch between control modes. The middle-right input channel (Channel 2 in Figure 3) controls the opening and closing of the gripper. The right-most input channel (labeled Mapping Mode Switch) can not be used directly for control as it is used internally in the Quadstick firmware to switch between the primary and secondary mapping of the joystick axis.

II-D2 Mode-Specific Inputs

Although the Quadstick only has one physical joystick, it is possible to switch between two different axis assignments, denoted here as primary and secondary axis mapping modes, by using the rightmost breath input channel. Repeated mode switching would substantially slow down control as this switch takes a few seconds to complete. Therefore, the mappings are designed so that the majority of movement can be completed using the primary axis mapping, and the secondary is only needed for final adjustments to the manipulation angle.

The primary mapping is intuitively shared over all modes and maps the horizontal and vertical axes to x and y movement of the base or end-effector. The middle-left channel (referred to as channel 1 in Figure 3) is mapped in a controller-specific manner. In all modes, channel 1 maps the third control axis of the currently active controller, the first two being mapped by the joystick. For the base control mode, this corresponds to the yaw axis, as the x and y axes are mapped to the joystick. For end-effector control, we map this channel to the z-axis, as thus, all translational axes are exposed when in primary mapping mode.

The design of secondary axis mappings is less straightforward, and the final mapping, which can be seen in Table I, is the result of an iterative testing process with the pilot. Both the front and top configurations share a mapping of roll onto the horizontal axis. However, the vertical axis maps to yaw in EE Control Mode Front and to pitch in EE Control Mode Top. This is due to these being the most useful axes to have actuated for said configuration. For example, in EE Control Mode Front, controlling yaw allows for grasping handles or objects that are not vertical. The secondary mode remains unmapped for the base controller.

Mapping

Mode

Axis

Base Control

Mode

EE Control

Mode Front

EE Control

Mode Top

Primary

horizontal

vertical

Secondary

horizontal

roll

vertical

yaw

pitch

TABLE I: Joystick Axis Mappings

II-E Autonomous Face Touching

During the February Challenge, one of the tasks included touching the pilot’s mouth with a toothbrush without the pilot moving his head toward the brush. As the Quadstick interface must be situated at the pilot’s mouth, using it to teleoperate the brush into contact with the mouth is not a viable option. We therefore automate this task, keeping in mind that it is also generically applicable to other sorts of daily tasks, such as eating food.

We detect the operator’s face and mouth using SCRFD [23]. The distance to the face is calculated using the depth measurements of the L515 RGB-D camera mounted on the DynaArm. As robustness while operating in close proximity to the pilot is extremely safety-critical, additional safety measures are implemented. Firstly, the system is equipped with a force-torque sensor situated between the last link of the DynaArm and the end-effector. When in close proximity to the pilot, the trajectory-following module actively listens to potential collision events reported by the sensor, and if one is detected, it halts the approach and retracts the arm in a safe manner. Additionally, if the pilot feels the approach is unsafe, he can use voice commands to prematurely illicit a safe retraction of the arm.

II-F Voice Command Interface

Using a computer interface can be difficult with limited mobility, and additional physical buttons are difficult to add and can also become confusing. To make our system more user-friendly and responsive, we use a voice command interface for activating commands that are rarely used, but necessary. For this, we first translate human speech to text using Whisper [24]. Then, we filter the text, searching for keywords that are mapped in a predefined manner to possible commands. For the February Challenge the only exposed commands were: starting the face touching pipeline, interrupting face touching with a safe retreat protocol, and a general ”stop” command to interrupt any current execution. For real-world deployment, using a Large Language Model (LLM) to translate natural speech into commands would provide a more intuitive solution. However, in the limited context of the Cybathlon competition, where time is a critical aspect, LLM execution is too slow.

III RESULTS

As the system is mainly teleoperated by the pilot, their proficiency is integral to its performance. How much training time is needed depends largely on how intuitive the control scheme of the system is [14]. During training, the pilot practiced executing the challenge tasks on a mock race track and gave feedback for the refinement of the control scheme and GUI visualization of the system. Our pilot, Samuel Kunz, trained for roughly 15 hours over five sessions to reach the level of proficiency exhibited during the February Challenge. This included minimal failure rate executing the race tasks, collision-free movement of the robotic base across the race track, and synchronized movement of the robotic base and the motorized wheelchair.

The February Challenge consisted of two runs, and for each team, the run with the best score (best time on equal scores between runs) was counted as the final result. The track setup is presented in Figure 4. We achieved a full score on both runs with times of 6 minutes 34 seconds and 6 minutes 51 seconds. Additionally, we were the only team to achieve a full final score. Throughout our runs, we spent on average, 23 percent of the time moving along the track and 77 percent of the time manipulating. The motion on the track includes both the robot and the pilot moving into position. While most of the time is spent on manipulation, automating many of these tasks is difficult, as making perception pipelines as reactive as direct human control is still an active challenge in the robotic community.

Although Cybathlon is a fairly pre-defined competition scenario, our aim was to develop a generic control method that also generalizes to previously unseen tasks. The success of this can be seen in our performance on the Scarf task, a task which the pilot had not trained for during practice on the mock course and nonetheless completed with a full score and a best time of 64 seconds.

IV CONCLUSIONS

We have presented the first design for controlling and operating an independent quadrupedal assistance robot for people with reduced mobility (e.g., quadriplegic). Our platform is highly versatile, as it can reach places that remain inaccessible by wheelchair and does not force the pilot to permanently carry a robotic arm attachment. This design won the Cybathlon Challenge February 2024 Assistant Robot Race, where it completed a series of everyday tasks in record time. In the future, when assistance robots are medically certified and can perform more tasks autonomously, they will have a huge impact on the lives of many people who need daily care.

References

[1] L. Jaeger, R. d. S. Baptista, C. Basla, P. Capsi-Morales, Y. K. Kim, S. Nakajima, C. Piazza, M. Sommerhalder, L. Tonin, G. Valle, et al., “How the CYBATHLON Competition Has Advanced Assistive Technologies,” Annual Review of Control, Robotics, and Autonomous Systems, vol. 6, pp. 447–476, 2023.
[2] A. Lau and K. McKenna, “Perception of Quality of Life by Chinese elderly persons with stroke,” Disability and Rehabilitation, vol. 24, no. 4, pp. 203–218, 2002.
[3] A. J. Pearce, B. Adair, K. Miller, E. Ozanne, C. Said, N. Santamaria, and M. E. Morris, “Robotics to Enable Older Adults to Remain Living at Home,” Journal of Aging Research, 2012.
[4] L. Penteridis, G. D’Onofrio, D. Sancarlo, F. Giuliani, F. Ricciardi, F. Cavallo, A. Greco, I. Trochidis, and A. Gkiokas, “Robotic and Sensor Technologies for Mobility in Older People,” Rejuvenation Research, vol. 20, no. 5, pp. 401–410, 2017.
[5] R. Riener, “The Cybathlon promotes the development of assistive technology for people with physical disabilities,” Journal of NeuroEngineering and Rehabilitation, 2016.
[6] CYBATHLON Challenges 2024. Accessed: 2024-04-25. [Online]. Available: https://cybathlon.ethz.ch/en/events/challenges/challenges-2024
[7] B. Li, G. Li, Y. Sun, G. Jiang, J. Kong, and D. Jiang, “A review of rehabilitation robot,” in Youth Academic Annual Conference of Chinese Association of Automation (YAC), 2017, pp. 907–911.
[8] S. Cooper, A. Di Fava, C. Vivas, L. Marchionni, and F. Ferro, “ARI: the Social Assistive Robot and Companion,” in IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), 2020, pp. 745–751.
[9] A. Ruggiero, D. Mahr, G. Odekerken-Schröder, T. R. Spena, and C. Mele, Chapter 21: Companion robots for well-being: a review and relational framework. Edward Elgar Publishing, 2022, pp. 309 – 330.
[10] Y. Kim, B. Velamala, Y. Choi, Y. Kim, H. Kim, N. Kulkarni, and E.-J. Lee, “A Literature Review on the Smart Wheelchair Systems,” 2023.
[11] J. B. Julie Bourassa, Julie Faieta and F. Routhier, “Wheelchair-mounted robotic arms: a survey of occupational therapists’ practices and perspectives,” Disability and Rehabilitation: Assistive Technology, vol. 18, no. 8, pp. 1421–1430, 2023.
[12] A. Sivaprakasam, H. Wang, R. A. Cooper, and A. M. Koontz, “Innovation in Transfer Assist Technologies for Persons with Severe Disabilities and Their Caregivers,” IEEE Potentials, vol. 36, no. 1, pp. 34–41, 2017.
[13] Assistive Innovations iARM. [Online]. Available: https://www.assistive-innovations.com/en/robotic-arms/iarm
[14] C.-S. Chung, H. Wang, M. Hannan, D. Ding, A. Kelleher, and R. Cooper, “Task-Oriented Performance Evaluation for Assistive Robotic Manipulators: A Pilot Study,” American Journal of Physical Medicine & Rehabilitation, vol. 96, p. 1, 2016.
[15] M. Hutter, C. Gehring, D. Jud, A. Lauber, C. D. Bellicoso, V. Tsounis, J. Hwangbo, K. Bodie, P. Fankhauser, M. Bloesch, R. Diethelm, S. Bachmann, A. Melzer, and M. Hoepflinger, “ANYmal - a highly mobile and dynamic quadrupedal robot,” in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2016, pp. 38–44.
[16] Dynatech: Dynaarm. [Online]. Available: https://dyna-tech.ch/robotic-arm/
[17] J. Mišeikis, P. Caroni, P. Duchamp, A. Gasser, R. Marko, N. Mišeikienė, F. Zwilling, C. De Castelbajac, L. Eicher, M. Früh, et al., “Lio-A Personal Robot Assistant for Human-Robot Interaction and Care Applications,” IEEE Robotics and Automation Letters, vol. 5, no. 4, pp. 5339–5346, 2020.
[18] A. Bilyea, N. Seth, S. Nesathurai, and H. Abdullah, “Robotic assistants in personal care: A scoping review,” Medical engineering & physics, vol. 49, pp. 1–6, 2017.
[19] ANYbotics, “Generation D ANYmal Technical Specifications,” May 2022, Accessed: 2024-04-25. [Online]. Available: https://www.anybotics.com/anymal-technical-specifications.pdf
[20] J.-P. Sleiman, F. Farshidian, M. V. Minniti, and M. Hutter, “A Unified MPC Framework for Whole-Body Dynamic Locomotion and Manipulation,” IEEE Robotics and Automation Letters, vol. 6, no. 3, pp. 4688–4695, 2021.
[21] Z. Fu, X. Cheng, and D. Pathak, “Deep whole-body control: Learning a unified policy for manipulation and locomotion,” in Conference on Robot Learning (CoRL), 2023, pp. 138–149.
[22] T. Miki, J. Lee, J. Hwangbo, L. Wellhausen, V. Koltun, and M. Hutter, “Learning robust perceptive locomotion for quadrupedal robots in the wild,” Science Robotics, vol. 7, no. 62, p. eabk2822, 2022.
[23] J. Guo, J. Deng, A. Lattas, and S. Zafeiriou, “Sample and Computation Redistribution for Efficient Face Detection,” arXiv preprint arXiv:2105.04714, 2021.
[24] Open-AI, “Introducing Whisper,” Accessed: 2024-04-26. [Online]. Available: https://openai.com/research/whisper