Development of an end-to-end differentiable robot motion control architecture

Le descriptif de l’offre ci-dessous est en Anglais

Type de contrat : CDD

Contrat renouvelable : Oui

Niveau de diplôme exigé : Bac + 5 ou équivalent

Fonction : Ingénieur scientifique contractuel

Niveau d'expérience souhaité : De 3 à 5 ans

Mission confiée

This assignment is part of the OSS4EAI (open source software for embodied AI) project. OSS4EAI explores the design of end-to-end differentiable learning architectures. It revisits fundamental algorithms in robotics, particularly for physical simulation, and extends them to automate the transfer to real robots where data is sparse. This approach will be implemented in maintained open-source software to accelerate the learning of robotic behaviors and propose new forms of data and model sharing.

Principales activités

Your goal will be to develop a motion learning pipeline for manipulation and locomotion using the Simple differentiable simulator developed in OSS4EAI, comparing the resulting pipeline with standard reinforcement learning. Here is a more detailed breakdown of the tasks involved:

  • Learn how to use the existing reinforcement learning baseline on Upkie wheeled-biped robots
  • Interface Simple as a forward-dynamics simulator in the Upkie software, and evaluate its performance
    • Implement a rolling-without-slipping constraint in C++, in close collaboration with simulator developers
    • Document the performance comparison on the project website
  • Implement a policy learning pipeline using Simple as a differentiable simulator
    • The algorithmic layout will be specified with researchers actively working in the project
  • Compare the performance of learned policies on the real robot

Based on the results of this first phase, the project can be extended to a second year where we will extend the pipeline to a broader range of robot tasks (e.g. humanoid locomotion).

Compétences

Candidates should hold either a PhD degree in robotics or an MSc degree with more than five years of experience in the robotics industry.

Required skills:

  • Experience in running live code on real robots
  • Knowledge in robotics modeling: kinematics, dynamics, Jacobian matrices, …
  • Programming skills in C++ and Python
  • Spoken and written technical English

Other skills that will be appreciated:

  • Experience in model predictive control or reinforcement learning
  • Programming skills in PyTorch

Although the project requires a strong background in robotics and software engineering, prior knowledge of reinforcement learning is not a pre-requisite as things can be learnt on the go with existing software and support from the team.

Avantages

  • Subsidized meals
  • Partial reimbursement of public transport costs
  • Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
  • Possibility of teleworking (after 6 months of employment) and flexible organization of working hours
  • Professional equipment available (videoconferencing, loan of computer equipment, etc.)
  • Social, cultural and sports events and activities
  • Access to vocational training
  • Social security coverage