8 comments

  • whiplash451 8 minutes ago
    How does Lerobot prevent « full control » and « understanding »? I thought this was an open source library.

    I am not an official supporter of the library but am asking out of curiosity.

  • NalNezumi 1 hour ago
    Cool stuff. At my previous (startup / research) job I had to set up similar system (but with franka arm and multi view camera) alone because I was the only one with robotics background.

    >I do not intend to calibrate the camera’s extrinsics or intrinsics for now.

    Sensible choice, although I suggest it's good in the long run to do at early stage in your setup, especially if you intend to collect data for policy learning.

    Debugging trained policies for visual manipulation task can be a headache and having as much context on variables that can change is a good practice.

    My previous setup was in Japan, a earthquake prone place and I wasted some time after realizing the camera got misaligned due to earthquake. A simple solution is just to place an Aruco marker on the table that tracks the relative extrinsic position of camera, and add it as metadata to collected teleoperation dataset.

  • dlt713705 1 hour ago
    As impressive as this setup may be, I'm still amazed at how slow this type of robot is, whether amateur or professional grade. I have no expertise in this field, but as an observer, the apparent progress in this area seems very limited. I guess my expectations are too high and my understanding of the problems to solve is too low.
    • mplappert 20 minutes ago
      It’s partially my fault I currently clip the max speed _and_ I only input soft control changes when teleoperating to avoid crashing into things. The robot itself could definitely move more quickly than what you see in the video.

      It would be interesting to explore how RL can be applied on top of my (flawed) human demos to optimize beyond what I’m able to do.

  • avilay 1 hour ago
    Hey this is cool! I am doing something similar myself with the SO101 arm robot from Robot Studio using a patchwork of my own code and LeRobot. Would love to collaborate with you if you are open to it. You can find me on Discord as `.avilay`. https://www.linkedin.com/posts/avilay_lerobot-huggingface-ro...

    Would like to know your reasoning on not going with LeRobot.

    • mplappert 13 minutes ago
      Looks very cool! I’m not a huge discord user but how about you shoot me an email and we can figure out how to share notes? (I don’t want to post it directly here but it’s easy to find on my personal website, just google my name)

      Re why not SO-101: the article has a footnote about this; I actually bought the SO-101 as well! I want to integrate it into the same setup so I can switch depending on task.

      Somewhat surprisingly the xarm was actually much faster to arrive; I got it within 2 days of ordering. I don’t have a 3D printer and getting the SO-101 from the vendor I ordered it at took almost 4 weeks. So partially it just came down to what I had access to more quickly.

      Second point is reliability: I think the SO-101 is cool but I’d be surprised if it doesn’t break more quickly than the xarm. I wanted something that’s going to last a long time without headaches. And these industrial arms are really mature hardware wise now.

      Hope this helps!

    • nl 39 minutes ago
      I'm very interested in the SO101. I've never done any robotics and that seems a palatable entry level thing to try things out.

      How have you found it?

      (The author does explain his reasons for not using LeRobot in the post - although "I also use LeRobot for training and running baseline policies, and the vendor SDKs for the hardware.")

    • Huangjinliu 28 minutes ago
      [dead]
  • wxw 2 hours ago
    Great article. I'll be following along. Would like to learn more about the robotics space.

    - I've heard the advantage of ROS besides the architecture is the ecosystem (driver integrations, etc). Is that not an issue because the arm supports a Python SDK OOTB?

    - Any issues you've been running into with this setup?

    - How do you determine if a session recording is good enough for training? Is 50/100 samples really all you need?

    • mplappert 6 minutes ago
      Glad you like it!

      Re your questions:

      - The driver situation turned out totally fine; I intentionally picked HW with good python sdk support so that was very painless.

      - The static camera (the C920) is not super great; it drops frames and sometimes cuts out. We’ll see how that goes but it’s probably the clostest thing I want to swap right now. Another issue is reach of the arm when forcing the worst to be axis parallel with the table; you cannot get very far away. The chess setup demo in the video gives an example: I can just reach the row of pawns and beyond that it’s out of reach.

      - I don’t know yet! The 50-100 figure comes from the ACT and diffusion policy papers but it depends on the type of task. For fine tuning my sense is that you only need a few hours worth of demos to get good results with pi0.5 etc. a big reason I’m doing this project is that I want to try all of this myself, so the next posts definitely will talk about that

  • timsuchanek 1 hour ago
    This is really exciting. Incredible that you can do this for this budget at home. Unthinkable a couple years ago.
    • mplappert 12 minutes ago
      Thanks Tim and fun seeing you here :)
  • gonzogato 3 hours ago
    [dead]