This near-realtime segmentation, copy, and paste is wild:
Inside XR writes,
In a Twitter thread, Diagne said the secret is BASNet, an architecture for salient object detection with boundaries. (Paper here).
The delay is about 2.5 seconds to cut and 4 seconds to paste, though Diagne notes there are ways to speed that up.
The GitHub page for the project is available here.