Stream3D-VLM is an online 3D vision-language model that supports real-time spatial understanding and interaction directly from streaming video. By incrementally integrating geometry priors and ...
This repository is the official implementation of FancyVideo. Video demos can be found in the webpage. Some of them are contributed by the community. You can customize your own videos using the ...