LoGeR – 3D reconstruction from extremely long videos (DeepMind, UC Berkeley)
Posted by helloplanets 22 hours ago
Comments
Comment by tmilard 20 hours ago
Comment by overfeed 3 hours ago
Waymo recently announced[1] a World Model that does exactly this: using footage from a single-camera dashcam, it can predict/simulate multiple inputs that would have been sensed by a Waymo vehicle on the same travel path (i.e. multiple camera angles, Lidar cloud, etc). On top of this, the model can be prompted to customize the scenario (adding an elephants or a tornado were the example given)
1. https://waymo.com/blog/2026/02/the-waymo-world-model-a-new-f...
Comment by wumms 18 hours ago
[0] https://arstechnica.com/gadgets/2017/09/googles-street-view-...
Comment by IshKebab 20 hours ago
> This is a reimplementation of LoGeR; complete code and models will be released upon approval.
I don't understand why it's a reimplementation either?
I would guess it's "research" code anyway so not really usable unless you are an expert.
Comment by priowise 12 hours ago
Comment by quadrature 11 hours ago
In the areas you have visited previously you have two estimates of your position one from your frame-to-frame estimates and another from the map you built of the area the first time. You can then solve an optimization problem to bring those two estimates closer together.
In order to find out if you've already visited an area you store a description of the locations in a DB and search through them. The paper says they use a compressed representation of the "maps" and use test time training to optimize the global consistency between their sub maps.
Comment by _fw 19 hours ago
Comment by Karliss 17 hours ago
Comment by realberkeaslan 17 hours ago
Comment by raphaelmolly8 11 hours ago
Comment by msuniverse2026 20 hours ago
Comment by KeplerBoy 20 hours ago
Comment by haritha-j 20 hours ago
Comment by endymion-light 18 hours ago
If you want reconstruction and training of robotic movement, this is far more appropriate. I believe we're going to see robots being able to "dream" in terms of analysing historical video information on spaces and improving movement and navigation.
So not mass surveilance, but probably there's a future of mass subjugation using robot enforcement.
Comment by KaiserPro 14 hours ago
Relocalisation is the bit thats surveillance-y. But its also crucial for accurate visual only navigation.
Comment by imtringued 19 hours ago
Comment by Dead_Lemon 20 hours ago
Comment by alpine01 18 hours ago
I can imagine future iterations of this which bring together other stills of the same space at that time to augment the dataset. Then perhaps another pass to fill in gaps with likely missing content based on probability or data from say the same street 10 years later.
It won't be 100% real, but I think it'd be very cool to be able to have a google-street view style experience of areas before google street view existed.
Comment by phrotoma 17 hours ago
Now do Kowloon Walled City.
Comment by voidUpdate 19 hours ago
Comment by washadjeffmad 15 hours ago
Comment by KaiserPro 14 hours ago
Lidars are great, and getting smaller, but they still eat a lot of power. (The quest 3 had a lidar on the front[well structured light] and it was mostly not used)
For machines to understand the 3d world, first they need to extract geometry, then isolate those geometries into objects. This method is _a_ way to do that, the first step, extracting 3d points.
The problem with this model is that the points are not actually that well aligned frame to frame. This is why it looks a bit blurry. I assume this is to avoid running out of memory, as you're not quite sure about which points are relevant and need to be kept in memory.
Once you have those points, you need to replace them with simplfied geometry, so that you can workout intersections and junk.
Comment by ekjhgkejhgk 15 hours ago
Comment by flipbrad 19 hours ago
Comment by petargyurov 19 hours ago
Also, I am not sure how heavy LIDAR units are, but remember that the heavier the payload the more the flight time is reduced. Some drones can only have a single payload, so if you also want to capture (high-res) video/imgs you need to fly again.
It all depends on the use-case.
Comment by Daub 19 hours ago
Comment by taneq 17 hours ago
Comment by _diyar 16 hours ago