I've made a SatAst, a small collection of hand-annotated satellite to astronaut image correspondences, public on github: github.com/georg-bn/sat.... This benchmark is part of the RoMa v2 paper, see Johan's thread below. bsky.app/profile/pars...
I've made a SatAst, a small collection of hand-annotated satellite to astronaut image correspondences, public on github: github.com/georg-bn/sat.... This benchmark is part of the RoMa v2 paper, see Johan's thread below. bsky.app/profile/pars...
"Authors should not use negative v-spaces to change the template layout."
The template layout:
yes
Utonia: Toward One Encoder for All Point Clouds
Yujia Zhang, Xiaoyang Wu, Yunhan Yang, Xianzhe Fan, Han Li, Yuechen Zhang, Zehao Huang, Naiyan Wang, Hengshuang Zhao
tl;dr: PointTransformerv3 pretrained on tons of different data
arxiv.org/abs/2603.03283
ZipMap: Linear-Time Stateful 3D Reconstruction with Test-Time Training
@haian-jin.bsky.social Rundi Wu, Tianyuan Zhang, Ruiqi Gao, @jonbarron.bsky.social @snavely.bsky.social @holynski.bsky.social
tl;dr: more test-time-training for getting scene latent.
arxiv.org/abs/2603.04385
DAGE: Dual-Stream Architecture for Efficient and Fine-Grained Geometry Estimation
Tuan Duc Ngo, Jiahui Huang, Seoung Wug Oh, Kevin Blackburn-Matzen, Evangelos Kalogerakis, Chuang Gan, Joon-Young Lee
tl;dr: low-res multivew (Pi3-distilled) + highres single view( MoGe2 ft)
arxiv.org/abs/2603.03744
NOVA3R: Non-pixel-aligned Visual Transformer for Amodal 3D Reconstruction
Weirong Chen, Chuanxia Zheng, Ganlin Zhang, Andrea Vedaldi, Daniel Cremers
tl;dr: let VGGT output latents -> decode point cloud.
arxiv.org/abs/2603.04179
Dark3R: Learning Structure from Motion in the Dark
Andrew Y Guo, Anagh Malik, SaiKiran Tedla, Yutong Dai, Yiqian Qin, Zach Salehe, Benjamin Attal, Sotiris Nousias, Kyros Kutulakos, David B. Lindell
tl;dr: LoRa for MASt3R to make it work on low-light.
arxiv.org/abs/2603.05330
#CVPR2026
NOVA3R: Non-pixel-aligned Visual Transformer for Amodal 3D Reconstruction
Weirong Chen, @chuanxiaz.bsky.social, @ganlinzhang.xyz, Andrea Vedaldi, @dcremers.bsky.social
tl;dr: TripoSG+VGGT
layout issue with tables?
arxiv.org/abs/2603.04179
Submit your paper of structured reconstruction -- CAD, semantic, wireframe, city monitoring, etc., to USM3D 2026!
cmt3.research.microsoft.com/USM2026
Deadline: March 24, 2026.
@cvprconference.bsky.social
#CVPR2026
#USM3D2026 #USM3D
Wow
#CVPR2026 One more week to submit your work to the Embedded Vision Workshop @ CVPR! @cvprconference.bsky.social (new deadline: March 11)
Info at: embeddedvisionworkshop.wordpress.com
AI writing is like store-bought cake. It might be perfectly fine, maybe even as good as something you could make yourself, but itβs weird to give it to someone and say itβs homemade
Image Matching Challenge 2026.
It is named "IMC 2025 On-going".
- It will be living for longer than "until next CVPR" - multiyear leaderboard.
- No prize, but invite to talk about solution at CVPR.
- Dataset+metrics same as 2025.
www.kaggle.com/competitions...
#CVPR2026
@cvprconference.bsky.social
You still have 2 weeks to submit your paper to Image Matching Workshop at #CVPR2026
Deadline: March 16.
Topics: anything related to image matching and 3D reconstruction.
cmt3.research.microsoft.com/IMW2026
@cvprconference.bsky.social
Clarification about dual submissions to @eccv.bsky.social and @cvprconference.bsky.social Findings track.
If you submit the same work to ECCV, please do not opt in to CVPR 2026 Findingsβopting in would make it a dual submission. Opt-in instructions will be sent once the logistics are finalized.
Exploring the AI Obedience: Why is Generating a Pure Color Image Harder than CyberPunk?
Hongyu Li, Kuan Liu, Yuan Chen, Juntao Hu, Huimin Lu, Guanjie Chen, Xue Liu, Guangming Lu, Hong Huang
tl;dr: Flux and NanoBanana fail at precise color filling.
arxiv.org/abs/2603.00166
I am delighted (that pun will make sense in a second) that @alistairfoggin.bsky.social's first paper, CroCoDiLight, has been accepted to ICLR. The idea came from a group discussion on the CroCo paper from @naverlabseurope.bsky.social and realising it might implicitly already understand relighting.
Rethinking Camera Choice: An Empirical Study on Fisheye Camera Properties in Robotic Manipulation
Han Xue, Nan Min, Xiaotong Liu, Wendi Chen, Yuan Fang, Jun Lv, Cewu Lu, Chuan Wen
tl;dr: fisheye cameras are great for robotics, when they can see non-textureless environment.
arxiv.org/abs/2603.02139
You still have 2 weeks to submit your paper to Image Matching Workshop at #CVPR2026
Deadline: March 16.
Topics: anything related to image matching and 3D reconstruction.
cmt3.research.microsoft.com/IMW2026
@cvprconference.bsky.social
Image Matching Challenge 2026.
It is named "IMC 2025 On-going".
- It will be living for longer than "until next CVPR" - multiyear leaderboard.
- No prize, but invite to talk about solution at CVPR.
- Dataset+metrics same as 2025.
www.kaggle.com/competitions...
#CVPR2026
@cvprconference.bsky.social
He knows who is the good boy :)
Bluesky needs polls
repost if you agree, like if you disagree
FLIGHT: Fibonacci Lattice-based Inference for Geometric Heading in real-Time
David Dirnfeld, Fabien Delattre @pedro-miraldo.bsky.social Erik Learned-Miller
tl;dr: Hough transform is back -- now for camera translation direction.
arxiv.org/abs/2602.23115
Global-Aware Edge Prioritization for Pose Graph Initialization
@weitong8591.bsky.social @gtolias.bsky.social Jiri Matas, @danielbarath.bsky.social
tl;dr: another global desc->GNN->MST. Supervision:# triangulated points/pair. +heuristic MST postprocessing. Eval on IMC-PT.
arxiv.org/abs/2602.21963
Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?
Tilemachos Aravanis @stojnicv.xyz @billpsomas.bsky.social Nikos Komodakis @gtolias.bsky.social
tl;dr: almost yes if use 1-3 images, no if more(fig 6)
arxiv.org/abs/2602.23339
#CVPR2026
Excited to share that our paper "Global-Aware Edge Prioritization for Pose Graph Initialization" has been accepted to CVPR 2026! #CVPR2026 See you soon in Denver!π₯³π₯³ Code is coming soonπ§
βHow would you do an accurate and efficient pose graph initialization in a global manner? arxiv.org/abs/2602.21963
I did it year ago,
ducha-aiki.github.io/wide-baselin...
So no, future post is mostly not about LLMs
No