Dmytro Mishkin's Avatar

Dmytro Mishkin

@ducha-aiki

Marrying classical CV and Deep Learning. I do things, which work, rather than being novel, but not working. http://dmytro.ai

2,594
Followers
156
Following
1,335
Posts
06.12.2023
Joined
Posts Following

Latest posts by Dmytro Mishkin @ducha-aiki

Preview
I'm Not a Robot Prove your humanity once and for all

neal.fun/not-a-robot/

06.03.2026 15:41 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

"Authors should not use negative v-spaces to change the template layout."

The template layout:

06.03.2026 07:56 πŸ‘ 26 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0

yes

06.03.2026 15:34 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image Post image Post image Post image

Utonia: Toward One Encoder for All Point Clouds

Yujia Zhang, Xiaoyang Wu, Yunhan Yang, Xianzhe Fan, Han Li, Yuechen Zhang, Zehao Huang, Naiyan Wang, Hengshuang Zhao
tl;dr: PointTransformerv3 pretrained on tons of different data
arxiv.org/abs/2603.03283

06.03.2026 14:31 πŸ‘ 11 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image Post image Post image Post image

ZipMap: Linear-Time Stateful 3D Reconstruction with Test-Time Training

@haian-jin.bsky.social Rundi Wu, Tianyuan Zhang, Ruiqi Gao, @jonbarron.bsky.social @snavely.bsky.social @holynski.bsky.social

tl;dr: more test-time-training for getting scene latent.
arxiv.org/abs/2603.04385

06.03.2026 14:09 πŸ‘ 8 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image Post image Post image Post image

DAGE: Dual-Stream Architecture for Efficient and Fine-Grained Geometry Estimation

Tuan Duc Ngo, Jiahui Huang, Seoung Wug Oh, Kevin Blackburn-Matzen, Evangelos Kalogerakis, Chuang Gan, Joon-Young Lee
tl;dr: low-res multivew (Pi3-distilled) + highres single view( MoGe2 ft)
arxiv.org/abs/2603.03744

06.03.2026 14:02 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image Post image Post image Post image

NOVA3R: Non-pixel-aligned Visual Transformer for Amodal 3D Reconstruction

Weirong Chen, Chuanxia Zheng, Ganlin Zhang, Andrea Vedaldi, Daniel Cremers

tl;dr: let VGGT output latents -> decode point cloud.
arxiv.org/abs/2603.04179

06.03.2026 13:54 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 1
Post image Post image Post image Post image

Dark3R: Learning Structure from Motion in the Dark

Andrew Y Guo, Anagh Malik, SaiKiran Tedla, Yutong Dai, Yiqian Qin, Zach Salehe, Benjamin Attal, Sotiris Nousias, Kyros Kutulakos, David B. Lindell

tl;dr: LoRa for MASt3R to make it work on low-light.
arxiv.org/abs/2603.05330
#CVPR2026

06.03.2026 13:29 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image Post image Post image Post image

NOVA3R: Non-pixel-aligned Visual Transformer for Amodal 3D Reconstruction

Weirong Chen, @chuanxiaz.bsky.social, @ganlinzhang.xyz, Andrea Vedaldi, @dcremers.bsky.social

tl;dr: TripoSG+VGGT

layout issue with tables?
arxiv.org/abs/2603.04179

05.03.2026 17:06 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

Submit your paper of structured reconstruction -- CAD, semantic, wireframe, city monitoring, etc., to USM3D 2026!

cmt3.research.microsoft.com/USM2026
Deadline: March 24, 2026.
@cvprconference.bsky.social
#CVPR2026
#USM3D2026 #USM3D

04.03.2026 13:19 πŸ‘ 5 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0

Wow

04.03.2026 13:13 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

#CVPR2026 One more week to submit your work to the Embedded Vision Workshop @ CVPR! @cvprconference.bsky.social (new deadline: March 11)

Info at: embeddedvisionworkshop.wordpress.com

04.03.2026 12:02 πŸ‘ 3 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

AI writing is like store-bought cake. It might be perfectly fine, maybe even as good as something you could make yourself, but it’s weird to give it to someone and say it’s homemade

03.03.2026 19:03 πŸ‘ 522 πŸ” 66 πŸ’¬ 16 πŸ“Œ 8
Preview
Image Matching Challenge 2025 Ongoing Ongoing leaderboard for Image Matching Challenge 2025.

Image Matching Challenge 2026.
It is named "IMC 2025 On-going".
- It will be living for longer than "until next CVPR" - multiyear leaderboard.
- No prize, but invite to talk about solution at CVPR.
- Dataset+metrics same as 2025.
www.kaggle.com/competitions...
#CVPR2026
@cvprconference.bsky.social

02.03.2026 13:37 πŸ‘ 7 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0
Post image

You still have 2 weeks to submit your paper to Image Matching Workshop at #CVPR2026

Deadline: March 16.
Topics: anything related to image matching and 3D reconstruction.
cmt3.research.microsoft.com/IMW2026
@cvprconference.bsky.social

02.03.2026 13:42 πŸ‘ 7 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0

Clarification about dual submissions to @eccv.bsky.social and @cvprconference.bsky.social Findings track.

If you submit the same work to ECCV, please do not opt in to CVPR 2026 Findingsβ€”opting in would make it a dual submission. Opt-in instructions will be sent once the logistics are finalized.

03.03.2026 13:40 πŸ‘ 10 πŸ” 4 πŸ’¬ 0 πŸ“Œ 0
Post image Post image Post image Post image

Exploring the AI Obedience: Why is Generating a Pure Color Image Harder than CyberPunk?

Hongyu Li, Kuan Liu, Yuan Chen, Juntao Hu, Huimin Lu, Guanjie Chen, Xue Liu, Guangming Lu, Hong Huang

tl;dr: Flux and NanoBanana fail at precise color filling.
arxiv.org/abs/2603.00166

03.03.2026 12:22 πŸ‘ 6 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

I am delighted (that pun will make sense in a second) that @alistairfoggin.bsky.social's first paper, CroCoDiLight, has been accepted to ICLR. The idea came from a group discussion on the CroCo paper from @naverlabseurope.bsky.social and realising it might implicitly already understand relighting.

03.03.2026 10:40 πŸ‘ 9 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Post image Post image Post image Post image

Rethinking Camera Choice: An Empirical Study on Fisheye Camera Properties in Robotic Manipulation

Han Xue, Nan Min, Xiaotong Liu, Wendi Chen, Yuan Fang, Jun Lv, Cewu Lu, Chuan Wen

tl;dr: fisheye cameras are great for robotics, when they can see non-textureless environment.
arxiv.org/abs/2603.02139

03.03.2026 11:04 πŸ‘ 7 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

You still have 2 weeks to submit your paper to Image Matching Workshop at #CVPR2026

Deadline: March 16.
Topics: anything related to image matching and 3D reconstruction.
cmt3.research.microsoft.com/IMW2026
@cvprconference.bsky.social

02.03.2026 13:42 πŸ‘ 7 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0
Preview
Image Matching Challenge 2025 Ongoing Ongoing leaderboard for Image Matching Challenge 2025.

Image Matching Challenge 2026.
It is named "IMC 2025 On-going".
- It will be living for longer than "until next CVPR" - multiyear leaderboard.
- No prize, but invite to talk about solution at CVPR.
- Dataset+metrics same as 2025.
www.kaggle.com/competitions...
#CVPR2026
@cvprconference.bsky.social

02.03.2026 13:37 πŸ‘ 7 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0

He knows who is the good boy :)

01.03.2026 11:12 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Bluesky needs polls

repost if you agree, like if you disagree

27.02.2026 16:54 πŸ‘ 18 πŸ” 24 πŸ’¬ 13 πŸ“Œ 3
Post image Post image Post image Post image

FLIGHT: Fibonacci Lattice-based Inference for Geometric Heading in real-Time

David Dirnfeld, Fabien Delattre @pedro-miraldo.bsky.social Erik Learned-Miller

tl;dr: Hough transform is back -- now for camera translation direction.
arxiv.org/abs/2602.23115

27.02.2026 16:44 πŸ‘ 7 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Post image Post image Post image Post image

Global-Aware Edge Prioritization for Pose Graph Initialization

@weitong8591.bsky.social @gtolias.bsky.social Jiri Matas, @danielbarath.bsky.social

tl;dr: another global desc->GNN->MST. Supervision:# triangulated points/pair. +heuristic MST postprocessing. Eval on IMC-PT.
arxiv.org/abs/2602.21963

27.02.2026 16:30 πŸ‘ 7 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Post image Post image Post image Post image

Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?

Tilemachos Aravanis @stojnicv.xyz @billpsomas.bsky.social Nikos Komodakis @gtolias.bsky.social

tl;dr: almost yes if use 1-3 images, no if more(fig 6)
arxiv.org/abs/2602.23339
#CVPR2026

27.02.2026 16:17 πŸ‘ 7 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0

Excited to share that our paper "Global-Aware Edge Prioritization for Pose Graph Initialization" has been accepted to CVPR 2026! #CVPR2026 See you soon in Denver!πŸ₯³πŸ₯³ Code is coming soon🚧
❓How would you do an accurate and efficient pose graph initialization in a global manner? arxiv.org/abs/2602.21963

26.02.2026 15:54 πŸ‘ 10 πŸ” 3 πŸ’¬ 1 πŸ“Œ 0
ChatGPT and Image Matching – Wide baseline stereo meets deep learning Are we done yet?

I did it year ago,
ducha-aiki.github.io/wide-baselin...

So no, future post is mostly not about LLMs

26.02.2026 11:31 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

No

26.02.2026 09:37 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

No

26.02.2026 09:37 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0