Our work received an invited talk at the Imageomics-AAAI-25 workshop of #AAAI25. @vtaohu.bsky.social will be representing us there. Without me being there, I still would like to share our poster with you :D
We also have another oral presentation for DepthFM on March 1, 2:30 pm-3:45 pm.
28.02.2025 17:03
π 3
π 1
π¬ 0
π 0
typos
01.03.2025 00:42
π 1
π 0
π¬ 1
π 0
Our method pipeline
π€When combining Vision-language models (VLMs) with Large language models (LLMs), do VLMs benefit from additional genuine semantics or artificial augmentations of the text for downstream tasks?
π€¨Interested? Check out our latest work at #AAAI25:
π»Code and πPaper at: github.com/CompVis/DisCLIP
π§΅π
08.01.2025 15:54
π 15
π 8
π¬ 1
π 0
Did you know you can distill the capabilities of a large diffusion model into a small ViT? βοΈ
We showed exactly that for a fundamental task:
semantic correspondenceπ
A thread π§΅π
06.12.2024 14:35
π 4
π 2
π¬ 1
π 2
Your Diffusion Model is secretly an implicit timestep model, no matter discrete or continuous~
04.12.2024 23:42
π 6
π 0
π¬ 0
π 0
π
27.11.2024 11:11
π 1
π 0
π¬ 0
π 0
Introducing βMAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAMβ! We do SLAM with novel view synthesis capabilities on multiple simultaneously operating agents!
vladimiryugay.github.io/magic_slam/i...
1/7
27.11.2024 05:34
π 51
π 17
π¬ 3
π 1