Vincent Tao Hu's Avatar

Vincent Tao Hu

@vtaohu

LMU postdoc from Ommer-Lab, MCML junior member. UvA PhD, PKU

935
Followers
173
Following
3
Posts
19.11.2024
Joined
Posts Following

Latest posts by Vincent Tao Hu @vtaohu

Post image

Our work received an invited talk at the Imageomics-AAAI-25 workshop of #AAAI25. @vtaohu.bsky.social will be representing us there. Without me being there, I still would like to share our poster with you :D

We also have another oral presentation for DepthFM on March 1, 2:30 pm-3:45 pm.

28.02.2025 17:03 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

typos

01.03.2025 00:42 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Our method pipeline

Our method pipeline

πŸ€”When combining Vision-language models (VLMs) with Large language models (LLMs), do VLMs benefit from additional genuine semantics or artificial augmentations of the text for downstream tasks?

🀨Interested? Check out our latest work at #AAAI25:

πŸ’»Code and πŸ“Paper at: github.com/CompVis/DisCLIP

πŸ§΅πŸ‘‡

08.01.2025 15:54 πŸ‘ 15 πŸ” 8 πŸ’¬ 1 πŸ“Œ 0

Did you know you can distill the capabilities of a large diffusion model into a small ViT? βš—οΈ
We showed exactly that for a fundamental task:
semantic correspondenceπŸ“

A thread πŸ§΅πŸ‘‡

06.12.2024 14:35 πŸ‘ 4 πŸ” 2 πŸ’¬ 1 πŸ“Œ 2

Your Diffusion Model is secretly an implicit timestep model, no matter discrete or continuous~

04.12.2024 23:42 πŸ‘ 6 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

πŸ‘

27.11.2024 11:11 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Video thumbnail

Introducing β€œMAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM”! We do SLAM with novel view synthesis capabilities on multiple simultaneously operating agents!

vladimiryugay.github.io/magic_slam/i...
1/7

27.11.2024 05:34 πŸ‘ 51 πŸ” 17 πŸ’¬ 3 πŸ“Œ 1