Thiemo Alldieck's Avatar

Thiemo Alldieck

@thiemoall

Research Scientist @ Google DeepMind | 3D Computer Vision & Machine Learning

1,277
Followers
215
Following
15
Posts
17.11.2024
Joined
Posts Following

Latest posts by Thiemo Alldieck @thiemoall

(5) The success of scaling text, images, video should be an argument *for* scaling, not *against* other modalities.

(6) Efficiency matters. Hoping models become as efficient as existing alternatives without exploring to improve those alternatives is blindfolding us.

6/6

18.02.2026 10:20 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

(4) As pointed out already by Aleks Holynski on Twitter, text tokens, and pixels are equally handcrafted. If we accept those as valid, singling out 3D as "too handcrafted" is logically inconsistent.

5/6

18.02.2026 10:20 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

(3) We humans build spatial memory through physical interaction. I don't see how models can develop true spatial understanding without building a spatial memory themselves. 3D representations seem way more helpful here than observing 2D pixel streams.

4/6

18.02.2026 10:20 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

(2) 3D is more than its representation. While specific data structures will evolve or disappear, 3D is the fundamental concept our world is grounded in. It will always be worth studying, even if models learn it implicitly (which we currently just hope for).

3/6

18.02.2026 10:20 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

(1) Computer vision was developed to solve "real" problems like measuring, quality control, medical imaging, or mapping. These aren't just "fake tasks" waiting for an embodied agent.

2/6

18.02.2026 10:20 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Great read! Here are my 2 cents: I agree with the push toward end-to-end learning, however, the conclusion that CV will simply "go away" feels too dramatic and overly simplified. Here is what I believe was overlooked: 🧡

(cross posting from Twitter)

1/6

18.02.2026 10:20 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Project page
*links to*
Huggingface paper page
*links to*
arXiv abstract
*links to*
PDF

🫠🫠🫠

28.10.2025 12:25 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

We are looking for Student Researchers to work with us in ZurichπŸ‡¨πŸ‡­ next year!

If you work on depth and/or 3D reconstruction, please reach out!

Europe-based position:
www.google.com/about/career...

US-based position:
www.google.com/about/career...

28.10.2025 08:49 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Find me today at 4:30pm at the Google booth - let's chat! #CVPR2025

13.06.2025 18:46 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

On my way to #CVPR2025 πŸ›«

Looking forward to connect!

10.06.2025 06:54 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

If you expect a service (paper published), pay a price (review others). Isn't it that simple?

20.01.2025 07:37 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Excited to share that today our paper recommender platform www.scholar-inbox.com has reached 20k users! We hope to reach 100k by the end of the year.. Lots of new features are being worked on currently and rolled out soon.

15.01.2025 22:03 πŸ‘ 190 πŸ” 26 πŸ’¬ 12 πŸ“Œ 8
Post image

My group is looking for motivated PhD students that want to work on the future of digital humans.
Within the ERC project 'LeMo: Learning Digital Humans in Motion' there are two open positions:

www.career.tu-darmstadt.de/HPv3.Jobs/TU...

www.career.tu-darmstadt.de/HPv3.Jobs/TU...

14.01.2025 19:07 πŸ‘ 19 πŸ” 7 πŸ’¬ 0 πŸ“Œ 0

hey everyone - I am now also active here and excited about computer vision and machine learning stuff. πŸŽ‰

08.01.2025 14:07 πŸ‘ 47 πŸ” 5 πŸ’¬ 3 πŸ“Œ 0
Preview
a cartoon character named charlie brown is putting a letter in a mailbox ALT: a cartoon character named charlie brown is putting a letter in a mailbox

πŸ˜•

25.12.2024 07:22 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Scroll Reverser is another one...

20.12.2024 10:33 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Come and work with us πŸ’ͺ

26.11.2024 06:59 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

☝️

20.11.2024 05:51 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0