Happy new year to everyone...
Happy new year to everyone...
π₯Introducing Gemini 2.5, our most intelligent model with impressive capabilities in advanced reasoning and coding.
Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. Itβs #1 on the LM Arena leaderboard. π₯
folks working on one or more of the following
πΌοΈ Image Descriptions to improve Image-Text alignment
AND/OR
π¬Multi/Cross Lingual image-text understanding/generation
AND/OR
πGeo-Cultural representation and learning
Please DM if you are willing to discuss the current state/challenges/future-work.
New starter pack! go.bsky.app/GZ4hZzu
Too soon but π€
πββοΈ Could I be added ? Thanks :)
We had a great experience presenting our work on ImageInWords to the community #EMNLP2024 . Thank you everyone for stopping byπ! Looking forward to future work and seeing image descriptions as a foundational multi-modal task! @emnlpmeeting.bsky.social @deep-mind.bsky.social #NLProc #Multimodal
All the ACL chapters are here now: @aaclmeeting.bsky.social @emnlpmeeting.bsky.social @eaclmeeting.bsky.social @naaclmeeting.bsky.social #NLProc
hello new followers! weβre actively hiring on our generative media team in Mountain View: boards.greenhouse.io/deepmind/job...
we work on image, video, audio, etcβ¦ come work with us if youβre interested! apply asap :)
π’ Excited to unveil our latest research, ImageInWords (IIW)! πWe're pushing the boundaries of image descriptions with a new seeded, sequential, human-in-the-loop approach producing SoTA, articulate, hyper-detailed descriptions.
arXiv: arxiv.org/abs/2405.02793
#NLProc #ComputerVision #Multimodal