π₯ @veraneplenbroek.bsky.social, Sandro Pezelle, @barbaraplank.bsky.social, @davidschlangen.bsky.social, Alessandro Suglia, @akskuchi.bsky.social, @ecekt.bsky.social, and @alberto-testoni.bsky.social.
πPoster Session 2 β Hall 4/5, 11:00β12:30, Monday, July 28.
#MaiNLP #MCML #NLProc
18.07.2025 10:19
π 3
π 0
π¬ 0
π 0
π₯ Special thanks to @annabavaresco.bsky.social, @raffagbernardi.bsky.social, @leobertolazzi.bsky.social, @delliott.bsky.social, Raquel FernΓ‘ndez, Albert Gatt, @esamghaleb.bsky.social, Mario Giulianelli, @michaelwhanna.bsky.social, @akoller.bsky.social, @andre-t-martins.bsky.social
18.07.2025 10:19
π 3
π 0
π¬ 1
π 0
π₯Β This work is the result of a wonderful collaboration involving 20 researchers from 11 different universities.
18.07.2025 10:19
π 0
π 0
π¬ 1
π 0
πBased on evaluations across 11 recent LLMs, we find that model judgments should be used with care, as they exhibit notable variability depending on the task and samples being evaluated. We argue that LLMs should be carefully validated against human judgments before being used as evaluators.
18.07.2025 10:19
π 0
π 0
π¬ 1
π 0
πΒ In this work, we study whether LLM judgments can be reliably used as proxies for human judgments. We introduce JUDGE-BENCH, an extensive collection of 20 datasets with human annotations covering a variety of NLP tasks.
18.07.2025 10:19
π 0
π 0
π¬ 1
π 0
π₯Β Huge thanks to my collaborators and co-authors, Sondre Wold and @barbaraplank.bsky.social
πPoster Session 7 β Hall 4/5, 10:30β12:00, Tuesday, July 29.
18.07.2025 10:19
π 1
π 0
π¬ 1
π 0
π Moreover, we show that these circuits can be reused and combined through set operations to represent more complex functional capabilities of the model. For more information, check out the paper!
18.07.2025 10:19
π 1
π 0
π¬ 1
π 0
πΒ In this work, we study the relationship between transformer circuits identified for highly compositional and functionally related tasks. We find that functionally similar circuits exhibit both notable node overlap and cross-task faithfulness.
18.07.2025 10:19
π 1
π 0
π¬ 1
π 0
I am happy to share that Iβll be attending #ACL2025 in Vienna π¦πΉ, where Iβll be presenting two papers (more information below)!
18.07.2025 10:19
π 11
π 0
π¬ 1
π 0
The hand-drawn sign from three years ago.
πMaiNLP is turning 3 today!ππ₯³ Weβve grown a lot since @barbaraplank.bsky.social started this group with nothing but three aspiring researches and a hand-drawn sign on the door. Huge thanks to all the amazing people who have joined or visited us since. Hereβs to many more years of exciting research!π
01.04.2025 10:40
π 20
π 9
π¬ 1
π 2
πββοΈ
25.11.2024 18:03
π 0
π 0
π¬ 0
π 0