's Avatar

@sarapapi

(she/her) Postdoc at @fbk-mt.bsky.social | Working on speech translation

111
Followers
102
Following
16
Posts
02.12.2024
Joined
Posts Following

Latest posts by @sarapapi

Jobs | Science and Technology Hub - Trento | A Researcher in Responsible and Trustworthy NLP

๐Ÿš€ We're hiring a Researcher in Responsible & Trustworthy NLP! Join our research group @fbk-mt.bsky.social at Fondazione Bruno Kessler to work on fairness and trustworthiness in multilingual technologies.

๐Ÿ“… Deadline: Dec 10, 2025
๐Ÿ”— Apply: jobs.fbk.eu/Annunci/Offe...

25.11.2025 09:07 ๐Ÿ‘ 8 ๐Ÿ” 8 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Our next presentation is by @sarapapi.bsky.social: "How real is your real-time simultaneous speech-to-text translation system?"

Look for the answer in her TACL paper: direct.mit.edu/tacl/article...

#lt2025fbk

28.10.2025 13:08 ๐Ÿ‘ 2 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Thanks to all the participants! #clicit2025

26.09.2025 17:28 ๐Ÿ‘ 11 ๐Ÿ” 5 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Our very own @sarapapi.bsky.social presenting FAMA at #clicit2025:

๐Ÿ“—Paper: clic2025.unica.it/wp-content/u...
๐Ÿ”— Models: hf.co/collections/...
๐Ÿ“Š Data: hf.co/datasets/FBK...
๐Ÿ’ป Code: github.com/hlt-mt/FBK-f...

Joint work with @speechtekfbk.bsky.social

25.09.2025 14:49 ๐Ÿ‘ 5 ๐Ÿ” 2 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Last oral session of the first #clicit2025 day! See you all at the welcome drink!

24.09.2025 16:41 ๐Ÿ‘ 2 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Iโ€™m the guest ๐Ÿ™‹๐Ÿปโ€โ™€๏ธ

24.09.2025 16:13 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

๐Ÿš€ Excited to present FAMA, the first large-scale #OpenScience #Speech foundation model for ๐Ÿ‡ฎ๐Ÿ‡น Italian & ๐Ÿ‡ฌ๐Ÿ‡ง English, at #clicit2025 (17:30โ€“18:45 oral session)!

๐Ÿ”— Models: hf.co/collections/...
๐Ÿ“Š Data: hf.co/datasets/FBK...
๐Ÿ’ป Code: github.com/hlt-mt/FBK-f...
๐Ÿ“„ Preprint: arxiv.org/pdf/2505.22759

24.09.2025 13:20 ๐Ÿ‘ 7 ๐Ÿ” 2 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

An interesting survey about #RAG and its interplay with #multimodality: Retrieval-Augmented Generation for AI-Generated Content: A Survey

arxiv.org/pdf/2402.19473

@fbk-mt.bsky.social

18.09.2025 16:26 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 1

MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks
Read more: https://arxiv.org/html/2507.19634v1

04.08.2025 08:42 ๐Ÿ‘ 1 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Sara Papi, Maike Z\"ufle, Marco Gaido, Beatrice Savoldi, Danni Liu, Ioannis Douros, Luisa Bentivogli, Jan Niehues
MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks
https://arxiv.org/abs/2507.19634

29.07.2025 09:12 ๐Ÿ‘ 4 ๐Ÿ” 4 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

@sarapapi.bsky.social presented her TACL paper: โ€œHow real is your real-time simultaneous speech-to-text translation system?โ€

๐Ÿ‘‰ aclanthology.org/2025.tacl-1.14/
(2/6)

02.08.2025 16:31 ๐Ÿ‘ 3 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
How โ€œRealโ€ is Your Real-Time Simultaneous Speech-to-Text Translation System?

Official paper: direct.mit.edu/tacl/article...

27.07.2025 13:23 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

๐Ÿ”ฅ Is your real-time SimulST system REAL?

Our TACL paper analyzes 110 works and reveals:
๐Ÿšซ Overreliance on short-form speech
๐ŸŒ€ Terminology chaos
๐Ÿ“‰ Real-world deployment gaps
We bring order-New taxonomy, trends & recommendations!

๐Ÿ“#ACL2025 Poster: Monday 11-12:30, Hall 4/5

#Speech #SpeechTech

27.07.2025 13:17 ๐Ÿ‘ 5 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Qualtrics Survey | Qualtrics Experience Management The most powerful, simple and trusted way to gather experience data. Start your journey to experience management and try a free account today.

๐Ÿ” Stiamo studiando come l'AI viene usata in Italia e per farlo abbiamo costruito un sondaggio!

๐Ÿ‘‰ bit.ly/sondaggio_ai...

(รจ anonimo, richiede ~10 minuti, e se partecipi o lo fai girare ci aiuti un sacco๐Ÿ™)

Ci interessa anche raggiungere persone che non si occupano e non sono esperte di AI!

03.06.2025 10:24 ๐Ÿ‘ 16 ๐Ÿ” 18 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
FAMA - a FBK-MT Collection The First Large-Scale Open-Science Speech Foundation Model for English and Italian

๐Ÿš€ New tech report out! Meet FAMA, our open-science speech foundation model family for both ASR and ST in ๐Ÿ‡ฌ๐Ÿ‡ง English and ๐Ÿ‡ฎ๐Ÿ‡น Italian.

The models are live and ready to try on @hf.co:
๐Ÿ”— huggingface.co/collections/...

๐Ÿ“„ Preprint: arxiv.org/abs/2505.22759

#ASR #ST #OpenScience #MultilingualAI

30.05.2025 15:35 ๐Ÿ‘ 7 ๐Ÿ” 3 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Inline citations with only first author name, or first two co-first author names.

Inline citations with only first author name, or first two co-first author names.

If you're finishing your camera-ready for ACL or ICML and want to cite co-first authors more fairly, I just made a simple fix to do this! Just add $^*$ to the authors' names in your bibtex, and the citations should change :)

github.com/tpimentelms/...

29.05.2025 08:53 ๐Ÿ‘ 85 ๐Ÿ” 23 ๐Ÿ’ฌ 4 ๐Ÿ“Œ 0

I am honored to have received this award today! ๐ŸŽŠ

09.05.2025 16:17 ๐Ÿ‘ 4 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

The evaluation period has begun for our shared tasks!

The test data is now available on our website, and submissions are due Tuesday April 15! โฐ

Please email task organizers or the google group with any questions ๐Ÿฅณ

03.04.2025 15:27 ๐Ÿ‘ 6 ๐Ÿ” 4 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

๐Ÿ“ข The evaluation period of the Instruction Following task at
@iwslt.bsky.social just started!

๐Ÿ–ฅ๏ธ Consider submitting your speech-to-text system!

The outputs can be easily uploaded on the SPEECHM platform developed in the Meetween project (www.meetween.eu)!
โžก๏ธ iwslt2025.speechm.cloud.cyfronet.pl

01.04.2025 12:39 ๐Ÿ‘ 9 ๐Ÿ” 5 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Thanks a lot! @deboranozza.bsky.social already added me to the channel ๐Ÿ˜Š

25.03.2025 15:50 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

I'm thrilled to be one of the speakers at the next MT Marathon in Helsinki ๐Ÿš€

I look forward to sharing insights on automatic translation and related topics with our community!

19.03.2025 23:04 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

โค๏ธ

17.03.2025 23:54 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction Real-time speech interaction, serving as a fundamental interface for human-machine collaboration, holds immense potential. However, current open-source models face limitations such as high costs in vo...

Glad to see that the model weights of the new Step-Audio, a speech foundation model + large language model (+ speech decoder) architecture, are published under open licenses! ๐Ÿ†“

arxiv.org/abs/2502.11946

20.02.2025 15:11 ๐Ÿ‘ 4 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 1
Post image

As if you needed more reasons to submit to #GITT2025:
๐Ÿ”‘๐ŸŽต Cristina Anselmi, video game #localization & #AI expert with a focus on #inclusive #language will be our keynote speaker!
๐Ÿ’ธRegistration fees are on the MTSummit website and you can register just for GITT if you so choose ๐Ÿ˜Ž
๐Ÿ‘€ See you there! ๐Ÿ‘€

13.02.2025 15:10 ๐Ÿ‘ 13 ๐Ÿ” 9 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 1
Preview
Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison Following the remarkable success of Large Language Models (LLMs) in NLP tasks, there is increasing interest in extending their capabilities to speech -- the most common form in communication. To integ...

I'm happy to share that our paper "Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison" has been accepted at @naaclmeeting.bsky.social 2025! #NAACL2025

@mgaido91.bsky.social ๐Ÿ‘

๐Ÿ“ƒ Preprint: arxiv.org/abs/2501.02370
โฐ Code will be released soon

#NLProc #Speech

23.01.2025 08:44 ๐Ÿ‘ 10 ๐Ÿ” 3 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
a polar bear cub is laying in a pile of branches . ALT: a polar bear cub is laying in a pile of branches .

Hello world! ๐Ÿ‘‹ We're coming out of hibernation to bring you this happy news:
1) We're organising the 3rd edition of GITT at #MTSummit! Working on #gender & #translation #technology? We'll see you there!
2) We're moving away from Twitter, so share the news and help us find old and new GITT friends!

22.01.2025 12:17 ๐Ÿ‘ 26 ๐Ÿ” 15 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 1

๐Ÿ™Œ All members of our group are now on Bluesky! ๐Ÿ™Œ

You can find all of us in this starter pack ๐Ÿ‘‡

16.01.2025 09:51 ๐Ÿ‘ 6 ๐Ÿ” 5 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Instruction-following Speech Processing track Home of the IWSLT conference and SIGSLT.

Exciting news: IWSLT will be co-located with @aclmeeting.bsky.social 2025 again this year! ๐ŸŽ‰

Interested in speech processing? Check out the new task on instruction following โ€” any model can participate! ๐Ÿš€

๐Ÿ“… Data release: April 1
โณ Submission deadline: April 15

๐Ÿ’ฌ iwslt.org/2025/instruc...

15.01.2025 18:36 ๐Ÿ‘ 11 ๐Ÿ” 5 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Iโ€™m glad to announce that our work โ€œHow "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System?โ€ has been accepted at the Transactions of @aclmeeting.bsky.social (TACL)! ๐ŸŽ‰

The preprint is available here:
arxiv.org/pdf/2412.18495

27.12.2024 14:07 ๐Ÿ‘ 7 ๐Ÿ” 2 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image Post image

Our @apierg.bsky.social presenting our #calamita challenges at #CLiCit2024: machine translation and gender-fair generation.

Poster session upcoming, see you there!

For more details:
๐Ÿ‘‰ MagneT: clic2024.ilc.cnr.it/wp-content/u...
๐Ÿ‘‰ GFG: clic2024.ilc.cnr.it/wp-content/u...

06.12.2024 16:22 ๐Ÿ‘ 9 ๐Ÿ” 2 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0