Jannis Bulian (@j5b)

The Gemini 2.5 Technical Report is out: storage.googleapis.com/deepmind-med...

17.06.2025 20:09 👍 9 🔁 2 💬 0 📌 0

🥁Introducing Gemini 2.5, our most intelligent model with impressive capabilities in advanced reasoning and coding.

Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on the LM Arena leaderboard. 🥇

25.03.2025 17:25 👍 215 🔁 65 💬 34 📌 11

We’ve been teaching Gemini to think.

Try it here: aistudio.google.com/prompts/new_...

19.12.2024 17:56 👍 4 🔁 0 💬 0 📌 0

Happy birthday Gemini!

06.12.2024 22:10 👍 14 🔁 1 💬 0 📌 0

📢We release Tülu 3, a family of fully-open state-of-the-art post-trained models, alongside its data, code, and training recipes, serving as a comprehensive guide for modern post-training techniques!

21.11.2024 17:29 👍 59 🔁 7 💬 2 📌 1

Good software is an enabler for good science! 💥🧪

Inspired by the below post, I like to point people at libraries like github.com/patrick-kidg... as a template for what a modern Python library looks like: `pre-commit`, ruff, pyright, pyproject.toml, an open-source license, etc. 🤓

18.11.2024 13:04 👍 86 🔁 11 💬 6 📌 1

Amazon.com

Fun, insightful, useful, cheap: Thinking Like A Large Language Model: Become an AI manager a.co/d/7xMTtJM

17.11.2024 17:04 👍 0 🔁 1 💬 0 📌 0

A comparison of LLMs mean rating average in presentational and epistemological dimensions.

We compared notable LLMs such as InstructGPT, ChatGPT, GPT4, PaLM2 (text-bison), and Falcon-180B. They excel at presenting climate information, but there's room for improvement in the epistemic qualities of their answers.

06.10.2023 17:28 👍 1 🔁 0 💬 0 📌 0

This is a tough task for human raters. Our study finds that AI can effectively assist human raters, offering promising avenues for scalable oversight on difficult problems like this.

06.10.2023 17:27 👍 1 🔁 0 💬 1 📌 0

Excited to share our latest paper: We explore how large language models tackle questions on climate change 🌎, introducing an evaluation framework grounded in #SciComm research.

Read the preprint: arxiv.org/abs/2310.02932

06.10.2023 17:27 👍 6 🔁 0 💬 1 📌 0

Jannis Bulian

Latest posts by Jannis Bulian @j5b