Nima Nooshiri's Avatar

Nima Nooshiri

@nimanzik

ML Engineer at BDiM GmbH | PhD in Seismology | Pythoneer | Digital Signal Processing | Applied Deep Learning | Prev.: GFZ Potsdam and DIAS Dublin

103
Followers
235
Following
1
Posts
21.10.2023
Joined
Posts Following

Latest posts by Nima Nooshiri @nimanzik

Post image

The Illustrated DeepSeek-R1

Spent the weekend reading the paper and sorting through the intuitions. Here's a visual guide and the main intuitions to understand the model and the process that created it.

newsletter.languagemodels.co/p/the-illust...

27.01.2025 20:22 ๐Ÿ‘ 76 ๐Ÿ” 23 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 4
Table Representation Learning Workshop TRL Workshop ---

At NeurIPS, IMHO:
- openreview.net/forum?id=LN5...
- openreview.net/forum?id=WH5...
- openreview.net/forum?id=Vbm...
- openreview.net/forum?id=FmN...
Before (self-plug):
- arxiv.org/abs/2402.16785
- arxiv.org/abs/2207.01848
Beyond:
- arxiv.org/abs/2410.18164
- table-representation-learning.github.io

30.12.2024 21:02 ๐Ÿ‘ 26 ๐Ÿ” 2 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Improved TableReport:
โ—ผ tighter layout
โ—ผ support any script (any alphabet ุญุจ เคฎเคพเคฏเคพ) in the plots
โ—ผ robust to outliers

It works without dependencies, in any html-based environment (Jupyter notebooks, @vscode.dev, a simple web page...)

Check it out on skrub-data.org
4/5

27.11.2024 20:46 ๐Ÿ‘ 1 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
A high-level summary diagram taken from the slides linked below. It shows the interplay of two main components: a probabilistic model and decision maker or planner.

A high-level summary diagram taken from the slides linked below. It shows the interplay of two main components: a probabilistic model and decision maker or planner.

Probabilistic predictions of an underfitting polynomial classifier on a noisy XOR task and the corresponding under-confident calibration curve.

Probabilistic predictions of an underfitting polynomial classifier on a noisy XOR task and the corresponding under-confident calibration curve.

Probabilistic predictions of an overfitting polynomial classifier and the resulting overconfident calibration curve on the same noisy XOR problem.

Probabilistic predictions of an overfitting polynomial classifier and the resulting overconfident calibration curve on the same noisy XOR problem.

Simulation study to show the relative lack of stability of hyperparameter tuning when using hard metrics such as Accuracy or soft yet not probabilistic metrics such as ROC AUC compared to a strictly proper scoring rule such as the log-loss.

Simulation study to show the relative lack of stability of hyperparameter tuning when using hard metrics such as Accuracy or soft yet not probabilistic metrics such as ROC AUC compared to a strictly proper scoring rule such as the log-loss.

I recently shared some of my reflections on how to use probabilistic classifiers for optimal decision-making under uncertainty at @pydataparis.bsky.social 2024.

Here is the recording of the presentation:

www.youtube.com/watch?v=-gYn...

27.11.2024 14:17 ๐Ÿ‘ 49 ๐Ÿ” 19 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 1

We're always updating the pydata & scipy project starter pack:
go.bsky.app/6HkrMcp

Hello @scikit-learn.bsky.social , @networkx.bsky.social , @scipyconf.bsky.social

22.11.2024 17:46 ๐Ÿ‘ 53 ๐Ÿ” 20 ๐Ÿ’ฌ 6 ๐Ÿ“Œ 1
Preview
Hype, Sustainability, and the Price of the Bigger-is-Better Paradigm in AI With the growing attention and investment in recent AI approaches such as large language models, the narrative that the larger the AI system the more valuable, powerful and interesting it is is increa...

Me: writes a paper (with @sashamtl.bsky.social and @meredithmeredith.bsky.social) on how hype and choice of goals and benchmarks leads to an arm race in AI arxiv.org/abs/2409.14160

Them: here's my proposal to solve the problem by a better AI algorithm.

๐Ÿคจ The problem is social: it's norm setting..

05.11.2024 13:29 ๐Ÿ‘ 46 ๐Ÿ” 12 ๐Ÿ’ฌ 3 ๐Ÿ“Œ 0

I guess you are looking for this GeoGinger ๐Ÿ‘‰๐Ÿป @geoginger.bsky.social

16.11.2024 23:40 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
Optuna: a hyperparameter optimization framework Scikit-learn allows you to perform hyperparameter search but a lot of it happens in memory. Sometimes you want to have a storage layer for these hyperparamet...

Will be going live in a bit to talk about Optuna. Feel free to join live at lunch if you have any questions you'd like me to answer live.

15.11.2024 09:21 ๐Ÿ‘ 5 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Map of โ€œvolcanoes that changed the worldโ€

Map of โ€œvolcanoes that changed the worldโ€

Probably not the upbeat, diverting story most of us are looking for these days, but an interesting read nevertheless.
And hey, whatโ€™s another future cataclysm among friends?

โ€œThe next massive volcano eruption will cause climate chaos โ€” and we are unpreparedโ€
โš’๏ธ๐Ÿงช๐ŸŒ‹
www.nature.com/articles/d41...

13.11.2024 16:27 ๐Ÿ‘ 43 ๐Ÿ” 14 ๐Ÿ’ฌ 9 ๐Ÿ“Œ 3

boom

13.11.2024 16:19 ๐Ÿ‘ 81 ๐Ÿ” 7 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 0