Hugging Face's Avatar

Hugging Face

@hf.co

The AI community building the future!

16,007
Followers
53
Following
3
Posts
17.05.2023
Joined
Posts Following

Latest posts by Hugging Face @hf.co

Screenshot of a simple app showing bounding boxes for photographs detected in historic newspaper images.

Screenshot of a simple app showing bounding boxes for photographs detected in historic newspaper images.

hf jobs uv run \
  --flavor a100-large \
  -s HF_TOKEN=HF_TOKEN \
  https://huggingface.co/datasets/uv-scripts/sam3/raw/main/detect-objects.py \
  -- davanstrien/newspapers-with-images-after-photography-big \
  davanstrien/newspapers-photo-predictions \
  --class-name "photograph" \
  --confidence-threshold 0.4

hf jobs uv run \ --flavor a100-large \ -s HF_TOKEN=HF_TOKEN \ https://huggingface.co/datasets/uv-scripts/sam3/raw/main/detect-objects.py \ -- davanstrien/newspapers-with-images-after-photography-big \ davanstrien/newspapers-photo-predictions \ --class-name "photograph" \ --confidence-threshold 0.4

Building datasets to train smaller, task-focused models used to be incredibly time-consuming.

Very excited to see SAM3 massively lower that barrier. Describe the class you want to detect and get annotated datasets automatically!

Try it yourself: huggingface.co/datasets/uv-...!

21.11.2025 13:30 πŸ‘ 51 πŸ” 12 πŸ’¬ 1 πŸ“Œ 0
Post image

Training LLMs end to end is hard. But way more people should, and will, be doing it in the future.

The @hf.co Research team is excited to share their new e-book that covers the full pipeline:
Β· pre-training,
Β· post-training,
Β· infra.

200+ pages of what worked and what didn’t. ‡️

02.11.2025 15:17 πŸ‘ 141 πŸ” 25 πŸ’¬ 4 πŸ“Œ 1
Preview
The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling A Blog post by Hugging Science on Hugging Face

πŸ’» Our pharmome mapping data is now accessible to ML developers on @hf.co, making our purpose-built drug-target interaction data easily accessible for model development.

huggingface.co/blog/hugging...

18.11.2025 19:48 πŸ‘ 7 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0
Preview
JuliaHealthOrg (The JuliaHealth Organization) Org profile for The JuliaHealth Organization on Hugging Face, the AI community building the future.

JuliaHealth is on @hf.co! πŸ€—

If you are interested in #julialang, #llm or #agentic workflows, and how #GenAI can be used within public health, medical informatics, and survey-based research, drop us a line or a follow! πŸ€“

How are you using GenAI in your medical research?

#opensource #medsky #data

20.10.2025 20:14 πŸ‘ 12 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0
Preview
Showcasing Your Work on HuggingFace Spaces | Towards Data Science Building an app is exciting - but sharing it is where the real value kicks in. Back when Heroku offered a free tier, deploying demos was effortless. Those days are gone, and finding a simple, free…

When we build an app, it’s only natural to want to share it. Ivo Bernardo walks you through a short tutorial on how to deploy your own @hf.co Space. If you want to highlight your work and applications, this is a strong option.

15.10.2025 16:23 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Video thumbnail

Small models work great for GLAM but there aren't enough examples!

With @wjbmattingly.bsky.social I'm launching small-models-for-glam on @hf.co to create/curate models that run on modest hardware and address GLAM use cases.

Follow the org to keep up-to-date!
huggingface.co/small-models...

16.10.2025 13:22 πŸ‘ 12 πŸ” 7 πŸ’¬ 0 πŸ“Œ 0
Preview
comma-project/comma-jsonl Β· Datasets at Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

(10/🧡)

The corpus isn’t just readable πŸ‘οΈ β€” it’s also fully downloadable!
Now hosted on @hf.co :

🧾 JSONL dataset β†’ huggingface.co/datasets/com...
πŸ“‚ More formats (ALTO, TEI, etc.) coming soon β€” we’re uploading the GBs as we speak.

15.10.2025 14:51 πŸ‘ 4 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Preview
ValiantLabs/Qwen3-4B-Thinking-2507-Esper3.1 Β· Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Esper 3.1 is here on @hf.co - our DevOps, coding, and architecture specialist is back, trained on higher difficulty data! For everyone to use: huggingface.co/ValiantLabs/...

06.10.2025 16:28 πŸ‘ 11 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Screenshot of the dataset viewer showing a column of marc data + the first few pages of an open access monograph

Screenshot of the dataset viewer showing a column of marc data + the first few pages of an open access monograph

New @hf.co BigLAM dataset: 9,363 OA books with page images + rich MARC metadata for evaluating (and training) VLMs on metadata extraction.

Libraries are starting to explore AI-assisted cataloguing, but we lack public evaluation data. Hoping this helps fill that gap.

huggingface.co/datasets/big...

02.10.2025 18:50 πŸ‘ 32 πŸ” 9 πŸ’¬ 2 πŸ“Œ 2
Post image

We have Nvidia B200s ready to go for you in Hugging Face Inference Endpoints πŸ”₯

I tried them out myself and the performance is amazing.

On top of that we just got a fresh batch of H100s as well. At $4.5/hour it's a clear winner in terms of price/perf compared to the A100.

06.10.2025 08:44 πŸ‘ 6 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image Post image Post image

Wonder how LLMs learn over long time horizons & how hate-checks deal with time?

Look at our new work "Chronoberg": an open-source dataset spanning 250 years of books with analysis of shifts in meaning & continual learning of LLMs:

arxiv.org/pdf/2509.22360

huggingface.co/datasets/spa...

06.10.2025 09:42 πŸ‘ 5 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Graph showing the conversion of Hugging Face repositories from LFS storage to Xet storage.

Graph showing the conversion of Hugging Face repositories from LFS storage to Xet storage.

The Hub is on 100% on Xet. πŸš€

A little over a year ago, @hf.co acquired XetHub to unlock the next phase of growth in models and datasets. huggingface.co/blog/xethub-...

In April, there were 1,000 Hugging Face repos on Xet. Now every repo (over 6M) on the Hub is on Xet.

03.10.2025 15:16 πŸ‘ 12 πŸ” 5 πŸ’¬ 2 πŸ“Œ 0
Post image

One of the hardest challenges in AI safety is finding the right balance: how do we protect people from harm without undermining their agency? This tension is especially visible in conversational systems, where safeguards can sometimes feel more paternalistic than supportive.

29.09.2025 12:06 πŸ‘ 11 πŸ” 1 πŸ’¬ 1 πŸ“Œ 2
Preview
Introducing RTEB: A New Standard for Retrieval Evaluation We’re on a journey to advance and democratize artificial intelligence through open source and open science.

The @hf.co community is awesome. Real work that moves everyone forward: huggingface.co/blog/rteb

01.10.2025 16:22 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Video thumbnail

πŸ‹πŸ¦πŸΈ We just launched an interactive demo of NatureLM-audio on @hf.co!

πŸ‘‰ Try the demo with your audio or ours, share your feedback, and help us shape the future of decoding animal communication: huggingface.co/blog/EarthSp...

04.09.2025 16:17 πŸ‘ 11 πŸ” 3 πŸ’¬ 0 πŸ“Œ 1
Post image

You only need one line of code to start exploring the new @OpenAI models!

gr.load("models/openai/gpt-oss-120b", provider="fireworks-ai").launch()

05.08.2025 18:13 πŸ‘ 11 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0
Post image

Well, it took just 2 hours for GPT-OSS to hit #1 on @hf.co. Don’t remember seeing anything rise that fast!

05.08.2025 19:33 πŸ‘ 11 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Preview
openai/gpt-oss-120b Β· Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

OpenAI have released their new open source models!

One thing I really like about this release is that while they are only open weight, the model is not gated in any way (anyone can download it) and it has a permissive OSS license (apache 2).

Very refreshing.

huggingface.co/openai/gpt-o...

06.08.2025 02:10 πŸ‘ 13 πŸ” 5 πŸ’¬ 2 πŸ“Œ 0
Preview
uv-scripts/openai-oss Β· Datasets at Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

You can now generate synthetic data using OpenAIs GPT OSS models on @hf.co Jobs!

One command, no setup:

hf jobs uv run --flavor l4x4 [script-url] \
--input-dataset your/dataset \
--output-dataset your/output

Works on L4 GPUs ⚑

huggingface.co/datasets/uv-...

06.08.2025 07:38 πŸ‘ 11 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Screenshot of the app showing a page from a book + different views of existing and new ocr.

Screenshot of the app showing a page from a book + different views of existing and new ocr.

Many VLM-based OCR models have been released recently. Are they useful for libraries and archives?

I made a quick Space to compare VLM OCR with "traditional" OCR using 11k Scottish exam papers from @natlibscot.bsky.social

huggingface.co/spaces/davanstrien/ocr-time-capsule

01.08.2025 15:09 πŸ‘ 47 πŸ” 15 πŸ’¬ 4 πŸ“Œ 1
Preview
Ready Xet Go - a Hugging Face Space by jsulz This app helps you monitor the progress of migrating repositories to Xet, showing you stats and charts on migration status and file types.

We just crossed 1 million repositories backed by Xet storage on @hf.co

I celebrated by reviving the early 2000s web design aesthetics that I love so much. Here's our dashboard showing our progress converting the Hub from Git LFS to Xet (and demonstrating my questionable design sensibilities).

30.07.2025 18:41 πŸ‘ 7 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Screenshot of running HF News Aggregator.

Screenshot of running HF News Aggregator.

Built my 1st app exclusively using @hf.co Hub features! It helps me keep track/summarize the latest HF News. Uses Datasets, Inference Endpoints, the newly announced `hf jobs`, and Spaces to visualize results. Check it out here: huggingface.co/spaces/rajat...

30.07.2025 02:08 πŸ‘ 6 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Preview
Run and manage Jobs We’re on a journey to advance and democratize artificial intelligence through open source and open science.

`hf jobs` looks super interesting. Send off workloads easily to remote infra.

And with inline metadata uv scripts
`hf jobs uv run my_script.py` all dependencies can be defined in a single file. So simple and so useful.

huggingface.co/docs/hugging...

28.07.2025 19:29 πŸ‘ 13 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0
Post image

Working to port VLaMy to an entirely free mode where you can just cache all your data in the browser for a project. Slowly adding all the features from the full version to this user-free version. Available now on @hf.co @danielvanstrien.bsky.social
Link: huggingface.co/spaces/wjbma...

28.07.2025 17:11 πŸ‘ 20 πŸ” 6 πŸ’¬ 2 πŸ“Œ 0
Video thumbnail

465 people. 122 languages. 58,185 annotations!

FineWeb-C v1 is complete! Communities worldwide have built their own educational quality datasets, proving that we don't need to wait for big tech to support languages.

Huge thanks to all who contributed!

huggingface.co/blog/davanst...

08.07.2025 12:07 πŸ‘ 33 πŸ” 11 πŸ’¬ 2 πŸ“Œ 0
Video thumbnail

Last week, we concluded the @gradio-hf.bsky.social‬ MCP hackathon with @hf.co‬. The project that one the @llamaindex.bsky.social prize was the "Nasa Space Explorer" πŸ”­πŸͺ
3 servers that provide live data on:
β˜„οΈ Asteroids
πŸ€– the Mars Rover
🌌 Astronomy

Here's the space: huggingface.co/spaces/Agen...

26.06.2025 18:13 πŸ‘ 5 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
introducing LeRobot.js - interact with your robot in the browser
introducing LeRobot.js - interact with your robot in the browser YouTube video by Tim Pietrusky

i love the simplicity of LeRobot from @hf.co to interact with robots, especially for beginners like me

there is one very huge problem: it's written in python, but i love js

introducing: LeRobot.js

interact with your robot directly in the browser

www.youtube.com/watch?v=H1iU...

04.07.2025 23:55 πŸ‘ 12 πŸ” 3 πŸ’¬ 1 πŸ“Œ 0
Post image

🧬 super psyched to announce a new collaboration between @hf.co and Ginkgo Datapoints to open up high-quality biological datasets for the machine learning community!

Just dropped the GDPx and GDPa dataset series on the Hub (x1000 boost to AI for drug development)

πŸ”— huggingface.co/ginkgo-datap...

06.07.2025 08:56 πŸ‘ 6 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

If you are interested in a unified collection of common misinformation detection benchmarks, check out our recent repo @hf.co

19.06.2025 18:23 πŸ‘ 15 πŸ” 6 πŸ’¬ 0 πŸ“Œ 0