Hi @ShandaAI 🤗
I'm Niels and work as part of the open-source team at Hugging Face. I discovered your work through Hugging Face's daily papers as yours got featured: https://huggingface.co/papers/2603.23497.
The paper page lets people discuss about your paper and lets them find artifacts about it (your dataset for instance), you can also claim the paper as yours which will show up on your public profile at HF, and add GitHub/project page URLs.
I saw on your GitHub and project page that the WildWorld dataset and WildBench are "Coming Soon". It looks like an incredible contribution to the world modeling and generative ARPG community!
Would you like to host the dataset (and any pre-trained baseline models you plan to release) on https://huggingface.co/datasets? Hosting on Hugging Face will give your work significantly more visibility and enable better discoverability through metadata tags.
For a dataset of this scale (108M+ frames), hosting on 🤗 would allow people to do:
from datasets import load_dataset
dataset = load_dataset("ShandaAI/WildWorld")
If you're interested, you can find a guide here: https://huggingface.co/docs/datasets/loading. We also support Webdataset, which is particularly useful for efficiently streaming large-scale video/multimodal datasets: https://huggingface.co/docs/datasets/en/loading#webdataset.
Additionally, if you decide to release your pre-trained models, hosting them on the Hub allows users to easily download and use them via hf_hub_download or direct library integrations.
Let me know if you're interested or need any guidance!
Kind regards,
Niels
ML Engineer @ HF 🤗
Hi @ShandaAI 🤗
I'm Niels and work as part of the open-source team at Hugging Face. I discovered your work through Hugging Face's daily papers as yours got featured: https://huggingface.co/papers/2603.23497.
The paper page lets people discuss about your paper and lets them find artifacts about it (your dataset for instance), you can also claim the paper as yours which will show up on your public profile at HF, and add GitHub/project page URLs.
I saw on your GitHub and project page that the WildWorld dataset and WildBench are "Coming Soon". It looks like an incredible contribution to the world modeling and generative ARPG community!
Would you like to host the dataset (and any pre-trained baseline models you plan to release) on https://huggingface.co/datasets? Hosting on Hugging Face will give your work significantly more visibility and enable better discoverability through metadata tags.
For a dataset of this scale (108M+ frames), hosting on 🤗 would allow people to do:
If you're interested, you can find a guide here: https://huggingface.co/docs/datasets/loading. We also support Webdataset, which is particularly useful for efficiently streaming large-scale video/multimodal datasets: https://huggingface.co/docs/datasets/en/loading#webdataset.
Additionally, if you decide to release your pre-trained models, hosting them on the Hub allows users to easily download and use them via
hf_hub_downloador direct library integrations.Let me know if you're interested or need any guidance!
Kind regards,
Niels
ML Engineer @ HF 🤗