Skip to content

Release WildWorld dataset and baseline models on Hugging Face #2

@NielsRogge

Description

@NielsRogge

Hi @ShandaAI 🤗

I'm Niels and work as part of the open-source team at Hugging Face. I discovered your work through Hugging Face's daily papers as yours got featured: https://huggingface.co/papers/2603.23497.

The paper page lets people discuss about your paper and lets them find artifacts about it (your dataset for instance), you can also claim the paper as yours which will show up on your public profile at HF, and add GitHub/project page URLs.

I saw on your GitHub and project page that the WildWorld dataset and WildBench are "Coming Soon". It looks like an incredible contribution to the world modeling and generative ARPG community!

Would you like to host the dataset (and any pre-trained baseline models you plan to release) on https://huggingface.co/datasets? Hosting on Hugging Face will give your work significantly more visibility and enable better discoverability through metadata tags.

For a dataset of this scale (108M+ frames), hosting on 🤗 would allow people to do:

from datasets import load_dataset

dataset = load_dataset("ShandaAI/WildWorld")

If you're interested, you can find a guide here: https://huggingface.co/docs/datasets/loading. We also support Webdataset, which is particularly useful for efficiently streaming large-scale video/multimodal datasets: https://huggingface.co/docs/datasets/en/loading#webdataset.

Additionally, if you decide to release your pre-trained models, hosting them on the Hub allows users to easily download and use them via hf_hub_download or direct library integrations.

Let me know if you're interested or need any guidance!

Kind regards,

Niels
ML Engineer @ HF 🤗

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions