Skip to content

OpenSearch-AI/Youku-Dense-Caption

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

8 Commits
Β 
Β 
Β 
Β 

Repository files navigation

Youku Dense Caption Dataset πŸŽ₯

License Videos Captions Language

πŸ“Š Dataset Overview

A comprehensive collection of Chinese video captions from Youku (δΌ˜ι…·), featuring:

  • πŸ“Ή Videos: 31,466 complete short videos
  • ✍️ Captions: 311,921 Chinese captions
  • 🈺 Language: Chinese
  • πŸ“± Source: Youku Platform (δΌ˜ι…·)

πŸš€ Usage

The dataset is available for download from ModelScope.

1. Dataset Download ⬇️

# Install Git LFS
git lfs install

# Clone the dataset
git lfs clone https://oauth2:your_git_token@www.modelscope.cn/datasets/os_ai/Youku_Dense_Caption.git

πŸ”‘ Get Token: Visit https://modelscope.cn/my/myaccesstoken

2. Dataset Structure πŸ“

πŸ“¦ ROOT
β”œβ”€β”€ πŸ“ benchmark_files/
β”‚   β”œβ”€β”€ πŸ“Š generation.json     # Test set for caption generation
β”‚   └── πŸ“Š grounding.json      # Test set for video moment retrieval
β”‚
β”œβ”€β”€ πŸ“ meta_files/
β”‚   β”œβ”€β”€ πŸ“‹ Agriculture.csv     # Video file paths and Complete captions in the agriculture category
β”‚   β”œβ”€β”€ πŸ“‹ Children.csv
|   └── πŸ“‹ [Other Categories].csv
β”‚
└── πŸ“ data_files/
    β”œβ”€β”€ πŸ“ Agriculture/         # Agriculture videos
    β”‚   β”œβ”€β”€ πŸ“¦ train/           # Training set (zipped)
    β”‚   β”œβ”€β”€ πŸ“¦ val/             # Validation set (zipped)
    β”‚   └── πŸ“ test/            # Test set (preview ready)
    β”‚
    β”œβ”€β”€ πŸ“ Children/            # Children videos
    β”‚   β”œβ”€β”€ πŸ“¦ train/
    β”‚   β”œβ”€β”€ πŸ“¦ val/
    β”‚   └── πŸ“ test/
    β”‚
    └── πŸ“ [Other Categories]/  # Other categories
        β”œβ”€β”€ πŸ“¦ train/
        β”œβ”€β”€ πŸ“¦ val/
        └── πŸ“ test/

3. Usage Guide πŸ“–

  1. After Download:

    • Navigate to target category folder
    • Example: cd data_files/Agriculture
  2. Data Preparation:

    • Unzip files in train/ and val/ directories
    • Files in test/ directory are ready to use

⚠️ Important Notes:

  • train and val data are stored in compressed format, requiring extraction
  • test data is directly accessible for preview and testing

πŸ’‘ For questions, please refer to project documentation or submit an Issue

πŸ“š Citation

If you use this dataset in your research, please cite:

@inproceedings{xiong2025youku,
    title={Youku Dense Caption: A Large-scale Chinese Video Dense Caption Dataset and Benchmarks},
    author={Zixuan Xiong, Guangwei Xu, Wenkai Zhang, Yuan Miao, Xuan Wu, LinHai, Ruijie Guo, Hai-Tao Zheng},
    booktitle={The Thirteenth International Conference on Learning Representations},
    year={2025},
    url={https://openreview.net/forum?id=vvi5OjPhbu}
}

πŸ“„ License

This dataset is released under the CC BY-NC-SA 4.0 license.


⭐ Star us on GitHub if you find this dataset useful! ⭐

About

Chinese dense video captioning datasets

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors