Region_Learner

The Pytorch implementation for "Video-Text Pre-training with Learned Regions" (arxiv)

We are still cleaning up the code further and preparing for pre-training weights.

Preparation

Overall, this code is built on PyTorch with DistributedDataParallel (DDP).

Create conda env and install required packages via sh setup_myEnv.sh
Create some important folders
1. mkdir data (you can symlink huge datasets to this folder)
2. mkdir meta_data (put meta data of each dataset here)
3. mkdir results
Download Pre-training data
1. Download WebVid-2M (see https://github.com/m-bain/webvid)
2. Download CC3M (see https://ai.google.com/research/ConceptualCaptions/download)

PS: Not all videos are avaible so that you need to modify the metadata depend on your case. We also provide our metadata in here.

Pre-training

Run sh pre-training.sh (Commands with different settings are listed in this script.)

Finetuning (on MSR-VTT)

Download data (see https://github.com/m-bain/frozen-in-time#-finetuning-benchmarks-msr-vtt)
Run sh fine-tune.sh.

Pre-trained Weights

WebVid2M + CC3M

Acknowledgements

This code is based off Frozen in Time

Citation

@article{yan2021video,
  title={Video-Text Pre-training with Learned Regions},
  author={Yan, Rui and Shou, Mike Zheng and Ge, Yixiao and Wang, Alex Jinpeng and Lin, Xudong and Cai, Guanyu and Tang, Jinhui},
  journal={arXiv preprint arXiv:2112.01194},
  year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
base		base
configs		configs
data_loader		data_loader
logger		logger
model		model
trainer		trainer
utils		utils
.DS_Store		.DS_Store
README.md		README.md
args.py		args.py
fine-tuning.sh		fine-tuning.sh
parse_config.py		parse_config.py
pre-training.sh		pre-training.sh
setup_myEnv.sh		setup_myEnv.sh
train.py		train.py

showlab/Region_Learner

Folders and files

Latest commit

History

Repository files navigation

Region_Learner

Preparation

Pre-training

Finetuning (on MSR-VTT)

Pre-trained Weights

Acknowledgements

Citation

About

Topics

Resources

Stars

Watchers

Forks

Languages