[ACM MM'25] Towards Blind Bitstream-corrupted Video Recovery: A Visual Foundation Model-driven Framework

Tianyi Liu¹, Kejun Wu², Chen Cai¹, Yi Wang³, Kim-Hui Yap¹, and Lap-Pui Chau³
¹School of Electrical and Electronic Engineering, Nanyang Technological University
²School of Electronic Information and Communications, Huazhong University of Science and Technology
³Department of Electrical and Electronic Engineering, The Hong Kong Polytechnic University

Installation

git clone https://github.com/LIUTIGHE/B2SCVR.git
conda create -n b2scvr python=3.10
conda activate b2scvr

# build mmcv first according to the official documents (can ignore the torch mismatch)
pip install mmcv==2.2.0 -f https://download.openmmlab.com/mmcv/dist/cu121/torch2.4/index.html

# install torch according to the official documents
conda install pytorch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 pytorch-cuda=12.1 -c pytorch -c nvidia  

# install DAC developed based on SAM2.1
cd ../model/modules/sam2
pip install -e .

# other requirements
cd ../../..
pip install -r requirements.txt

If ModuleNotFoundError: No module named 'torchvision.transforms.functional_tensor occurs, one possible solution is to manually modify the 8th row in degradations.py mentioned in the Error, from from torchvision.transforms.functional_tensor import rgb_to_grayscale to from torchvision.transforms.functional import rgb_to_grayscale
If you meet mmcv-related error, please modify the reported line mmcv.cnn -> mmengine.model / mmcv.runner -> mmengine.runner.

Quick Test

Prepare inputs and model checkpoints: a corrupted video bitstream and the first corruption indication (e.g., the first corruption mask in frame 9 of inputs/trucks-race_2.h264). Then download the model checkpoints via this link, and put them into checkpoints/ folder.
Extract the corrupted frames and motion vector (mv) and prediction mode (pm) for each frame from the input corrupted video bitstream (e.g., inputs/trucks-race_2.h264)
```
python inputs.py --input inputs/trucks-race_2.h264
```

Stage 1: Use DAC to detect and localize video corruption:

cd model/modules/sam2
bash run.sh  # if there is a loading error, mostly related to vos_inference.py line 277-278, which sets a fixed suffix

Stage 2: Use the CFC-based recovery model to perform restoration

cd ../../..
python test.py --ckpt checkpoints/B2SCVR.pth --input_video inputs/bsc_imgs/trucks-race --dac_mask inputs/results/trucks-race --width 432 --height 240  # set 240P test if OOM occurs

The recovered frames sequence and GIF video will be saved in outputs/ folder.

Citation

If you find the code useful, please kindly consider citing our paper

@article{liu2025towards,
  title={Towards Blind Bitstream-corrupted Video Recovery via a Visual Foundation Model-driven Framework},
  author={Liu, Tianyi and Wu, Kejun and Cai, Chen and Wang, Yi and Yap, Kim-Hui and Chau, Lap-Pui},
  journal={arXiv preprint arXiv:2507.22481},
  year={2025}
}

Acknowledgements

This work is built upon BSCV, SAM-2, and ATD.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[ACM MM'25] Towards Blind Bitstream-corrupted Video Recovery: A Visual Foundation Model-driven Framework

Installation

Quick Test

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
checkpoints		checkpoints
core		core
model		model
README.md		README.md
inputs.py		inputs.py
requirements.txt		requirements.txt
test.py		test.py

LIUTIGHE/B2SCVR

Folders and files

Latest commit

History

Repository files navigation

[ACM MM'25] Towards Blind Bitstream-corrupted Video Recovery: A Visual Foundation Model-driven Framework

Installation

Quick Test

Citation

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages