Official code and dataset for VQA² series models and dataset
Install dependencies:
cd VQA_main
conda create -n VQA python=3.10 -y
conda activate VQA
pip install --upgrade pip # Enable PEP 660 support.
pip install -e ".[train]"
pip install pytorchvideo #For slowfast base model download
pip install transformers==4.44.0 #Change the transformers version
Notice! Replace the VQA/lib/python3.10/site-packages/transformers/models/qwen2/modeling_qwen2.py to VQA_main/modeling_qwen2.py (we set some customized parameters in it).
cd VQA_main
For UGC-Video Scoring:
python ./llava/eval/model_score_UGC.py
For Streaming-Video Scoring:
python ./llava/eval/model_score_streaming.py
cd VQA_benchmark_test
For Q-bench-video Evaluation:
python ./llava/eval/model_vqa_q_bench_video.py
For Simple Q&A:
python ./llava/eval/model_conv.py
Gradio demo:
python ./app.py #Note that the minimum GPU requirement is 3090(24G)*1.
cd VQA_main
chmod +x ./scripts/train/finetune_VQA².sh
Then directly execute this .sh file. Note that we only support training with per_device_train_batch_size=1.
We temporarily provide the huggingface weight of VQA²-UGC-Scorer(7B) ,VQA²-Streaming-Scorer(7B), and VQA²-Assistant(7B); more versions will be released later.
HF-PATH:
VQA²-UGC-Scorer(7B): https://huggingface.co/q-future/VQA-UGC-Scorer (q-future/VQA-UGC-Scorer)
VQA²-Streaming-Scorer(7B): https://huggingface.co/q-future/VQA-Streaming-Scorer (q-future/VQA-Streaming-Scorer)
VQA²-Assistant(7B): https://huggingface.co/q-future/VQA-Assistant (q-future/VQA-Assistant)
If you consider this work interesting, please feel free to cite it in your work!
@article{jia2024vqa,
title={VQA $\^{} 2$: Visual Question Answering for Video Quality Assessment},
author={Jia, Ziheng and Zhang, Zicheng and Qian, Jiaying and Wu, Haoning and Sun, Wei and Li, Chunyi and Liu, Xiaohong and Lin, Weisi and Zhai, Guangtao and Min, Xiongkuo},
journal={arXiv preprint arXiv:2411.03795},
year={2024}
}
}
@article{zhang2024q,
title={Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs},
author={Zhang, Zicheng and Jia, Ziheng and Wu, Haoning and Li, Chunyi and Chen, Zijian and Zhou, Yingjie and Sun, Wei and Liu, Xiaohong and Min, Xiongkuo and Lin, Weisi and others},
journal={arXiv preprint arXiv:2409.20063},
year={2024}
}