Keywords Contrastive learning Data imbalance NR-VQA Self-supervised learning ViViT Video classification