Skip to content

Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"

Notifications You must be signed in to change notification settings

minjoong507/Consistency-of-Video-LLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 

Repository files navigation

On the Consistency of Video Large Language Models in Temporal Comprehension

arXiv

News

Introduction

image

  • We study the model’s consistency in temporal comprehension by assessing whether its responses align with the initial grounding, using dedicated probes and datasets. We specifically focus on video temporal grounding, where the task involves identifying timestamps in a video that correspond to language queries.

Citation

If you find our paper useful, please consider citing our paper.

@article{jung2024consistency,
  title={On the Consistency of Video Large Language Models in Temporal Comprehension},
  author={Jung, Minjoon and Xiao, Junbin and Zhang, Byoung-Tak and Yao, Angela},
  journal={arXiv preprint arXiv:2411.12951},
  year={2024}
}

Acknowledgement

We appreciate for the following awesome Video-LLMs:

About

Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published