李远宁, 刘汀, 蒋树强, 等. 基于“bag of words”的视频匹配方法[J]. 通信学报, 2007,(12):147-151.
LI Yuan-ning1, LIU Ting1, JIANG Shu-qiang1, et al. Video matching method based on "bag of words"[J]. 2007, (12): 147-151.
李远宁, 刘汀, 蒋树强, 等. 基于“bag of words”的视频匹配方法[J]. 通信学报, 2007,(12):147-151.DOI:
LI Yuan-ning1, LIU Ting1, JIANG Shu-qiang1, et al. Video matching method based on "bag of words"[J]. 2007, (12): 147-151.DOI:
基于“bag of words”的视频匹配方法
摘要
提出了一种利用"bag of words"模型对视频内容进行建模和匹配的方法。通过量化视频帧的局部特征构建视觉关键词(visual words)辞典
将视频的子镜头表示成若干视觉关键词的集合。在此基础上构建基于子镜头的视觉关键词词组的倒排索引
用于视频片段的匹配和检索。这种方法保留了局部特征的显著性及其相对位置关系
而且有效地压缩了视频的表达
加速的视频的匹配和检索过程。实验结果表明
和已有方法相比
基于"bag of words"的视频匹配方法在大视频样本库上获得了更高的检索精度和检索速度。
Abstract
A "bag of words" was presented based method for video representation and matching.First
all local features of all video frames were quantized into a dictionary of visual words.Then each sub-shot of the video was represented by a set of visual words.Finally
a revered index of visual words was created to speed the matching process of video clips.This method not only takes local appearance and spatial information into account
but also compresses the representation of video content.Highly competitive experimental results show that our proposed method is more effective and efficient than former methods for video matching in large video dataset.