by Alshuth, Peter, Hermes, Thorsten, Voigt, Lutz and Herzog, Otthein
Abstract:
In this paper videos are analyzed to get a content-based decription of the video. The structure of a given video is useful to index long videos efficiently and automatically. A comparison between shots gives an overview about cut frequency, cut pattern, and scene bounds. After a shot detection the shots are grouped into clusters based on their visual similarity. A time-constraint clustering procedure is used to compare only those shots that are positioned inside a time range. Shots from different areas of the video (e.g., begin/end) are not compared. With this cluster information that contains a list about shots and their clusters it is possible to calculate scene bounds. A labeling of all clusters gives a declaration about the cut pattern. It is easy now to distinguish a dialogue from an action scene. The final content analysis is done by the ImageMiner 1 system. The ImageMiner system developed at the University of Bremen of the Image Processing Department of the Center for Computing Technology realizes content-based image retrieval for still images through a novel combination of methods and techniques of computer vision and artifical intelligence. The ImageMiner system consists of three analysis modules for computer vision, namely for color, texture, and contour analysis. Additionally exists a module for object recognition. The output of the object recognition module can be indexed by a text retrieval system. Thus, concepts like forestscene may be searched for. We combine the still image analysis with the results of the video analysis in order to retrieve shots or scenes. 1 ImageMiner is a trademark of IBM Cooperation
Reference:
Alshuth, Peter, Hermes, Thorsten, Voigt, Lutz and Herzog, Otthein, "On Video Retrieval: Content Analysis by ImageMiner", In IS&T/SPIE Symposium on Electronical Imaging Sciene & Technology (Storage and Retrieval for Images and Video Databases), vol. 3312, pp. 236–247, 1998.
Bibtex Entry:
@INPROCEEDINGS{Alshuthc,
author = {Alshuth, Peter and Hermes, Thorsten and Voigt, Lutz and Herzog, Otthein},
title = {On Video Retrieval: Content Analysis by ImageMiner},
booktitle = {IS{\&}T/SPIE Symposium on Electronical Imaging Sciene {\&} Technology
(Storage and Retrieval for Images and Video Databases)},
year = {1998},
volume = {3312},
pages = {236--247},
month = {January},
abstract = {In this paper videos are analyzed to get a content-based decription
of the video. The structure of a given video is useful to index long
videos efficiently and automatically. A comparison between shots
gives an overview about cut frequency, cut pattern, and scene bounds.
After a shot detection the shots are grouped into clusters based
on their visual similarity. A time-constraint clustering procedure
is used to compare only those shots that are positioned inside a
time range. Shots from different areas of the video (e.g., begin/end)
are not compared. With this cluster information that contains a list
about shots and their clusters it is possible to calculate scene
bounds. A labeling of all clusters gives a declaration about the
cut pattern. It is easy now to distinguish a dialogue from an action
scene. The final content analysis is done by the ImageMiner 1 system.
The ImageMiner system developed at the University of Bremen of the
Image Processing Department of the Center for Computing Technology
realizes content-based image retrieval for still images through a
novel combination of methods and techniques of computer vision and
artifical intelligence. The ImageMiner system consists of three analysis
modules for computer vision, namely for color, texture, and contour
analysis. Additionally exists a module for object recognition. The
output of the object recognition module can be indexed by a text
retrieval system. Thus, concepts like forestscene may be searched
for. We combine the still image analysis with the results of the
video analysis in order to retrieve shots or scenes. 1 ImageMiner
is a trademark of IBM Cooperation},
owner = {pmania},
timestamp = {2012.11.06},
url = {http://www-agki.tzi.de/grp/ag-ki/download/1998/alshuthetal98.pdf}
}