On Video Retrieval: Content Analysis by ImageMiner (bibtex)
by Alshuth, Peter, Hermes, Thorsten, Voigt, Lutz and Herzog, Otthein
Abstract:
In this paper videos are analyzed to get a content-based decription of the video. The structure of a given video is useful to index long videos efficiently and automatically. A comparison between shots gives an overview about cut frequency, cut pattern, and scene bounds. After a shot detection the shots are grouped into clusters based on their visual similarity. A time-constraint clustering procedure is used to compare only those shots that are positioned inside a time range. Shots from different areas of the video (e.g., begin/end) are not compared. With this cluster information that contains a list about shots and their clusters it is possible to calculate scene bounds. A labeling of all clusters gives a declaration about the cut pattern. It is easy now to distinguish a dialogue from an action scene. The final content analysis is done by the ImageMiner 1 system. The ImageMiner system developed at the University of Bremen of the Image Processing Department of the Center for Computing Technology realizes content-based image retrieval for still images through a novel combination of methods and techniques of computer vision and artifical intelligence. The ImageMiner system consists of three analysis modules for computer vision, namely for color, texture, and contour analysis. Additionally exists a module for object recognition. The output of the object recognition module can be indexed by a text retrieval system. Thus, concepts like forestscene may be searched for. We combine the still image analysis with the results of the video analysis in order to retrieve shots or scenes. 1 ImageMiner is a trademark of IBM Cooperation
Reference:
Alshuth, Peter, Hermes, Thorsten, Voigt, Lutz and Herzog, Otthein, "On Video Retrieval: Content Analysis by ImageMiner", In IS&T/SPIE Symposium on Electronical Imaging Sciene & Technology (Storage and Retrieval for Images and Video Databases), vol. 3312, pp. 236–247, 1998.
Bibtex Entry:
@INPROCEEDINGS{Alshuthc,
  author = {Alshuth, Peter and Hermes, Thorsten and Voigt, Lutz and Herzog, Otthein},
  title = {On Video Retrieval: Content Analysis by ImageMiner},
  booktitle = {IS{\&}T/SPIE Symposium on Electronical Imaging Sciene {\&} Technology
	(Storage and Retrieval for Images and Video Databases)},
  year = {1998},
  volume = {3312},
  pages = {236--247},
  month = {January},
  abstract = {In this paper videos are analyzed to get a content-based decription
	of the video. The structure of a given video is useful to index long
	videos efficiently and automatically. A comparison between shots
	gives an overview about cut frequency, cut pattern, and scene bounds.
	After a shot detection the shots are grouped into clusters based
	on their visual similarity. A time-constraint clustering procedure
	is used to compare only those shots that are positioned inside a
	time range. Shots from different areas of the video (e.g., begin/end)
	are not compared. With this cluster information that contains a list
	about shots and their clusters it is possible to calculate scene
	bounds. A labeling of all clusters gives a declaration about the
	cut pattern. It is easy now to distinguish a dialogue from an action
	scene. The final content analysis is done by the ImageMiner 1 system.
	The ImageMiner system developed at the University of Bremen of the
	Image Processing Department of the Center for Computing Technology
	realizes content-based image retrieval for still images through a
	novel combination of methods and techniques of computer vision and
	artifical intelligence. The ImageMiner system consists of three analysis
	modules for computer vision, namely for color, texture, and contour
	analysis. Additionally exists a module for object recognition. The
	output of the object recognition module can be indexed by a text
	retrieval system. Thus, concepts like forestscene may be searched
	for. We combine the still image analysis with the results of the
	video analysis in order to retrieve shots or scenes. 1 ImageMiner
	is a trademark of IBM Cooperation},
  owner = {pmania},
  timestamp = {2012.11.06},
  url = {http://www-agki.tzi.de/grp/ag-ki/download/1998/alshuthetal98.pdf}
}
Powered by bibtexbrowser