Video imprint (computer vision)

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Proposed as an extension of image epitomes in the field of video content analysis, video imprint is obtained by recasting video contents into a fixed-sized tensor representation[1][2] regardless of video resolution or duration. Specifically, statistical characteristics are retained to some degrees so that common video recognition tasks can be carried out directly on such imprints, e.g., event retrieval, temporal action localization.[2] It is claimed that both spatio-temporal interdependences are accounted for and redundancies are mitigated during the computation of video imprints.

The option of computing video imprints exploiting the epitome model[3] has the advantage of more flexible input feature formats and more efficient training stage for video content analysis.

See also

[edit | edit source]

References

[edit | edit source]
  1. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  2. ^ a b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  3. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).