history | Firefly's space

shape matching

Mathematicians define shape as an equivalence class under a group of transformations. This tells when two shapes are exactly the same, we need more than that for a theory of shape similarity/shape distance.

The statisticians define shape to address the problem of shape distance, but assumes that correspondences are known. Other statistical approaches to shape comparison do not require correspondences – e.g., one could compare feature vectors containing descriptors, e.g., area/moments, but they discard detailed shape information.

Broadly speaking, there are two approaches:

feature-based: use spatial arrangements of extracted feature, e.g., edge elements/junctions
- Boundaries of silhouette image:
- Silhouettes do not have holes or internal markings, the associated boundaries are represented by a single-closed curve which can be parametrized by arclength.
- Fourier descriptors
- medial aixs transform – capture the part structure of the shape in the graph structure of the skeleton
- 1D nature of silhouette curves
- comparing silhouette in MPEG-7 standard
1. Silhouette ignore internal contours, difficult to extract from real images, so treat the shape as a set of points in 2D image (e.g., edge detector).
2. Hausdorff distance is extended to deal with parital matching and clutter, but no return correspondences.
3. Several approaches to shape recognition based on spatial configurations of small number of keypoints or landmarks:
- Not all objects have distinguished keypoints(e.g., circle), using keypoints alone sacrifices the shape info available in smooth portions of object ontours
brightness (Appearance) -based: make more direct use of pixel brightness
- complement to feature-base methods
- Make direct use of gray values within the visible portion of the object, instead of focusing on the shape of the occluding contour or other extracted features.
- Build classifiers without explicitly finding correspondences, but relies on a learning algorithm having enough examples to acquire the appropriate invariances

Historyof ideas in 【recognition】

1960s – early early 1990s: the geometric era
1990s: appearance-based models

Limitation:

Requires Requires global registration of patterns global registration of patterns
Not robust to clutter, occlusion, geometric transformations

mid-1990s – present: sliding window approaches

Late 1990s: local features

Large-scale image search

Combining local local features, indexing, and spatial constraints.
Philbin et al. ‘07

Early 2000s: parts-and shape models

Model:
– Object as a set of parts
– Relative locations between parts
– Appearance of part

Mid-2000s: bags of features

Objects Objects as texture

All of these are treated as being the same
No distinction between foreground and background: scene recognition?

Present trends: combination of local and global methods, data-driven methods, context

Global scene descriptors

The “gist” of of a scene: Oliva & Torralba (2001)
J. Tighe and S. Lazebnik, ECCV 2010
J. Hays and A. Efros, Scene Completion using Millions of Photographs, SIGGRAPH 2007
D Hoiem A Efros and M Herbert . Putting Putting Objects in Objects in Perspective. CVPR 2006.
Discriminatively trained part-based models

Firefly's space

Tag Archives: history

Historyof ideas in 【recognition】

Recent Posts

Blogs I Follow

Flickr Photos

TomTom

Categories