shape matching

Mathematicians define shape as an equivalence class under a group of transformations. This tells when two shapes are exactly the same, we need more than that for a theory of shape similarity/shape distance.

The statisticians define shape to address the problem of shape distance, but assumes that correspondences are known. Other statistical approaches to shape comparison do not require correspondences – e.g., one could compare feature vectors containing descriptors, e.g., area/moments, but they discard detailed shape information.

Broadly speaking, there are two approaches:

  1. feature-based: use spatial arrangements of extracted feature, e.g., edge elements/junctions
    • Boundaries of silhouette image:
    • Silhouettes do not have holes or internal markings, the associated boundaries are represented by a single-closed curve which can be parametrized by arclength.
    • Fourier descriptors
    • medial aixs transform – capture the part structure of the shape in the graph structure of the skeleton
    • 1D nature of silhouette curves
    • comparing silhouette in MPEG-7 standard
    1. Silhouette ignore internal contours, difficult to extract from real images, so treat the shape as a set of points in 2D image (e.g., edge detector).
    2. Hausdorff distance is extended to deal with parital matching and clutter, but no return correspondences.
    3. Several approaches to shape recognition based on spatial configurations of small number of keypoints or landmarks:
      • Geometric hashing votes for a model without explicitly solving for correspondences
      • Train decision trees for recognition by learning discriminative spatial configurations of keypoints
      • Gray-level info at the keypoints provides greateer discriminative power
    • Not all objects have distinguished keypoints(e.g., circle), using keypoints alone sacrifices the shape info available in smooth portions of object ontours

  2. brightness (Appearance) -based: make more direct use of pixel brightness
    • complement to feature-base methods
    • Make direct use of gray values within the visible portion of the object, instead of focusing on the shape of the occluding contour or other extracted features.
    • Build classifiers without explicitly finding correspondences, but relies on a learning algorithm having enough examples to acquire the appropriate invariances

Historyof ideas in 【recognition】

1960s – early early 1990s:  the geometric era 
1990s: appearance-based models

Limitation:

    • Requires Requires global registration of patterns global registration of patterns
    • Not robust to clutter, occlusion, geometric transformations

mid-1990s – present: sliding window approaches

Late 1990s: local features

Large-scale image search

    • Combining local local features, indexing, and spatial constraints.
    • Philbin et al. ‘07

Early 2000s: parts-and shape models

Model:
– Object as a set of parts
– Relative locations between parts
– Appearance of part

Mid-2000s: bags of features

Objects Objects as texture

    • All of these are treated as being the same
    • No distinction between foreground and background: scene recognition?

Present trends: combination of local and global methods, data-driven methods, context

Global scene descriptors

    • The “gist” of of a scene: Oliva & Torralba (2001)
    • J. Tighe and S. Lazebnik, ECCV 2010
    • J. Hays and A. Efros, Scene Completion using Millions of Photographs, SIGGRAPH 2007
    • D Hoiem A Efros and M Herbert . Putting Putting Objects in Objects in Perspective. CVPR 2006.
    • Discriminatively trained part-based models
BlueAsteroid

Just another WordPress.com site

Jing's Blog

Just another WordPress.com site

Start from here......

我的心情魔方

天才遠私廚

希望能做一個分享各種資訊的好地方

语义噪声

西瓜大丸子汤的博客

笑对人生,傲立寰宇

Just another WordPress.com site

Where On Earth Is Waldo?

A Project By Melanie Coles

The Serious Computer Vision Blog

A blog about computer vision, robotics, and artificial intelligence.

Cauthy's Blog

paper review...

Cornell Computer Vision Seminar Blog

Blog for CS 7670 - Special Topics in Computer Vision

datarazzi

Life through nerd-colored glasses

Luciana Haill

Brainwaves Augmenting Consciousness

槑烎

1,2,∞

Dr Paul Tennent

and the university of nottingham

turn off the lights, please

A bunch of random, thinned and stateless thoughts around the Web

John's Weblog

Just another WordPress.com weblog

I Am That I Am

Chasing tomorrow's sunrise.