BU CLA CS 835: Seminar in Image and Video Computing --- Class commentary on articles

BU CLA CS 835: Seminar on Image and Video Computing

Class commentary on articles: Segmentation

William Klippgen

"Region Competition: Unifying Snakes, Region Growing, Energy/Bayes/MDL
for Multiband Image Segmentation" by Zhu, Lee and Yuille.
----------------------------------------------------------------------

By using a method called "region competition", this paper suggests
that existing techniques such as snake/ballon models, region growing
and Bayes/MDl can be unified within a statistical framework that makes
use of each method's advantages.

Simply explained, the method lets pixels in a number of regions
compete for ownership of pixels along their boundaries. By using
statistical properties ranging from mean pixel values up to complex
probability distributions of pixels, the likelihood for membership in
regions can be decided. In case more complicated statistical analysis
has to be performed, the paper suggest to use a window of m pixels
around each boundary point. The window size must be kept small,
however, to avoid loosing to much precision.

Initially, the "seeds" that are points in the image that should lie
within a certain probability distribution family, will sometime be
hypothezised as being on the border of two distributions. The
proposed method works well, even if such "bad" seeds are given at the
start.

As long as all the probability models assumed to exist for all region
of the image is known to the region competition algorithm, the
homogenous regions within a given picture will be found.

I am fascinated by the generalization proposed in this article,
especially as the probablility models are being used for color and
texture. The authors are currently working on combining cues from the
different models to gain better detection.

"Markov Random Field Models for Unsupervised Segmentation of Textured
Colour Images" by Panjwani and Healey.
---------------------------------------------------------------------

By using Markov random field models for color textures, this approach
looks at the spatial interaction within the color planes and
interaction between them. The stepwise final merging is based on
optimizing a conditional pseudolikelihood of the image. The
conditional pseudolikelihood is a sum of all conditional probability
densities for each pixel in the current image. This probability
density is measured by how a given pixel's value is likely, given its
neighbours, based on the various probability functions for the various
regions. By optimizing the sum of all likelihoods, a steady progress
towards a correct segmentations is assumed.

Lars Liden

"Markov Random Field Models for Unsupervised Segmentation
of Textured Color Images"

Panjwani & Healey

Goal:
Look at developing an unsupervised segmentation process for textured
color images with no a priori knowledge of the textures.

The Model:
Gaussian Markov Random Field Model
Model color textures by looking at spatial interaction:
Within Plane.
each of three color planes (RGB)
Between color planes.

Each component of the RGB vector at a location is represented as a
linear combination of the color components of the neighbors and
additive noise.

Parameter Estimation:
Use maximum likelihood method

The Segmentation algorithm (3 parts):
Region Splitting Phase
- ensures initial images is small enough to fall
within a single textured region in the image
Conservative Merging Process
- Locally merges similar adjacent regions
Stepwise Optimal Merging
- Iteratively processes optimal merges until a
stopping criteria is met

Region Splitting:
Input image is first divided into a number of square blocks
Each block is tested for uniformity of non-uniformity
- First broken into four sub-blocks
- The block is assumed to be uniform if the estimated
color mean and estimated covariance matrix differs
from each of the sub-blocks by some threshold
Uniform block are left as-is, non-uniform block are broken up
The process is repeated until a minimum block size is reached
Result is the image is divided into uniform segments

Agglomerative Hierarchical Clustering
The ratio of the log of the pseudolikelihood of an block
before and after a merge is used to decide whether to
perform a merge

Note that the merging of regions corresponding to different
textures results in a large decrease in the likelyhood
of an image and a corresponding large value for the
pseudolikelyhood ratio

(see P. 942 for equation for pseudolikelihood)

Conservative Merging
To criteria are used:
- The color mean vector and covariance matrix for
adjacent region must be below a threshold vector
- The pseudolikelihood ratio for the merge must be
less than a threshold
The process is repeated until every possible new merge does not
satisfy the merging criteria.

Stepwise Optimal Merging
At each step, find the pair of adjacent cluster whose merger
would decrease the conditional pseudolikelihood as little as
possible
Assures that at each iteration the algorithm selects the best
possible merge

Stopping Criterion:
Stop merging if the pseudolikelyhood ratio corresponding to
a new merge is large compared to the pseudolikelihood ratio
for the last merge.
This is likely to happen when dissimilar textures start to
be merged

Comments:
(+)
Particularly well suited for natural scenes (a particularly
hard problem)

(-)
Seems to be very computationally expensive involving estimation
of a large number of parameters
(12 within-plane parameters and 24 across-plane parameters for
EACH textured region)

Although segmentation is "unsupervised" it doesn't seem like
there is an automatic way to choose good thresholds.
It seems like the optimal threshold will vary depending on the
texture density within an image.

Authors point out that if an image consists of many small
regions with different textures the algorithm is more sensitive to the
choice of threshold

Seems to require a priori knowledge of image contents, and
possibility even several tries as segmentation to find the best
thresholds

"Region Competition: Unifying Snakes, Region Growing, Energy/Bayes/MDL
for Multi-ban Image Segmentation."
Zhu, Lee & Yuille

This paper was a bit frustrating in that it assumed prior
knowledge of the various image segmentation methods captured by this
common statistical framework. It wouldn't have been so bad if the
authors had bothered to define all their variables, but in many cases,
they left the reader hanging, making it difficult to work through the
computations.
I have two main comments. First, in the examples, the authors
mentioned that after initial convergence the regions created by bad
seeds are removed by merging them with adjacent regions after which
computation proceeds until it once again converges. What they don't
explain is how this procedure is determined. They vaguely state that
pairs which cause the energy function to decrease the most are merged,
but what determines how many regions will be merged? This would seem
to require some knowledge of the "right segmentation" so to speak. Are
they using and energy threshold to determine when to stop merging?
Note in the example of the human figure (6f), the author's also vaguely
mention showing results after "spreading additional seeds". I was a
little bothered by the fact that they didn't explain this in more
detail. Did they selectively decide where to add more seeds in the
image to improve segmentation? If so this seems rather a bogus final
result.
Secondly, I wanted to comment that this method seems
vulnerable to different scales within the image. As pointed out by the
authors, the eyes and mouth in the image are merged into the face
region. This would seem to be true of any smaller scale feature in an
image - that it will be treated as noise. As long as regions share
similar scale of features, this method seems to work fairly well.
However, unlike GMRF, it doesn't seem to be able adjust for the level
of detail within an area.

Gregory Ganarz

In "Region Competition", S.C. Zhu et al. present an algorithm for image segmentation which combines the snake/balloon and region growing approaches to segmentation. This approach suffers from a dependence on initial seed choice. Also, by only partitioning the image into subregions with homogeneous properties, the algorithm is unable to link similar regions which have dissimilar regions interposed between them, or regions which are dissimilar, but are part of the same object. As an example, consider:

OO
**********OO******
*********OO*******
********OO********
OO

The above image would be segmented into three regions by the "region competition" algorithm, yet humans generally perceive only two objects (a rectangle composed of *'s, with an occluding bar composed of O's on top of it. Though the algorithm might "know" that the left and right regions of *'s are the same based on their similar texture (*'s), it would introduce two new borders, which should be assigned to the O bar, and not the * regions.

-----------------------------------------------------------------------------
In "Markov Random Field Models for Unsupervised Segmentation of Textured Color Images", Panjwani and Healey present an unsupervised segmentation algorithm for color textures. Based on Markov models, the algorithm describes "color texture in terms of the statistical dependence of the RGB vector measured at a pixel on the RGB values measured at neighboring pixels." p. 939. This algorithm suffers from the assumption that pixels are the relevant units to base a segmentation on. In some situtations, high level groupings may provide the basis for segmentation. By choosing the smallest scale availible for units, the Panjwani algorithms is computationally very expensive.

Shrenik Daftary

Throughout this course, many of the papers that we have covered have
assumed some type of segmentation algorithm or have given an inadequate
method to segment images. The two papers this week provide methods to
actually segment images with seemingly adequate results.

Synopsis of "Markov Random Field Models for Unsupervised Segmentation of
Textured Color Images" by Panjwani and Healey

In this paper an automated segmentation technique is presented that relies
on numerous properties of the image, but concentrates on the red, green,
and blue vectors of each pixel. Part of the problem for the technique
presented in this paper are its reliance on color for appropriate
segmentation, because many of the objects that may need to be found could
be multicolored, and these variations in color may arbitrarily be
segmented apart from each other. This issue could be addressed by an
additional level of processing that rejoins parts of an image that are
from the same object.

The method presented also relied on models that are based on spatial
interaction between the three color panes. A method to represent the
spatial interaction of pixel is presented based on the mean values for the
R,G,B panes, and the actual value at the current pixel point. Next
parameter estimation techniques is presented in terms of maximum
likelihood methods.

The segmentation algorithm involves the simple process of creating a
conservative merge. Next regions that are similar to each other are
joined. Next a process is applied to the image to form uniform image
segments, which can be divided into 4x4 pixels. This allows differently
textured portions of the image to be separated from each other in most
cases.

The actual clustering begins by using a distance function based on the
pseudolikelihood of the image with the merging of the segments and before
the merging. Other merging techniques are presented which rely on the
distance between the two segments, in which if the distance is below a
threshold the images are joined. Finally stepwise optimal merging
completes the segmentation process. The pseudolikelihood ratio is
calculated for each region and if the difference is greater than a
threshold the clustering is stopped otherwise the regions are joined.

Although the images were difficult to see, since they were apparently in
color, the technique seemed to function well in the cases that were
presented. However the color was fairly constant for each of the segmented
regions, and the technique would probably fail in cases where wide stripes
run across the object. Another problem is that the system is not scale
invariant, since the initial segmentation relies on the division of the
object into minimally 4x4 segments.

Synopsis of "Region Competition: Unifying Snakes, Region Growing,
Energy/Bayes/MDL for Multi-band Image Segmentation" by Zhu, Lee and Yuille

This paper attempts to present the best of different image segmenting
techniques to obtain better segmentation. The first technique that they
present is the snake technique, which attempts to form a closed boundary
of a region. This is improved by the forming of a balloon contour, which
pushes the initial contour out and smooths the contour. Techniques to
region merge, and grow are presented based on the Fisher distance being
less than a threshold. The other technique that this paper relies on is
the use of Bayes, and minimum description length (MDL).

The technique is first described for grey scale images divided into
homogenous regions. Homogenous means that the distribution created by the
image is consistent with a prespecified pdf (Gaussian is considered) that
is different for each region in the image. Region segmentation occurs by
the competition among separate regions for pixels adjacent to them.
Several examples are presented with different seeds and the growth of the
seeds to either bigger regions or to nothing after several iterations.

This method worked well in the cases presented, but in many of the cases
relied on several iterations. It functioned well with a textured object.
Problems with this technique may include functioning in situations where
the object is partially occluded completely in one direction for perhaps
two pixels. This paper may also have difficulty dealing with multicolored
objects; and objects that have very similar colors. Its functioning may
be modified though by using a different function with skewed
probabilities though.

John Petry

REGION COMPETITION: UNIFYING SNAKES, REGION GROWING, ENERGY/BAYES/MDL FOR
_________________________________________________________________________
MULTI-BAND IMAGE SEGMENTATION, by Zhu, Lee and Yuille
_____________________________

Zhu et al present a good method for combining two types of segmentation
measures -- local connectivity measures for use near boundary points, and
general measures within regions.

Theirs is an iterative approach, in which regions compete for pixels and can
lose them to neighboring regions if the pixel better fits the profile of the
neighboring region. The test can be done with a single equation which
considers both the probability measure for boundary pixels and grey-level
tests for interior pixels. The boundary tests include use of a small window
around the pixel to average out noise effects.

The authors show some very good demonstrations, particularly the fact that the
initial region seeds need not be well positioned in order for the code to
converge on a good solution.

I had several problems with the approach, though, and I'm not sure if they're
minor details which are easily handled, or more major problems which the
authors' glanced over:

(1) They do not discuss region splitting, only merging. If a region contains
no seeds to begin with, it's not clear to me that it will be located as a
separate entity. (I assume this is easy to handle, but I may be wrong).
This is related to the problem that they seem to require at least as many
seeds to begin as there are final regions. This is a poor assumption in an
unsupervised system. The alternative is to plant many seeds, but that may
cause the code to slow down (consider the limit of one seed per pixel, or
more likely one centered in every 3x3 block, for instance).

(2) The number of iterations required is large -- about 100 in some of their
examples. How is this number chosen? Is there any guarantee that the system
will converge on a good solution, and that each pass of the algorithm
produces a result that is closer to the optimum than the previous result was
(ie., is it monotonically increasing in "accuracy").

And are there cases where regions will chase each other around the image,
for instance, or live in some form of alternating size like some stable
blinking structures in Conway's "Life"? I don't expect this to happen,
but I don't know that the monotonic part is guaranteed to occur.

MARKOV RANDOM FIELD MODELS FOR UNSUPERVISED SEGMENTATION OF TEXTURED
____________________________________________________________________
COLOR IMAGES, by Panjwani and Healy
____________

This approach is an iterative method of clustering and splitting textured
color regions for segmentation.

The basic merge and split approach is reasonable, though it results in blocky
objects and is thus best for large objects rather than detecting fine borders.
The measure on which the clustering is done tests textures both within each
RGB (or equivalent) plane, and between planes.

I don't have a lot of comments on this paper, though I'm curious how well
it handles changes in resolution; i.e., at what point texture features become
regions of their own. There also seems to be an issue of how finely to
threshold the different regions.

It's chief advantages appear to be the fact that it is almost totally
independent of operator interaction, and that it works well on natural
scenes, which may the the hardest to handle (though I'm curious how it handles
really high-frequency natural features).



Stan Sclaroff

Created:  Oct 23, 1995

Last Modified: