|
Back to the Gradient Vector Flow Page
List of Citations from Science Citation Index for
T. F. Cootes, C. J. Taylor, D. H. Cooper, et al., "
Active Shape Models - Their Training and Application,"
Computer Vision and Image Understanding, 61(1): 38-59,
January 1995.
1995: 1 1996: 10 1997: 32 1998: 19 1999: 23 2000: 34 2001: 21 2002: 1
Total citations: 141
As of 28 Jan 2002
By Year - By Citations - By Year with Abstract
|
| |
| 1995 |
- SOZOU, PD, COOTES, TF, TAYLOR, CJ, and DIMAURO, EC, "NONLINEAR GENERALIZATION OF POINT DISTRIBUTION MODELS USING POLYNOMIAL REGRESSION," IMAGE AND VISION COMPUTING, vol. 13, pp. 451-457, 1995.
Abstract:
We have previously described how to model shape variability by
means of point distribution models (PDM) in which there is a
linear relationship between a set of shape parameters and the
positions of points on the shape. This linear formulation can
fail for shapes which articulate or bend. We show examples of
such failure for both real and synthetic classes of shape. A
new, more general formulation for PDMs, based on polynomial
regression, is presented. The resulting polynomial regression
PDMs (PRPDM) perform well on the data for which the linear
method failed.
|
| |
| 1996 |
- Denzler, J, and Niemann, H, "3D data driven prediction for active contour models based on geometric bounding volumes," PATTERN RECOGNITION LETTERS, vol. 17, pp. 1171-1178, 1996.
Abstract:
Active contour models have proven to be a promising approach
for data driven object tracking without knowledge about the
problem domain and the object. Problems arise in case of fast
moving objects and in natural scenes with heterogeneous
background. In these cases, a prediction step is an essential
part of the tracking mechanism. In this paper we describe a new
approach for modelling the contour of moving objects in the 3D
world. The key point is the description of moving objects by a
simplified geometric model, the sc-called bounding volume. The
contour of moving objects is predicted by estimating the
movement and the shape of the bounding volume in the 3D world
and by projecting its contour to the image plane. Stochastic
optimization algorithms are used to estimate shape parameters
of the bounding volume. The 2D contour of the bounding volume
is used to initialize the active contour, which then extracts
the contour of the moving object. Thus, an update of the motion
and model parameters of the bounding volume is possible. No
task specific knowledge and no a priori knowledge about the
object is necessary. We will show that in the case of convex
polyhedral bounding volumes, this method can be applied to
real-time closed-loop object tracking on standard Unix
workstations. Furthermore, we present experiments which prove
that the robustness for tracking moving objects in front of a
heterogeneous background can be improved.
- Girard, S, Dinten, JM, and Chalmond, B, "Building and training radiographic models for flexible object identification from incomplete data," IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, vol. 143, pp. 257-264, 1996.
Abstract:
The authors address the problem of identifying the projection
of an object from incomplete data extracted from its
radiographic image. They assume that the unknown object is a
particular sample of a flexible object. Their approach consists
first in designing a deformation model able to represent and to
simulate a great variety of samples of the flexible object
radiographic projection. This modellisation is achieved using a
training set of complete data. Then, given the incomplete data,
the identification task consists in estimating the observed
object using the deformation model. The proposed modelling
extracts from the training set, not only the deformation modes,
but also other relevant information (such as probability
distributions on the deformations, relations between
deformations) to use it to regularise the identification step.
- Abrantes, AJ, and Marques, JS, "Class of constrained clustering algorithms for object boundary extraction," IEEE TRANSACTIONS ON IMAGE PROCESSING, vol. 5, pp. 1507-1521, 1996.
Abstract:
Boundary extraction is a key task in many image analysis
operations. This paper describes a class of constrained
clustering algorithms for object boundary extraction that
includes several well-known algorithms proposed in different
fields (deformable models, constrained clustering, data
ordering, and traveling salesman problems), The algorithms
belonging to this class are obtained by the minimization of a
cost function with two terms: a quadratic regularization term
and an image-dependent term defined by a set of weighting
functions, The minimization of the cost function is achieved by
lowpass filtering the previous model shape and by attracting
the model units toward the centroids of their attraction
regions, To define a new algorithm belonging to this class, the
user has to specify a regularization matrix and a set of
weighting functions that control the attraction of the model
units toward the data, The usefulness of this approach is
twofold: It provides a unified framework for many existing
algorithms in pattern recognition and deformable models, and
allows the design of new recursive schemes.
- Nastar, C, and Ayache, N, "Frequency-based nonrigid motion analysis: Application to four dimensional medical images," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 18, pp. 1067-1079, 1996.
Abstract:
We present a method for nonrigid motion analysis in time
sequences of volume images (4D data). In this method, nonrigid
motion of the deforming object contour is dynamically
approximated by a physically-based deformable surface. In order
to reduce the number of parameters describing the deformation,
we make use of a modal analysis which provides a spatial
smoothing of the surface. The deformation spectrum, which
outlines the main excited modes, can be efficiently used for
deformation comparison. Fourier analysis on time signals of the
main deformation spectrum components provides a temporal
smoothing of the data. Thus a complex nonrigid deformation is
described by only a few parameters: the main excited modes and
the main Fourier harmonics. Therefore, 4D data can be analyzed
in a very concise manner. The power and robustness of the
approach is illustrated by various results on medical data. We
believe that our method has important applications in automatic
diagnosis of heart diseases and in motion compression.
- Chakraborty, A, Staib, LH, and Duncan, JS, "Deformable boundary finding in medical images by integrating gradient and region information," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 15, pp. 859-870, 1996.
Abstract:
Accurately segmenting and quantifying structures is a key issue
in biomedical image analysis. The two conventional methods of
image segmentation, region-based segmentation, and boundary
finding, often suffer from a variety of limitations. Here we
propose a method which endeavors to integrate the two
approaches in an effort to form a unified approach that is
robust to noise and poor initialization. Our approach uses
Green's theorem to derive the boundary of a homogeneous region-
classified area in the image and integrates this with a gray
level gradient-based boundary finder. This combines the
perceptual notions of edge/shape information with gray level
homogeneity. A number of experiments were performed both on
synthetic and real medical images of the brain and heart to
evaluate the new approach, and it is shown that the integrated
method typically performs better when compared to conventional
gradient-based deformable boundary finding. Further, this
method yields these improvements with little increase in
computational overhead, an advantage derived from the
application of the Green's theorem.
- Mitiche, A, and Bouthemy, P, "Computation and analysis of image motion: A synopsis of current problems and methods," INTERNATIONAL JOURNAL OF COMPUTER VISION, vol. 19, pp. 29-55, 1996.
Abstract:
The goal of this paper is to offer a structured synopsis of the
problems in image motion computation and analysis, and of the
methods proposed, exposing the underlying models and supporting
assumptions. A sufficient number of pointers to the literature
will be given, concentrating mostly on recent contributions.
Emphasis will be on the detection, measurement and segmentation
of image motion. Tracking, and deformable motion isssues will
be also addressed. Finally, a number of related questions which
could require more investigations will be presented.
- Subsol, G, Thirion, JP, and Ayache, N, "Application of an automatically built 3D morphometric brain atlas: Study of cerebral ventricle shape," VISUALIZATION IN BIOMEDICAL COMPUTING, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1131, pp. 373-382, 1996.
Abstract:
In this paper we present new results on the automatic building
of a 3D morphometric brain atlas from volumetric MRI images and
its application to the study of the shape of cerebral
structures. In particular, we show how it is possible to define
''abnormal'' deformations of the cerebral ventricles with a
small set of parameters.
- Hill, A, Cootes, TF, and Taylor, CJ, "Active shape models and the shape approximation problem," IMAGE AND VISION COMPUTING, vol. 14, pp. 601-607, 1996.
Abstract:
Active Shape Models (ASM) use an iterative algorithm to match
statistically defined models of known but variable objects to
instances in images. Each iteration of ASM search involves two
steps: image data interrogation and shape approximation. Here
we consider the shape approximation step in detail. We present
a new method of shape approximation which uses directional
constraints. We show how the error term for the shape
approximation problem can be extended to cope with directional
constraints, and present iterative solutions to the 2D and 3D
problems. We also present an efficient algorithm for the 2D
problem in which a modification of the error term permits a
closed-form approximate solution which can be used to produce
starting estimates for the iterative solution.
- Cootes, TF, DiMauro, EC, Taylor, CJ, and Lanitis, A, "Flexible 3D models from uncalibrated cameras," IMAGE AND VISION COMPUTING, vol. 14, pp. 581-587, 1996.
Abstract:
We describe how to build statistically-based flexible models of
the 3D structure of variable objects, given a training set of
uncalibrated images. We assume that for each example object
there are two labelled images taken from different viewpoints.
From each image pair a 3D structure can be reconstructed, up to
either an affine or projective transformation, depending on
which camera model is used. The reconstructions are aligned by
choosing the transformations which minimise the distances
between matched points across the training set. A statistical
analysis results in an estimate of the mean structure of the
training examples and a compact parameterised model of the
variability in shape across the training set. Experiments have
been performed using pinhole and affine camera models. Results
are presented for both synthetic data and real images.
- Christensen, GE, Rabbitt, RD, and Miller, MI, "Deformable templates using large deformation kinematics," IEEE TRANSACTIONS ON IMAGE PROCESSING, vol. 5, pp. 1435-1447, 1996.
Abstract:
A general automatic approach is presented for accommodating
local shape variation when mapping a two-dimensional (2-D) or
three-dimensional (3-D) template image into alignment with a
topologically similar target image, Local shape variability is
accommodated by applying a vector-field transformation to the
underlying material coordinate system of the template while
constraining the transformation to be smooth (globally positive
definite Jacobian), Smoothness is guaranteed without
specifically penalizing large-magnitude deformations of small
subvolumes by constraining the transformation on the basis of a
Stokesian limit of the fluid-dynamical Navier-Stokes equations,
This differs fundamentally from quadratic penalty methods, such
as those based on linearized elasticity or thin-plate splines,
in that stress restraining the motion relaxes over time
allowing large-magnitude deformations, Kinematic nonlinearities
are inherently necessary to maintain continuity of structures
during large-magnitude deformations, and are included in all
results, After initial global registration, final mappings are
obtained by numerically solving a set of nonlinear partial
differential equations associated with the constrained
optimization problem, Automatic regridding is performed by
propagating templates as the nonlinear transformations
evaluated on a finite lattice become singular, Application of
the method to intersubject registration of neuroanatomical
structures illustrates the ability to account for local
anatomical variability.
|
| |
| 1997 |
- Luettin, J, and Thacker, NA, "Speechreading using probabilistic models," COMPUTER VISION AND IMAGE UNDERSTANDING, vol. 65, pp. 163-178, 1997.
Abstract:
We describe a robust method for locating and tracking lips in
gray-level image sequences. Our approach learns patterns of
shape variability from a training set which constrains the
model during image search to only deform in ways similar to the
training examples, Image search is guided by a learned gray-
level model which is used to describe the large appearance
variability of lips, Such variability might be due to different
individuals, illumination, mouth opening, specularity, or
visibility of teeth and tongue, Visual speech features are
recovered from the tracking results and represent both shape
and intensity information, We describe a speechreading (lip-
reading) system, where the extracted features are modeled by
Gaussian distributions and their temporal dependencies by
hidden Markov models. Experimental results are presented for
locating lips, tracking lips, and speechreading. The database
used consists of a broad variety of speakers and was recorded
in a natural environment with no special lighting or lip
markers used, For a speaker independent digit recognition task
using visual information only, the system achieved an accuracy
about equivalent to that of untrained humans. (C) 1997 Academic
Press.
- Solloway, S, Hutchinson, CE, Waterton, JC, and Taylor, CJ, "The use of active shape models for making thickness measurements of articular cartilage from MR images," MAGNETIC RESONANCE IN MEDICINE, vol. 37, pp. 943-952, 1997.
Abstract:
Previously reported studies to quantify articular cartilage
have used labor-intensive manual or semi-automatic data-driven
techniques, demonstrating high accuracy and precision. However,
none has been able to automate the segmentation process. This
paper describes a fast, automatic, model-based approach to
segmentation and thickness measurement of the femoral cartilage
in 3D T-1-weighted images using active shape models (ASMs).
Systematic experiments were performed to assess the accuracy
and precision of the technique with in vivo images of both
normal and abnormal knees. Segmentation accuracy was determined
by comparing the results of the segmentation with the
boundaries delineated by a radiologist, The mean error in
locating the boundary was 0.57 pixels. To assess the precision
of the measurement technique, the mean thickness of the femoral
cartilage was calculated for repeated scans of five healthy
volunteers. A mean coefficient of variation (CV) of 2.8% was
obtained for the thickness measurements.
- Redhead, AL, Kotcheff, ACW, Taylor, CJ, Porter, ML, and Hukins, DWL, "An automated method for assessing routine radiographs of patients with total hip replacements," PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART H- JOURNAL OF ENGINEERING IN MEDICINE, vol. 211, pp. 145-154, 1997.
Abstract:
This paper describes a new, fully automated method of locating
objects on radiographs of patients with total joint
replacements (TJRs). A statistical computer model, known as an
active shape model, was trained to identify the position of the
femur, pelvis, stem and cup marker wire on radiographs of
patients with Charnley total hip prostheses. Once trained, the
model was able to locate these objects through a process of
automatic image searching, despite their appearance depending
on the orientation and anatomy of the patient. Experiments were
carried out to test the accuracy with which the model was able
to fit to previously unseen data and with which reference
points could be calculated from the model points. The model was
able to locate the femur and stem with a mean error of
approximately 0.8 mm and a 95 per cent confidence limit of 1.7
mn. Once the model had successfully located these objects, the
midpoint of the stem head could be calculated with a mean error
of approximately 0.2 mm. Although the model has been trained on
Charnley total hip replacements, the method is generic and so
can be applied to radiographs of patients with any TJR. This
paper shows that computer models can form the basis of a quick,
automatic method of taking measurements from standard clinical
radiographs.
- Delibasis, K, Undrill, PE, and Cameron, GG, "Designing Fourier descriptor-based geometric models for object interpretation in medical images using genetic algorithms," COMPUTER VISION AND IMAGE UNDERSTANDING, vol. 66, pp. 286-300, 1997.
Abstract:
In previous work we have modeled simple 3D anatomical objects
using deformed superquadrics and established their optimal
position with the aid of genetic algorithms (GAs). Here we
extend the complexity of the search object using 3D Fourier
descriptor (FD) representations and allow GAs once again to
optimize the object's shape and position. Using magnetic
resonance image data, we perform an approximate segmentation on
one lateral ventricle in the brain and use the FDs from this as
seeding values for the GAs to search for the left and right
lateral ventricles in seven 3D data sets. We show that the
method is capable of coping with normal biological variation.
Finally, we compare FD/GA-guided segmentation with a manually
guided interactive region growing method and find an agreement
of 78 +/- 10% in voxel classification with a corresponding
average edge placement error of 2.2 +/- 0.4 mm. (C) 1997
Academic Press.
- Smyth, PP, Taylor, CJ, and Adams, JE, "Automatic measurement of vertebral shape using Active Shape Models," INFORMATION PROCESSING IN MEDICAL IMAGING, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1230, pp. 441-446, 1997.
Abstract:
In this paper, we describe how Active Shape Models (ASMs) have
been used to accurately and robustly locate vertebrae in noisy
lateral Dual Energy X-ray Absorptiometry (DXA) images of the
spine. Vertebrae were located using either separate models for
each vertebra, or a combined model of the whole spine. The
combined model was found to be more robust. We show that ASMs
allow normal vertebrae to be located as accurately as by human
operators. We measure the performance of ASMs in locating
fractured vertebrae of osteoporotic patients. We also describe
how model parameters may be used to estimate how accurately a
vertebra had been located, in order to detect vertebrae for
which search had failed.
- Joshi, SC, Banerjee, A, Christensen, GE, Csernansky, JG, Haller, JW, Miller, MI, and Wang, L, "Gaussian random fields on sub-manifolds for characterizing brain surfaces," INFORMATION PROCESSING IN MEDICAL IMAGING, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1230, pp. 381-386, 1997.
Abstract:
This paper provides analytical methods for characterizing the
variation of the shape of neuroanatomically significant
substructures of the brain in an ensemble of brain images. The
focus of this paper is on the neuro-anatomical variation of the
"shape" of 2-dimensional surfaces in the brain. Brain surfaces
are studied by building templates that are smooth sub-manifolds
of the underlying coordinate system of the brain. Variation of
the shape in populations is quantified via defining Gaussian
random vector fields on these sub-manifolds Methods for the
empirical construction of Gaussian random vector fields for
representing the variations of the substructures are presented.
As an example, using these methods we characterize the shape of
the hippocampus in a population of normal controls and
schizophrenic brains. Results from a recently completed study
comparing shapes of the hippocampus in a group of matched
schizophrenic and normal control subjects are presented.
Bayesian hypothesis test is formulated to cluster the normal
and schizophrenic hippocampi in the population of 20
individuals.
- Duta, N, and Sonka, M, "Segmentation and interpretation of MR brain images using an improved knowledge-based active shape model," INFORMATION PROCESSING IN MEDICAL IMAGING, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1230, pp. 375-380, 1997.
Abstract:
An improvement of the Active Shape procedure introduced by
Cootes and Taylor is presented. The new automated brain
segmentation and interpretation approach incorporates a priori
knowledge about neuroanatomic structures and their specific
structural relationships to provide robust segmentation and
labeling. The method was trained in 8 MR brain images and
tested in 19 brain images by comparison to observer-defined
independent standards. Neuroanatomic structures in all images
from the test set were successfully identified. The presented
method is applicable to virtually any task involving deformable
shape analysis.
- Ahmad, T, Taylor, CJ, Lanitis, A, and Cootes, TF, "Tracking and recognising hand gestures, using statistical shape models," IMAGE AND VISION COMPUTING, vol. 15, pp. 345-352, 1997.
Abstract:
Hand gesture recognition from video images is of considerable
interest as a means of providing simple and intuitive man-
machine interfaces. Possible applications range from replacing
the mouse as a pointing device to virtual reality and
communication with the deaf. We describe an approach to
tracking a hand in an image sequence and recognising, in each
video frame. which of five gestures it has adopted. A
statistically based Point Distribution Model (PDM) is used to
provide a compact parametrised description of the shape of the
hand for any of the gestures or the transitions between them.
The values of the resulting shape parameters are used in a
statistical classifier to identify gestures. The model can be
used as a deformable template to track a hand through a video
sequence but this proves unreliable. We describe how a set of
models, one for each of the five gestures, can be used for
tracking with the appropriate model selected automatically. We
show that this results in reliable tracking and gesture
recognition for two 'unseen' video sequences in which all the
gestures are used.
- Sozou, PD, Cootes, TF, Taylor, CJ, DiMauro, EC, and Lanitis, A, "Non-linear point distribution modelling using a multi-layer perceptron," IMAGE AND VISION COMPUTING, vol. 15, pp. 457-463, 1997.
Abstract:
Objects of the same class sometimes exhibit variation in shape.
This shape variation has previously been modelled by means of
point distribution models (PDMs) in which there is a linear
relationship between a set of shape parameters and the
positions of points on the shape. A polynomial regression
generalization of PDMs, which succeeds in capturing certain
forms of non-linear shape variability, has also been described.
Here we present a new form of PDM, which uses a multi-layer
perceptron to carry out non-linear principal component
analysis. We compare the performance of the new model with that
of the existing models on two classes of variable shape: one
exhibits bending, and the other exhibits complete rotation. The
linear PDM fails on both classes of shape; the polynomial
regression model succeeds for the first class of shapes but
fails for the second; the new multi-layer perceptron model
performs well for both classes of shape. The new model is the
most general formulation for PDMs which has been proposed to
date. (C) 1997 Elsevier Science B.V.
- Smyth, PP, Taylor, CJ, and Adams, JE, "Automatic measurement of vertebral shape using active shape models," IMAGE AND VISION COMPUTING, vol. 15, pp. 575-581, 1997.
Abstract:
In this paper, we describe how Active Shape Models (ASMs) have
been used to accurately and robustly locate vertebrae in
lateral Dual Energy X-ray Absorptiometry (DXA) images of the
spine. DXA images are of low spatial resolution, and contain
significant random and structural noise, providing a difficult
challenge for object location methods. All vertebrae in the
image were searched for simultaneously, improving robustness in
location of individual vertebrae by making use of constraints
on shape provided by the position of other vertebrae. We show
that the use of ASMs with minimal user interaction allows
accuracy to be obtained which is as good as that achievable by
human operators, as well as high precision. Having located each
vertebra, it is desirable to evaluate whether it has been
located sufficiently accurately for shape measurements to be
useful. We determined this on the basis of grey-level model
fit, which was shown to usefully detect poorly located
vertebrae, which should enable accuracy to be improved by
rejecting proposed search solutions whose grey-level fit was
poorer than a threshold. (C) 1997 Elsevier Science B.V.
- Kotcheff, ACW, and Taylor, CJ, "Automatic construction of eigenshape models by genetic algorithm," INFORMATION PROCESSING IN MEDICAL IMAGING, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1230, pp. 1-14, 1997.
Abstract:
A new approach to the problem of automatic construction of
eigenshape models is presented. Eigenshape models have proved
to be successful in a variety of medical image analysis
problems. However, automatic model construction is a difficult
problem, and in many applications the models are built by hand
- a painstaking process. Previous attempts to produce models
automatically have been applicable only in specific cases or
under certain assumptions. We show that the problem can be
understood very simply in terms of shape symmetries. The pose
and parameterisation of each shape must be chosen so as to
produce a model that is compact and specific. Mie define an
objective function that measures these properties. The problem
of automatic model construction is thus reduced to an
optimisation problem. We show that the objective function we
define can be optimised by a Genetic Algorithm, and produces
models that are better than hand built ones.
- Rueckert, D, and Burger, P, "Geometrically deformable templates for shape-based segmentation and tracking in cardiac MR images," ENERGY MINIMIZATION METHODS IN COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1223, pp. 83-98, 1997.
Abstract:
We present a new approach to shape-based segmentation and
tracking of multiple, deformable anatomical structures in
cardiac MR images. We propose to use an energy-minimizing
geometrically deformable template (GDT) which can deform into
similar shapes under the influence of image forces. The degree
of deformation of the template from its equilibrium shape is
measured by a penalty function associated with mapping between
the two shapes. In 2D, this term corresponds to the bending
energy of an idealized thin-plate of metal. By minimizing this
term along with the image energy terms of the classic
deformable model, the deformable template is attracted towards
objects in the image whose shape is similar to its equilibrium
shape. This framework allows for the simultaneous segmentation
of multiple deformable objects using intra- as well as inter-
shape information. The energy minimization problem of the
deformable template is formulated in a Bayesian framework and
solved using relaxation techniques: Simulated Annealing (SA), a
stochastic relaxation technique is used for segmentation while
Iterated Conditional Modes (ICM), a deterministic relaxation
technique is used for tracking. We present results of the
algorithm applied to the reconstruction of the left and right
ventricle of the human heart in 4D MR images.
- Liu, TL, and Geiger, D, "Visual deconstruction: Recognizing articulated objects," ENERGY MINIMIZATION METHODS IN COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1223, pp. 295-309, 1997.
Abstract:
We propose a deconstruction framework to recognize and find
articulated objects. In particular we axe interested in human
arm and leg articulations. The deconstruction view of
recognition naturally decomposes the problem of finding an
object in an image, into the one of (i) extracting key features
in an image, (ii) detecting key points in the models, (iii)
segmenting an image, and (iv) comparing shapes. All of these
subproblems can not be resolved independently. Together, they
reconstruct the object in the image. We briefly address (i) and
(ii) to focus on solving together shape similarity and
segmentation, combining top-down & bottom-up algorithms. We
show that the visual deconstruction approach is derived as an
optimization for a Bayesian-Information theory, and that the
whole process is naturally generated by the guaranteed Dijkstra
optimization algorithm.
- Lanitis, A, Taylor, CJ, and Cootes, TF, "Automatic interpretation and coding of face images using flexible models," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 19, pp. 743-756, 1997.
Abstract:
Face images are difficult to interpret because they are highly
variable. Sources of variability include individual appearance,
3D pose, facial expression, and lighting. We describe a compact
parametrized model of facial appearance which takes into
account all these sources of variability. The model represents
both shape and gray-level appearance, and is created by
performing a statistical analysis over a training set of face
images. A robust multiresolution search algorithm is used to
fit the model to faces in new images. This allows the main
facial features to be located, and a set of shape, and gray-
level appearance parameters to be recovered. A good
approximation to a given face can be reconstructed using less
than 100 of these parameters. This representation can be used
for tasks such as image coding, person identification, 3D pose
recovery, gender recognition, and expression recognition.
Experimental results are presented for a database of 690 face
images obtained under widely varying conditions of 3D pose,
lighting, and facial expression. The system performs well on
ail the tasks listed above.
- Vetter, T, and Poggio, T, "Linear object classes and image synthesis from a single example image," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 19, pp. 733-742, 1997.
Abstract:
The need to generate new views of a 3D object from a single
real image arises in several fields, including graphics and
object recognition. While the traditional approach relies on
the use of 3D models, we have recently introduced [1], [2], [3]
simpler techniques that are applicable under restricted
conditions. The approach exploits image transformations that
are specific to the relevant object class, and learnable from
example Views of other ''prototypical'' objects df the same
class. In this paper, we introduce such a technique by
extending the notion of linear class proposed by Poggio and
Vetter. For linear object classes, it is shown that linear
transformations can be learned exactly from a basis set of 2D
prototypical views. We demonstrate the approach on artificial
objects and then show preliminary evidence that the technique
can effectively ''rotate'' high-resolution face images from a
single 2D view.
- Pavlovic, VI, Sharma, R, and Huang, TS, "Visual interpretation of hand gestures for human-computer interaction: A review," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 19, pp. 677-695, 1997.
Abstract:
The use of hand gestures provides an attractive alternative to
cumbersome interface devices for human-computer interaction
(HCl). In particular, Visual interpretation of hand gestures
can help in achieving the ease and naturalness desired for HCl.
This has motivated a very active research area concerned with
computer vision-based analysis and interpretation of hand
gestures. We survey the literature on visual interpretation of
hand gestures in the context of its role in HCl. This
discussion is organized on the basis of the method used for
modeling, analyzing, and recognizing gestures. Important
differences in the gesture interpretation approaches arise
depending on whether a 3D model of the human hand or an image
appearance model of the human hand is used. 3D hand models
offer a way of more elaborate modeling of hand gestures but
lead to computational hurdles that have not been overcome given
the real-time requirements of HCl. Appearance-based models lead
to computationally efficient ''purposive'' approaches that work
well under constrained situations but seem to lack the
generality desirable for HCl. We also discuss implemented
gestural systems as well as other potential applications of
vision-based gesture recognition. Although the current progress
is encouraging, further theoretical as well as computational
advances are needed before gestures can be widely used for HCl.
We discuss directions of future research in gesture
recognition, including its integration with other natural modes
of human-computer interaction.
- Yow, KC, and Cipolla, R, "Feature-based human face detection," IMAGE AND VISION COMPUTING, vol. 15, pp. 713-735, 1997.
Abstract:
Human face detection has always been an important problem for
face, expression and gesture recognition. Though numerous
attempts have been made to detect and localize faces, these
approaches have made assumptions that restrict their extension
to more general cases. We identify that the key factor in a
generic and robust system is that of using a large amount of
image evidence, related and reinforced by model knowledge
through a probabilistic framework. In this paper, we propose a
feature-based algorithm for detecting faces that is
sufficiently generic and is also easily extensible to cope with
more demanding variations of the imaging conditions. The
algorithm detects feature points from the image using spatial
filters and groups them into face candidates using geometric
and gray level constraints. A probabilistic framework is then
used to reinforce probabilities and to evaluate the likelihood
of the candidate as a face. We provide results to support the
validity of the approach and demonstrate its capability to
detect faces under different scale, orientation and viewpoint.
(C) 1997 Elsevier Science B.V.
- Vetter, T, and Troje, NF, "Separation of texture and shape in images of faces for image coding and synthesis," JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, vol. 14, pp. 2152-2161, 1997.
Abstract:
Human faces differ in shape and texture. Image representations
based on this separation of shape and texture information have
been reported by several authors [for a review, see Science
272, 1905 (1996)]. We investigate such a representation of
human faces based on a separation of texture and two-
dimensional shape information. Texture and shape were separated
by use of pixel-by-pixel correspondence among the various
images, which was established through algorithms known from
optical flow computation. We demonstrate the improvement of the
proposed representation over well-established pixel-based
techniques in terms of coding efficiency and in terms of the
ability to generalize to new images of faces. The evaluation is
performed by calculating different distance measures between
the original image and its reconstruction and by measuring the
time that human subjects need to discriminate them. (C) 1997
Optical Society of America.
- Taylor, CJ, Cootes, TF, Lanitis, A, Edwards, G, Smyth, P, and Kotcheff, ACW, "Model-based interpretation of complex and variable images," PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES B-BIOLOGICAL SCIENCES, vol. 352, pp. 1267-1274, 1997.
Abstract:
The ultimate goal of machine vision is image understanding-the
ability not only to recover image structure but also to know
what it represents. By definition, this involves the use of
models which describe and label the expected structure of the
world. Over the past decade, model-based vision has been
applied successfully to images of man-made objects. It has
proved much more difficult to develop model-based approaches to
the interpretation of images of complex and variable structures
such as faces or the internal organs of the human body (as
visualized in medical images). In such cases it has been
problematic even to recover image structure reliably without a
model to organize the often noisy and incomplete image
evidence. The key problem is that of variability. To be useful,
a model needs to be specific-that is, to be capable of
representing only 'legal' examples of the modelled object(s).
It has proved difficult to achieve this whilst allowing for
natural variability. Recent developments have overcome this
problem; it has been shown that specific patterns of
variability in shape and grey-level appearance can be captured
by statistical models that can be used directly in image
interpretation. The details of the approach are outlined and
practical examples from medical image interpretation and face
recognition are used to illustrate how previously intractable
problems can now be tackled successfully. It is also
interesting to ask whether these results provide any possible
insights into natural vision; for example, we show that the
apparent changes in shape which result from viewing three-
dimensional objects from different viewpoints can be modelled
quite well in two dimensions; this may lend some support to the
'characteristic views' model of natural vision.
- Subsol, G, Roberts, N, Doran, M, Thirion, JP, and Whitehouse, GH, "Automatic analysis of cerebral atrophy," MAGNETIC RESONANCE IMAGING, vol. 15, pp. 917-927, 1997.
Abstract:
3D MR data obtained for 10 healthy control subjects have been
used to build a brain atlas, The atlas is built in four stages,
First, a set of features that are unambiguously definable and
anatomically relevant need to be computed for each item in the
database, The chosen features are crest lines along which the
maximal principal curvature of the surface of the brain is
maximal in its associated principal direction, Second, a
nonrigid registration algorithm is used to determine the common
crest lines among the subjects in the database, These crest
lines form the structure of the atlas. Third, a set of crest
lines is taken as a reference set and a modal analysis is
performed to determine the fundamental deformations that are
necessary to bring the individual data in line with the
reference set, The deformations are averaged and the set of
mean crest lines becomes the atlas, Finally, the standard
deviation of the deformations between the atlas and the items
in the database defines the normal variation in the relative
positions of the crest lines in a healthy population, In a
fully automatic procedure, the crest lines on the surface of
the brain adjacent to the cerebral ventricles in a patient with
primary progressive aphasia were compared to the atlas;
confirmation that the brain of this patient demonstrates
atrophy was provided by stereological analysis that showed that
the volume of the left cerebral hemisphere is 48.8 ml (CE 2.8%)
less than the volume of the right cerebral hemisphere in the
region of the temporal and frontal lobes, When the amplitude of
the deformations necessary to register the crest lines obtained
for the patient with the atlas were greater than three standard
deviations beyond the variability inherent in the atlas, the
deformation was considered significant, Four of the five main
deformation modes of the longest crest line of the surface of
the brain adjacent to the cerebral ventricles were
significantly different in the patient with primary progressive
aphasia compared to the atlas, The ventricles are
preferentially enlarged in the left cerebral hemisphere,
Furthermore, they are closer together posteriorly and further
apart anteriorly than in the atlas, These observations may be
indicative of the atrophy of the temporal and frontal lobes of
the left cerebral hemisphere noted in the patient, Ultimately,
the approach may provide a useful screening technique for
identifying brain diseases involving cerebral atrophy, Serial
studies of individual patients may provide insights into the
processes controlling or affected by particular diseases. (C)
1997 Elsevier Science Inc.
- Rueckert, D, and Burger, P, "Shape-based segmentation and tracking in 4D cardiac MR images," CVRMED-MRCAS'97, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1205, pp. 43-52, 1997.
Abstract:
We present a new approach to shape-based segmentation and
tracking of multiple, deformable anatomical structures in
cardiac MR images. We propose to use an energy-minimizing
geometrically deformable template (GDT) which can deform into
similar shapes under the influence of image forces. The degree
of deformation of the template from its equilibrium Shape is
measured by a penalty function associated with mapping between
the two shapes. In 2D, this term corresponds to the bending
energy of an idealized thin-plate of metal. By minimizing this
term along with the image energy terms of the classic
deformable model, the deformable template is attracted towards
objects in the image whose shape is similar to its equilibrium
shape. This framework allows the simultaneous segmentation of
multiple deformable objects using intra-as well as inter-shape
information. The energy minimization problem of the deformable
template is formulated in a Bayesian framework and solved using
relaxation techniques: Simulated Annealing (SA), a stochastic
relaxation technique is used for segmentation while Iterated
Conditional Modes (ICM), a deterministic relaxation technique
is used for tracking. We present results of the algorithm
applied to the reconstruction of the left and right ventricle
of the human heart in 4D MR images.
- Montagnat, J, and Delingette, H, "Volumetric medical images segmentation using shape constrained deformable models," CVRMED-MRCAS'97, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1205, pp. 13-22, 1997.
Abstract:
In this paper we address the problem of extracting geometric
models from lour contrast volumetric images, given a template
or reference shape of that model. We proceed by deforming a
reference model in a volumetric image. This reference
deformable model is represented as a simplex mesh submitted to
regularizing shape constraint. Furthermore, we introduce an
original approach that combines the deformable model framework
with the elastic registration (based on iterative closest point
algorithm) method. This new method increases the robustness of
segmentation while allowing very complex deformation, of the
original template. Examples of segmentation of the liver and
brain ventricles are provided.
- Grzeszczuk, RP, and Levin, DN, "''Brownian strings'': Segmenting images with stochastically deformable contours," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 19, pp. 1100-1114, 1997.
Abstract:
This paper describes an image segmentation technique in which
an arbitrarily shaped contour was deformed stochastically until
it fitted around an object of interest. The evolution of the
contour was controlled by a simulated annealing process which
caused the contour to settle into the global minimum of an
image-derived ''energy'' function. The nonparametric energy
function was derived from the statistical properties of
previously segmented images, thereby incorporating prior
experience. Since the method was based on a state space search
for the contour with the best global properties, it was stable
in the presence of image errors which confound segmentation
techniques based on local criteria, such as connectivity.
Unlike ''snakes'' and other active contour approaches, the new
method could handle arbitrarily irregular contours in which
each interpixel crack represented an independent degree of
freedom. Furthermore, since the contour evolved toward the
global minimum of the energy, the method was more suitable for
fully automatic applications than the snake algorithm, which
frequently has to be reinitialized when the contour becomes
trapped in local energy minima. High computational complexity
was avoided by efficiently introducing a random local
perturbation in a time independent of contour length, providing
control over the size of the perturbation, and assuring that
resulting shape changes were unbiased. The method was
illustrated by using it to find the brain surface in magnetic
resonance head images and to track blood vessels in angiograms.
- OToole, AJ, Vetter, T, Volz, H, and Salter, EM, "Three-dimensional caricatures of human heads: distinctiveness and the perception of facial age," PERCEPTION, vol. 26, pp. 719-732, 1997.
Abstract:
A standard facial-caricaturing algorithm was applied to a
three-dimensional representation of human heads. This algorithm
sometimes produced heads that appeared 'caricatured'. More
commonly, however, exaggerating the distinctive three-
dimensional information in a face seemed to produce an increase
in the apparent age of the face-both at a local level, by
exaggerating small facial creases into wrinkles, and at a more
global level via changes that seemed to make the underlying
structure of the skull more evident. Concomitantly, de-emphasis
of the distinctive three-dimensional information in a face made
it appear relatively younger than the veridical and caricatured
faces. More formally, face-age judgments made by human
observers were ordered according to the level of caricature,
with anticaricatures judged younger than veridical faces, and
veridical faces judged younger than caricatured faces. These
results are discussed in terms of the importance of the nature
of the features made more distinct by a caricaturing algorithm
and the nature of human representation(s) of faces.
- Hozumi, T, Yoshida, K, Yoshioka, H, Yagi, T, Akasaka, T, Takagi, T, Nishiura, M, Watanabe, M, and Yoshikawa, J, "Echocardiographic estimation of left ventricular cavity area with a newly developed automated contour tracking method," JOURNAL OF THE AMERICAN SOCIETY OF ECHOCARDIOGRAPHY, vol. 10, pp. 822-829, 1997.
Abstract:
Development of an automated contour tracking method provides
detection and tracking of the endocardial boundary using the
energy minimization method without tracing a region of
interest. The purpose of this study was to compare the
automated contour tracking method and manually drawn methods
for the measurement of left ventricular cavity areas and
fractional area change. Apical four-chamber view was visualized
and recorded for off-line analysis in 11 patients by means of
two-dimensional echocardiography. The automated contour
tracking method automatically traces the endocardial border
from the recorded images and calculates left ventricular cavity
areas (end-diastole and end-systole) and fractional area
change. In the same images selected as end-diastole and end-
systole in the automated contour tracking method, left
ventricular endocardial border was manually traced to calculate
left ventricular cavity areas and fractional area change. Both
methods were compared by Linear regression analysis for the
measurement of cavity areas and fractional area change. Left
ventricular areas measured by the automated contour tracking
method showed an excellent correlation with those by the manual
method (end-diastole: r = 0.99, y = 0.83x + 2.6, standard error
of the estimate = 1.5 cm(2); end-systole: r = 0.99, y = 0.96x -
0.8, standard error of the estimate = 1.2 cm(2)). The mean
differences between the automated contour tracking and manual
methods were -3.1 +/- 5.1 cm(2) and -1.6 +/- 2.4 cm(2) at end-
diastole and end-systole, respectively. Fractional area change
determined by the automated contour tracking method correlated
well with that by the manual method (r = 0.95, y = 1.17x - 6.5,
standard error of the estimate = 3.4%). The mean difference
between the automated contour tracking and manual methods was -
0.8% +/- 7.1%. In conclusion, a newly developed automated
contour tracking method correlates highly with the manual
method for the estimation of left ventricular cavity areas and
fractional area change in high-quality images. This suggests
that this new technique may be useful in the automated
quantitation of left ventricular function in patients with
high-quality images with no dropout and no intercavity artifact
or structure.
- Huang, CL, Chang, WT, Wu, LC, and Wang, JK, "Three-dimensional PET emission scan registration and transmission scan synthesis," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 16, pp. 542-561, 1997.
Abstract:
The duration of a positron emission tomography (PET) imaging
scan can be reduced if the transmission scan of one patient
which is used for emission correction can be synthesized by
using the reference transmission scan of another patient, In
this paper, we propose a new intersubjects PET emission scan
registration method and PET transmission synthesis method by
using the boundary information of the body or brain scan of the
PET emission scans. The PET emission scans have poor image
quality and different intensity statistics so that we
preprocess the emission scans to have similar histogram and
then apply the point distribution model (PDM) [15] to extract
the contours of the emission scan, The extracted boundary
contour of every slice is used to reconstruct the three-
dimensional (3-D) surface of the reference set and the target
set, Our registration is 3-D surface-based which uses the
normal flow method [17] to find the correspondence vector field
between two 3-D reconstructed surfaces, Since it is difficult
to analyze internal organ using the PET emission scan imaging
without correction, we assume that the deformation of internal
organ is homogeneous, With the corresponding vector field
between the two emission scans and the transmission scan of the
reference set, we can synthesize the transmission scan of the
target set.
- Huang, CL, and Huang, YM, "Facial expression recognition using model-based feature extraction and action parameters classification," JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, vol. 8, pp. 278-290, 1997.
Abstract:
This paper introduces an automatic facial expression
recognition system which consists of two parts: facial feature
extraction and facial expression recognition. The system
applies the point distribution model and the gray-level model
to find the facial features. Then the position variations of
certain designated points on the facial feature are described
by 10 action parameters (APs). There are two phases in the
recognition process: the training phase and the recognition
phase. In the training phase, given 90 different expressions,
the system classifies the principal components of the APs of
all training expressions into six different clusters. In the
recognition phase, given a facial image sequence, it identifies
the facial expressions by extracting the 10 APs, analyzes the
principal components, and finally calculates the AP profile
correlation for a higher recognition rate. In the experiments,
our system has demonstrated that it can recognize the facial
expression effectively. (C) 1997 Academic Press.
- Choi, KN, Cross, ADJ, and Hancock, ER, "Localising facial features with matched filters," AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1206, pp. 11-20, 1997.
Abstract:
This paper describes a study of facial feature recognition
using matched filter techniques. The basic aim is to develop a
set of filters that can be used to characterise each of eight
different facial features. These are left and right eyes, left
and right-eyebrows, hairline, nose, mouth and chin. The matched
filters are extracted from training images using inverse
Fourier analysis. We provide an experimental evaluation of the
method on the University of Berne face data-base.. Here we
explore the most effective choice of training data so that the
filters can be effectively applied when the facial pose varies.
We also evaluate the effectiveness of the method when facial
occlusion due to spectacles is present.
- Bowden, R, Mitchell, TA, and Sarhadi, M, "Cluster based nonlinear principle component analysis," ELECTRONICS LETTERS, vol. 33, pp. 1858-1859, 1997.
Abstract:
In the field of computer vision, principle component analysis
(PCA) is often used to provide statistical models of shape,
deformation or appearance. This simple statistical model
provides a constrained. compact approach to model based vision.
However, as larger problems are considered. high dimensionality
and nonlinearity make linear PCA an unsuitable and unreliable
approach. A nonlinear PCA (NLPCA) technique is proposed which
uses cluster analysis and dimensional reduction to provide a
fast. robust solution. Simulation results on both 2D contour
models and greyscale images are presented.
- Joshi, SC, Miller, MI, and Grenander, U, "On the geometry and shape of brain sub-manifolds," INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, vol. 11, pp. 1317-1343, 1997.
Abstract:
This paper develops mathematical representations for neuro-
anatomically significant substructures of the brain and their
variability in a population. The focus of the paper is an the
neuro-anatomical variation of the geometry and the "shape" of
two-dimensional surfaces in the brain. As examples, we focus on
the cortical and hippocampal surfaces in an ensemble of Macaque
monkeys and human MRI brains. The "shapes" of the substructures
are quantified via the construction of templates; the
variations are represented by defining probabilistic
deformations of the template. Methods for empirically
estimating probability measures on these deformations are
developed by representing the deformations as Gaussian random
vector fields on the embedded sub-manifolds. The Gaussian
random vector fields are constructed as quadratic mean limits
using complete orthonormal bases on the sub-manifolds. The
complete orthonormal bases are generated using modes of
vibrations of the geometries of the brain sub-manifolds. The
covariances are empirically estimated from an ensemble of brain
data. Principal component analysis is presented for
characterizing the "eigen-shape" of the hippocampus in an
ensemble of MRI-MPRAGE whole brain images. Clustering based on
eigen-shape is presented for two sub-populations of normal and
schizophrenic.
- Christensen, GE, Joshi, SC, and Miller, MI, "Volumetric transformation of brain anatomy," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 16, pp. 864-877, 1997.
Abstract:
This paper presents diffeomorphic transformations of three-
dimensional (3-D) anatomical image data of the macaque
occipital lobe and whole brain cryosection imagery and of deep
brain structures in human brains as imaged via magnetic
resonance imagery, These transformations are generated in a
hierarchical manner, accommodating both global and local
anatomical detail, The initial low-dimensional registration is
accomplished by constraining the transformation to be in a low-
dimensional basis, The basis is defined by the Green's function
of the elasticity operator placed at predefined locations in
the anatomy and the eigenfunctions of the elasticity operator,
The high-dimensional large deformations are vector fields
generated via the mismatch between the template and target-
image volumes constrained to be the solution of a Navier-Stokes
fluid model. As part of this procedure, the Jacobian of the
transformation is tracked, insuring the generation of
diffeomorphisms. It is shown that transformations constrained
by quadratic regularization methods such as the Laplacian,
biharmonic, and linear elasticity models, do not ensure that
the transformation maintains topology and, therefore, must only
be used for coarse global registration.
- Hill, A, Brett, AD, and Taylor, CJ, "Automatic landmark identification using a new method of non- rigid correspondence," INFORMATION PROCESSING IN MEDICAL IMAGING, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1230, pp. 483-488, 1997.
Abstract:
A method for corresponding the boundaries of two shapes is
presented. The algorithm locates a matching pair of sparse
polygonal approximations, one for each of a pair of boundaries,
by minimising a cost function using a greedy algorithm. The
cost function expresses the dissimilarity in both the shape and
representation error (with respect to the defining boundary) of
the sparse polygons. Results are presented for three classes of
shape which exhibit various types of non-rigid deformation. The
algorithm is also applied to an automatic landmark
identification task for the construction of statistical shape
models.
|
| |
| 1998 |
- Schuhmann, D, Seemann, M, Schoepf, UJ, Haubner, M, Krapichler, C, Gebicke, K, Reiser, M, and Englmeier, KH, "Computerized diagnostic data analysis and 3-D visualization," RADIOLOGE, vol. 38, pp. 799-809, 1998.
Abstract:
Purpose:To survey methods for 3D data visualization and image
analysis which can be used for computer based diagnostics.
Material and methods: The methods available are explained in
short terms and links to the literature are presented. Methods
which allow basic manipulation of 3D data are windowing,
rotation and clipping. More complex methods for visualization
of 3D data are multiplanar reformation, volume projections
(MIP,semi-transparent projections) and surface projections.
Methods for image analysis comprise local data transformation
(e.g. filtering) and definition and application of complex
models (e.g. deformable models). Results: Volume projections
produce an impression of the 3D data set without reducing the
data amount. This supports the interpretation of the 3D data
set and saves time in comparison to any investigation which
requires examination of all slice images. More advanced
techniques for visualization, e.g. surface projections and
hybrid rendering visualize anatomical information to a very
detailed extent, but both techniques require the segmentation
of the structures of interest. Image analysis methods can be
used to extract these structures(e.g. an organ)from the image
data. Discussion:At the present time volume projections are
robust and fast enough to be used routinely. Surface
projections can be used to visualize complex and presegmented
anatomical features.
- Hagenlocker, M, and Fujimura, K, "CFFD: a tool for designing flexible shapes," VISUAL COMPUTER, vol. 14, pp. 271-287, 1998.
Abstract:
This paper describes a solid deformation method, composed free-
form deformation (CFFD), which applies a sequence of uniform
periodic B-spline FFDs over 3D Euclidean space. The
construction of the individual FFDs, which are defined by
unbounded control lattices, is described, concentrating on a
method by which feature point trajectories are employed to
control lattice point displacements. Problems due to mutual
influence of feature point trajectories are discussed, and
frozen points, which inhibit deformation in their proximity,
are introduced. Also, methods for constructing the FFD lattices
are proposed which control mutual influence of feature point
trajectories. The paper addresses computational issues of
constructing and applying CFFDs, and discusses the application
of CFFD to 3D design and animation.
- Montagnat, J, and Delingette, H, "Globally constrained deformable models for 3D object reconstruction," SIGNAL PROCESSING, vol. 71, pp. 173-186, 1998.
Abstract:
To achieve geometric reconstruction from 3D datasets two
complementary approaches have been widely used. On one hand,
the deformable model framework locally applies forces to fit
the data. On the other hand, the non-rigid registration
framework computes a global transformation minimizing the
distance between a template and the data. We first show that
applying a global transformation on a surface template, is
equivalent to applying certain global forces on a deformable
model. Second, we propose a scheme which combines the
registration and free-form deformation. This globally
constrained deformation scheme allows us to control the amount
of deformation from the reference shape with a single
parameter. Finally, we propose a general algorithm for
performing model-based reconstruction in a robust and accurate
manner. Examples on both range data and medical images are used
to illustrate and validate the globally constrained deformation
framework. (C) 1998 Elsevier Science B.V. All rights reserved.
- Jain, AK, Zhong, Y, and Dubuisson-Jolly, MP, "Deformable template models: A review," SIGNAL PROCESSING, vol. 71, pp. 109-129, 1998.
Abstract:
In this paper, we review the recently published work on
deformable models. We have chosen to concentrate on 2D
deformable models and relate the energy minimization approaches
to the Bayesian formulations. We categorize the various active
contour systems according to the definition of the deformable
model. We also present in detail one particular formulation for
deformable templates which combines edge, texture, color and
region information for the external energy and model
deformations using wavelets, splines or Fourier descriptors. We
explain how these models can be used for segmentation, image
retrieval in a large database and object tracking in a video
sequence. (C) 1998 Elsevier Science B.V. All rights reserved.
- Fujimura, K, and Makarov, M, "Foldover-free image warping," GRAPHICAL MODELS AND IMAGE PROCESSING, vol. 60, pp. 100-111, 1998.
Abstract:
An image warping method is presented that deforms an image
continuously without foldover, while observing a given set of
trajectories of feature elements. Any intermediate image during
the morph is homeomorphic to the initial image and the morphing
process is a homotopy. The method permits points, line-
segments, and polygons to be included as features in the image.
Our method is based on time-varying triangulation, that is,
triangulation changes as features move. Accordingly, the
deformation mapping is updated locally for the part for which
the triangulation changes. Experimental results are included to
demonstrate the feasibility of our approach and the complexity
of the algorithm is analyzed. (C) 1998 Academic Press.
- Ip, HHS, and Shen, DG, "An affine-invariant active contour model (AI-snake) for model- based segmentation," IMAGE AND VISION COMPUTING, vol. 16, pp. 135-146, 1998.
Abstract:
In this paper, we show that existing shaped-based active
contour models are not affine-invariant and we addressed the
problem by presenting an affine-invariant snake model (AI-
snake) such that its energy function are defined in terms local
and global affine-invariant features. The main characteristic
of the AI-snake is that, during the process of object
extraction, the pose of the model contour is dynamically
adjusted such that it is in alignment with the current snake
contour by solving the snake-prototype correspondence problem
and determining the required affine transformation. In
addition, we formulate the correspondence matching between the
snake and the object prototype as an error minimization process
between two feature vectors which capture both local and global
deformation information. We show that the technique is robust
against object deformations and complex scenes. (C) 1998
Elsevier Science B.V.
- Edwards, GJ, Lanitis, A, Taylor, CJ, and Cootes, TF, "Statistical models of face images - Improving specificity," IMAGE AND VISION COMPUTING, vol. 16, pp. 203-211, 1998.
Abstract:
Model-based approaches to the interpretation of face images
have proved very successful. We have previously described
statistically based models of face shape and grey-level
appearance and shown bow they can be used to perform various
coding and interpretation tasks. In the paper we describe
improved methods of modelling which couple shape and grey-level
information more directly than our existing methods, isolate
the changes in appearance due to different sources of
variability (person, expression, pose, lighting) and deal with
non-linear shape variation. We show that the new methods are
better suited to interpretation and tracking tasks. (C) 1998
Elsevier Science B.V.
- Glasbey, CA, and Mardia, KV, "A review of image-warping methods," JOURNAL OF APPLIED STATISTICS, vol. 25, pp. 155-171, 1998.
Abstract:
Image warping is a transformation which maps all positions in
one image plane to positions in a second plane. It arises in
many image analysis problems, whether in order to remove
optical distortions introduced by a camera or a particular
viewing perspective, to register art image with a map or
template, or to align two or more images. The choice of warp is
a compromise between a smooth distortion and one which achieves
a good match. Smoothness can be ensured by assuming a
parametric form for the warp or by constraining it using
differential equations. Matching can be specified by points to
be brought into alignment, by local measures of correlation
between images, or by the coincidence of edges. Parametric and
non-parametric approaches to warping, and matching criteria,
are reviewed.
- Garrido, A, and De la Blanca, NP, "Physically-based active shape models: Initialization and optimization," PATTERN RECOGNITION, vol. 31, pp. 1003-1017, 1998.
Abstract:
In this paper we describe a new approach for 2-D object
segmentation using an automatic method applied on images with
problems as partial information, overlapping objects, many
objects in a single scene, severe noise conditions and locating
objects with a very high degree of deformation. We use a
physically-based shape model to obtain a deformable template,
which is defined on a canonical orthogonal coordinate system.
The proposed methodology works starting from the output of an
edge detector, which is processed to automatically obtain an
approximation of the shape. The final estimation of the shapes
is obtained fitting a deformable template model, which is
defined on a learned surface of deformation. Results from
biological images are presented. (C) 1998 Pattern Recognition
Society. Published by Elsevier Science Ltd. All rights
reserved.
- Ivins, JP, and Porrill, J, "A deformable model of the human iris for measuring small three- dimensional eye movements," MACHINE VISION AND APPLICATIONS, vol. 11, pp. 42-51, 1998.
Abstract:
This paper describes a deformable model of the human iris which
forms part of a system for accurate offline measurement of
binocular three-dimensional eye movements, particularly
cyclotorsion (torsion), from video image sequences. At least
two existing systems measure torsion from infrared video images
by pupil tracking followed by cross correlation using arcs of
bandpass-filtered iris texture. Unfortunately, pupil expansion
and contraction reduces the accuracy of this method unless
drugs are used to constrict the pupil, which causes temporary
blurred vision. A five-parameter deformable model of the iris
is therefore developed for analysing images obtained without
the use of drugs. This model can translate (horizontal and
vertical eye motion), rotate (torsion) and scale both uniformly
and radially (pupil changes). Torsion measurements obtained
with the model are repeatable and accurate to within 0.1
degrees; this performance in illustrated by analysing binocular
torsion during fixation on a stationary target.
- Vetter, T, "Synthesis of novel views from a single face image," INTERNATIONAL JOURNAL OF COMPUTER VISION, vol. 28, pp. 103-116, 1998.
Abstract:
Images formed by a human face change with viewpoint. A new
technique is described for synthesizing images of faces from
new viewpoints, when only a single 2D image is available. A
novel 2D image of a face can be computed without explicitly
computing the 3D structure of the head. The technique draws on
a single generic 3D model of a human head and on prior
knowledge of faces based on example images of other faces seen
in different poses. The example images are used to "learn" a
pose-invariant shape and texture description of a new face. The
3D model is used to solve the correspondence problem between
images showing faces in different poses. The proposed method is
interesting for view independent face recognition tasks as well
as for image synthesis problems in areas like teleconferencing
and virtualized reality.
- Gu, C, and Lee, MC, "Semiautomatic segmentation and tracking of semantic video objects," IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, vol. 8, pp. 572-584, 1998.
Abstract:
This paper introduces a novel semantic video object extraction
system using mathematical morphology and a perspective motion
model. Inspired by the results from the study of the human
visual system, we intend to solve the semantic video object
extraction problem in two separate steps: supervised I-frame
segmentation, and unsupervised P-frame tracking. First, the
precise semantic video object boundary can be found using a
combination of human assistance and a morphological
segmentation tool. Second, the semantic video objects in the
remaining frames are obtained using global perspective motion
estimation and compensation of the previous semantic video
object plus boundary refinement as used for I frames.
- Lepsoy, S, and Curinga, S, "Conversion of articulatory parameters into active shape model coefficients for lip motion representation and synthesis," SIGNAL PROCESSING-IMAGE COMMUNICATION, vol. 13, pp. 209-225, 1998.
Abstract:
Speech-driven facial animation combines techniques from
different disciplines such as image analysis, computer
graphics, and speech analysis. Active shape models (ASM) used
in image analysis are excellent tools for characterizing lip
contour shapes and approximating their motion in image
sequences. By controlling the coefficients for an ASM, such a
model can also be used for animation. We design a mapping of
the articulatory parameters used in phonetics into ASM
coefficients that control nonrigid lip motion. The mapping is
designed to minimize the approximation error when articulatory
parameters measured on training lip contours are taken as input
to synthesize the training lip movements. Since articulatory
parameters can also be estimated from speech, the proposed
technique can form an important component of a speech-driven
facial animation system. (C) 1998 Elsevier Science B.V. All
rights reserved.
- Ivins, J, and Porrill, J, "Constrained active region models for fast tracking in color image sequences," COMPUTER VISION AND IMAGE UNDERSTANDING, vol. 72, pp. 54-71, 1998.
Abstract:
Image segmentation is a fundamental problem in computer vision,
for which deformable models offer a partial solution. Most
deformable models work by performing some kind of edge
detection; complementary region growing methods have not often
been used. As a result, deformable models that track regions
rather than edges have yet to be developed to a great extent.
Active region models are a relatively new type of deformable
model driven by a region energy that is a function of the
statistical characteristics of an image. This paper describes
the use of constrained active region models for frame-rate
tracking in color video images on widely available computer
hardware. Two of the many color representations now in use are
reviewed for this purpose: the intensity-based RGB space and
the more intuitive HSV space. Normalized RGB, which is
essentially a measure of hue and saturation, emerges as the
preferred representation because it is invariant to
illumination changes and can be obtained from many frame-
grabbers via a simple fast software transformation. Three types
of motion are examined for constraining deformable models:
rigid models can only translate and rotate to fit image
features; conformal models can also change size; affine models
exhibit two kinds of shearing in addition to the other
components. Two methods are described for producing affine
motion, given the desired unconstrained motion calculated by
searching for local energy minima lying perpendicular to the
model boundary. An existing method, based on iterative gradient
descent, computes translating, rotating, scaling, and shearing
forces which can be combined to produce affine and other types
of motion. A faster, more accurate method uses least-squares
minimization to approximate the desired motion; with this
method it is also possible to derive specific equations for
rigid and conformal motion and to correct for the aperture
problem associated with the perpendicular search method. The
advantages of the new least-squares method are illustrated by
using it to drive an active region model via an affine
transformation which tracks the movements of a robot arm at
frame rate in color video images, (C) 1998 Academic Press.
- Cross, ADJ, and Hancock, ER, "Graph matching with a dual-step EM algorithm," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 20, pp. 1236-1253, 1998.
Abstract:
This paper describes a new approach to matching geometric
structure in 2D point-sets. The novel feature is to unify the
tasks of estimating transformation geometry and identifying
point-correspondence matches. Unification is realized by
constructing a mixture model over the bipartite graph
representing the correspondence match and by affecting
optimization using the EM algorithm. According to our EM
framework, the probabilities of structural correspondence gate
contributions to the expected likelihood function used to
estimate maximum likelihood transformation parameters. These
gating probabilities measure the consistency of the matched
neighborhoods in the graphs. The recovery of transformational
geometry and hard correspondence matches are interleaved and
are realized by applying coupled update operations to the
expected log-likelihood function. In this way, the two
processes bootstrap one another. This provides a means of
rejecting structural outliers. We evaluate the technique on two
real-world problems. The first involves the matching of
different perspective views of 3.5-inch floppy discs. The
second example is furnished by the matching of a digital map
against aerial images that are subject to severe barrel
distortion due to a line-scan sampling process. We complement
these experiments with a sensitivity study based on synthetic
data.
- Wang, YM, and Staib, LH, "Elastic model based non-rigid registration incorporating statistical shape information," MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI'98, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1496, pp. 1162-1173, 1998.
Abstract:
This paper describes a new method of non-rigid registration
using the combined power of elastic and statistical shape
models. The transformations are constrained to be consistent
with a physical model of elasticity to maintain smoothness and
continuity. A Bayesian formulation, based on this model, on an
intensity similarity measure, and on statistical shape
information embedded in corresponding boundary points, is
employed to find a more accurate and robust nan-rigid
registration. A dense set of forces arises from the intensity
similarity measure to accommodate complex anatomical details. A
sparse set of forces constrains consistency with statistical
shape models derived from a training set. A number of
experiments were performed on both synthetic and real medical
images of the brain and heart to evaluate the approach. It is
shown that statistical boundary shape information significantly
augments and improves elastic model based non-rigid
registration.
- Fleute, M, and Lavallee, S, "Building a complete surface model from sparse data using statistical shape models: Application to computer assisted knee surgery," MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI'98, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1496, pp. 879-887, 1998.
Abstract:
This paper addresses the problem of extrapolating very few
range data to obtain a complete surface representation of an
antomical structure. A new method that uses statistical shape
models is proposed and its application to modeling a few points
manually digitized on the femoral surface is detailed, in order
to improve visualization of a system developped by TIMC
laboratory for computer assisted anterior cruciate ligament
(ACL) reconstruction. The model is built from a population of
11 femur specimen digitized manually. Data sets are registered
together using an elastic registration method of Szeliski and
Lavallee based on octree-splines. Principal Components Analysis
(PCA) is performed on a field of surface deformation vectors.
Fitting this statistical model to a few points is performed by
non-linear optimisation. Results are presented for both
simulated and real data. The method is very flexible and can be
applied to any structures for which the shape is stable.
- Duta, N, and Sonka, M, "Segmentation and interpretation of MR brain images: An improved active shape model," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 17, pp. 1049-1062, 1998.
Abstract:
This paper reports a novel method for fully automated
segmentation that is based on description of shape and its
variation using point distribution models (PDM's). An
improvement of the active shape procedure introduced by Cootes
and Taylor to find new examples of previously learned shapes
using PDM's is presented, The new method for segmentation and
interpretation of deep neuroanatomic structures such as
thalamus, putamen, ventricular system, etc. incorporates a
priori knowledge about shapes of the neuroanatomic structures
to provide their robust segmentation and labeling in magnetic
resonance (MR) brain images. The method was trained in eight MR
brain images and tested in 19 brain images by comparison to
observer-defined independent standards. Neuroanatomic
structures in all testing images were successfully identified.
Computer-identified and observer-defined neuroanatomic
structures agreed well, The average labeling error was 7% +/-
3%, Border positioning errors were quite small, with the
average border positioning error of 0.8 +/- 0.1 pixels in 256 x
256 MR images, The presented method was specifically developed
for segmentation of neuroanatomic structures in MR brain
images. However, it is generally applicable to virtually any
task involving deformable shape analysis.
- Grenander, U, and Miller, MI, "Computational anatomy: An emerging discipline," QUARTERLY OF APPLIED MATHEMATICS, vol. 56, pp. 617-694, 1998.
Abstract:
This paper studies mathematical methods in the emerging new
discipline of Computational Anatomy. Herein we formalize the
Brown/Washington University model of anatomy following the
global pattern theory introduced in [1, 2], in which anatomies
are represented as deformable templates, collections of 0, 1,
2, 3-dimensional manifolds. Typical structure is carried by the
template with the variabilities accommodated via the
application of random transformations to the background
manifolds. The anatomical model is a quadruple (Omega, H, I,
P), the background space Omega = boolean ORalpha M-alpha of 0,
1, 2, 3-dimensional manifolds, the set of diffeomorphic
transformations on the background space H : Omega <-> Omega,
the space of idealized medical imagery I, and P the family of
probability measures on H. The group of diffeomorphic
transformations H is chosen to be rich enough so that a large
family of shapes may be generated with the topologies of the
template maintained. For normal anatomy one deformable template
is studied, with (Omega, H, I) corresponding to a homogeneous
space [3], in that it can be completely generated from one of
its elements, I = HItemp,I-temp is an element of I. For
disease, a family of templates boolean ORalphaItempalpha are
introduced of perhaps varying dimensional transformation
classes. The complete anatomy is a collection of homogeneous
spaces I-total = boolean ORalpha(I-alpha,H-alpha). There are
three principal components to computational anatomy studied
herein. (1) Computation of large deformation maps: Given any
two elements I, I' is an element of I in the same homogeneous
anatomy (Omega, H, I), compute diffeomorphisms h from one
anatomy to the other I (h-1)reversible arrow(h) I'. This is the
principal method by which anatomical structures are understood,
transferring the emphasis from the images I is an element of I
to the structural transformations h is an element of H that
generate them. (2) Computation of empirical probability laws:
Given populations of anatomical imagery and diffeomorphisms
between them I h(n-1)reversible arrow(hn) I-n, n = 1, . . . ,
N, generate probability laws P is an element of P on H that
represent the anatomical variation reflected by the observed
population of diffeomorphisms h(n), n = 1,..., N. (3) Inference
and disease testing: Within the anatomy (Omega, H, I, P),
perform Bayesian classification and testing for disease and
anomaly.
|
| |
| 1999 |
- Luo, B, and Hancock, ER, "Procrustes alignment with the EM algorithm," COMPUTER ANALYSIS OF IMAGES AND PATTERNS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1689, pp. 623-631, 1999.
Abstract:
This paper casts the problem of point-set alignment via
Procrustes analysis into a maximum likelihood framework using
the EM algorithm. The aim is to improve the robustness of the
Procrustes alignment to noise and clutter. By constructing a
Gaussian mixture model over the missing correspondences between
individual points, we show how alignment can be realised by
applying singular value decomposition to a weighted point
correlation matrix. Moreover, by gauging the relational
consistency of the assigned correspondence matches, we can edit
the point sets to remove clutter. We illustrate the
effectiveness of the method matching stereogram. We also
provide a sensitivity analysis to demonstrate the operational
advantages of the method.
- Chui, H, Rambo, J, Duncan, J, Schultz, R, and Rangarajan, A, "Registration of cortical anatomical structures via robust 3D point matching," INFORMATION PROCESSING IN MEDICAL IMAGING, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1613, pp. 168-181, 1999.
Abstract:
Inter-subject non-rigid registration of cortical anatomical
structures as seen in MR is a challenging problem. The
variability of the sulcal and gyral patterns across patients
makes the task of registration especially difficult regardless
of whether voxel- or feature-based techniques are used. In this
paper, we present an approach to matching sulcal point features
interactively extracted by neuroanatomical experts. The robust
point matching (RPM) algorithm is used to find the optimal
affine transformations for matching sulcal points. A 3D
linearly interpolated non-rigid warping is then generated for
the original image volume. We present quantitative and visual
comparisons between Talairach, mutual information-based
volumetric matching and RPM on five subjects' MR images.
- Cootes, TF, Beeston, C, Edwards, GJ, and Taylor, CJ, "A unified framework for atlas matching using Active Appearance Models," INFORMATION PROCESSING IN MEDICAL IMAGING, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1613, pp. 322-333, 1999.
Abstract:
We propose to use statistical models of shape and texture as
deformable anatomical atlases. By training on sets of labelled
examples these can represent both the mean structure and
appearance of anatomy in medical images, and the allowable
modes of deformation. Given enough training examples such a
model should be able synthesise any image of normal anatomy. By
finding the parameters which minimise the difference between
the synthesised model image and the target image we can locate
all the modelled structure. This potentially time consuming
step can be solved rapidly using the Active Appearance Model
(AAM). In this paper we describe the models and the AAM
algorithm and demonstrate the approach on structures in MR
brain cross-sections.
- Brett, AD, and Taylor, CJ, "A framework for automated landmark generation for automated 3D statistical model construction," INFORMATION PROCESSING IN MEDICAL IMAGING, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1613, pp. 376-381, 1999.
Abstract:
We describe a method of pairwise 3D surface correspondence for
the automated generation of landmarks on a set of examples from
a class of shape. We show how the pairwise corresponder can be
used in an extension of an existing framework for establishing
dense correspondences between a set of training examples to
build a 3D statistical model. The framework relies upon
additional algorithms for the production of surface paths
between vertices on a polyhedral mesh, and these are described.
An example statistical model is shown for the left lateral
ventricle of the brain.
- Velasco, HMG, Aligue, FJL, Orellana, CJG, Macias, MM, and Sotoca, MIA, "Application of ANN techniques to automated identification of bovine livestock," ENGINEERING APPLICATIONS OF BIO-INSPIRED ARTIFICIAL NEURAL NETWORKS, VOL II, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1607, pp. 422-431, 1999.
Abstract:
In this work a classification system is presented that, taking
lateral images of cattle as inputs, is able to identify the
animals and classify them by breed into previously learnt
classes. The system consists of two fundamental parts. In the
first one, a deformable-model-based preprocessing of the image
is made, in which the contour of the animal in the photograph
is sought, extracted, and normalized. Next, a neural classifier
is presented that, supplemented with a decision-maker at its
output, makes the distribution into classes. In the last part,
the results obtained in a real application of this methodology
are presented.
- Germond, L, Dojat, M, Taylor, C, and Garbay, C, "A multi-agent system for MRI brain segmentation," ARTIFICIAL INTELLIGENCE IN MEDICINE, LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, vol. 1620, pp. 423-432, 1999.
Abstract:
In this paper we present an original approach for the
segmentation of MRI brain images which is based on a
cooperation between low-level and high-level approches. MRI
brain images are very difficult to segment mainly due to the
presence of inhomogeneities within tissues and also due to the
high anatomical variability of the brain topology between
individuals. In order to tackle these difficulties, we have
developped a method whose characteristics are : (i) the use of
a priori knowledge essentially anatomical and model-based; (ii)
a multi-agent system (MAS) for low-level region segmentation;
(iii) a cooperation between a priori knowledge and low-level
segmentation to guide and constrain the segmentation processes.
These characteristics allow to produce an automatic detection
of the main tissues of the brain. The method is validated with
phantoms and real images through comparisons with another
widely used approach (SPM).
- Hutton, TJ, Hammond, P, and Davenport, JC, "Active shape models for customised prosthesis design," ARTIFICIAL INTELLIGENCE IN MEDICINE, LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, vol. 1620, pp. 448-452, 1999.
Abstract:
Images and computer graphics play an increasingly important
role in the design and manufacture of medical prostheses and
implants. Images provide guidance on optimal design in terms of
location, preparation and the overall shape and configuration
of subcomponents. Direct manipulation of a graphical
representation provides a natural design environment. RaPiD is
a CAD-like knowledge-based assistant for designing a dental
prosthesis known as a removable partial denture (RPD). The
expertise embedded in RaPiD encourages optimal subcomponent
configuration, but currently supports only minor customisation.
This paper describes how oral images and Active Shape Models
(ASMs) are being used to address this limitation.
- Kelemen, A, Szekely, G, and Gerig, G, "Elastic model-based segmentation of 3-D neuroradiological data sets," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 18, pp. 828-839, 1999.
Abstract:
This paper presents a new technique for the automatic model-
based segmentation of three-dimensional (3-D) objects from
volumetric image data. The development closely follows the
seminal work of Taylor and Cootes on active shape models, but
is based on a hierarchical parametric object description rather
than a point distribution model, The segmentation system
includes both the building of statistical models and the
automatic segmentation of new image data sets via a restricted
elastic deformation of shape models, Geometric models are
derived from a sample set of image data which have been
segmented by experts, The surfaces of these binary objects are
converted into parametric surface representations, which are
normalized to get an invariant object-centered coordinate
system, Surface representations are expanded into series of
spherical harmonics which provide parametric descriptions of
object shapes. It is shown that invariant object surface
parametrization provides a good approximation to automatically
determine object homology in terms of sets of corresponding
sets of surface points. Gray-level information near object
boundaries is represented by 1-D intensity profiles normal to
the surface. Considering automatic segmentation of brain
structures as our driving application, our choice of
coordinates for object alignment was the well-accepted
stereotactic coordinate system. Major variation of object
shapes around the mean shape, also referred to as shape
eigenmodes, are calculated in shape parameter space rather than
the feature space of point coordinates, Segmentation makes use
of the object shape statistics by restricting possible elastic
deformations into the range of the training shapes, The mean
shapes are initialized in a new data set by specifying the
landmarks of the stereotactic coordinate system, The model
elastically deforms, driven by the displacement forces across
the object's surface, which are generated by matching local
intensity profiles. Elastical deformations are limited by
setting bounds for the maximum variations in eigenmode space.
The technique has been applied to automatically segment left
and right hippocampus, thalamus, putamen, and globus pallidus
from volumetric magnetic resonance scans taken from
schizophrenia studies. The results have been validated by
comparison of automatic segmentation with the results obtained
by interactive expert segmentation.
- Lotjonen, J, Magnin, IE, Nenonen, J, and Katila, T, "Reconstruction of 3-D geometry using 2-D profiles and a geometric prior model," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 18, pp. 992-1002, 1999.
Abstract:
A method has been developed to reconstruct three-dimensional
(3-D) surfaces from two-dimensional (2-D) projection data. It
is used to produce individualized boundary element models,
consisting of thorax and lung surfaces, for electro- and
magnetocardiographic inverse problems. Two orthogonal
projections are utilized, A geometrical prior model, built:
using segmented magnetic resonance images, is deformed
according to profiles segmented from projection images. In our
method, virtual X-ray images of the prior model are first
constructed by simulating real X-ray imaging, The 2-D profiles
of the model are segmented from the projections and elastically
matched with the profiles segmented from patient data. The
displacement vectors produced by the elastic 2-D matching are
back projected onto the 3-D surface of the prior model.
Finally, the model is deformed, using the back-projected
vectors. Two different deformation methods are proposed, The
accuracy of the method is validated by a simulation, The
average reconstruction error of a thorax and lungs was 1.22
voxels, corresponding to about 5 mm.
- Chesnaud, C, Refregier, P, and Boulet, V, "Statistical region snake-based segmentation adapted to different physical noise models," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 21, pp. 1145-1157, 1999.
Abstract:
Algorithms for object segmentation are crucial in many image
processing applications. During past years, active contour
models (snakes) have been widely used for finding the contours
of objects. This segmentation strategy is classically edge-
based in the sense that the snake is driven to fit the maximum
of an edge map of the scene. In this paper, we propose a region
snake approach and we determine fast algorithms for the
segmentation of an object in an image. The algorithms developed
in a Maximum Likelihood approach are based on the calculation
of the statistics of the inner and the outer regions (defined
by the snake). It has thus been possible to develop optimal
algorithms adapted to the random fields which describe the gray
levels in the input image if we assume that their probability
density function family are known. We demonstrate that this
approach is still efficient when no boundary's edge exists in
the image. We also show that one can obtain fast algorithms by
transforming the summations over a region, for the calculation
of the statistics, into summations along the boundary of the
region. Finally, we will provide numerical simulation results
for different physical situations in order to illustrate the
efficiency of this approach.
- Stammberger, T, Eckstein, F, Michaelis, M, Englmeier, KH, and Reiser, M, "Interobserver reproducibility of quantitative cartilage measurements: Comparison of B-spline snakes and manual segmentation," MAGNETIC RESONANCE IMAGING, vol. 17, pp. 1033-1042, 1999.
Abstract:
The objective of this work was to develop a segmentation
technique for thickness measurements of the articular cartilage
in MR images and to assess the interobserver reproducibility of
the method in comparison with manual segmentation. The
algorithm is based on a B-spline snakes approach and is able to
delineate the cartilage boundaries in real time and with
minimal user interaction. The interobserver reproducibility of
the method, ranging from 3.3 to 13.6% for various section
orientations and joint surfaces, proved to be significantly
superior to manual segmentation. (C) 1999 Elsevier Science Inc.
- Craw, I, Costen, N, Kato, T, and Akamatsu, S, "How should we represent faces for automatic recognition?," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 21, pp. 725-736, 1999.
Abstract:
We describe results obtained from a testbed used to investigate
different codings for automatic face recognition. An eigenface
coding of shape-free faces using manually located landmarks was
more effective than the corresponding coding of correctly
shaped faces. Configuration also proved an effective method of
recognition, with rankings given to incorrect matches
relatively uncorrelated with those from shape-free faces. Both
sets of information combine to improve significantly the
performance of either system. The addition of a system, which
directly correlated the intensity values of shape-free images,
also significantly increased recognition, suggesting extra
information was still available. The recognition advantage for
shape-free faces reflected and depended upon high-quality
representation of the natural facial variation via a disjoint
ensemble of shape-free faces; if the ensemble was comprised of
nonfaces, a shape-free disadvantage was induced. Manipulation
within the shape-free coding to emphasize distinctive features
of the faces, by caricaturing, allowed further increases in
performance; this effect was only noticeable when the
independent shape-free and configuration coding was used. Taken
together, these results strongly support the suggestion that
faces should be considered as lying in a high-dimensional
manifold, which is locally linearly approximated by these
shapes and textures, possibly with a separate system for local
features. Principal Components Analysis is then seen as a
convenient tool in this local approximation.
- Egmont-Petersen, M, and Arts, T, "Recognition of radiopaque markers in X-ray images using a neural network as nonlinear filter," PATTERN RECOGNITION LETTERS, vol. 20, pp. 521-533, 1999.
Abstract:
Neural networks are developed to recognise radiopaque markers
in biplane cineangiographic video-images with a background
composed of different objects. Our connectionist approach is
compared theoretically as well as experimentally with linear
template matching. Theoretically, neural networks are likely to
give better recognition results as they can implement nonlinear
discriminants. Experiments confirm that the networks result in
better marker recognitions than template matching. (C) 1999
Elsevier Science B.V. All rights reserved.
- Cootes, TF, and Taylor, CJ, "A mixture model for representing shape variation," IMAGE AND VISION COMPUTING, vol. 17, pp. 567-573, 1999.
Abstract:
The shape variation displayed by a class of objects can be
represented as probability density function, allowing us to
determine plausible and implausible examples of the class.
Given a training set of example shapes we can align them into a
common co-ordinate frame and use kernel-based density
estimation techniques to represent this distribution. Such an
estimate is complex and expensive, so we generate a simpler
approximation using a mixture of gaussians. We show how to
calculate the distribution, and how it can be used in image
search to locate examples of the modelled object in new images.
(C) 1999 Elsevier Science B.V. All rights reserved.
- Brett, AD, Hill, A, and Taylor, CJ, "A method of 3D surface correspondence and interpolation for merging shape examples," IMAGE AND VISION COMPUTING, vol. 17, pp. 635-642, 1999.
Abstract:
A method for corresponding the triangulated mesh surface
representations of two shapes is presented. It comprises a
method of polyhedral mesh decimation and a symmetric version of
the iterative Closest Point (ICP) algorithm. The method
produces a matching pair of sparse polyhedral approximations,
one for each shape surface, using a global Euclidean measure of
similarity. A method of surface patch parameterisation is
presented which uses minimal paths constructed across the
surface of a polyhedron. We describe the use of this patch
parameterisation in the interpolation of surfaces for the
construction of a merged mean shape with a densely triangulated
surface. Results are presented for the production of a binary
tree of merged biological shapes which may be used as a basis
for the automated landmarking of a set of examples. (C) 1999
Elsevier Science B.V. All rights reserved.
- Lelieveldt, BPF, van der Geest, RJ, Rezaee, MR, Bosch, JG, and Reiber, JHC, "Anatomical model matching with fuzzy implicit surfaces for segmentation of thoracic volume scans," IEEE TRANSACTIONS ON MEDICAL IMAGING, vol. 18, pp. 218-230, 1999.
Abstract:
Many segmentation methods for thoracic volume data require
manual input in the form of a seed point, initial contour,
volume of interest etc. The aim of the work presented here is
to further automate this segmentation initialization step. In
this paper an anatomical modeling and matching method is
proposed to coarsely segment thoracic volume data into
anatomically labeled regions. An anatomical model of the thorax
is constructed in two steps: 1) individual organs are modeled
with blended fuzzy implicit surfaces and 2) the single organ
models are grouped into a tree structure with a solid modeling
technique named constructive solid geometry (CSG), The
combination of CSG with fuzzy implicit surfaces allows a
hierarchical scene description by means of a boundary model,
which characterizes the scene volume as a boundary potential
function. From this boundary potential, an energy function is
defined which is minimal when the model is registered to the
tissue-air transitions in thoracic magnetic resonance imaging
(MRI) data. This allows automatic registration in three steps:
feature detection, initial positioning and energy minimization,
The model matching has been validated in phantom simulations
and on 15 clinical thoracic volume scans from different
subjects. In 13 of these sets the matching method accurately
partitioned the image volumes into a set of volumes of interest
for the heart, lungs, cardiac ventricles, and thorax outlines.
The method is applicable to segmentation of various types of
thoracic MR-images, provided that a large part of the thorax is
contained in the image volume.
- Chalmond, B, and Girard, SC, "Nonlinear modeling of scattered multivariate data and its application to shape change," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 21, pp. 422-432, 1999.
Abstract:
We are given a set of points in a space of high dimension. For
instance, this set may represent many visual appearances of an
object, a face, or a hand. We address the problem of
approximating this set by a manifold in order to have a compact
representation of the object appearance. When the scattering of
this set is approximately an ellipsoid, then the problem has a
well-known solution given by Principal Components Analysis
(PCA). However, in some situations like object displacement
learning or face learning, this linear technique may be ill-
adapted and nonlinear approximation has to be introduced. The
method we propose can be seen as a Non Linear PCA (NLPCA), the
main difficulty being that the data are not ordered. We propose
an index which favors the choice of axes preserving the closest
point neighborhoods. These axes determine an order for visiting
all the points when smoothing. Finally, a new criterion, called
"generalization error," is introduced to determine the
smoothing rate, that is, the knot number for the spline
fitting. Experimental results conclude this paper: The method
is tested on artificial data and on two data bases used in
visual learning.
- Cosio, FA, and Davies, BL, "Automated prostate recognition: a key process for clinically effective robotic prostatectomy," MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, vol. 37, pp. 236-243, 1999.
Abstract:
Clinical trials of PROBOT, a robotic system for prostate
surgery, have shown that robotic surgery of soft tissue can be
successful. Monitoring of the progress of the resection has
shown to be a necessary feature of an effective robotic system
for prostate surgery. It should provide the surgeon with a
reliable method of assessing the cavity during resection. An
automatic system for intraoperative monitoring of the progress
of the resection during robotic prostatectomy consists of two
subsystems: real-time intraoperative imaging of the prostate
and automatic identification of the contour of the gland on
each image. The development of a fully automatic scheme for
prostate recognition on transurethral ultrasound scans is
reported. A genetic algorithm has been developed to
automatically adjust a model of the prostate boundary until an
optimum fit to the prostate in a given image is obtained. An
analysis of ifs performance on 22 different ultrasound images
showed an average error of 6.21 mm. Use of a genetic algorithm
and a constrained prostate model have shown to be a robust way
to automatically identify the prostate in ultrasound images.
The scheme is able to produce approximate prostate boundaries,
without any human intervention, on ultrasound scans of varying
quality. In addition to soft tissue robotic surgery, the
generic algorithm technique is also applicable to a wide range
of computer assisted surgical techniques.
- Gavrila, DM, "The visual analysis of human movement: A survey," COMPUTER VISION AND IMAGE UNDERSTANDING, vol. 73, pp. 82-98, 1999.
Abstract:
The ability to recognize humans and their activities by vision
is key for a machine to interact intelligently and effortlessly
with a human-inhabited environment. Because of many potentially
important applications, "looking at people" is currently one of
the most active application domains in computer vision. This
survey identifies a number of promising applications and
provides an overview of recent developments in this domain. The
scope of this survey is limited to work on whole-body or hand
motion; it does not include work on human faces. The emphasis
is on discussing the various methodologies; they are grouped in
2-D approaches with or without explicit shape models and 3-D
approaches. Where appropriate, systems are reviewed. We
conclude with some thoughts about future directions. (C) 1999
Academic Press.
- Behiels, G, Vandermeulen, D, Maes, F, Suetens, P, and Dewaele, P, "Active Shape Model-based segmentation of digital X-ray images," MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, MICCAI'99, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1679, pp. 128-137, 1999.
Abstract:
We propose an improved search procedure for Active Shape Model
(ASM) based delineation of anatomical structures in digital X-
ray images. Whereas the original ASM search method (1)
iteratively improves the current estimate of the location of
boundary points by a limited least squares adjustment of the
pose and shape parameters, our method additionally requires the
subsequent changes in shape during the search to be smooth,
which is achieved by using a minimum cost path search
algorithm. We, compare the two methods oil a database of more
than 400 manual segmentations of digital X-ray images of the
femur, humerus and calcaneus. We evaluate the accuracy and
robustness of both methods using a cross-validation procedure.
- Fleute, M, and Lavallee, S, "Nonrigid 3-D/2-D registration of images using statistical models," MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, MICCAI'99, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1679, pp. 138-147, 1999.
Abstract:
This paper presents a new algorithm for reconstruction of 3D
shapes using a few x-ray views and a statistical model. In many
applications of surgery such as orthopedics, it is desirable to
define a surgical planning oil 3-D images and then to execute
the plan using standard registration techniques and image-
guided surgery systems. But the cost, time and x-ray dose
associated with standard pre-operative Computed Tomography
makes it difficult to use this methodology for rather standard
interventions. Instead, we propose to use. a few x-ray images
generated from a C-Arm and to build the 3-D shape of the
patient bones or organs intra-operatively, by deforming a
statistical 3-D model to the contours segmented oil the x-ray
views. In this paper, we concentrate on the application of our
method to bone reconstruction. The algorithm starts from
segmented contours of the bone oil the x-ray images and ail
initial estimate of the pose of the 3-D model in the common
coordinate system of the set of x-ray projections. The
statistical model is made of a few principal modes that are
sufficient to represent the normal anatomy. Those modes are
built by using a generalization of the Cootes and Taylor method
to 3-D surface models, previously published in MICCAI'98 by the
authors. Fitting the model to the contours is achieved by using
a generalization of the Iterative Closest Point Algorithm to
nonrigid 3D/2D registration. For pathological shapes, the
statistical model is not valid and subsequent local refinement
is necessary. First results are presented for a 3-D statistical
model of the distal part of the femur.
- Lotjonen, J, Magnin, IE, Reinhardt, L, Nenonen, J, and Katila, T, "Automatic reconstruction of 3D geometry using projections and a geometric prior model," MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, MICCAI'99, PROCEEDINGS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1679, pp. 192-201, 1999.
Abstract:
A method has been developed to reconstruct 3D surfaces from two
orthogonal X-ray projections. A 3D geometrical prior model,
composed of triangulated surfaces, is deformed according to
contours segmented from projection images. The contours are
segmented by a new method based on free-form deformation.
First, virtual X-ray images of the prior model are constructed
by simulating real X-ray imaging, Thereafter, the contours
segmented from the virtual projections are elastically matched
with patient data. Next, the produced 2D vectors are back-
projected onto the surface of the prior model and the prior
model is deformed using the back-projected vectors with shape-
based interpolation. The accuracy of the method is validated by
it data set, containing 20 cases. The method is applied to
reconstruct thorax and lung surfaces. The average matching
error is about 1.2 voxels, corresponding to 5 mm.
- Suri, JS, Haralick, RM, and Sheehan, FH, "Linear vs. quadratic optimization algorithms for bias correction of left ventricle chamber boundaries in low contrast projection ventriculograms produced from xray cardiac catheterization procedure," COMPUTER ANALYSIS OF IMAGES AND PATTERNS, LECTURE NOTES IN COMPUTER SCIENCE, vol. 1689, pp. 108-117, 1999.
Abstract:
Cardiac catheterization procedure produces ventriculogram which
have very low contrast in the apical, anterior and inferior
zones of the left ventricle (LV). Pixel-based classifiers
operating on these images produce boundaries which have
systematic positional and orientation bias and have a mean
error of about 10.5 mm. Using the IV convex information,
comprising of the apex and the aortic valve plane, this pa, per
presents a comparison of the linear and quadratic optimization
algorithms to remove these biases. These algorithms axe named
after the way the coefficients are computed: the identical
coefficient and the independent coefficient. Using the polyline
metric, we show that the quadratic optimization is better than
the linear optimization. We also show that the independent
coefficient method performs better than the identical
coefficient when the training data is large. The overall mean
system error was 2.49 mm while the goal set by the cardiologist
was 2.5 mm.
|
| |
| 2000 |
- Hill, A, Taylor, CJ, and Brett, AD, "A framework for automatic landmark identification using a new method of nonrigid correspondence," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 22, pp. 241-251, 2000.
Abstract:
A framework for automatic landmark indentification is presented
based on an algorithm for corresponding the boundaries of two
shapes. The auto-landmarking framework employs a binary tree of
corresponded pairs of shapes to generate landmarks
automatically on each of a set of example shapes. The landmarks
are used to train statistical shape models known as Point
Distribution Models. The correspondence algorithm locates a
matching pair of sparse polygonal approximations, one for each
of a pair of boundaries by minimizing a cost function, using a
greedy algorithm. The cost function expresses the dissimilarity
in both the shape and representation error (with respect to the
defining boundary) of the sparse polygons. Results are
presented for three classes of shape which exhibit various
types of nonrigid deformation.
- Bronkorsta, PJH, Reinders, MJT, Hendriks, EA, Grimbergen, J, Heethaar, RM, and Brankenhoff, GJ, "On-line detection of red blood cell shape using deformable templates," PATTERN RECOGNITION LETTERS, vol. 21, pp. 413-424, 2000.
Abstract:
For the purpose of automating a clinical diagnostic apparatus
to quantify the deformability of human red blood cells, we
present an automated image analysis procedure for on-line
detection of the cell shape based upon the method of parametric
deformable templates. (C) 2000 Elsevier Science B.V. All rights
reserved.
- Brett, AD, and Taylor, CJ, "A method of automated landmark generation for automated 3D PDM construction," IMAGE AND VISION COMPUTING, vol. 18, pp. 739-748, 2000.
Abstract:
A previous publication has described a method of pairwise
three-dimensional (3D) surface correspondence for the automated
generation of landmarks on a set of examples from a class of
shape (A.D. Brett, A. Hill, C.J. Taylor, A method of 3D surface
correspondence for automated landmark generation, in: 8th
British Machine Vision Conference, Essex, England, September
1997, pp 709-718). In this paper we describe a set of improved
algorithms which give more accurate and more robust results. We
show how the pairwise corresponder can be used in an extension
of an existing framework for establishing dense correspondences
between a set of training examples (A. Hill, A.D. Brett, C.J.
Taylor, Automatic landmark identification using a new method of
non-rigid correspondence, in: J. Duncan, G. Gindi, (Eds.), 15th
Conference on Information Processing in Medical Imaging,
Poulteney, VT, Springer, Berlin, 1997, pp. 483-488) to build a
3D Point Distribution Model. The framework relies upon
additional algorithms for the production of surface paths
between vertices on a polyhedral mesh, and these are described.
Example statistical models are shown for both smooth synthetic
data and the left lateral ventricle of the brain, a complex
biological shape which demonstrates considerable variation
between individuals. (C) 2000 Elsevier Science B.V. All rights
reserved.
- Bowden, R, Mitchell, TA, and Sarhadi, M, "Non-linear statistical models for the 3D reconstruction of human pose and motion from monocular image sequences," IMAGE AND VISION COMPUTING, vol. 18, pp. 729-737, 2000.
Abstract:
This paper presents a model based approach to human body
tracking in which the 2D silhouette of a moving human and the
corresponding 3D skeletal structure are encapsulated within a
non-linear point distribution model. This statistical model
allows a direct mapping to be achieved between the external
boundary of a human and the anatomical position. It is shown
how this information, along with the position of landmark
features such as the hands and head can be used to reconstruct
information about the pose and structure of the human body from
a monocular view of a scene. (C) 2000 Elsevier Science B.V. All
rights reserved.
- Egmont-Petersen, M, Schreiner, U, Tromp, SC, Lehmann, TM, Slaaf, DW, and Arts, T, "Detection of leukocytes in contact with the vessel wall from in vivo microscope recordings using a neural network," IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, vol. 47, pp. 941-951, 2000.
Abstract:
Leukocytes play an important role in the host defense as they
may travel from the blood stream into the tissue in reacting to
inflammatory stimuli. The leukocyte-vessel wall interactions
are studied in post capillary vessels by intraviral video
microscopy during in vivo animal experiments. Sequences of
video images are obtained and digitized with a frame grabber. A
method for automatic detection and characterization of
leukocytes in the video images is developed. Individual
leukocytes are detected using a neural network that is trained
with synthetic leukocyte images generated using a novel
stochastic model. This model makes it feasible to generate
images of leukocytes with different shapes and sizes under
various lighting conditions. Experiments indicate that neural
networks trained with the synthetic leukocyte images perform
better than networks trained with images of manually detected
leukocytes. The best performing neural network trained with
synthetic leukocyte images resulted in an 18% larger area under
the ROC curve than the best performing neural network trained
with manually detected leukocytes.
- Zhong, Y, Jain, AK, and Dubuisson-Jolly, MP, "Object tracking using deformable templates," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 22, pp. 544-549, 2000.
Abstract:
We propose a novel method for object tracking using prototype-
based deformable template models. To track an object in an
image sequence, we use a criterion which combines two terms:
the frame-to-frame deviations of the object shape and the
fidelity of the modeled shape to the Input image. The
deformable template model utilizes the prior shape information
which is extracted from the previous frames along with a
systematic shape deformation scheme to model the object shape
in a new frame. The following image information Is used in the
tracking process: 1) edge and gradient information: the object
boundary consists of pixels with large image gradient, 2)
region consistency: the same object region possesses consistent
color and texture throughout the sequence, and 3) interframe
motion: the boundary of a moving object is characterized by
large interframe motion. The tracking proceeds by optimizing an
objective function which combines both the shape deformation
and the fidelity of the modeled shape to the current image (in
terms of gradient, texture, and interframe motion). The
inherent structure in the deformable template. together with
region, motion, and image gradient cues. makes the proposed
algorithm relatively insensitive to the adverse effects of weak
image features and moderate amounts of occlusion.
- Huang, CL, Wu, MS, and Jeng, SH, "Gesture recognition using the multi-PDM method and Hidden Markov Model," IMAGE AND VISION COMPUTING, vol. 18, pp. 865-879, 2000.
Abstract:
This paper introduces a multi-Principal-Distribution-Model
(PDM) method and Hidden Markov Model (I-m IM) for gesture
recognition. To track the hand-shape, it uses the PDM model
which is built by learn |