Machine Learning and Statistical Modeling Approaches to Image Retrieval
:
Yixin Chen, Jia Li, James Z. Wang
:
Machine Learning and Statistical Modeling Approaches to Image Retrieval
:
Springer-Verlag
:
9781402080357
:
1
:
CHF 70.40
:
:
Sonstiges
:
English
:
198
:
DRM
:
PC/MAC/eReader/Tablet
:
PDF
In the early 1990s, the establishment of the Internet brought forth a revolutionary viewpoint of information storage, distribution, and processing: the World Wide Web is becoming an enormous and expanding distributed digital library. Along with the development of the Web, image indexing and retrieval have grown into research areas sharing a vision of intelligent agents. Far beyond Web searching, image indexing and retrieval can potentially be applied to many other areas, including biomedicine, space science, biometric identification, digital libraries, the military, education, commerce, culture and entertainment.
"Machin Learning and Statistical Modeling Approaches to Image Retrieval" describes several approaches of integrating machine learning and statistical modeling into an image retrieval and indexing system that demonstrates promising results. The topics of this book reflect authors' experiences of machine learning and statistical modeling based image indexing and retrieval. This book contains detailed references for further reading and research in this field as well.
2.1 Similarity Comparison
(p.16-17)
Similarity comparison is a key issue in CBIR [Santini and Jain, 1999]. In general, the comparison is performed over imagery features. According to the scope of representation, features fall roughly into two categories: global features and local features. The former category includes texture histogram, color histogram, color layout of the whole image, and features selected from multidimensional discriminant analysis of a collection of images [Faloutsos et al., 1994; Gupta and Jain, 1997; Pentland et al., 1996; Smith and Chang, 1996; Swets and Weng, 1996]. In the latter category are color, texture, and shape features for subimages [Picard and Minka, 1995], segmented regions [Carson et al., 2002; Chen and Wang, 2002; Ma and Manjunath, 1997; Wang et al., 2001b], and interest points [Schmid and Mohr, 1997].
As a relatively mature method, histogram matching has been applied to many general-purpose image retrieval systems such as IBM QBIC [Faloutsos et al., 1994], MIT Photobook [Pentland et al., 1996], Virage System [Gupta and Jain, 1997], and Columbia VisualSEEK and WebSEEK [Smith and Chang, 1996], etc. The Mahalanobis distance [Hafner et al., 1995] and intersection distance [Swain and Ballard, 1991] are commonly used to compute the difference between two histograms with the same number of bins. When the number of bins are different, e.g., when a sparse representation is used, the Earth Mover’s Distance (EMD) [Rubner et al., 1997] applies. The EMD is computed by solving a linear programming problem. A major drawback of the global histogram search lies in its sensitivity to intensity variations, color distortions, and cropping.
Many approaches have been proposed to tackle this problem:
*
The PicToSeek [Gevers and Smeulders, 2000] system uses color models invariant to object geometry, object pose, and illumination.
*
VisualSEEK and Virage systems attempt to reduce the influence of intensity variations and color distortions by employing spatial rela tionships and color layout in addition to those elementary color, texture, and shape features.
* The same idea of color layout indexing is extended in a later system, Stanford WBIIS [Wang et al., 1998], which, instead of averaging, characterizes the color variations over the spatial extent of an image by Daubechies’ wavelet coefficients and their variances.
* Schmid and Mohr [Schmid and Mohr, 1997] proposed a method of indexing images based on local features of automatically detected interest points of images.
* Minka and Picard [Minka and Picard, 1997] described a learning algorithm for selecting and grouping features. The user guides the learning process by providing positive and negative examples.
*
The approach presented in [Swets and Weng, 1996] uses what is called the Most Discriminating Features for image retrieval. These features are extracted from a set of training images by optimal linear projection.
*
The Virage system allows users to adjust weights of implemented features according to their own perceptions. The PicHunter system [Cox et al., 2000] and the UIUC MARS [Mehrotra et al., 1997] system are self-adaptable to different applications and different users based upon user feedbacks.
*
To approximate the human perception of the shapes of the objects in the images, Del Bimbo and Pala [Bimbo and Pala, 1997] introduced a measure of shape similarity using elastic matching.
*
In [Mojsilovic et al., 2000], matching and retrieval are performed along what is referred to as perceptual dimensions which are obtained from subjective experiments and multidimensional scaling based on the model of human perception of color patterns.
*
In [Berretti et al., 2000], two distinct similarity measures, concerning respectively with fitting human perception and with the efficiency of data organization and indexing, are proposed for content-based image retrieval by shape similarity.
Contents
7
Preface
13
Acknowledgments
16
Chapter 1 INTRODUCTION
17
1. Text-Based Image Retrieval
17
2. Content-Based Image Retrieval
19
3. Automatic Linguistic Indexing of Images
20
4. Applications of Image Indexing and Retrieval
21
4.1 Web- Related Applications
21
4.2 Biomedical Applications
23
4.3 Space Science
24
4.4 Other Applications
26
5. Contributions of the Book
26
5.1 A Robust Image Similarity Measure
26
5.2 Clustering- Based Retrieval
27
5.3 Learning and Reasoning with Regions
28
5.4 Automatic Linguistic Indexing
28
5.5 Modeling Ancient Paintings
29
6. The Structure of the Book
30
Chapter 2 IMAGE RETRIEVAL AND LINGUISTIC INDEXING
31
1. Introduction
31
2. Content-Based Image Retrieval
31
2.1 Similarity Comparison
32
2.2 Semantic Gap
34
3. Categorization and Linguistic Indexing
36
4. Summary
39
Chapter 3 MACHINE LEARNING AND STATISTICAL MODELING
41
1. Introduction
41
2. Spectral Graph Clustering
41
3. VC Theory and Support Vector Machines
44
3.1 VC Theory
45
3.2 Support Vector Machines
46
4. Additive Fuzzy Systems
50
5. Support Vector Learning for Fuzzy Rule-Based Classification Systems
52
5.1 Additive Fuzzy Rule- Based Classification Systems
53
5.2 Positive Definite Fuzzy Classifiers
54
5.3 An SVM Approach to Build Positive Definite Fuzzy Classifiers
56
6. 2-D Multi-Resolution Hidden Markov Models
58
7. Summary
62
Chapter 4 A ROBUST REGION-BASEDSIMILARITY MEASURE
63
1. Introduction
63
2. Image Segmentation and Representation
65
2.1 Image Segmentation
65
2.2 Fuzzy Feature Representation of an Image
67
2.3 An Algorithmic View
71
3. Unified Feature Matching
72
3.1 Similarity Between Regions
72
3.2 Fuzzy Feature Matching
74
3.3 The UFM Measure
76
3.4 An Algorithmic View
78
4. An Algorithmic Summarization of the System
79
5. Experiments
80
5.1 Query Examples
80
5.2 Systematic Evaluation
80
5.3 Speed
87
5.4 Comparison of Membership Functions
88
6. Summary
89
Chapter 5 CLUSTER-BASED RETRIEVALBY UNSUPERVISED LEARNING
91
1. Introduction
91
2. Retrieval of Similarity Induced Image Clusters
92
2.1 System Overview
92
2.2 Neighboring Target Images Selection
93
2.3 Spectral Graph Partitioning
94
2.4 Finding a Representative Image for a Cluster
95
3. An Algorithmic View
96
3.1 Outline of Algorithm
96
3.2 Organization of Clusters
98
3.3 Computational Complexity
99
3.4 Parameters Selection
100
<