305x Filetype PDF File size 0.36 MB Source: projet.liris.cnrs.fr
2016 23rd International Conference on Pattern Recognition (ICPR)
Cancún Center, Cancún, México, December 4-8, 2016
Texture Classification with Discrete Fractional
Fourier Transform
Liying Zheng* and Yan Chu*
School of Computer Science and Technology
Harbin Engineering University
Harbin, China
zhengliying@hrbeu.edu.cn
chuyan@hrbeu.edu.cn
Abstract—Based on the fact that the discrete fractional Fourier proposed by Candon in 1937. The idea was then reintroduced
transform (DFrFT) of a signal consists of not only spatial but and re-interpreted by Namias [12] and McBride and Kerr [13],
frequency characteristics, a novel texture classification algorithm respectively. So far, great attention has been paid to the FrFT,
is presented in this paper. A 1-D local DFrFT is proposed and the resulting in many applications of the FrFT on the areas such as
fractional frequency histogram is established to represent a radar signal processing[14-16], medical ultrasound imaging
texture. Moreover, the conceptions such as the between-class [17, 18], and image encryption[19-21]. Recently, some efforts
scatter and the within-class scatter, the coarse-to-fine searching, have been done on applying the FrFT to the field of pattern
2-statistic distance are also adopted to
and the minimum χ recognition. Kong et al. employed the amplitudes of the 2-D
complete the classification. The simulation results on four groups FrFT to represent the emotional state of a person [19]. Jing et
of benchmark texture images validate the performance of the al. proposed a face recognition approach by combining the
proposed texture classification method. FrFT with the discrimination analysis technique [20]. Based on
Keywords- discrete fractional Fourier transform; texture the 2-D discrete FrFT (DFrFT) and the principal component
classification; fractional frequency histogram analysis (PCA), Liu and Wang also proposed a method to
address the problem of face recognition [21]. Wang and Wang
I. INTRODUCTION applied the personal features derived from the FrFT, i.e. mean
energy within critical bands, to identify speakers [22]. Singh et
Image texture which contains important information for al. completed high-resolution SAR images categorization by
object recognition and scene interpretation is referred to as the using the log cumulants of the FrFT coefficients[23].
arrangement of certain basic elements. The ability of Studies have shown that the FrFT is intimately related to
discriminating almost all types of textures of human visual the time frequency representation of the Cohen class.
system attracts the researchers from various fields, especially Specifically, the FrFT of a signal produces a rotation of its WD
from computer vision society, to pay much attention to the task in phase space [24].It can be deduced that many important
of texture classification. local characteristics of a signal can be revealed from its FrFT.
One problem in texture analysis is the description of texture In this sense, the FrFT can potentially be used for analyzing
characteristics. Some popular texture descriptors are based on textures that have obviously spatial/spatial frequency
statistical methods, such as gray level co-occurrence characteristics. In this paper, an efficient texture classification
matrices[1], while others are based on modeling techniques, method based on the DFrFT is presented. Several 1-D local
such as Markov random field [2, 3] and fractal based modeling directional DFrFTs are firstly performed on a locally centered
[4]. Studies on human visual system show that the process of texture image. The DFrFT frequencies are then used to
texture segmentation requires high resolution in both spatial generate a histogram. Next, the scatter matrices of between-
and frequency domains. Thus, to describe a texture, many class and within-class, as well as a coarse-to-fine searching
time-frequency analysis tools have been employed. For technique are adopted to estimate the optimal DFrFT order.
example, Reed and Wechsler proposed a texture analysis 2
Finally, the texture is classified by using the minimum χ -
method by using the 2-D pseudo-Wigner distribution (PWD). statistic distance classifier.
They concluded that the WD-based method enjoys superior The remainder of this paper is organized as follows.
joint resolution [5]. Zhu et al described a framework for the Section 2 briefly introduces the DFrFT. Section 3 describes the
WD-based texture analysis [6]. furthermore, other popular 1-D local directional DFrFT and the fractional frequency
techniques such as Gabor transform[7, 8], wavelet analysis[9, histogram of a texture image in detail. Section 4 presents the
10], and windowed Fourier transform[11] have also been proposed DFrFT-based texture classification method. Section 5
employed to analyze textures. gives the experimental results followed by some conclusions in
As a time-frequency analysis technique, the fractional Section 6.
Fourier transform (FrFT) is a generalization of the
conventional Fourier transform, and its concept was initially
* Corresponding author
978-1-5090-4846-5/16/$31.00 ©2016 IEEE 2032
α
II. DISCRETE FRFT (DFRFT) Since I (x, y, m) is complex, one can use amplitude, phase
d
It is well known that a digital image is in discrete form. angle, or both, as the features to describe a texture. In this
Thus, in this paper, the discrete form of the FrFT has been paper, the amplitudes are employed for simplicity. Specifically,
employed. Let vector x = [ x(0), x(1), …., x(N-1) ] represent an the fractional Fourier frequencies corresponding to the sorted
α α th amplitudes are used. Firstly, |I(x,y,m)| are sorted in descending
N-length discrete time signal, and x and F be the α order d
DFrFT of x and the DFrFT matrix, respectively. Then, we have order. Then fractional Fourier frequencies corresponding to the
sorted |I(x,y,m)| are recoded. In accordance to the results shown
x FxT (1) d
in fig.1, theoretically, the difference of the sorted fractional
By now, several definitions of the DFrFT have been Fourier frequencies of two signals will be more obvious than
proposed [25-28]. In this paper, the DFrFT proposed by the traditional Fourier frequencies.
Ozaktas et al. [28] is employed for its excellent approximation
to the continuous FrFT. The DFrFT matrix of Ozaktas’ B. Fractional Frequency Histogram and Its Similarity
definition is given by To represent a texture, the fractional frequency histogram
F DΛHlpΛJ (2) of the above 1-D local directional DFrFT is constructed. Let
I(x,y) be a texture image with size of W by H. The fractional
where D and J are matrices representing the decimation and frequency histogram of I(x,y) is given by
interpolation operations, respectively. Λ is a diagonal matrix H(I,l,z) {H(I,l,z),H(I,l,z)...H(I,l,z)} (5)
that corresponds to the chirp multiplication, and H 1 2 D
corresponds to the convolution operation [28]. lp
with H (I,l,z) (zu(x,y,l)) (6)
Unlike the discrete Fourier transform (DFT) which d d
represents a signal in pure frequency domain, the DFrFT of a xy
signal with 0 1is an intermediate status between the spatial where D is the number of directions. δ(.) is the Dirac delta
function. z 0,1,2,...,| N | with |N | being the length of
and the frequency domains. d d
neighborhood sets at direction d and in this paper we
III. LOCAL DIRECTIONAL DFRFT AND THE FRACTIONAL let| N0 || N1 || ND |. u(x, y,l) is the fractional Fourier
FREQUENCY HISTOGRAM d
frequency corresponding to the sorted| I(x, y,m)|.
d
A. 1-D Local Directional DFrFT 2
The similarity of two textures is measured by the χ -statistic
Let I (x, y, n) be the nth directional neighbor of an image ~
d distance [29]. Let I and I be two texture images with the same
I(x,y) with direction d. To eliminate the effects due to ~
illumination, I (x, y, n) is firstly locally centered by using size, and H(I, l, z) and H(I ,l,z)be the fractional frequency
d ~
ˆ (3) histograms of I and I , respectively. The similarity of I and
Id (x, y,n) Id (x, y,n) I(x, y) ~
Define the 1-D local directional DFrFT as I is given by
D |I| |Nd| ~ 2
ˆ (4) 2 ~ (Hd(I,l,z)Hd(I,l,z))
Id (x, y,m) F (m,n)Id(x,y,n) (I,I ) ~ (7)
nNd d1l 0 z0 Hd(I,l,z)Hd(I,l,z)
th α
where Id (x,y,m) is the α order DFrFT, F is the DFrFT matrix where | I |W H .
given in (2), and N is the neighborhood indices in direction d.
d
As shown in fig.1, for each pixel P, there are four directional IV. TEXTURE CLASSIFICATION WITH THE DFRFT
0 0 0 0
neighborhood sets which are along 0 , 45 , 90 and 135 ,
α
respectively. According to the property of the DFrFT, I d (x, y, A. Summary of the Proposed Method
m) can reveal both spatial and frequency characteristics of the Fig.2 summarizes the major stages of the proposed texture
neighborhoods of I(x, y). classification method. The neighbors along direction d of each
pixel of a texture image I are firstly obtained with the similar
template shown in Fig.1, resulting in a 1-D series {I(x,y,
n )|d=0, 1, 2, 3 and n N }. Then, the locally centered
d d d
ˆ
series I(x, y,nd ) are obtained by using (3). Next, the optimal
th
α order 1-D local directional DFrFT is performed according
to (1) and (2), getting a 1-D series I(x, y,m) in the fractional
d
Fourier domain. Next, the fractional Fourier amplitudes
| I(x, y,m) | are sorted in descending order followed by
d
recording the corresponding fractional frequencies
Figure1. Illustration of directional neighborhoods. corresponding to the sorted amplitudes. Next, the fractional
frequency histogram is calculated with (5) and (6). Finally, a
2033
type of classifier can be adopted for completing the where S and S are the scatter matrices of between-class and
b w
classification. Here, since one of our targets is that of within-class, respectively. tr(.) means the trace of a matrix.
evaluating the performance of the proposed DFrFT texture Here, S and S are computed according to the fractional
b w
descriptor, the minimum distance classifier is adopted. frequency histogram obtained by (6) and (8). Please refer to
[20] For details.
B. Representation of a Texture Here, to get the optimal transform order, a coarse-to-fine
Assuming that there are C classes of textures in the training searching procedure which is similar to the algorithm in [30] is
set, the average fractional frequency histogram is employed to adopted. Let Δα0, λ, and ε be three positive numbers less than
th
represent the c class, i.e. 1, and αopt denote the optimal transform order .The coarse-to-
Nc fine searching procedure consists of the following five steps.
1 (c,i) Step1, α = 0, α = 0.995, Δα = Δα ;
H(c,l,z) c H(I ,l, z) (8) begin end 0
N i1 Step 2, Let α = {α , Δα+α , 2Δα+α , …, α };
begin begin begin end
c th
where c = 1,2,…C. N is the training samples in the c class. Step 3, Compute J with (9) for each α;
(c,i) th th
I denotes the i sample in the c class. To classify a texture,
2 Step 4, Select the α which maximizes the J as α ;
we compute the χ -statistic distance between its spectral opt
histogram and the average spectral histogram of each class by Setp5, α = max{0, α – 0.5Δα}, α = min{1, α +
using (8). Then, the image is classified into the class with the begin opt end opt
2 0.5Δα}, Δα= Δα×λ, go to Step 2 until Δα≤ε.
minimum χ -distance. Comparing to [30], the range of α is [0,0.995] rather than
C. Optimal Transform Order Estimation [0,1.0], since we find that when α =1.0, J may achieve the
The optimal DFrFT order α is a critical parameter for the extreme, but the performance of the proposed classification
proposed texture classification algorithm. To estimate this algorithm is probably poor.
parameter, the between-class and the within-class scatter V. E
matrices have been adopted [20]. An α which can maximize (9) XPERIMENTAL RESULTS AND ANALYSIS
is selected as the optimal transform order. A. Experimental Data
J tr(Sb)/tr(Sw) (9) The texture classification method discussed in Section 3
and Section 4 has been implemented in Matlab R2007b. The
experimental dataset are consists of four groups of benchmark
texture images which have been shown in Fig.3, being named
as Group1, Group2, Group3, and Group4, respectively.
Group1 whose images are the same with those in fig.1 of [11]
consists of 16 textures with the size of 128×128. The first four
textures in Group1 were generated by Gaussian random fields,
and the other 12 textures were chosen from Brodatz album. 10
Brodatz textures in Group2 are D4, D9, D19, D21, D24, D28,
D29, D36, D37 and D38 with the size of 320×320. These
textures are the same with fig.11(h) of [31]. Group3 is from
MIT Vision Texture database consisting of Fabric.0009,
Fabric.0016, Fabric.0019, Flowers.0005, Food.0005,
Leaves.0003, Misc.0000, Misc.0002, Sand.0000, and
Stone.0004 with the size of 256×256. The textures in Group3
are the same with fig.11(i) of [31]. Group4 is from the fifth test
suite of Outex texture database, consisting of
Outex_TC_00005, whose 24 textures are canvas001, 002, 003,
005, 006, 009, 011, 021, 022, 023, 025, 026, 031, 032, 033,
035, 038, 039, carpet002, carpet004, carpet005, carpet009,
tile005 and tile006. The size of each texture in Group4 is
320×320.
To generate training and test data, each texture in
Group1~Group4 was divided into a number of patches. A total
of 21 patches of each texture in Group1 were randomly
selected as training data, while the other 28 were used for test
data. The randomly selected half patches of each texture in
Group2, Group3 and Group4 were used for training while the
Figure2. Flowchart of the DFrFT-based texture classification. other half part for testing. All experiments were repeated 10
times and the average performance of the proposed method has
been evaluated.
2034
TABLE I. CLASSIFICATION RATES WITH DIFFERENT METHODS (%)
Texture DFrFT Liu Azencott DFT Singh
Groups
Group1 98.21 99.78 95.1 87.1 71.6
Group2 100 83.1 100 90 100
Group3 85.62 79.1 85.94 81.8 87.2
Group4 88.93 -- 55.86 83.2 51.3
The worst result of the proposed DFrFT method is on
Group3. These textures which is the same with fig.11(i) in[31]
is rather difficult to discriminate. In fact, all the heuristic
methods selected in [31] yield the classification rate of about
10% which is much lower than our method.
Moreover, table I illustrates that from the view of the
uniformity of the results, the proposed DFrFT method is better
than other methods. The mean and the standard deviation of the
DFrFT are 0.932 and 0.07, while they are 0.843 and 0.198 for
Figure3. Textures. (a) Group1, (b) Group2, (c) Group3, and (d) group4. Azencott’s method, 0.855 and 0.0373 for the DFT, and 0.775
and 0.3 for Singh’s method.
B. Optimal Transform Order for Each Group Finally, we compared the computational cost of the
The coarse-to-fine searching shown in section 4.3 is proposed method to three of the above mentioned methods.
adopted to estimate the optimal DFrFT order for each group of The average computational cost for classifying a texture with
textures. For Group1 and Group3, Δα = 0.1, λ = 0.1, and ε the proposed DFrFT, Azencott’s method, the DFT, and Singh’s
0 method are 155.2 milliseconds (ms), 6.2ms, 21.3ms, and
=0.01. And for Group2 and Group4, α = 0.1, λ = 0.1, and ε
0 61.5ms, respectively. Clearly, the proposed method is much
| N || N ||N |8
=0.1. Here, we let 0 1 3 .The optimal slower than other methods, since both calculating the 1-D local
DFrFT orders are 0.84, 0.3, 0.91 and 0.4 for Group1, Group2, DFrFT and establishing the fractional frequency histogram are
Group3, and Group4, respectively. time consuming. At this stage, therefore, our method is not fit
C. Comparison to Other Methods for real time applications where the speed of an algorithm is
crucial. However, its time cost for processing a texture with
We compared the proposed DFrFT-based method with size of 320 by 320 is about 155 ms, meaning 39 textures per
Liu’s spectral histogram method [29], Azencott's windowed second, which may still satisfy most applications that don’t
Fourier transform [11], traditional DFT, and Singh’s FrFT[23]. have a strict requirement on the speed.
Here, | N0 || N1 || N3 |8 , the DFrFT orders are 0.84,
VI. C
0.3, 0.91, and 0.4 for Group1, Group2, Group3, and Group4, ONCLUSIONS
respectively. Here, we let the DFrFT order be equal to 1 to get Based on the fact that the DFrFT of a signal reveals both
the results of the traditional DFT. TableI lists the classification spatial and frequency characteristics, a novel texture
rates. Note that the result of Liu’s method for Group4 is missed classification method has been proposed in this paper. A local
since all results of this method are from [29]. Furthermore, the directional DFrFT is proposed and the fractional frequency
result of Azencott for Group1 is from [11]. histogram is generated from the discrete fractional Fourier
For the textures in Group1, the methods in [11] and [29] frequencies corresponding to the sorted amplitudes of the local
give 22 and 2 misclassified patches, i.e., the classification rates directional DFrFTs of a texture. Furthermore, by using the
of 95.1% and 99.78%. The proposed method gives 8 matrices of the between-class scatter and the within-class
misclassified patches with classification rate of 98.21%. The scatter, as well as the coarse-to-fine searching strategy, the
classification rates of other two methods on Group1 are lower optimal DFrFT order has been estimated. The comparison to
than 90%, especially for Singh’s method whose rate is only the four existing methods validates the proposed DFrFT-based
71.6%. The main reason for the poor performance of Singh’s texture classification method. One problem with the proposed
method is that “it is dependent upon the application”[23]. For method is that of its time cost. For the future work, we need to
the textures in Group2 and in Group3, the proposed method study how to optimize the method to improve its efficiency.
outperforms the DFT and Liu’s method, and is competitive to
R
the other two methods, i.e. Azencott’s method and Singh’s EFERENCES
method. For Group4, the performance of the proposed method [1] R. M. Haralick, "Statistical and structural approaches to texture," in
is much better than the DFT, Singh’s or Azencott’s methods. It Proceedings of the IEEE, 1979, pp. 786-804.
is worth to mention that the best result of Group4 submitting to [2] H. Derin and W. S. Cole, "Segmentation of textured images using Gibbs
the Outex website is 86% [32], which is still lower than the random fields," Computer Vision, Graphics, and Image Processing vol.
result of the proposed DFrFT method. 35, pp. 72-98, 1985.
2035
no reviews yet
Please Login to review.