Top Banner
System for Comparing Images Allison Deal April 27, 2004
24
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: System for Comparing Images Allison Deal April 27, 2004.

System for Comparing Images

Allison DealApril 27, 2004

Page 2: System for Comparing Images Allison Deal April 27, 2004.

Problem The human eye can determine

similarities among images easily, but they cannot do this for large numbers of pictures.

Humans can miss subtle patterns among images.

It can be hard to find trends among images without looking through most or all of them.

Page 3: System for Comparing Images Allison Deal April 27, 2004.

Specific Problem Analyze Chinese art images from

three different time periods and 23 different artists.

Find similarities among artists paintings, color choices, and styles.

Uncover any hidden structures in the paintings, that only the computer can find.

Page 4: System for Comparing Images Allison Deal April 27, 2004.

Goal

Design a system that uses automated computations to compare images.

This system can be put to use by people interested in learning about artists’ painting styles.

Page 5: System for Comparing Images Allison Deal April 27, 2004.

Overview

Create models for each picture. Compute the distances between

all pairs of pictures. Use Multi-Dimensional Scaling for visualization purposes.

Analyze the data.

Page 6: System for Comparing Images Allison Deal April 27, 2004.

Mixture Models for Colors

Use K-means to determine the main colors of each image. Algorithm converges once distances

are within a certain small threshold. (1.0e-4)

Each pixel of the image is assigned to a specific cluster based on its color similarity.

Page 7: System for Comparing Images Allison Deal April 27, 2004.

Mixture Models for Colors Determine the background color and

remove it from the analysis because it does not provide useful information.

Use K-means results to compute mean and covariance vectors and weights for each cluster of each picture.

f(x) = N i=1 ai (x | i , i)

Page 8: System for Comparing Images Allison Deal April 27, 2004.

Mixture Models for Colors

The EM algorithm would have been more accurate in this context because of its accuracy, but K-means is faster for computational purposes.

Page 9: System for Comparing Images Allison Deal April 27, 2004.

Mixture Models for Colors

Gao Fenghan 4 clusters 0.097

[54.17 52.74 51.56] 0.474

[159.39 149.95 137.06] 0.263

[129.92 122.64 112.89] 0.166

[98.30 92.53 85.74]

Page 10: System for Comparing Images Allison Deal April 27, 2004.

Mixture Models for Colors

Chen Hongshou 2 clusters 0.295

[106.37 82.09 45.32] 0.705

[138.05 109.24 59.16]

Page 11: System for Comparing Images Allison Deal April 27, 2004.

Mixture Models for Colors

Zhang Daqian 5 clusters 0.065 [85.39 79.46 54.42] 0.290 [165.58 139.72

78.82] 0.247 [140.44 144.21

103.25] 0.176 [115.87 119.18

81.35] 0.222 [181.11 164.36

120.64]

Page 12: System for Comparing Images Allison Deal April 27, 2004.

Distance Calculations Kullback-Leibler distance

measures difference between two probability distributions (f(x) & g(x))

Difficult to calculate because of integration.

D(f ||g) = f(x) * log [ f(x)/g(x) ] dx

Page 13: System for Comparing Images Allison Deal April 27, 2004.

Distance Calculations Use Monte-Carlo Method instead

generate random samples from a distribution defined by f(x), g(x)

Use altered calculation of Kullback-Leibler Distance.

D(f ||g) 1/n * log [ f(xj)/ g(xj) ]

Page 14: System for Comparing Images Allison Deal April 27, 2004.

Distance Calculations This distance is not symmetric, so we

calculated D(f || g) and D(g || f) and took the average.

½ { D(f ||g) + D(g||f) }

By doing this for each picture, we can see where similarities exist and gain more information about the artists.

Page 15: System for Comparing Images Allison Deal April 27, 2004.

Multi-Dimensional Scaling

To visualize the result, we can use Multi-Dimensional Scaling and embed the information into a vector space.

Dr. Li is working more on this part of our project.

Page 16: System for Comparing Images Allison Deal April 27, 2004.

Results 1177 images from 23 artists and 4

different dynasties Around 50 images per artist

Focus on 10 artists

Dimensions of images: (100 to 150 pixels) x (around 200)

Page 17: System for Comparing Images Allison Deal April 27, 2004.

Results(1) Jin Nong: two-16, three-15, four-11, five-3(2) Shen Zhou: three-20, four-18, five-8(3) Gao Fenghan: two-2, three-8, four-23, five-10, six-4(4) Chen Hongshou:one-1, two-17, three-20, four-6, five-2(5) Qi Baishi: two-2, three-19, four-14, five-15, six-10, seven-2, eight-2(6) Zhang Daqian: two-1, three-4, four-16, five-39, six-24, seven-11,

eight-1(7) Chou Ying: two-4, three-10, four-7, five-5, six-5, seven-2, eight-1,

nine-4, ten-2(8) Dong Qichang: two-4, three-23, four-15, five-3, six-1(9) Huang Shen: two-5, three-14, four-11, five-6(10) Yun Shouping: two-1, three-10, four-19, five-6, six-7, eight-1, ten-2

(# of clusters - number of pictures with this many clusters)

Page 18: System for Comparing Images Allison Deal April 27, 2004.

Results

Ming Dynasty all tended to not use many colors

(often used ink)

(2) Shen Zhou: three-20, four-18, five-8(4) Chen Hongshou:one-1, two-17, three-20, four-6,

five-2(8) Dong Qichang: two-4, three-23, four-15, five-3,

six-1

Page 19: System for Comparing Images Allison Deal April 27, 2004.

Results Qing Dynasty

artists began to use more colors

(1) Jin Nong: two-16, three-15, four-11, five-3(3) Gao Fenghan: two-2, three-8, four-23, five-10,

six-4 (7) Chou Ying: two-4, three-10, four-7, five-5, six-

5, seven-2, eight-1, nine-4, ten-2(9) Huang Shen: two-5, three-14, four-11, five-6(10) Yun Shouping: two-1, three-10, four-19, five-

6, six-7, eight-1, ten-2

Page 20: System for Comparing Images Allison Deal April 27, 2004.

Results

After the Qing Dynasty more variation and more colors being

used

(5) Qi Baishi: two-2, three-19, four-14, five-15, six-10, seven-2, eight-2

(6) Zhang Daqian: two-1, three-4, four-16, five-39, six-24, seven-11, eight-1

Page 21: System for Comparing Images Allison Deal April 27, 2004.

Results

Artist Number Mean Variation (1) Jin Nong 45 3.000 0.953 (2) Shen Zhou 46 3.739 0.743 (3) Gao Fenghan 47 4.128 0.947 (4) Chen Hongshou 46 2.804 0.859 (5) Qi Baishi 64 4.406 1.388 (6) Zhang Daqian 96 5.229 1.090 (7) Chou Ying 40 4.975 2.370 (8) Dong Qichang 46 3.435 0.834 (9) Huang Shen 36 3.500 0.941 (10) Yun Shouping 45 4.522 1.656 Number= number of pictures, Mean=mean number of clusters,

Variation= variation in number of clusters

Page 22: System for Comparing Images Allison Deal April 27, 2004.

Results Zhang Daqian used the most colors, but

his variation was not that great compared to other artists.

Chou Ying had the greatest amount of variation and the second highest mean.

Many of the other artists tended not to vary much from painting to painting; they have a style and stick with it.

Page 23: System for Comparing Images Allison Deal April 27, 2004.

Summary

We can learn about artists’ styles without looking through all of their paintings by using automated computations.

Page 24: System for Comparing Images Allison Deal April 27, 2004.

Future Work

Instead of using the RGB colorspace, it might be better to use an LUV colorspace.

The LUV colorspace is easier for the human eye to perceive.