Top Banner
Cosine similarity metric calculation on low power heterogeneous computing platform Michał Karwatowski 1,2 , Sebastian Koryciak 1,2 , Ernest Jamro 1,2 , Agnieszka Dąbrowska-Boruch 1,2 , Kazimierz Wiatr 1 1 AGH University of Science and Technology, al. Mickiewicza 30, 30-059 Kraków, 2 ACK Cyfronet AGH, ul. Nawojki 11, 30-950 Kraków KUKDM 11-13.03.2015 Zakopane
16

Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Apr 19, 2019

Download

Documents

buinhi
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Cosine similarity metric calculation

on low power heterogeneous

computing platform

Michał Karwatowski1,2, Sebastian Koryciak1,2,

Ernest Jamro1,2, Agnieszka Dąbrowska-Boruch1,2,

Kazimierz Wiatr1

1AGH University of Science and Technology, al. Mickiewicza 30, 30-059 Kraków,2ACK Cyfronet AGH, ul. Nawojki 11, 30-950 Kraków

KUKDM 11-13.03.2015 Zakopane

Page 2: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Agenda

FPGA based hardware accelerated

computing

Text similarity analysis

Search algorithm implementation

Results

Future work

2

Page 3: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

FPGA based hardware accelerated

computing3

Page 4: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Text similarity analysis 4

Page 5: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Text comparison 5

Page 6: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Hardware 6

Page 7: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Zynq 7

Page 8: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Hardware architecture 8

Page 9: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Compare flow 9

Page 10: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Compare system 10

Page 11: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Tests

100,000 random documents processed

to vector form

Zynq software solution:

One and two cores

ARM Cortex-A9

667 MHz

Zynq PS + PL solution

8 paralel channels

100 MHz

11

Page 12: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Runtime comparison 12

0

5

10

15

20

25

30

35

40

45

50

55

60

0 2 4 6 8 10 12 14 16 18 20 22 24 26 28

Pro

cess

ing

tim

e [s

ec]

Number of processed reference vectors

Single core

Dual core

FPGA

Page 13: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Future work

Compression

High performance hardware

Higher level language

13

Page 14: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Text comparison 14

Page 15: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Cluster 15

Page 16: Cosine similarity metric calculation on low power ... fileterm 1 sentence 1 Vector Space Model Term Frequency— Inverse Document Frequency (tf — = Cosine similarity cos 9 = x log

Questions 16