Top Banner
Temple University QUALITY ASSESSMENT OF SEARCH TERMS IN SPOKEN TERM DETECTION Amir Harati and Joseph Picone, PhD Department of Electrical and Computer Engineering Temple University URL:
5

Temple University QUALITY ASSESSMENT OF SEARCH TERMS IN SPOKEN TERM DETECTION Amir Harati and Joseph Picone, PhD Department of Electrical and Computer.

Jan 01, 2016

Download

Documents

Charlotte Greer
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Temple University QUALITY ASSESSMENT OF SEARCH TERMS IN SPOKEN TERM DETECTION Amir Harati and Joseph Picone, PhD Department of Electrical and Computer.

Temple University

QUALITY ASSESSMENT OF SEARCH TERMSIN SPOKEN TERM DETECTION

Amir Harati and Joseph Picone, PhDDepartment of Electrical and Computer Engineering

Temple University

URL:

Page 2: Temple University QUALITY ASSESSMENT OF SEARCH TERMS IN SPOKEN TERM DETECTION Amir Harati and Joseph Picone, PhD Department of Electrical and Computer.

Abstract

• Spoken term detection is an extension of text-based searching that allows users to type keywords and search audio files containing spoken language for their existence.

• Performance is dependent on many external factors such as the acoustic channel, language and the confusability of the search term.

• Unlike text-based searches, the quality of the search term plays a significant role in the overall perception of the usability of the system.

• In this presentation we will review conventional approaches to keyword search.

• Goal: Develop a tool similar to the way password checking tools currently work.

• Approach: develop models that predict the quality of a search term based on its spelling (and underlying phonetic context).

Page 4: Temple University QUALITY ASSESSMENT OF SEARCH TERMS IN SPOKEN TERM DETECTION Amir Harati and Joseph Picone, PhD Department of Electrical and Computer.

Methods

Acoustic distance algorithm. Phonetic distance algorithm. Feature based algorithm.

Page 5: Temple University QUALITY ASSESSMENT OF SEARCH TERMS IN SPOKEN TERM DETECTION Amir Harati and Joseph Picone, PhD Department of Electrical and Computer.

Feature based methods

Using different pattern recognition methods. Different features and different feature selection

methods.

Example results:Results for Neural Network

Machine feature set MSE(train) R(train) R2(train) MSE(eval) R(eval) R2(eval)1 All 0.014 0.724 0.526 0.017 0.624 0.3892 set1 0.012 0.753 0.568 0.015 0.692 0.4793 set2 0.013 0.735 0.541 0.015 0.686 0.474 set3 0.015 0.697 0.486 0.015 0.691 0.4775 set4 0.015 0.697 0.486 0.015 0.689 0.4756 set5 0.016 0.674 0.455 0.016 0.669 0.4487 set6 0.016 0.674 0.455 0.016 0.669 0.4488 set7 0.013 0.734 0.54 0.016 0.675 0.4559* All 0.017 0.742 0.551 0.016 0.67 0.4510* set1 0.016 0.752 0.566 0.014 0.705 0.497