Top Banner
Tabla Gyan Realtime tabla recognition and resynthesis Parag Chordia (GTCMT) Alex Rae (GTCMT)
22

Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Dec 17, 2015

Download

Documents

Judith Goodman
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Tabla GyanRealtime tabla recognition and resynthesis

Parag Chordia (GTCMT)Alex Rae (GTCMT)

Page 2: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Overview

What :Stroke type

Transformation:TimbreRhythm

When:Stroke timing

Resynthesis

Page 3: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Video Demo

Page 4: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

The Drum

• Dayan – treble drum

• Bayan – bass drum

Page 5: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Tabla Language

Page 6: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Recognition Architecture

Onset detection

Statistical ModelSVM

BayesianNeural Net

Training data

ke

tun

dhe

gedha

te

Input music

Stroke Label

Rhythm

Page 7: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Build Model: Training Data

• Several Datasets• Professional

musician• Home recording

• Audio recordings manually edited and labeled

Page 8: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Build Model: Target Mapping

• Standardize idiosyncratic traditional naming conventions

• Map timbrally similar (or identical) strokes to the same category

Page 9: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Build Model: Feature Extraction

Spectral Features• MFCCs (24)• Centroid• Variance• Skewness• Kurtosis• Slope• Roll-off

VarianceF1F2F3...

Fn

Spectral centroid

Kurtosis

Feature Vector

Page 10: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Build Model: Trained Model

• WEKA machine learning package• Support Vector Machine• Models trained on different datasets can be

saved for future use

Page 11: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Audio: Input

• Live audio is taken from a close-mic’d tabla

• Stereo signal provides partial separation of drums

Page 12: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Audio: Segmentation

• Onset detection done in Max using bonk~• More recent parallel project uses spectral flux

algorithm in Java• End of stroke marked by next onset (1 sec

buffer size)• Onset times stored

Page 13: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Audio: Feature Extraction

VarianceF1F2F3...

Fn

Spectral centroid

Kurtosis

Feature Vector

Page 14: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Output: Classification

• Feature vector is fed to previously trained model

• Single category label returned

SVM labelfeature vector

Page 15: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Output: Symbolic Score

• Stroke label combined with timing and amplitude information

• Score stored in temporary buffer in Max patch

.3204 .9665 2

.3527 .5715 6

.3031 .3648 6

.3325 .9827 6

.2970 .4762 2

.3865 .5928 1

.3496 .6603 8

.7046 .4621 1

.3144 .5024 6

.7152 .2990 6

.3387 .8891 2

.2902 .7342 6

.3868 .9051 7

.3049 .5727 1

Page 16: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Output: Timbre Remapping

Stroke labels can be flexibly remapped

Page 17: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Output: Conditional Repetition

Page 18: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Output: User Interface

Page 19: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Dangum

Page 20: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Future Directions

• Beat tracking• Modeling specific types of improvisational

forms (e.g. qaida, tihai …)• Automate transformations• Improve interface so it can be “played”• Tracking of expressive parameters (e.g. bayan

pitch modulation)

Page 21: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.

Conclusions

• Shown a realtime tabla interaction system• Implemented as Max java external using

machine learning to identify strokes• Supports flexible transformations• Foundation for more general improvisation

system

Page 22: Overview What : Stroke type Transformation: Timbre Rhythm When: Stroke timing Resynthesis.