Zhe Chen, Chenxing Wu, Maxin Chen EECS 352 - Machine Perception of Music and Audio, Northwestern University Spectrogram Frame number Frequency (Hz) 50 100 150 200 250 300 350 0 100 200 300 400 500 600 700 800 900 1000 Spectrogram Frequency (Hz) Frame number (a) Spectrogram of speech 50 100 150 200 250 300 350 0 100 200 300 400 500 600 700 800 900 1000 (b) Spectrogram of melody Spectrogram Frequency (Hz) Frame number Spectrogram Frame number Frequency (Hz) 50 100 150 200 250 300 350 0 100 200 300 400 500 600 700 800 900 1000 Spectrogram Frequency (Hz) Frame number (c) Spectrogram of singing-speech Figure 2 Spectrogram comparison of audio files in speech-to-sing function Figure 3 User Interface Frequency (Hz) Frame number 0 100 200 300 400 500 600 700 800 900 0 50 100 150 200 250 300 350 400 450 500 Figure 1 Pitches before and after auto-tuning Before auto-tuning After auto-tuning Melody import Output with 3 function choices Recording Input and display Audio input (Recording) Onset detection Extract syllables Adjust each syllables to determined beats Extract notes of the melody Matching syllables and notes, modifying pitch Modified speech output Pitch detection Modifying to nearest whole tone Tuned singing output Melody input Choose Auto-tuning Speech-to- rap Speech-to- sing Choose Speech modification