Skip to content

kang9366/English-Pronunciation-Classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

14 Commits
ย 
ย 
ย 
ย 

Repository files navigation

2.1 ์‚ฌ์šฉ ๋ฐ์ดํ„ฐ

Untitled

Train ๋ฐ์ดํ„ฐ

  • ์•„ํ”„๋ฆฌ์นด, ํ˜ธ์ฃผ, ์บ๋‚˜๋‹ค, ์˜๊ตญ, ํ™์ฝฉ, ๋ฏธ๊ตญ์˜ ์ด 6๊ฐœ๊ตญ์˜ ํ™”์ž์˜ ๋ฌธ์žฅ ๋…น์Œ (.wavํŒŒ์ผ) ๋ฐ์ดํ„ฐ์…‹

    Untitled

  • ํด๋ž˜์Šค ๋ณ„ 1000๊ฐœ์˜ ๋ฐ์ดํ„ฐ

    : ๋” ๋งŽ์€ ๋ฐ์ดํ„ฐ ์…‹์ด ์žˆ์—ˆ์ง€๋งŒ, ๋ฉ”๋ชจ๋ฆฌ ๋ฌธ์ œ์™€ class imbalance ๋ฌธ์ œ๋ฅผ ํ”ผํ•˜๊ธฐ ์œ„ํ•˜์—ฌ ๊ฐ class๋ณ„ ๋ฐ์ดํ„ฐ๋ฅผ 1000๊ฐœ๋กœ ํ†ต์ผ์‹œ์ผฐ์Šต๋‹ˆ๋‹ค.

  • X_train = 6๊ฐœ๊ตญ์˜ wav ํŒŒ์ผ

  • y_train = 6๊ฐœ๊ตญ์˜ label (์•„ํ”„๋ฆฌ์นด๋ถ€ํ„ฐ ๋ฏธ๊ตญ ์ˆœ์œผ๋กœ 0,1,2,3,4,5,6) : ์ง์ ‘ label

Test ๋ฐ์ดํ„ฐ

: ์ด 6๊ฐœ๊ตญ์˜ ํ™”์ž์˜ ๋ฌธ์žฅ ๋…น์Œ ๋ฐ์ดํ„ฐ ์…‹ ํŒŒ์ผ 1000๊ฐœ (.wav)

2.3 ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ

์ „์ฒด ๊ณผ์ •

  1. librosa ๋ชจ๋“ˆ ์‚ฌ์šฉํ•˜์—ฌ wavํŒŒ์ผ ๋กœ๋“œ

    • wavํŒŒ์ผ์„ train์‹œ ๋ถˆ๋Ÿฌ์˜ค๋Š” ๊ณผ์ •์ด ์˜ค๋ž˜ ๊ฑธ๋ฆฌ๋ฏ€๋กœ, load๊ฐ€ ๋น ๋ฅธ npyํŒŒ์ผ๋กœ train ๋ฐ์ดํ„ฐ๋ฅผ ์ €์žฅํ•˜์˜€์Šต๋‹ˆ๋‹ค.
    • ๋ฉ”๋ชจ๋ฆฌ ๋ฌธ์ œ๋กœ ์ธํ•˜์—ฌ, load์‹œ ํ˜•ํƒœ๋ฅผ float32๋กœ ์ง€์ •ํ•ด์ฃผ์—ˆ์Šต๋‹ˆ๋‹ค.
    • train set์—์„œ ๊ฐ™์€ ์‚ฌ๋žŒ์ด ์—ฌ๋Ÿฌ ๋ฒˆ (ํ‰๊ท  3๋ฒˆ) ๋…น์Œ ํ•œ ๊ฒƒ์„ ํ™•์ธํ•˜๊ณ , sortํ•˜์—ฌ ๊ฐ™์€์‚ฌ๋žŒ์ด ๋…น์Œํ•œ ๊ฒƒ๋“ค์„ ๋ฒˆํ˜ธ๋ฅผ ๋ถ™์—ฌ์„œ ๋ถˆ๋Ÿฌ์™”์Šต๋‹ˆ๋‹ค. ์ถ”ํ›„ ํ•™์Šต ๊ณผ์ •์—์„œ 3๋ฒˆ์”ฉ ๋…น์Œํ•œ ๊ฒƒ์— ๋Œ€ํ•œ index๋ฅผ ์ฒ˜๋ฆฌํ•ด์ฃผ์—ˆ์Šต๋‹ˆ๋‹ค.
  2. Melspectrogram ๋ณ€ํ™˜ ํ›„ , librosa.power_to_db ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ ์‚ฌ์šฉ

  3. ๋ฐ์ดํ„ฐ ๊ฐ’์˜ ๋ฒ”์œ„๋ฅผ ๊ท ์ผํ•˜๊ฒŒ ๋งŒ๋“ค์–ด ์ฃผ๊ธฐ ์œ„ํ•˜์—ฌ scaling์„ ์ ์šฉํ•˜์˜€๋Š”๋ฐ min-max์™€ standardization ๋‘๊ฐ€์ง€ ๋ฐฉ์‹์œผ๋กœ scalingํ•˜์˜€์Šต๋‹ˆ๋‹ค.

  4. ์ „์ฒ˜๋ฆฌ๋ฅผ ๋งˆ์นœ ํŒŒ์ผ์„ npyํŒŒ์ผ๋กœ ์ €์žฅ

    : ๋งˆ์ฐฌ๊ฐ€์ง€๋กœ, ๋ฉ”๋ชจ๋ฆฌ ๋ฌธ์ œ๋กœ ์ธํ•˜์—ฌ ์ •๊ทœํ™”๊นŒ์ง€ ์ง„ํ–‰ํ•œ data๋ฅผ npyํŒŒ์ผ๋กœ ์ €์žฅํ•ด๋†“๊ณ  ๋ถˆ๋Ÿฌ์™€ ์‚ฌ์šฉํ•˜์˜€์Šต๋‹ˆ๋‹ค.

Mel-Spectrogram

  • ์ž…๋ ฅ ์‹ ํ˜ธ(์Œ์„ฑ ํŒŒ์ผ)์„ ์‹œ๊ฐ„ ๋‹จ์œ„๋กœ ์ชผ๊ฐœ์–ด, ๋‹ค์–‘ํ•œ ์ฃผํŒŒ์ˆ˜๋ฅผ ๊ฐ€์ง€๋Š” ์ฃผ๊ธฐํ•จ์ˆ˜๋กœ ๋ถ„ํ•ดํ•˜๊ณ , ์‚ฌ๋žŒ์ด ๋” ์˜ˆ๋ฏผํ•˜๊ฒŒ ์ธ์‹ํ•˜๋Š” ์ €์ฃผํŒŒ ๋ถ€๋ถ„์˜ ํ•ด์ƒ๋ ฅ์„ ๋†’์ธ mel scale๋กœ ๋ณ€ํ™˜ํ•ด ์ฃผ๋Š” ๊ณผ์ •์ž…๋‹ˆ๋‹ค.

๋ถˆ๋Ÿฌ์˜จ wav๋ฅผ librosa.load๋ฅผ ํ†ตํ•ด ๋ถˆ๋Ÿฌ์˜ค๋ฉด ์œ„์˜ ๊ฒฐ๊ณผ์™€ ๊ฐ™์ด sampling rate(sr) ๋งŒํผ์˜ float ๊ฐ’์„ ๊ฐ€์ง€๊ฒŒ ๋ฉ๋‹ˆ๋‹ค

๋ถˆ๋Ÿฌ์˜จ wav๋ฅผ librosa.load๋ฅผ ํ†ตํ•ด ๋ถˆ๋Ÿฌ์˜ค๋ฉด ์œ„์˜ ๊ฒฐ๊ณผ์™€ ๊ฐ™์ด sampling rate(sr) ๋งŒํผ์˜ float ๊ฐ’์„ ๊ฐ€์ง€๊ฒŒ ๋ฉ๋‹ˆ๋‹ค

Mel spectrogram ๋ณ€ํ™˜ ๊ฒฐ๊ณผ(log scale)

Mel spectrogram ๋ณ€ํ™˜ ๊ฒฐ๊ณผ(log scale)

  • Arguments
    • sr(sampling rate) : ์ดˆ๋‹น sample์˜ ๊ฐœ์ˆ˜. ๋ฐ์ดํ„ฐ์…‹ wavํŒŒ์ผ์˜ ๊ฒฝ์šฐ์—” 16000
    • n_fft(=win_length) : ์Œ์„ฑ์„ ์–ผ๋งˆ๋งŒํผ์˜ ๊ธธ์ด๋กœ ์ž๋ฅผ ๊ฒƒ์ธ์ง€
    • hop_length : ์Œ์„ฑ์˜ magnitude๋ฅผ ์–ผ๋งŒํผ ๊ฒน์นœ ์ƒํƒœ๋กœ ์ž˜๋ผ์„œ ๋ณด์—ฌ์ค„ ๊ฒƒ์ธ์ง€
    • n_mels : mel scale์„ ๋งŒ๋“ค๊ธฐ ์œ„ํ•ด ์ ์šฉํ•˜๋Š” mel filter ์˜ ๊ฐœ์ˆ˜

librosa.power_to_db

๋งŒ๋“ค์–ด์ง„ mel spectrogram์ด power scale์ผ ๊ฒฝ์šฐ ๋ณ€ํ™”๋ฅผ log scale๋กœ ์ธ์‹ํ•  ์ˆ˜ ์žˆ๋„๋ก log ๋ณ€ํ™˜์„ ํ•ด์ฃผ๋Š” ๊ณผ์ •์ž…๋‹ˆ๋‹ค.

Data Augmentation : Random Eraser & Imagegenerator

์ด๋ฏธ์ง€๋ฅผ shiftํ•˜๊ณ , randomํ•˜๊ฒŒ ์ด๋ฏธ์ง€์˜ ์ผ๋ถ€๋ฅผ ์ง€์šฐ๋Š” ๊ณผ์ •์„ ํ†ตํ•ด ๋ฐ์ดํ„ฐ๋ฅผ ์ฆ๊ฐ•ํ•˜์˜€์Šต๋‹ˆ๋‹ค.

Untitled

3. ํ•™์Šต ๋ชจ๋ธ

๋ชจ๋ธ์€ 1๊ฐœ์˜ input layer, 4๊ฐœ์˜ hidden layer, 1๊ฐœ์˜ out layer๋กœ ์ด 7๊ฐœ๋กœ ๊ตฌ์„ฑํ•˜์˜€์Šต๋‹ˆ๋‹ค.

data๊ฐ€ numpy ํ–‰๋ ฌ์ด๋ฏ€๋กœ input์˜ channel์€ 1์ด๊ณ  ํฌ๊ธฐ๋Š” 64*501์ด๊ธฐ ๋•Œ๋ฌธ์— input layer์—์„œ input shape๋กœ ์ด ๊ฐ’์„ ์„ค์ •ํ•ด์ฃผ์—ˆ์Šต๋‹ˆ๋‹ค.

๊ทธ๋ฆฌ๊ณ  ํ•ฉ์„ฑ๊ณฑ ์—ฐ์‚ฐ์„ ํ•œ ๋’ค ํ™œ์„ฑํ™”ํ•จ์ˆ˜๋กœ๋Š” ReLU๋ฅผ ์‚ฌ์šฉํ•˜์˜€๊ณ  batch normalization์„ ํ†ตํ•ด weight๋ฅผ ์„ค์ •ํ•ด์ฃผ์—ˆ์Šต๋‹ˆ๋‹ค. ์ด ๊ณผ์ •์„ ๋ฐ˜๋ณตํ•œ๋’ค Average pooling์„ ์ ์šฉํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ hidden layer๋ฅผ ๊ตฌ์„ฑํ•˜์˜€์Šต๋‹ˆ๋‹ค.

output layer๋Š” ๋งˆ์ง€๋ง‰์œผ๋กœ output์„ ์ถœ๋ ฅํ•˜๋Š” ์ธต์—์„œ๋Š” softmax ํ•จ์ˆ˜๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๊ฐ๊ฐ์˜ class์— ์†ํ•  ํ™•๋ฅ ์„ ๋‚˜ํƒ€๋‚ด์ฃผ๊ณ ๊ณ  class๊ฐ€ 6๊ฐœ์ด๊ธฐ ๋•Œ๋ฌธ์— unit์„ 6์œผ๋กœ ์„ค์ •ํ•˜์˜€์Šต๋‹ˆ๋‹ค.

์•„๋ž˜ ์‚ฌ์ง„์€ SVG ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ์ด์šฉํ•˜์—ฌ ๋ชจ๋ธ๊ตฌ์กฐ๋ฅผ ์‹œ๊ฐํ™”ํ•œ ๊ฒฐ๊ณผ์ž…๋‹ˆ๋‹ค.

dotres (2).png

  • model.summary()

Untitled

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •