際際滷

際際滷Share a Scribd company logo
Custom TTS using multi-speaker-tacotron(nanheekim)
2 / 20
[ 蠍一ヾ TTS 觜 ]
3 / 20
[ Jupyter 蟆 覈 ]
Anaconda3襯 伎
螳蟆 
[ tacotron 覈語  れ れ ]
4 / 20
燕 
燕 覓旧
蟲螳朱 覿襴
SST API襯 伎
ろ  
alignment
 
.npz
dataset
[ Dataset  螻殊 ]
[ alignment.json  企 覈 ]
[ Google Cloud SDK 譴 SST API  覈 ]
5 / 20
[ 碁 + 貊殊  蟆郁骸 loss graph ]
[  + 蟾語  蟆郁骸 loss graph ]
[ 碁 + 貊殊  蟆郁骸 sample]
[  + 蟾語  蟆郁骸 sample ]
6 / 20
[ 碁 るる + 蟲 貊殊 + 伎狩 + 蟾  蟆郁骸 loss graph ]
[ 蟆郁骸 sample]
碁 るる, 蟲 貊殊
伎狩, 蟾
7 / 20
[ IP れ 覦  れ ]
app.py
main.js
[ Flask 轟  UI ]
Index.html
app.py
web
audio
son+hozzi-trainingdate
synthesizecode1.wav
synthesizecode1.png

yuinna+kss-trainingdate
synthesizecode2.wav
synthesizecode2.png

static
css
main.css
js
templete
Index.html
main.js
siriwave.js
[ Flask Web 蟲譟磯 ]
8 / 20
[ UI 豌 覈 ]
[ ろ  ]
[ ロ ろ碁   覦  豢 ]
9 / 20
1. 曙^蟇1:  蟆
旧 伎 蠏碁 豺企 GPU 焔レ 覲企 譬 蟆曙 . Smart-Lab 覯襯 伎  螳 豢貅磯.
2. 曙^蟇2 :  一危 讌 覦 ろ 手骸 襷れ広 
覲企  讌  蟆郁骸襯 豢ロ蠍 伎 一朱 旧 伎  殊  覈襴 語 ′ れ
 殊伎 . 企 襯 豕 蠍  豺覓 蟲螳朱  燕殊 覿襴 ロ. 企蟆 ル
燕殊 蠍磯朱 ろ 手骸 襷れ広, 蟲蠍 SST API襦 焔 ろ 殊 覯渚蟆 覯讌  譟郁 襴
蟆曙郁 譟伎. 企ゼ 願屋蠍  覈 殊 燕殊 誤蟇磯 ろ 殊   豌襴  牛 
一危一 讌   .
3. 曙^蟇3:   螻殊
Custom TTS 蟲螻  れ 蟆曙 覈  伎 螳螳 牛 蟆  覯 旧 襴 蟆
蟆郁骸  覈 譬. 讌襷 4覈  覈 るゴ螻  るゴ蠍 覓語 煙 轟煙 觜訣 朱Μ 覈 旧
讌 覲企 譬 蟆郁骸襯 詞  .
4. 曙^蟇4:  危襴貅伎
れ螳朱 ろ語 燕殊 譯手 覦蠍  UI螳 .   螳覦蟆曙 jupyter襦, 轟 螳覦  
伎願鍵 覓語  觚殊一 伎襯 襷れ 企ゼ 願屋. 蠍一ヾ flask 蠍磯朱   伎襯 .
螻殊覈 : Deep Learning 伎 Custom TTS
Custom TTS using multi-speaker-tacotron(nanheekim)
Custom TTS using multi-speaker-tacotron(nanheekim)
12 / 8
[   蠍一  覦  襦 ]
https://wowtale.net/2019/11/15/naver-clova-premium-voice/
2) 螳覦 蠍郁
  蠍一
貉 
3
1) 燕 覈 : Multi-Speaker-Tacotron
Baidu Deep Voice
Encoder
Decoder
Attention
Vocoder
Google Tacotron
螳覦 蟆 覦
 蟆 蟲豢
4
 
5~10
 伎 蟲豢
覦 觜ろ
11
https://github.com/carpedm20/multi-speaker-tacotron-tensorflow
Google Tacotron
13 / 8
1)  蟆(Smart-lab) 2) 一危一 讌
1.  ( 伎る8) : 43700譴(11螳)
2. 碁 (碁 るる) : 3670譴(5螳)
3. 貊殊 (蟲 貊殊) : 12800譴(3螳)
4. 伎狩, 蟾 (讌 轟): 2930譴(3螳)
5. 蟾語 (讌 轟): 550譴(1螳)
[ 語]
[ ]
[蠍磯蓋 螳覦 蟆]
[ 蟆]
*1覯  Google SST API襯 伎 襷  一危一
 襷  
*4覯 伎狩, 蟾 讌 轟 覦 一危一  蠍一
一危一  讌襷 襷 螳 襷殊   讌
詞  
14 / 8
一危一 
Deep Voice Tacotron朱 
(200,000 ~ 500,000 Step)
 譟壱 
給 覈 
燕 燕 豢
 譯狩  讌 豢
燕 蟆郁骸 
碁  貊殊 蟾語 蟾 伎狩
一危一 
  覲
 殊 覓旧 蟲螳朱 襯願鍵
煙 襷 ろ (.json )
 ろ 蠍語伎 襷 ろ 覲
覦襯伎   覦 ろ 蟇
 螻殊
15 / 8
旧  豕譬   (.npz )
 觜 蟲
 煙  ろ碁ゼ 蠍語伎 襷蟆 譟一
觜 
蠍磯蓋 / ク讌郁鍵 / 覈貊 / 觚襴 / 殊
Pydub朱   手骸 覦郁化螻 
殊
[ 伎 蟲]
HOME(覃誤伎)
BASIC(蠍磯蓋 )
LETTER(ク讌 郁鍵)
ALARM()
ABOUT(螳/磯/れ企)
覈貊
殊觚襴
 蠍 譟一
焔  豢 覦 讌 蟆暑 
16 / 8
17 / 8
覃 伎 蠍磯蓋  伎 ク讌 郁鍵 伎  螳 伎
螳/磯/れ企覈貊 伎 殊 觚襴 伎 殊 伎
1. 曙^蟇1 : 一危一  螻殊
 一危一  襷襦  手骸 ろ碁ゼ 襷れ広蠍   朱 蠍一 螳 覓 襷 . 企 
襯 蠍  蟲蠍 STT API襯 伎 一危一 燕 螳 豢貅一. 蟲 貊殊る ろ  一危
伎襷 ろ 殊  襷讌 蠍 覓語 伎 貊襯 燕伎 螻  一危一朱 覦蠑語伎 . 企蟆
API 伎 貊襯 牛伎 一危一  螳 豢.
2. 曙^蟇3 :   螻殊
蟲螻  れ 蟆曙   伎襷 牛 蟆  れ 襯 覯 旧 襴 蟆 蟆郁骸  覈
譬. 讌襷 5覈  覈 るゴ螻  るゴ蠍 覓語 煙 轟煙 觜訣 朱Μ 覈 旧 讌 覲企 譬
蟆郁骸襯 詞  .   一危 讌 豺朱 蠍 り鍵 覓語 牛  Loss 蠏碁  朱れ
れ企慨覃伎  焔 覈語 蟆一.
3. 曙^蟇2 :  煙 襷  蠍磯 觜 螳覦
れ 觜 譴  煙 蟆     觜るゼ 伎 螳覦伎 .   煙 螻  覿朱
蠍磯蓋朱 ろ碁ゼ 蠍磯朱  觜れ. 磯殊 ク讌襯 曙伎手碓, 覈貊, 觚襴 蠏碁Μ螻  豢 螳 ろ 蠍磯
觜るれ  煙朱 蟲伎 覲企 朱 蟲   觜るゼ 蟲.  覈襴 覦郁化 燕   
殊 覲朱エ るゴ蠍 覓語 企ゼ  譟一伎 燕 螳 . 螳語 企佒 觜    危襴貅伎朱
蟆曙 .  觜るゼ   讌  殊 螻 殊 れ企   蟆 .
螻殊覈 : multi-speaker-tacotron 伎 螳   觜
18 / 8

More Related Content

Similar to Custom TTS using multi-speaker-tacotron(nanheekim) (20)

Speecher
SpeecherSpeecher
Speecher
JongRak Moon
衣 Front-End 求 求a
衣 Front-End 求 求a衣 Front-End 求 求a
衣 Front-End 求 求a
JinKwon Lee
螳 (Pastel editor)
螳 (Pastel editor)螳 (Pastel editor)
螳 (Pastel editor)
Kevin Hyun
螳( Pastel Editor)
螳( Pastel Editor)螳( Pastel Editor)
螳( Pastel Editor)
Kevin Hyun
2023 GDG Sondo DevFest - Flutter/ Flavor, PlatformChannel, Environment variab...
2023 GDG Sondo DevFest - Flutter/ Flavor, PlatformChannel, Environment variab...2023 GDG Sondo DevFest - Flutter/ Flavor, PlatformChannel, Environment variab...
2023 GDG Sondo DevFest - Flutter/ Flavor, PlatformChannel, Environment variab...
MaRoKim4
U&i insight2012ろ磯る
U&i insight2012ろ磯るU&i insight2012ろ磯る
U&i insight2012ろ磯る
Amy Young Ah Kim
Python螻 Tensorflow襯 AI Chatbot 螳覦 覦 る
Python螻 Tensorflow襯   AI Chatbot 螳覦 覦 る Python螻 Tensorflow襯   AI Chatbot 螳覦 覦 る
Python螻 Tensorflow襯 AI Chatbot 螳覦 覦 る
Susang Kim
一1 | 1谿螳
一1 | 1谿螳一1 | 1谿螳
一1 | 1谿螳
蟾覯 | 覓伎誤磯伎朱
伎 覦一狩 伎 覦襭 - 蟾一
伎 覦一狩 伎 覦襭 - 蟾一伎 覦一狩 伎 覦襭 - 蟾一
伎 覦一狩 伎 覦襭 - 蟾一
Yeon Soo Kim
CloudFront S3襯 伎 貉豸 覦壱 - 覦 CTO, SMARTSTUDY
CloudFront S3襯 伎 貉豸 覦壱  - 覦 CTO, SMARTSTUDYCloudFront S3襯 伎 貉豸 覦壱  - 覦 CTO, SMARTSTUDY
CloudFront S3襯 伎 貉豸 覦壱 - 覦 CTO, SMARTSTUDY
Amazon Web Services Korea
[る語ろ磯]CloudFront S3襯 伎 貉豸 覦壱
[る語ろ磯]CloudFront S3襯 伎 貉豸 覦壱 [る語ろ磯]CloudFront S3襯 伎 貉豸 覦壱
[る語ろ磯]CloudFront S3襯 伎 貉豸 覦壱
smartstudy_official
Using CloudFront and S3 at SMARTSTUDY
Using CloudFront and S3 at SMARTSTUDYUsing CloudFront and S3 at SMARTSTUDY
Using CloudFront and S3 at SMARTSTUDY
Hyun-woo Park
[15.09.17] 誤磯穴骸 轟 蠏碁Μ螻 碁
[15.09.17] 誤磯穴骸 轟  蠏碁Μ螻  碁[15.09.17] 誤磯穴骸 轟  蠏碁Μ螻  碁
[15.09.17] 誤磯穴骸 轟 蠏碁Μ螻 碁
Sanghun Yun
襦 蟲 れ 襴蠍 牛 蟲 襭 Edupresso!
襦 蟲 れ 襴蠍  牛 蟲 襭 Edupresso! 襦 蟲 れ 襴蠍  牛 蟲 襭 Edupresso!
襦 蟲 れ 襴蠍 牛 蟲 襭 Edupresso!
Edupresso
[9 誤磯結Μ] 觜襴蟆 轟觜 - 覦一ロ
[9 誤磯結Μ] 觜襴蟆  轟觜 - 覦一ロ[9 誤磯結Μ] 觜襴蟆  轟觜 - 覦一ロ
[9 誤磯結Μ] 觜襴蟆 轟觜 - 覦一ロ
daumfoundation
=梶 玩メ梶 梶 梶= 梶≡罪激 - 襷覲願鍵
=梶 玩メ梶 梶 梶= 梶≡罪激 - 襷覲願鍵=梶 玩メ梶 梶 梶= 梶≡罪激 - 襷覲願鍵
=梶 玩メ梶 梶 梶= 梶≡罪激 - 襷覲願鍵
覲旧
伎 譟 伎 (襷)
伎 譟 伎 (襷)伎 譟 伎 (襷)
伎 譟 伎 (襷)
Heungsub Lee
Sharing development experience of educational apps for the hard of hearing (P...
Sharing development experience of educational apps for the hard of hearing (P...Sharing development experience of educational apps for the hard of hearing (P...
Sharing development experience of educational apps for the hard of hearing (P...
Youngki Moon
20070920 Roll Project
20070920 Roll Project20070920 Roll Project
20070920 Roll Project
Daewoong Kim
Automatic generation of Hangul font
Automatic generation of Hangul fontAutomatic generation of Hangul font
Automatic generation of Hangul font
yejinkim73
衣 Front-End 求 求a
衣 Front-End 求 求a衣 Front-End 求 求a
衣 Front-End 求 求a
JinKwon Lee
螳 (Pastel editor)
螳 (Pastel editor)螳 (Pastel editor)
螳 (Pastel editor)
Kevin Hyun
螳( Pastel Editor)
螳( Pastel Editor)螳( Pastel Editor)
螳( Pastel Editor)
Kevin Hyun
2023 GDG Sondo DevFest - Flutter/ Flavor, PlatformChannel, Environment variab...
2023 GDG Sondo DevFest - Flutter/ Flavor, PlatformChannel, Environment variab...2023 GDG Sondo DevFest - Flutter/ Flavor, PlatformChannel, Environment variab...
2023 GDG Sondo DevFest - Flutter/ Flavor, PlatformChannel, Environment variab...
MaRoKim4
U&i insight2012ろ磯る
U&i insight2012ろ磯るU&i insight2012ろ磯る
U&i insight2012ろ磯る
Amy Young Ah Kim
Python螻 Tensorflow襯 AI Chatbot 螳覦 覦 る
Python螻 Tensorflow襯   AI Chatbot 螳覦 覦 る Python螻 Tensorflow襯   AI Chatbot 螳覦 覦 る
Python螻 Tensorflow襯 AI Chatbot 螳覦 覦 る
Susang Kim
伎 覦一狩 伎 覦襭 - 蟾一
伎 覦一狩 伎 覦襭 - 蟾一伎 覦一狩 伎 覦襭 - 蟾一
伎 覦一狩 伎 覦襭 - 蟾一
Yeon Soo Kim
CloudFront S3襯 伎 貉豸 覦壱 - 覦 CTO, SMARTSTUDY
CloudFront S3襯 伎 貉豸 覦壱  - 覦 CTO, SMARTSTUDYCloudFront S3襯 伎 貉豸 覦壱  - 覦 CTO, SMARTSTUDY
CloudFront S3襯 伎 貉豸 覦壱 - 覦 CTO, SMARTSTUDY
Amazon Web Services Korea
[る語ろ磯]CloudFront S3襯 伎 貉豸 覦壱
[る語ろ磯]CloudFront S3襯 伎 貉豸 覦壱 [る語ろ磯]CloudFront S3襯 伎 貉豸 覦壱
[る語ろ磯]CloudFront S3襯 伎 貉豸 覦壱
smartstudy_official
Using CloudFront and S3 at SMARTSTUDY
Using CloudFront and S3 at SMARTSTUDYUsing CloudFront and S3 at SMARTSTUDY
Using CloudFront and S3 at SMARTSTUDY
Hyun-woo Park
[15.09.17] 誤磯穴骸 轟 蠏碁Μ螻 碁
[15.09.17] 誤磯穴骸 轟  蠏碁Μ螻  碁[15.09.17] 誤磯穴骸 轟  蠏碁Μ螻  碁
[15.09.17] 誤磯穴骸 轟 蠏碁Μ螻 碁
Sanghun Yun
襦 蟲 れ 襴蠍 牛 蟲 襭 Edupresso!
襦 蟲 れ 襴蠍  牛 蟲 襭 Edupresso! 襦 蟲 れ 襴蠍  牛 蟲 襭 Edupresso!
襦 蟲 れ 襴蠍 牛 蟲 襭 Edupresso!
Edupresso
[9 誤磯結Μ] 觜襴蟆 轟觜 - 覦一ロ
[9 誤磯結Μ] 觜襴蟆  轟觜 - 覦一ロ[9 誤磯結Μ] 觜襴蟆  轟觜 - 覦一ロ
[9 誤磯結Μ] 觜襴蟆 轟觜 - 覦一ロ
daumfoundation
=梶 玩メ梶 梶 梶= 梶≡罪激 - 襷覲願鍵
=梶 玩メ梶 梶 梶= 梶≡罪激 - 襷覲願鍵=梶 玩メ梶 梶 梶= 梶≡罪激 - 襷覲願鍵
=梶 玩メ梶 梶 梶= 梶≡罪激 - 襷覲願鍵
覲旧
伎 譟 伎 (襷)
伎 譟 伎 (襷)伎 譟 伎 (襷)
伎 譟 伎 (襷)
Heungsub Lee
Sharing development experience of educational apps for the hard of hearing (P...
Sharing development experience of educational apps for the hard of hearing (P...Sharing development experience of educational apps for the hard of hearing (P...
Sharing development experience of educational apps for the hard of hearing (P...
Youngki Moon
20070920 Roll Project
20070920 Roll Project20070920 Roll Project
20070920 Roll Project
Daewoong Kim
Automatic generation of Hangul font
Automatic generation of Hangul fontAutomatic generation of Hangul font
Automatic generation of Hangul font
yejinkim73

More from Nanhee Kim (15)

Model compression
Model compressionModel compression
Model compression
Nanhee Kim
Kalman filter(nanheekim)
Kalman filter(nanheekim)Kalman filter(nanheekim)
Kalman filter(nanheekim)
Nanhee Kim
Creating touch screen based loop station using rapsberry pi and qt(nanheekim)
Creating touch screen based loop station using rapsberry pi and qt(nanheekim)Creating touch screen based loop station using rapsberry pi and qt(nanheekim)
Creating touch screen based loop station using rapsberry pi and qt(nanheekim)
Nanhee Kim
MCU(nanheekim)
MCU(nanheekim)MCU(nanheekim)
MCU(nanheekim)
Nanhee Kim
Book management system(nanheekim)
Book management system(nanheekim)Book management system(nanheekim)
Book management system(nanheekim)
Nanhee Kim
ADC(nanheekim)
ADC(nanheekim)ADC(nanheekim)
ADC(nanheekim)
Nanhee Kim
We can decide(Final)(nanheekim)
We can decide(Final)(nanheekim)We can decide(Final)(nanheekim)
We can decide(Final)(nanheekim)
Nanhee Kim
My dream(ver2016)(nanheekim)
My dream(ver2016)(nanheekim)My dream(ver2016)(nanheekim)
My dream(ver2016)(nanheekim)
Nanhee Kim
Control led using relay module and transistor(nanheekim)
Control led using relay module and transistor(nanheekim)Control led using relay module and transistor(nanheekim)
Control led using relay module and transistor(nanheekim)
Nanhee Kim
Creating text to talk active image(nanheekim)
Creating text to talk active image(nanheekim)Creating text to talk active image(nanheekim)
Creating text to talk active image(nanheekim)
Nanhee Kim
Searching algorithm(nanheekim)
Searching algorithm(nanheekim)Searching algorithm(nanheekim)
Searching algorithm(nanheekim)
Nanhee Kim
Transfer learning of model alexnet for image classification to matlab(nanheekim)
Transfer learning of model alexnet for image classification to matlab(nanheekim)Transfer learning of model alexnet for image classification to matlab(nanheekim)
Transfer learning of model alexnet for image classification to matlab(nanheekim)
Nanhee Kim
Digital clock using 7segment(nanheekim)
Digital clock using 7segment(nanheekim)Digital clock using 7segment(nanheekim)
Digital clock using 7segment(nanheekim)
Nanhee Kim
Seminar 2legs robots(nanheekim)
Seminar 2legs robots(nanheekim)Seminar 2legs robots(nanheekim)
Seminar 2legs robots(nanheekim)
Nanhee Kim
Imu sensor(nanhee_kim)
Imu sensor(nanhee_kim)Imu sensor(nanhee_kim)
Imu sensor(nanhee_kim)
Nanhee Kim
Model compression
Model compressionModel compression
Model compression
Nanhee Kim
Kalman filter(nanheekim)
Kalman filter(nanheekim)Kalman filter(nanheekim)
Kalman filter(nanheekim)
Nanhee Kim
Creating touch screen based loop station using rapsberry pi and qt(nanheekim)
Creating touch screen based loop station using rapsberry pi and qt(nanheekim)Creating touch screen based loop station using rapsberry pi and qt(nanheekim)
Creating touch screen based loop station using rapsberry pi and qt(nanheekim)
Nanhee Kim
MCU(nanheekim)
MCU(nanheekim)MCU(nanheekim)
MCU(nanheekim)
Nanhee Kim
Book management system(nanheekim)
Book management system(nanheekim)Book management system(nanheekim)
Book management system(nanheekim)
Nanhee Kim
ADC(nanheekim)
ADC(nanheekim)ADC(nanheekim)
ADC(nanheekim)
Nanhee Kim
We can decide(Final)(nanheekim)
We can decide(Final)(nanheekim)We can decide(Final)(nanheekim)
We can decide(Final)(nanheekim)
Nanhee Kim
My dream(ver2016)(nanheekim)
My dream(ver2016)(nanheekim)My dream(ver2016)(nanheekim)
My dream(ver2016)(nanheekim)
Nanhee Kim
Control led using relay module and transistor(nanheekim)
Control led using relay module and transistor(nanheekim)Control led using relay module and transistor(nanheekim)
Control led using relay module and transistor(nanheekim)
Nanhee Kim
Creating text to talk active image(nanheekim)
Creating text to talk active image(nanheekim)Creating text to talk active image(nanheekim)
Creating text to talk active image(nanheekim)
Nanhee Kim
Searching algorithm(nanheekim)
Searching algorithm(nanheekim)Searching algorithm(nanheekim)
Searching algorithm(nanheekim)
Nanhee Kim
Transfer learning of model alexnet for image classification to matlab(nanheekim)
Transfer learning of model alexnet for image classification to matlab(nanheekim)Transfer learning of model alexnet for image classification to matlab(nanheekim)
Transfer learning of model alexnet for image classification to matlab(nanheekim)
Nanhee Kim
Digital clock using 7segment(nanheekim)
Digital clock using 7segment(nanheekim)Digital clock using 7segment(nanheekim)
Digital clock using 7segment(nanheekim)
Nanhee Kim
Seminar 2legs robots(nanheekim)
Seminar 2legs robots(nanheekim)Seminar 2legs robots(nanheekim)
Seminar 2legs robots(nanheekim)
Nanhee Kim
Imu sensor(nanhee_kim)
Imu sensor(nanhee_kim)Imu sensor(nanhee_kim)
Imu sensor(nanhee_kim)
Nanhee Kim

Custom TTS using multi-speaker-tacotron(nanheekim)

  • 2. 2 / 20 [ 蠍一ヾ TTS 觜 ]
  • 3. 3 / 20 [ Jupyter 蟆 覈 ] Anaconda3襯 伎 螳蟆 [ tacotron 覈語 れ れ ]
  • 4. 4 / 20 燕 燕 覓旧 蟲螳朱 覿襴 SST API襯 伎 ろ alignment .npz dataset [ Dataset 螻殊 ] [ alignment.json 企 覈 ] [ Google Cloud SDK 譴 SST API 覈 ]
  • 5. 5 / 20 [ 碁 + 貊殊 蟆郁骸 loss graph ] [ + 蟾語 蟆郁骸 loss graph ] [ 碁 + 貊殊 蟆郁骸 sample] [ + 蟾語 蟆郁骸 sample ]
  • 6. 6 / 20 [ 碁 るる + 蟲 貊殊 + 伎狩 + 蟾 蟆郁骸 loss graph ] [ 蟆郁骸 sample] 碁 るる, 蟲 貊殊 伎狩, 蟾
  • 7. 7 / 20 [ IP れ 覦 れ ] app.py main.js [ Flask 轟 UI ] Index.html app.py web audio son+hozzi-trainingdate synthesizecode1.wav synthesizecode1.png yuinna+kss-trainingdate synthesizecode2.wav synthesizecode2.png static css main.css js templete Index.html main.js siriwave.js [ Flask Web 蟲譟磯 ]
  • 8. 8 / 20 [ UI 豌 覈 ] [ ろ ] [ ロ ろ碁 覦 豢 ]
  • 9. 9 / 20 1. 曙^蟇1: 蟆 旧 伎 蠏碁 豺企 GPU 焔レ 覲企 譬 蟆曙 . Smart-Lab 覯襯 伎 螳 豢貅磯. 2. 曙^蟇2 : 一危 讌 覦 ろ 手骸 襷れ広 覲企 讌 蟆郁骸襯 豢ロ蠍 伎 一朱 旧 伎 殊 覈襴 語 ′ れ 殊伎 . 企 襯 豕 蠍 豺覓 蟲螳朱 燕殊 覿襴 ロ. 企蟆 ル 燕殊 蠍磯朱 ろ 手骸 襷れ広, 蟲蠍 SST API襦 焔 ろ 殊 覯渚蟆 覯讌 譟郁 襴 蟆曙郁 譟伎. 企ゼ 願屋蠍 覈 殊 燕殊 誤蟇磯 ろ 殊 豌襴 牛 一危一 讌 . 3. 曙^蟇3: 螻殊 Custom TTS 蟲螻 れ 蟆曙 覈 伎 螳螳 牛 蟆 覯 旧 襴 蟆 蟆郁骸 覈 譬. 讌襷 4覈 覈 るゴ螻 るゴ蠍 覓語 煙 轟煙 觜訣 朱Μ 覈 旧 讌 覲企 譬 蟆郁骸襯 詞 . 4. 曙^蟇4: 危襴貅伎 れ螳朱 ろ語 燕殊 譯手 覦蠍 UI螳 . 螳覦蟆曙 jupyter襦, 轟 螳覦 伎願鍵 覓語 觚殊一 伎襯 襷れ 企ゼ 願屋. 蠍一ヾ flask 蠍磯朱 伎襯 . 螻殊覈 : Deep Learning 伎 Custom TTS
  • 12. 12 / 8 [ 蠍一 覦 襦 ] https://wowtale.net/2019/11/15/naver-clova-premium-voice/
  • 13. 2) 螳覦 蠍郁 蠍一 貉 3 1) 燕 覈 : Multi-Speaker-Tacotron Baidu Deep Voice Encoder Decoder Attention Vocoder Google Tacotron 螳覦 蟆 覦 蟆 蟲豢 4 5~10 伎 蟲豢 覦 觜ろ 11 https://github.com/carpedm20/multi-speaker-tacotron-tensorflow Google Tacotron 13 / 8
  • 14. 1) 蟆(Smart-lab) 2) 一危一 讌 1. ( 伎る8) : 43700譴(11螳) 2. 碁 (碁 るる) : 3670譴(5螳) 3. 貊殊 (蟲 貊殊) : 12800譴(3螳) 4. 伎狩, 蟾 (讌 轟): 2930譴(3螳) 5. 蟾語 (讌 轟): 550譴(1螳) [ 語] [ ] [蠍磯蓋 螳覦 蟆] [ 蟆] *1覯 Google SST API襯 伎 襷 一危一 襷 *4覯 伎狩, 蟾 讌 轟 覦 一危一 蠍一 一危一 讌襷 襷 螳 襷殊 讌 詞 14 / 8
  • 15. 一危一 Deep Voice Tacotron朱 (200,000 ~ 500,000 Step) 譟壱 給 覈 燕 燕 豢 譯狩 讌 豢 燕 蟆郁骸 碁 貊殊 蟾語 蟾 伎狩 一危一 覲 殊 覓旧 蟲螳朱 襯願鍵 煙 襷 ろ (.json ) ろ 蠍語伎 襷 ろ 覲 覦襯伎 覦 ろ 蟇 螻殊 15 / 8 旧 豕譬 (.npz )
  • 16. 觜 蟲 煙 ろ碁ゼ 蠍語伎 襷蟆 譟一 觜 蠍磯蓋 / ク讌郁鍵 / 覈貊 / 觚襴 / 殊 Pydub朱 手骸 覦郁化螻 殊 [ 伎 蟲] HOME(覃誤伎) BASIC(蠍磯蓋 ) LETTER(ク讌 郁鍵) ALARM() ABOUT(螳/磯/れ企) 覈貊 殊觚襴 蠍 譟一 焔 豢 覦 讌 蟆暑 16 / 8
  • 17. 17 / 8 覃 伎 蠍磯蓋 伎 ク讌 郁鍵 伎 螳 伎 螳/磯/れ企覈貊 伎 殊 觚襴 伎 殊 伎
  • 18. 1. 曙^蟇1 : 一危一 螻殊 一危一 襷襦 手骸 ろ碁ゼ 襷れ広蠍 朱 蠍一 螳 覓 襷 . 企 襯 蠍 蟲蠍 STT API襯 伎 一危一 燕 螳 豢貅一. 蟲 貊殊る ろ 一危 伎襷 ろ 殊 襷讌 蠍 覓語 伎 貊襯 燕伎 螻 一危一朱 覦蠑語伎 . 企蟆 API 伎 貊襯 牛伎 一危一 螳 豢. 2. 曙^蟇3 : 螻殊 蟲螻 れ 蟆曙 伎襷 牛 蟆 れ 襯 覯 旧 襴 蟆 蟆郁骸 覈 譬. 讌襷 5覈 覈 るゴ螻 るゴ蠍 覓語 煙 轟煙 觜訣 朱Μ 覈 旧 讌 覲企 譬 蟆郁骸襯 詞 . 一危 讌 豺朱 蠍 り鍵 覓語 牛 Loss 蠏碁 朱れ れ企慨覃伎 焔 覈語 蟆一. 3. 曙^蟇2 : 煙 襷 蠍磯 觜 螳覦 れ 觜 譴 煙 蟆 觜るゼ 伎 螳覦伎 . 煙 螻 覿朱 蠍磯蓋朱 ろ碁ゼ 蠍磯朱 觜れ. 磯殊 ク讌襯 曙伎手碓, 覈貊, 觚襴 蠏碁Μ螻 豢 螳 ろ 蠍磯 觜るれ 煙朱 蟲伎 覲企 朱 蟲 觜るゼ 蟲. 覈襴 覦郁化 燕 殊 覲朱エ るゴ蠍 覓語 企ゼ 譟一伎 燕 螳 . 螳語 企佒 觜 危襴貅伎朱 蟆曙 . 觜るゼ 讌 殊 螻 殊 れ企 蟆 . 螻殊覈 : multi-speaker-tacotron 伎 螳 觜 18 / 8