What is the Presentation?
Why people make it?
Who can we use for reference?
Where must we notice?
When we have contents,
How can we extend it?
Rediscover “Presentation”
with tips everyone heard once.
What is the Presentation?
Why people make it?
Who can we use for reference?
Where must we notice?
When we have contents,
How can we extend it?
Rediscover “Presentation”
with tips everyone heard once.
Soft Actor-Critic is an off-policy maximum entropy deep reinforcement learning algorithm that uses a stochastic actor. It was presented in a 2017 NIPS paper by researchers from OpenAI, UC Berkeley, and DeepMind. Soft Actor-Critic extends the actor-critic framework by incorporating an entropy term into the reward function to encourage exploration. This allows the agent to learn stochastic policies that can operate effectively in environments with complex, sparse rewards. The algorithm was shown to learn robust policies on continuous control tasks using deep neural networks to approximate the policy and action-value functions.
28. 参考文献
[1] Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua
Bengio, “Generative Adversarial Networks”, Jun 2014
[2] Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros, ”Unpaired Image-to-Image Translation using Cycle-Consistent
Adversarial Networks”, ICCV 2017.
[3] S M. Ali Eslami, Danilo Jimenez Rezende, et.al. “Neural scene representation and rendering”, Science 15 Jun 2018
[4] Takuhiro Kaneko, Hirokazu Kameoka ,“PARALLEL-DATA-FREE VOICE CONVERSION USING CYCLE-CONSISTENT
ADVERSARIAL NETWORKS”, NTT Corporation
[5] Lifa Sun, Kun Li, Hao Wang, Shiyin Kang and Helen Meng, “PHONETIC POSTERIORGRAMS FOR MANY-TO-ONE VOICE
CONVERSION WITHOUT PARALLEL DATA TRAINING “
[6] 統計的声質変換を行うための知識と手法
https://nico-opendata.jp/ja/casestudy/2stack_voice_conversion/report.html#[4]
[7] 人工知能に関する断創録
http://aidiary.hatenablog.com/
[8]スペクトログラムとメル周波数スペクトログラムの可視化
http://r9y9.github.io/blog/2013/11/16/mel-spectrogram/
30