|
[1] Aron van den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals,Alexander Graves, Nal Kalchbrenner, Andrew Senior, and Koray Kavukcuoglu.Wavenet: A generative model for raw audio. InArxiv, 2016. URLhttps://arxiv.org/abs/1609.03499. [2] Bob Sturm, Joo Santos, Oded Ben-Tal, and Iryna Korshunova. Music transcriptionmodelling and composition using deep learning.Conference on Computer Simulationof Musical Creativity, 2016. [3] Li-Chia Yang, Szu-Yu Chou, and Yi-Hsuan Yang. Midinet: A convolutional genera-tive adversarial network for symbolic-domain music generation.International Societyof Music Information Retrieval Conference, 2017. [4] Cheng-Zhi Anna Huang, Ashish Vaswani, Jakob Uszkoreit, Noam Shazeer, Ian Si-mon, Curtis Hawthorne, Andrew M. Dai, Matthew D. Hoffman, Monica Dinculescu,and Douglas Eck. Music transformer.International Conference on Learning Repre-sentations, 2019. [5] Gabriel Guimaraes, Benjamin Sanchez, Pedro Farias, and Aln Aspuru-Guzik.Objective-reinforced generative adversarial networks (organ) for sequence generationmodels.ArXiv, 2017. [6] Natasha Jaques, Shixiang Gu, Richard E. Turner, and Douglas Eck. Tuning recurrentneural networks with reinforcement learning.International Conference on LearningRepresentations, 2017. [7] Nikhil Kotecha. Bach2bach: Generating music using a deep reinforcement learning approach.https://arxiv.org/ftp/arxiv/papers/1812/1812.01060.pdf, 2018. [8] Orry Messer and Pravesh Ranchod. The use of apprenticeship learning via inverse reinforcement learning for generating melodies.International Computer Music Con-ference, 2014. [9] Daniel D. Johnson. Composing music with recurrent neural networks. http://www.hexahedria.com/2015/08/03/composing-music-with-recurrent-neural-networks/. Accessed: 2020-06-2. [10] Jeffrey L. Elman. Finding structure in time.Cognitive Science, 1990. [11] Sepp Hochreiter and Jurgen Schmidhuber. Long-short term memory.Neural Computation, 1997. [12] Daniel D. Johnson. Generating polyphonic music using tied parallel networks.Inter-national Conference on Computational Intelligence in Music, Sound, Art and Design,2017.88 [13] Christopher Watkins.Learning From Delayed Rewards. PhD thesis, King’s College,Cambridge, 1989. [14] Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, IoannisAntonoglou, Daan Wierstra, and Martin Riedmiller. Playing atari with deep rein-forcement learning.Conference on Neural Information Processing Systems, 2013. [15] Long-Ji Lin.Reinforcement Learning for Robots Using Neural Networks. PhD thesis,School of Computer Science, Carnegie Mellon University, 1993. [16] Richard S. Sutton and Andrew G Barto.Reinforcement Learning: An Introduction.The MIT Press, second edition, 2018. [17] Pieter Abbeel and Andrew Y. Ng. Apprenticeship learning via inverse reinforcementlearning.International Conference on Machine Learning, 2004. [18] Brian D. Ziebart, Andrew Maas, J. Andrew Bagnell, and Anind K. Dey. Maximumentropy inverse reinforcement learning.AAAI Conference on Artificial Intelligence,2008. [19] Chelsea Finn, Paul Christiano, Pieter Abbeel, and Sergey Levine. A connection be-tween generative adversarial networks, inverse reinforcement learning, and energy-based models.Neural Information Processing Systems Conference, 2016. [20] Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley,Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial networks.Neural Information Processing Systems Conference, 2014.89 [21] Justin Fu, Katie Luo, and Sergey Levine. Learning robust rewards with adversarial in-verse reinforcement learning.International Conference on Learning Representations,2018. [22] Matthew D. Zeiler. Adadelta: An adaptive learning method.ArXiv, 2012. [23] Nikhil Kotecha. Generating music.https://github.com/nikhil-kotecha/GeneratingMusic. Accessed: 2020-06-2. [24] Justin Fu. Inverse rl.https://github.com/justinjfu/inverserl.Accessed: 2020-06-2. [25] Google Magenta Team. Magenta: Music and art generation with machine intelli-gence.https://github.com/magenta/magenta. Accessed: 2020-06-2. [26] Hao-Wen Dong, Wen-Yi Hsiao, Li-Chia Yang, and Yi-Hsuan Yang. Musegan: Multi-track sequential generative adversarial networks for symbolic music generation andaccompaniment.AAAI Conference on Artificial Intelligence (AAAI), 2018. [27] Colin Raffel.Learning-Based Methods for Comparing Sequences, with Applicationsto Audio-to-MIDI Alignment and Matching. PhD thesis, Colombia University, 2016. [28] Diederik P. Kingma and Jimmi Lei Ba. Adam: A method for stochastic optimization.International Conference for Learning Representations, 2015. [29] John Schulman, Sergey Levine, Philipp Moritz, Micheal Jordan, and Pieter Abbeel.Trust region policy optimization.International Conference on Machine Learning,2015.90 [30] Hao-Wen Dong and Yi-Hsuan Yang. Convolutional generative adversarial networkswith binary neurons for polyphonic music generation.International Society for MusicInformation Retrieval, 2018. [31] Mohammad Akbari and Jie Liang. Semi-recurrent cnn-based vae-gan for sequentialdata generation.IEEE International Conference on Acoustics, Speech and SignalProcessing, 2018. [32] Stephen Merity, Caiming Xiong, James Bradbury, and Richard Socher. Pointer sen-tinel mixture models, 2016.
|