AAI_2025_Capstone_Chronicles_Combined

24

References

Andén, J., & Mallat, S. (2014). Deep scattering spectrum. IEEE Transactions on Signal Processing, 62 (16), 4114–4128. https://doi.org/10.1109/TSP.2014.2326991

Beauchamp, J. W. (2007). Analysis, synthesis, and perception of musical sounds: The sound of music . Springer.

Engel, J., Resnick, C., Roberts, A., Dieleman, S., Eck, D., Simonyan, K., & Norouzi, M. (2017). Neural audio synthesis of musical notes with WaveNet autoencoders. In Proceedings of the 34th International Conference on Machine Learning (pp. 1068–1077). PMLR. https://proceedings.mlr.press/v70/engel17a/engel17a.pdf

Gong, Y., Chung, Y., & Glass, J. (2021). AST: Audio spectrogram transformer. In Proceedings of Interspeech 2021 (pp. 571–575). https://doi.org/10.21437/Interspeech.2021-698

Humphrey, E. J., Bello, J. P., & LeCun, Y. (2012). Feature learning and deep architectures: New directions for music informatics. In Proceedings of the 13th International Society for Music Information Retrieval Conference (pp. 403–408). https://ismir2012.ismir.net/event/papers/403_ISMIR_2012.pdf

Jensen, K. (2005). A model for timbre similarity . In Proceedings of the International Symposium on Music Information Retrieval (pp. 235–240).

McFee, B., Raffel, C., Liang, D., Ellis, D. P. W., McVicar, M., Battenberg, E., & Nieto, O. (2015). Librosa: Audio and music signal analysis in Python. In Proceedings of the 14th Python in Science Conference (pp. 18–25). https://doi.org/10.25080/Majora-7b98e3ed-003

356

Made with FlippingBook - Share PDF online