Publications

Zhao, W., & Singh, R. (2020). "Speech-based parameter estimation of an asymmetric vocal fold oscillation model and its application in discriminating vocal fold pathologies." Accepted by the 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020). IEEE.

Y. Wen, R. Singh, B. Raj, "Face reconstruction from voice using generative adversarial networks," in 33th Conference on Neural Information Processing Systems (NeurIPS), 2019.

Memon, Shahan Ali, Wenbo Zhao, Bhiksha Raj, and Rita Singh, "Neural regression trees," in 2019 International Joint Conference on Neural Networks (IJCNN), pp. 1-8. IEEE, 2019.

Zhao, W., Gao, Y., & Singh, R. (2017)."Speaker identification from the sound of the human breath" in arXiv preprint arXiv:1712.00171.

Y. Wen, T. Zhou, R. Singh and B. Raj., "A corrective learning approach for text-independent speaker verification," in 43th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018.

Y. Wen, M. AlIsmail, W. Liu, B. Raj, R. Singh,"Disjoint mapping network for cross-modal matching of voices and faces," in 7th International Conference on Learning Representations (ICLR), 2019.

Gao, Y., Singh, R., & Raj, B. (2018, April), "Voice impersonation using generative adversarial networks," in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 2506-2510). IEEE.

Zhao, W., Gao, Y., Memon, S. A., Raj, B., & Singh, R. (2019)."Hierarchical Routing Mixture of Experts." in arXiv preprint arXiv:1903.07756.

Gao, Y., Zheng, W., Yang, Z., Kohler, T., Fuegen, C., & He, Q. (2020)."Interactive Text-to-Speech via Semi-supervised Style Transfer Learning," in arXiv preprint arXiv:2002.06758.

Dhamyal, H., Zhou, T., Raj, B., & Singh, R. (2019, December),"Optimizing Neural Network Embeddings Using a Pair-Wise Loss for Text-Independent Speaker Verification," in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (pp. 742-748). IEEE.

Memon, S. A., Dhamyal, H., Wright, O., Justice, D., Palat, V., Boler, W., ... & Singh, R. (2019)."Detecting gender differences in perception of emotion in crowdsourced data," in arXiv preprint arXiv:1910.11386.

Dhamyal, H., Memon, S. A., Raj, B., & Singh, R. (2019),"The phonetic bases of vocal expressed emotion: natural versus acted," in arXiv preprint arXiv:1911.05733.