Seguir
Xiangang Li
Xiangang Li
Baidu, DiDi, Beike
Nenhum e-mail foi confirmado
Título
Citado por
Citado por
Ano
Deep speech 2: End-to-end speech recognition in english and mandarin
D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ...
International conference on machine learning, 173-182, 2016
36012016
Deep speaker: an end-to-end neural speaker embedding system
C Li, X Ma, B Jiang, X Li, X Zhang, X Liu, Y Cao, A Kannan, Z Zhu
arXiv preprint arXiv:1705.02304, 2017
5482017
Constructing long short-term memory based deep recurrent neural networks for large vocabulary speech recognition
X Li, X Wu
2015 ieee international conference on acoustics, speech and signal …, 2015
4962015
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio
G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ...
arXiv preprint arXiv:2106.06909, 2021
1592021
Learning alignment for multimodal emotion recognition from speech
H Xu, H Zhang, K Han, Y Wang, Y Peng, X Li
arXiv preprint arXiv:1909.05645, 2019
1562019
Improving transformer-based speech recognition using unsupervised pre-training
D Jiang, X Lei, W Li, N Luo, Y Hu, W Zou, X Li
arXiv preprint arXiv:1910.09932, 2019
992019
Speech simclr: Combining contrastive and reconstruction objective for self-supervised speech representation learning
D Jiang, W Li, M Cao, W Zou, X Li
arXiv preprint arXiv:2010.13991, 2020
702020
Openchat: Advancing open-source language models with mixed-quality data
G Wang, S Cheng, X Zhan, X Li, S Song, Y Liu
arXiv preprint arXiv:2309.11235, 2023
652023
Gram-CTC: Automatic unit selection and target decomposition for sequence labelling
H Liu, Z Zhu, X Li, S Satheesh
International Conference on Machine Learning, 2188-2197, 2017
642017
Towards end-to-end code-switching speech recognition
N Luo, D Jiang, S Zhao, C Gong, W Zou, X Li
arXiv preprint arXiv:1810.13091, 2018
592018
Exploring the impact of instruction data scaling on large language models: An empirical study on real-world use cases
Y Ji, Y Deng, Y Gong, Y Peng, Q Niu, L Zhang, B Ma, X Li
arXiv preprint arXiv:2303.14742, 2023
582023
Implicit discourse relation recognition using neural tensor network with interactive attention and sparse learning
F Guo, R He, D Jin, J Dang, L Wang, X Li
Proceedings of the 27th International Conference on Computational …, 2018
522018
A comparative study on selecting acoustic modeling units in deep neural networks based large vocabulary Chinese speech recognition
X Li, Y Yang, Z Pang, X Wu
Neurocomputing 170, 251-256, 2015
452015
A further study of unsupervised pretraining for transformer based speech recognition
D Jiang, W Li, R Zhang, M Cao, N Luo, Y Han, W Zou, K Han, X Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
412021
On loss functions and recurrency training for GAN-based speech enhancement systems
Z Zhang, C Deng, Y Shen, DS Williamson, Y Sha, Y Zhang, H Song, X Li
arXiv preprint arXiv:2007.14974, 2020
392020
Speech Emotion Recognition by Combining Amplitude and Phase Information Using Convolutional Neural Network.
L Guo, L Wang, J Dang, L Zhang, H Guan, X Li
INTERSPEECH, 1611-1615, 2018
352018
Comparable study of modeling units for end-to-end mandarin speech recognition
W Zou, D Jiang, S Zhao, G Yang, X Li
2018 11th International Symposium on Chinese Spoken Language Processing …, 2018
342018
Transformer based unsupervised pre-training for acoustic representation learning
R Zhang, H Wu, W Li, D Jiang, W Zou, X Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
332021
Didispeech: A large scale mandarin speech corpus
T Guo, C Wen, D Jiang, N Luo, R Zhang, S Zhao, W Li, C Gong, W Zou, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
292021
Modeling speaker variability using long short-term memory networks for speech recognition.
X Li, X Wu
Interspeech, 1086-1090, 2015
292015
O sistema não pode executar a operação agora. Tente novamente mais tarde.
Artigos 1–20