Zengwei Yao

Citado por

	Todos	Desde 2019
Citações	327	327
Índice h	7	7
Índice i10	5	5

140

105

2019202020212022202320245 23 47 74 121 57

Acesso público

Ver todos

3 artigos

0 artigo

disponível

não disponível

Com base nas autorizações de financiamento

Coautores

Fangjun KuangXiaomiE-mail confirmado em xiaomi.com
Wei KangSenior engineer, Xiaomi Corp.E-mail confirmado em xiaomi.com
Daniel PoveyChief Speech Scientist, Xiaomi Corp.E-mail confirmado em xiaomi.com
Xiaoyu YangMachine Learning Engineer, Xiaomi Corp.E-mail confirmado em xiaomi.com
Mingshuang LuoICT, UCAS, Peng Cheng LabE-mail confirmado em mails.ucas.ac.cn
Weihuang LiuUniversity of MacauE-mail confirmado em um.edu.mo
Jiahui PanSouth China Normal UniversityE-mail confirmado em m.scnu.edu.cn
Yifan YangMachine Learning Engineer, Xiaomi Corp.E-mail confirmado em xiaomi.com
Piotr ŻelaskoPrincipal Research Scientist @ NvidiaE-mail confirmado em nvidia.com
Wenjie PeiHarbin Institute of Technology, Shenzhen; Delft University of TechnologyE-mail confirmado em hit.edu.cn
Guangming LuHarbin Institute of Technology, ShenzhenE-mail confirmado em hit.edu.cn
Zengrui JinThe Chinese University of Hong KongE-mail confirmado em se.cuhk.edu.hk
Fanglin ChenAssociate Professor, Harbin Institute of Technology, ShenzhenE-mail confirmado em hit.edu.cn
David Zhang, DapengDistinguished Presidential Chair Prof., Chinese Univ of HK (SZ); Emeritus Prof., HK Polytechnic UnivE-mail confirmado em cuhk.edu.cn

Seguir

Zengwei Yao

Machine Learning Engineer, Xiaomi Corp.

E-mail confirmado em xiaomi.com - Página inicial

speech recognition deep learning


Título Ordenar por citações Ordenar por ano Ordenar por título	Citado por Citado por	Ano
Speech emotion recognition using fusion of three multi-task learning-based classifiers: HSF-DNN, MS-CNN and LLD-RNN Z Yao, Z Wang, W Liu, Y Liu, J Pan Speech Communication 120, 11-19, 2020	136	2020
Convolutional two-stream network using multi-facial feature fusion for driver fatigue detection W Liu, J Qian, Z Yao, X Jiao, J Pan Future Internet 11 (5), 115, 2019	91	2019
Pruned RNN-T for fast, memory-efficient ASR training F Kuang, L Guo, W Kang, L Lin, M Luo, Z Yao, D Povey Proc. Interspeech 2022, 2068--2072, 2022	40	2022
Zipformer: A faster and better encoder for automatic speech recognition Z Yao, L Guo, X Yang, W Kang, F Kuang, Y Yang, Z Jin, L Lin, D Povey ICLR 2024, 2023	17	2023
Fingerprint restoration using cubic Bezier curve Y Tu, Z Yao, J Xu, Y Liu, Z Zhang BMC bioinformatics 21, 1-19, 2020	13	2020
Fast and parallel decoding for transducer W Kang, L Guo, F Kuang, L Lin, M Luo, Z Yao, X Yang, P Żelasko, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	9	2023
Stepwise-refining speech separation network via fine-grained encoding in high-order latent domain Z Yao, W Pei, F Chen, G Lu, D Zhang IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 378-393, 2022	9	2022
Libriheavy: a 50,000 hours asr corpus with punctuation casing and context W Kang, X Yang, Z Yao, F Kuang, Y Yang, L Guo, L Lin, D Povey ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	3	2024
Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation L Guo, X Yang, Q Wang, Y Kong, Z Yao, F Cui, F Kuang, W Kang, L Lin, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	3	2023
Blank-regularized CTC for Frame Skipping in Neural Transducer Y Yang, X Yang, L Guo, Z Yao, W Kang, F Kuang, L Lin, X Chen, D Povey Proc. INTERSPEECH 2023, 4409--4413, 2023	3	2023
Delay-penalized transducer for low-latency streaming ASR W Kang, Z Yao, F Kuang, L Guo, X Yang, L Lin, P Żelasko, D Povey ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	2	2023
PromptASR for contextualized ASR with controllable style X Yang, W Kang, Z Yao, Y Yang, L Guo, F Kuang, L Lin, D Povey ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	1	2024
METHOD AND APPARATUS FOR TRAINING NEURAL NETWORK, AND METHOD AND APPARATUS FOR AUDIO PROCESSING W Kang, P Daniel, F Kuang, L Guo, Z Yao, L Lin, M Luo US Patent App. 18/080,713, 2023		2023
METHOD AND APPARATUS FOR AUDIO PROCESSING, ELECTRONIC DEVICE AND STORAGE MEDIUM M Luo, F Kuang, L Guo, L Lin, W Kang, Z Yao, P Daniel US Patent App. 18/078,483, 2023		2023
METHOD OF TRAINING SPEECH RECOGNITION MODEL, ELECTRONIC DEVICE AND STORAGE MEDIUM Z Yao, L Guo, P Daniel, L Lin, F Kuang, W Kang, M Luo, Q Wang, Y Kong US Patent App. 18/078,460, 2023		2023
Delay-penalized CTC implemented based on Finite State Transducer Z Yao, W Kang, F Kuang, L Guo, X Yang, Y Yang, L Lin, D Povey Proc. INTERSPEECH 2023, 1329--1333, 2023		2023
Semantic-Aware Local-Global Vision Transformer J Zhang, Z Yao, F Chen, G Lu, W Pei arXiv preprint arXiv:2211.14705, 2022		2022

O sistema não pode executar a operação agora. Tente novamente mais tarde.

Artigos 1–17

Citações por ano

Citações duplicadas

Citações mescladas

Adicionar coautoresCoautores

Seguir

Citado por

Coautores