Berrak Sisman

Citado por

	Todos	Desde 2019
Citações	1735	1699
Índice h	23	23
Índice i10	36	34

520

260

130

390

201720182019202020212022202320245 29 86 193 311 420 519 155

Acesso público

Ver todos

32 artigos

1 artigo

disponível

não disponível

Com base nas autorizações de financiamento

Coautores

Haizhou LiThe Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), China; NUS, SingaporeE-mail confirmado em u.nus.edu
Rui Liu (刘瑞)Professor, Inner Mongolia UniversityE-mail confirmado em mail.imu.edu.cn
Kun ZhouAlibaba GroupE-mail confirmado em u.nus.edu
Björn SchullerProfessor, Technische Universität München (TUM) / Imperial College London & CSO, audEERINGE-mail confirmado em tum.de
Junichi YamagishiNational Institute of Informatics, Tokyo, JapanE-mail confirmado em nii.ac.jp
Simon KingProfessor of Speech Processing, University of EdinburghE-mail confirmado em ed.ac.uk
Satoshi NakamuraNara Institute of Science and TechnologyE-mail confirmado em is.naist.jp
Andros TjandraFacebook AI (research scientist)E-mail confirmado em fb.com
Carlos BussoProfessor of Electrical Engineering, The University of Texas at DallasE-mail confirmado em utdallas.edu
Dorien HerremansSingapore University of Technology and DesignE-mail confirmado em sutd.edu.sg
Nancy F. ChenFellow, Generative AI Group Leader, AI for Education Programme Head at A*STARE-mail confirmado em csail.mit.edu
Onur KayaProfessor of Electrical and Electronics Eng, Işık UniversityE-mail confirmado em isikun.edu.tr
Sennur UlukusProfessor of Electrical and Computer Engineering, University of MarylandE-mail confirmado em umd.edu
Wei YangAssistant Professor, Department of Computer Science, University of Texas at DallasE-mail confirmado em utdallas.edu
David VandykeAppleE-mail confirmado em apple.com

Seguir

Berrak Sisman

Electrical & Computer Engineering Department, The University of Texas at Dallas

E-mail confirmado em utdallas.edu - Página inicial

deep learning speech synthesis TTS voice conversion speech processing


Título Ordenar por citações Ordenar por ano Ordenar por título	Citado por Citado por	Ano
An overview of voice conversion and its challenges: From statistical modeling to deep learning B Sisman, J Yamagishi, S King, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 132-157, 2021	293	2021
Seen and unseen emotional style transfer for voice conversion with a new emotional speech dataset K Zhou, B Sisman, R Liu, H Li IEEE ICASSP 2021 International Conference on Acoustics, Speech, and Signal …, 2021	155	2021
Emotional Voice Conversion: Theory, Databases and ESD K Zhou, B Sisman, R Liu, H Li Speech Communication, 2022	108	2022
VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for Zerospeech Challenge 2019 A Tjandra, B Sisman, M Zhang, S Sakti, H Li, S Nakamura Proc. Interspeech 2019, 2019	81	2019
Expressive TTS Training with Frame and Style Reconstruction Loss R Liu, B Sisman, G Gao, H Li IEEE/ACM Transactions on Audio, Speech and Language Processing, 2021	78	2021
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data K Zhou, B Sisman, H Li Proc. Odyssey 2020, Tokyo, Japan, 2020	73	2020
Teacher-Student Training for Robust Tacotron-based TTS R Liu, B Sisman, J Li, F Bao, G Gao, H Li IEEE ICASSP 2020 International Conference on Acoustics, Speech, and Signal …, 2020	64	2020
A voice conversion framework with tandem feature sparse representation and speaker-adapted wavenet vocoder B Sisman, M Zhang, H Li Proc. Interspeech, 1978 -1982, 2018	59	2018
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion K Zhou, B Sisman, M Zhang, H Li Proc. Interspeech 2020, 2020	53	2020
Group sparse representation with wavenet vocoder adaptation for spectrum and prosody conversion B Sisman, M Zhang, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (6), 1085 …, 2019	50	2019
Sparse representation of phonetic features for voice conversion with and without parallel data B Sisman, H Li, KC Tan Automatic Speech Recognition and Understanding Workshop (ASRU), 2017 IEEE …, 2017	48	2017
Adaptive Wavenet Vocoder for Residual Compensation in GAN-based Voice Conversion B Sisman, M Zhang, S Sakti, H Li, S Nakamura 2018 IEEE Spoken Language Technology Workshop (SLT), 282-289, 2018	45	2018
SINGAN: Singing voice conversion with generative adversarial networks B Sisman, K Vijayan, M Dong, H Li Asia-Pacific Signal and Information Processing Association Annual Summit and …, 2019	44	2019
Transformation of prosody in voice conversion B Sisman, H Li, KC Tan Asia-Pacific Signal and Information Processing Association Annual Summit and …, 2017	38	2017
Emotion Intensity and its Control for Emotional Voice Conversion K Zhou, B Sisman, R Rana, BW Schuller, H Li IEEE Transactions on Affective Computing, 2023	37	2023
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech K Zhou, B Sisman, H Li 2021 IEEE Spoken Language Technology Workshop (SLT 2021), 2021	36	2021
On the study of Generative Adversarial Networks for Cross-lingual Voice Conversion B Sisman, M Zhang, M Dong, H Li IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) 2019, 2019	35	2019
GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis R Liu, B Sisman, H Li IEEE ICASSP 2021 International Conference on Acoustics, Speech, and Signal …, 2021	34	2021
Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability R Liu, B Sisman, H Li INTERSPEECH 2021, 2021	34	2021
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training K Zhou, B Sisman, H Li INTERSPEECH 2021, 2021	30	2021

O sistema não pode executar a operação agora. Tente novamente mais tarde.

Artigos 1–20

Citações por ano

Citações duplicadas

Citações mescladas

Adicionar coautoresCoautores

Seguir

Citado por

Coautores