Shruti Palaskar
Title
Cited by
Cited by
Year
How2: a large-scale dataset for multimodal language understanding
R Sanabria, O Caglayan, S Palaskar, D Elliott, L Barrault, L Specia, ...
arXiv preprint arXiv:1811.00347, 2018
1042018
Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the “Speaking rosetta” JSALT 2017 workshop
O Scharenborg, L Besacier, A Black, M Hasegawa-Johnson, F Metze, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
32*2018
Multimodal abstractive summarization for how2 videos
S Palaskar, J Libovickı, S Gella, F Metze
arXiv preprint arXiv:1906.07901, 2019
272019
Combining LSTM and latent topic modeling for mortality prediction
Y Jo, L Lee, S Palaskar
arXiv preprint arXiv:1709.02842, 2017
272017
End-to-end multimodal speech recognition
S Palaskar, R Sanabria, F Metze
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
242018
Cmu sinbad’s submission for the dstc7 avsd challenge
R Sanabria, S Palaskar, F Metze
DSTC7 at AAAI2019 workshop 6, 2019
232019
ASR error correction and domain adaptation using machine translation
A Mani, S Palaskar, NV Meripo, S Konam, F Metze
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
212020
Building an asr system for a low-resource language through the adaptation of a high-resource language asr system: Preliminary results
O Scharenborg, F Ciannella, S Palaskar, A Black, F Metze, L Ondel, ...
Proceedings of ICNLSSP, Casablanca, Morocco, 2017
202017
Acoustic-to-word recognition with sequence-to-sequence models
S Palaskar, F Metze
2018 IEEE Spoken Language Technology Workshop (SLT), 397-404, 2018
172018
Multimodal grounding for sequence-to-sequence speech recognition
O Caglayan, R Sanabria, S Palaskar, L Barraul, F Metze
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
152019
Learned in speech recognition: Contextual acoustic word embeddings
S Palaskar, V Raunak, F Metze
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
132019
Multimodal abstractive summarization for open-domain videos
J Libovickı, S Palaskar, S Gella, F Metze
Proceedings of the Workshop on Visually Grounded Interaction and Language …, 2018
132018
Learning from multiview correlations in open-domain videos
N Holzenberger, S Palaskar, P Madhyastha, F Metze, R Arora
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
112019
Towards understanding ASR error correction for medical conversations
A Mani, S Palaskar, S Konam
Proceedings of the First Workshop on Natural Language Processing for Medical …, 2020
92020
How2Sign: a large-scale multimodal dataset for continuous American sign language
A Duarte, S Palaskar, L Ventura, D Ghadiyaram, K DeHaan, F Metze, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
32021
Grounded Sequence to Sequence Transduction
L Specia, L Barrault, O Caglayan, A Duarte, D Elliott, S Gella, ...
IEEE journal of selected topics in signal processing 14 (3), 577-591, 2020
32020
Transfer learning for multimodal dialog
S Palaskar, R Sanabria, F Metze
Computer Speech & Language 64, 101093, 2020
22020
Multimodal Learning from Videos
S Palaskar
2021
Multimodal Speech Summarization Through Semantic Concept Learning}}
S Palaskar, R Salakhutdinov, AW Black, F Metze
Proc. Interspeech 2021, 791-795, 2021
2021
How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language Open Website
A Duarte, S Palaskar, L Ventura, D Ghadiyaram, KJ De Haan, F Metze, ...
The system can't perform the operation now. Try again later.
Articles 1–20