LearnNLP / nlp_arxiv_daily

arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Contributors Forks Stargazers Issues

Updated on 2024.08.19

Table of Contents
  1. Speech Translation
  2. Legal
  3. Speech Recognition
  4. Audio Forenisc

Speech Translation

Publish Date Title Authors PDF Code
2024-08-14 CMU's IWSLT 2024 Simultaneous Speech Translation System Xi Xu, Siqi Ouyang, Brian Yan, Patrick Fernandes, William Chen, Lei Li, Graham Neubig, Shinji Watanabe et.al. 2408.07452v1 null
2024-07-31 Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent Shanbo Cheng, Zhichao Huang, Tom Ko, Hang Li, Ningxin Peng, Lu Xu, Qini Zhang et.al. 2407.21646v1 null
2024-07-31 Contrastive Feedback Mechanism for Simultaneous Speech Translation Haotian Tan, Sakriani Sakti et.al. 2407.20524v2 null
2024-07-08 Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation Jarod Duret, Yannick Estève, Titouan Parcollet et.al. 2407.18332v1 null
2024-07-22 LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models Xi Chen, Songyang Zhang, Qibing Bai, Kai Chen, Satoshi Nakamura et.al. 2407.15415v1 link
2024-07-18 Preset-Voice Matching for Privacy Regulated Speech-to-Speech Translation Systems Daniel Platnick, Bishoy Abdelnour, Eamon Earl, Rahul Kumar, Zahra Rezaei, Thomas Tsangaris, Faraj Lagum et.al. 2407.13153v1 null
2024-06-26 Navigating the Minefield of MT Beam Search in Cascaded Streaming Speech Translation Rastislav Rabatin, Frank Seide, Ernie Chang et.al. 2407.11010v1 null
2024-07-01 Cross-Lingual Transfer Learning for Speech Translation Rao Ma, Yassir Fathullah, Mengjie Qian, Siyuan Tang, Mark Gales, Kate Knill et.al. 2407.01130v1 null
2024-06-30 NAIST Simultaneous Speech Translation System for IWSLT 2024 Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Tomoya Yanagita, Kosuke Doi, Mana Makinae, Haotian Tan, Makoto Sakai, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura et.al. 2407.00826v1 null
2024-06-27 Leveraging Synthetic Audio Data for End-to-End Low-Resource Speech Translation Yasmin Moslem et.al. 2406.17363v2 null
2024-06-24 Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024 Sai Koneru, Thai-Binh Nguyen, Ngoc-Quan Pham, Danni Liu, Zhaolin Li, Alexander Waibel, Jan Niehues et.al. 2406.16777v1 null
2024-06-20 SimulSeamless: FBK at IWSLT 2024 Simultaneous Speech Translation Sara Papi, Marco Gaido, Matteo Negri, Luisa Bentivogli et.al. 2406.14177v1 link
2024-06-16 CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving Bhavani Shankar, Preethi Jyothi, Pushpak Bhattacharyya et.al. 2406.10993v1 null
2024-06-15 Lightweight Audio Segmentation for Long-form Speech Translation Jaesong Lee, Soyoon Kim, Hanbyul Kim, Joon Son Chung et.al. 2406.10549v1 null
2024-06-12 Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation Peidong Wang, Jian Xue, Jinyu Li, Junkun Chen, Aswin Shanmugam Subramanian et.al. 2406.10276v1 null
2024-06-14 Diffusion Synthesizer for Efficient Multilingual Speech to Speech Translation Nameer Hirschkind, Xiao Yu, Mahesh Kumar Nandwana, Joseph Liu, Eloi DuBois, Dao Le, Nicolas Thiebaut, Colin Sinclair, Kyle Spence, Charles Shang, Zoe Abrams, Morgan McGuire et.al. 2406.10223v1 null
2024-06-14 Exploring the Correlation between Human and Machine Evaluation of Simultaneous Speech Translation Xiaoman Wang, Claudio Fantinuoli et.al. 2406.10091v1 null
2024-06-11 CTC-based Non-autoregressive Textless Speech-to-Speech Translation Qingkai Fang, Zhengrui Ma, Yan Zhou, Min Zhang, Yang Feng et.al. 2406.07330v1 link
2024-06-11 Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data? Qingkai Fang, Shaolei Zhang, Zhengrui Ma, Min Zhang, Yang Feng et.al. 2406.07289v1 null
2024-06-06 Label-Synchronous Neural Transducer for E2E Simultaneous Speech Translation Keqi Deng, Philip C. Woodland et.al. 2406.04541v1 link
2024-06-06 Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation Matthias Sperber, Ondřej Bojar, Barry Haddow, Dávid Javorský, Xutai Ma, Matteo Negri, Jan Niehues, Peter Polák, Elizabeth Salesky, Katsuhito Sudoh, Marco Turchi et.al. 2406.03881v1 null
2024-06-05 StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning Shaolei Zhang, Qingkai Fang, Shoutao Guo, Zhengrui Ma, Min Zhang, Yang Feng et.al. 2406.03049v1 link
2024-06-04 Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation Min-Jae Hwang, Ilia Kulikov, Benjamin Peloquin, Hongyu Gong, Peng-Jen Chen, Ann Lee et.al. 2406.02733v1 null
2024-06-04 SimulTron: On-Device Simultaneous Speech to Speech Translation Alex Agranovich, Eliya Nachmani, Oleg Rybakov, Yifan Ding, Ye Jia, Nadav Bar, Heiga Zen, Michelle Tadmor Ramanovich et.al. 2406.02133v1 null
2024-06-01 Recent Advances in End-to-End Simultaneous Speech Translation Xiaoqian Liu, Guoqiang Hu, Yangfan Du, Erfeng He, YingFeng Luo, Chen Xu, Tong Xiao, Jingbo Zhu et.al. 2406.00497v1 null
2024-05-30 SeamlessExpressiveLM: Speech Language Model for Expressive Speech-to-Speech Translation with Chain-of-Thought Hongyu Gong, Bandhav Veluri et.al. 2405.20410v1 null
2024-05-28 TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation Chenyang Le, Yao Qian, Dongmei Wang, Long Zhou, Shujie Liu, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Sheng Zhao, Michael Zeng et.al. 2405.17809v1 null
2024-05-22 DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation Weiting Tan, Jingyu Zhang, Lingfeng Shen, Daniel Khashabi, Philipp Koehn et.al. 2405.13274v1 link
2024-05-21 MELD-ST: An Emotion-aware Speech Translation Dataset Sirou Chen, Sakiko Yahata, Shuichiro Shimizu, Zhengdong Yang, Yihang Li, Chenhui Chu, Sadao Kurohashi et.al. 2405.13233v1 null
2024-03-25 Advancing Speech Translation: A Corpus of Mandarin-English Conversational Telephone Speech Shannon Wotherspoon, William Hartmann, Matthew Snover et.al. 2404.11619v1 null
2024-03-19 MSLM-S2ST: A Multitask Speech Language Model for Textless Speech-to-Speech Translation with Speaker Style Preservation Yifan Peng, Ilia Kulikov, Yilin Yang, Sravya Popuri, Hui Lu, Changhan Wang, Hongyu Gong et.al. 2403.12408v1 null
2024-03-08 FFSTC: Fongbe to French Speech Translation Corpus D. Fortune Kponou, Frejus A. A. Laleye, Eugene C. Ezin et.al. 2403.05488v1 null
2024-06-26 Compact Speech Translation Models via Discrete Speech Units Pretraining Tsz Kin Lam, Alexandra Birch, Barry Haddow et.al. 2402.19333v2 null
2024-02-25 Direct Punjabi to English speech translation using discrete units Prabhjot Kaur, L. Andrew M. Bush, Weisong Shi et.al. 2402.15967v1 null
2024-05-17 Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing? Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli et.al. 2402.12025v2 null
2024-06-05 Pushing the Limits of Zero-shot End-to-End Speech Translation Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà et.al. 2402.10422v2 link
2024-02-02 A Case Study on Filtering for End-to-End Speech Translation Md Mahfuz Ibn Alam, Antonios Anastasopoulos et.al. 2402.01945v1 null
2024-01-17 TranSentence: Speech-to-speech Translation via Language-agnostic Sentence-level Speech Encoding without Language-parallel Data Seung-Bin Kim, Sang-Hoon Lee, Seong-Whan Lee et.al. 2401.12992v1 null
2024-01-11 R-BI: Regularized Batched Inputs enhance Incremental Decoding Framework for Low-Latency Simultaneous Speech Translation Jiaxin Guo, Zhanglin Wu, Zongyao Li, Hengchao Shang, Daimeng Wei, Xiaoyu Chen, Zhiqiang Rao, Shaojun Li, Hao Yang et.al. 2401.05700v1 null
2023-12-21 Speech Translation with Large Language Models: An Industrial Practice Zhichao Huang, Rong Ye, Tom Ko, Qianqian Dong, Shanbo Cheng, Mingxuan Wang, Hang Li et.al. 2312.13585v1 null
2023-12-18 Soft Alignment of Modality Space for End-to-end Speech Translation Yuhao Zhang, Kaiqi Kou, Bei Li, Chen Xu, Chunliang Zhang, Tong Xiao, Jingbo Zhu et.al. 2312.10952v1 null
2023-12-08 Seamless: Multilingual Expressive and Streaming Speech Translation Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek, Yilin Yang, Ethan Ye, Ivan Evtimov, Pierre Fernandez, Cynthia Gao, Prangthip Hansanti, Elahe Kalbassi, Amanda Kallet, Artyom Kozhevnikov, Gabriel Mejia Gonzalez, Robin San Roman, Christophe Touret, Corinne Wong, Carleigh Wood, Bokai Yu, Pierre Andrews, Can Balioglu, Peng-Jen Chen, Marta R. Costa-jussà, Maha Elbayad, Hongyu Gong, Francisco Guzmán, Kevin Heffernan, Somya Jain, Justine Kao, Ann Lee, Xutai Ma, Alex Mourachko, Benjamin Peloquin, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Anna Sun, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang, Mary Williamson et.al. 2312.05187v1 link
2024-03-26 AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation Jeongsoo Choi, Se Jin Park, Minsu Kim, Yong Man Ro et.al. 2312.02512v2 link
2023-11-07 Rethinking and Improving Multi-task Learning for End-to-end Speech Translation Yuhao Zhang, Chen Xu, Bei Li, Hao Chen, Tong Xiao, Chunliang Zhang, Jingbo Zhu et.al. 2311.03810v1 link
2023-11-01 End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation Juan Zuluaga-Gomez, Zhaocheng Huang, Xing Niu, Rohit Paturi, Sundararajan Srinivasan, Prashant Mathur, Brian Thompson, Marcello Federico et.al. 2311.00697v1 link
2023-10-31 Towards a Deep Understanding of Multilingual End-to-End Speech Translation Haoran Sun, Xiaohu Zhao, Yikun Lei, Shaolin Zhu, Deyi Xiong et.al. 2310.20456v1 link
2023-10-26 DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation Yongxin Zhu, Zhujin Gao, Xinyuan Zhou, Zhongyi Ye, Linli Xu et.al. 2310.17570v1 null
2023-10-24 Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection Dennis Fucci, Marco Gaido, Sara Papi, Mauro Cettolo, Matteo Negri, Luisa Bentivogli et.al. 2310.15752v1 link
2023-10-23 How To Build Competitive Multi-gender Speech Translation Models For Controlling Speaker Gender Translation Marco Gaido, Dennis Fucci, Matteo Negri, Luisa Bentivogli et.al. 2310.15114v1 link
2023-10-23 Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models Arya D. McCarthy, Hao Zhang, Shankar Kumar, Felix Stahlberg, Ke Wu et.al. 2310.13678v2 null
2023-10-23 Towards Real-World Streaming Speech Translation for Code-Switched Speech Belen Alastruey, Matthias Sperber, Christian Gollan, Dominic Telaar, Tim Ng, Aashish Agarwal et.al. 2310.12648v2 link
2023-10-17 Long-form Simultaneous Speech Translation: Thesis Proposal Peter Polák et.al. 2310.11141v1 null
2023-10-13 Dialect Transfer for Swiss German Speech Translation Claudio Paonessa, Yanick Schraner, Jan Deriu, Manuela Hürlimann, Manfred Vogel, Mark Cieliebak et.al. 2310.09088v1 null
2023-10-11 DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation Qingkai Fang, Yan Zhou, Yang Feng et.al. 2310.07403v1 link
2023-10-11 Enhancing expressivity transfer in textless speech-to-speech translation Jarod Duret, Benjamin O'Brien, Yannick Estève, Titouan Parcollet et.al. 2310.07279v1 null
2023-10-06 Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach Junkun Chen, Jian Xue, Peidong Wang, Jing Pan, Jinyu Li et.al. 2310.04399v1 null
2023-10-03 Tuning Large language model for End-to-end Speech Translation Hao Zhang, Nianwen Si, Yaqi Chen, Wenlin Zhang, Xukui Yang, Dan Qu, Xiaolin Jiao et.al. 2310.02050v1 null
2023-10-07 LAE-ST-MoE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-switching ASR Guodong Ma, Wenxuan Wang, Yuke Li, Yuting Yang, Binbin Du, Haoran Fu et.al. 2309.16178v2 null
2023-09-27 Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization Amir Hussein, Brian Yan, Antonios Anastasopoulos, Shinji Watanabe, Sanjeev Khudanpur et.al. 2309.15686v1 null
2023-09-21 Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition Chen Xu, Xiaoqian Liu, Erfeng He, Yuhao Zhang, Qianqian Dong, Tong Xiao, Jingbo Zhu, Dapeng Man, Wu Yang et.al. 2309.12234v1 link
2024-04-25 SpeechAlign: a Framework for Speech Translation Alignment Evaluation Belen Alastruey, Aleix Sant, Gerard I. Gállego, David Dale, Marta R. Costa-jussà et.al. 2309.11585v2 null
2023-09-20 Long-Form End-to-End Speech Translation via Latent Alignment Segmentation Peter Polák, Ondřej Bojar et.al. 2309.11384v1 null
2023-09-20 Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff Peter Polák, Brian Yan, Shinji Watanabe, Alex Waibel, Ondřej Bojar et.al. 2309.11379v1 null
2024-01-22 DiariST: Streaming Speech Translation with Speaker Diarization Mu Yang, Naoyuki Kanda, Xiaofei Wang, Junkun Chen, Peidong Wang, Jian Xue, Jinyu Li, Takuya Yoshioka et.al. 2309.08007v2 link
2024-07-19 Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer Yongqi Wang, Jionghao Bai, Rongjie Huang, Ruiqi Li, Zhiqing Hong, Zhou Zhao et.al. 2309.07566v2 null
2023-09-14 Direct Text to Speech Translation System using Acoustic Units Victoria Mingote, Pablo Gimeno, Luis Vicente, Sameer Khurana, Antoine Laurent, Jarod Duret et.al. 2309.07478v1 null
2024-07-17 End-to-End Evaluation for Low-Latency Simultaneous Speech Translation Christian Huber, Tu Anh Dinh, Carlos Mullov, Ngoc Quan Pham, Thai Binh Nguyen, Fabian Retkowski, Stefan Constantin, Enes Yavuz Ugan, Danni Liu, Zhaolin Li, Sai Koneru, Jan Niehues, Alexander Waibel et.al. 2308.03415v3 null
2023-07-17 Multilingual Speech-to-Speech Translation into Multiple Target Languages Hongyu Gong, Ning Dong, Sravya Popuri, Vedanuj Goswami, Ann Lee, Juan Pino et.al. 2307.08655v1 null
2023-07-17 Improving End-to-End Speech Translation by Imitation-Based Knowledge Distillation with Synthetic Transcripts Rebekka Hubert, Artem Sokolov, Stefan Riezler et.al. 2307.08426v1 link
2023-07-10 The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task Kun Song, Yi lei, Peikun Chen, Yiqing Cao, Kun Wei, Yongmao Zhang, Lei Xie, Ning Jiang, Guoqing Zhao et.al. 2307.04630v1 null
2023-07-03 Implicit Memory Transformer for Computationally Efficient Simultaneous Speech Translation Matthew Raffel, Lizhong Chen et.al. 2307.01381v1 link
2023-07-03 Shiftable Context: Addressing Training-Inference Context Mismatch in Simultaneous Speech Translation Matthew Raffel, Drew Penney, Lizhong Chen et.al. 2307.01377v1 link
2023-06-20 HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation Cihan Xiao, Henry Li Xinyuan, Jinyi Yang, Dongji Gao, Matthew Wiesner, Kevin Duh, Sanjeev Khudanpur et.al. 2306.11252v1 link
2023-06-14 Tagged End-to-End Simultaneous Speech Translation Training using Simultaneous Interpretation Data Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura et.al. 2306.08582v1 null
2023-06-13 NAVER LABS Europe's Multilingual Speech Translation Systems for the IWSLT 2023 Low-Resource Track Edward Gow-Smith, Alexandre Berard, Marcely Zanon Boito, Ioan Calapodescu et.al. 2306.07763v1 null
2023-06-13 Modality Adaption or Regularization? A Case Study on End-to-End Speech Translation Yuchen Han, Chen Xu, Tong Xiao, Jingbo Zhu et.al. 2306.07650v1 link
2023-07-12 KIT's Multilingual Speech Translation System for IWSLT 2023 Danni Liu, Thai Binh Nguyen, Sai Koneru, Enes Yavuz Ugan, Ngoc-Quan Pham, Tuan-Nam Nguyen, Tu Anh Dinh, Carlos Mullov, Alexander Waibel, Jan Niehues et.al. 2306.05320v3 link
2023-06-13 PolyVoice: Language Models for Speech to Speech Translation Qianqian Dong, Zhiying Huang, Qiao Tian, Chen Xu, Tom Ko, Yunlong Zhao, Siyuan Feng, Tang Li, Kexin Wang, Xuxin Cheng, Fengpeng Yue, Ye Bai, Xi Chen, Lu Lu, Zejun Ma, Yuping Wang, Mingxuan Wang, Yuxuan Wang et.al. 2306.02982v2 null
2023-06-02 Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23 Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà et.al. 2306.01327v1 null
2023-06-01 Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models Liam Dugan, Anshul Wadhawan, Kyle Spence, Chris Callison-Burch, Morgan McGuire, Victor Zordan et.al. 2306.01201v1 link
2024-01-25 Improved Cross-Lingual Transfer Learning For Automatic Speech Translation Sameer Khurana, Nauman Dawalatabad, Antoine Laurent, Luis Vicente, Pablo Gimeno, Victoria Mingote, James Glass et.al. 2306.00789v4 null
2023-07-25 StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation Kun Song, Yi Ren, Yi Lei, Chunfeng Wang, Kun Wei, Lei Xie, Xiang Yin, Zejun Ma et.al. 2305.17732v4 null
2024-01-16 Translatotron 3: Speech to Speech Translation with Monolingual Data Eliya Nachmani, Alon Levkovitch, Yifan Ding, Chulayuth Asawaroengchai, Heiga Zen, Michelle Tadmor Ramanovich et.al. 2305.17547v3 null
2023-05-27 CTC-based Non-autoregressive Speech Translation Chen Xu, Xiaoqian Liu, Xiaowen Liu, Qingxuan Sun, Yuhao Zhang, Murun Yang, Qianqian Dong, Tom Ko, Mingxuan Wang, Tong Xiao, Anxiang Ma, Jingbo Zhu et.al. 2305.17358v1 link
2023-05-26 Inter-connection: Effective Connection between Pre-trained Encoder and Decoder for Speech Translation Yuta Nishikawa, Satoshi Nakamura et.al. 2305.16897v1 null
2023-06-18 End-to-End Simultaneous Speech Translation with Differentiable Segmentation Shaolei Zhang, Yang Feng et.al. 2305.16093v2 link
2024-02-20 Textless Low-Resource Speech-to-Speech Translation With Unit Language Models Anuj Diwan, Anirudh Srinivasan, David Harwath, Eunsol Choi et.al. 2305.15405v2 link
2023-05-24 AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao et.al. 2305.15403v1 null
2023-05-25 CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation Yan Zhou, Qingkai Fang, Yang Feng et.al. 2305.14635v2 link
2023-05-23 Improving speech translation by fusing speech and text Wenbiao Yin, Zhicheng Liu, Chengqi Zhao, Tao Wang, Jian Tong, Rong Ye et.al. 2305.14042v1 null
2023-05-22 Improving Metrics for Speech Translation Claudio Paonessa, Dominik Frefel, Manfred Vogel et.al. 2305.12918v1 null
2023-05-22 Duplex Diffusion Models Improve Speech-to-Speech Translation Xianchao Wu et.al. 2305.12628v1 null
2023-05-19 DUB: Discrete Unit Back-translation for Speech Translation Dong Zhang, Rong Ye, Tom Ko, Mingxuan Wang, Yaqian Zhou et.al. 2305.11411v1 link
2023-07-20 AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation Sara Papi, Marco Turchi, Matteo Negri et.al. 2305.11408v2 link
2023-10-17 The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation Mutian He, Philip N. Garner et.al. 2305.09652v2 link
2023-05-15 Understanding and Bridging the Modality Gap for Speech Translation Qingkai Fang, Yang Feng et.al. 2305.08706v1 link
2023-05-12 Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation Yu-Kuan Fu, Liang-Hsuan Tseng, Jiatong Shi, Chen-An Li, Tsu-Yuan Hsu, Shinji Watanabe, Hung-yi Lee et.al. 2305.07455v1 null
2023-12-18 Improving Speech Translation Accuracy and Time Efficiency with Fine-tuned wav2vec 2.0-based Speech Segmentation Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura et.al. 2304.12659v2 link
2023-04-20 Improving Speech Translation by Cross-Modal Multi-Grained Contrastive Learning Hao Zhang, Nianwen Si, Yaqi Chen, Wenlin Zhang, Xukui Yang, Dan Qu, Wei-Qiang Zhang et.al. 2304.10309v1 null
2023-04-20 Decouple Non-parametric Knowledge Distillation For End-to-end Speech Translation Hao Zhang, Nianwen Si, Yaqi Chen, Wenlin Zhang, Xukui Yang, Dan Qu, Zhen Li et.al. 2304.10295v1 null
2023-04-10 Enhancing Speech-to-Speech Translation with Multiple TTS Targets Jiatong Shi, Yun Tang, Ann Lee, Hirofumi Inaguma, Changhan Wang, Juan Pino, Shinji Watanabe et.al. 2304.04618v1 null
2023-04-25 Selective Data Augmentation for Robust Speech Translation Rajul Acharya, Ashish Panda, Sunil Kumar Kopparapu et.al. 2304.03169v2 null
2023-10-26 Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference Biao Fu, Minpeng Liao, Kai Fan, Zhongqiang Huang, Boxing Chen, Yidong Chen, Xiaodong Shi et.al. 2303.07914v2 link
2023-03-09 MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition Xize Cheng, Linjun Li, Tao Jin, Rongjie Huang, Wang Lin, Zehan Wang, Huangdai Liu, Ye Wang, Aoxiong Yin, Zhou Zhao et.al. 2303.05309v1 link
2023-02-21 Efficient CTC Regularization via Coarse Labels for End-to-End Speech Translation Biao Zhang, Barry Haddow, Rico Sennrich et.al. 2302.10871v1 link
2023-06-05 Pre-training for Speech Translation: CTC Meets Optimal Transport Phuong-Hang Le, Hongyu Gong, Changhan Wang, Juan Pino, Benjamin Lecouteux, Didier Schwab et.al. 2301.11716v3 link
2023-01-25 A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation Wen-Chin Huang, Benjamin Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen et.al. 2301.10606v1 null
2023-11-01 SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations Ioannis Tsiamas, José A. R. Fonollosa, Marta R. Costa-jussà et.al. 2212.09699v3 link
2023-07-07 WACO: Word-Aligned Contrastive Learning for Speech Translation Siqi Ouyang, Rong Ye, Lei Li et.al. 2212.09359v3 link
2022-12-17 AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation Xingshan Zeng, Liangyou Li, Qun Liu et.al. 2212.08911v1 null
2022-12-16 BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric Mingda Chen, Paul-Ambroise Duquenne, Pierre Andrews, Justine Kao, Alexandre Mourachko, Holger Schwenk, Marta R. Costa-jussà et.al. 2212.08486v1 link
2023-05-26 UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino et.al. 2212.08055v2 link
2023-05-11 Attention as a Guide for Simultaneous Speech Translation Sara Papi, Matteo Negri, Marco Turchi et.al. 2212.07850v2 link
2022-12-12 Direct Speech-to-speech Translation without Textual Annotation using Bottleneck Features Junhui Zhang, Junjie Pan, Xiang Yin, Zejun Ma et.al. 2212.05805v1 null
2022-12-11 End-to-End Speech Translation of Arabic to English Broadcast News Fethi Bougares, Salim Jouili et.al. 2212.05479v1 null
2022-12-07 M3ST: Mix at Three Levels for Speech Translation Xuxin Cheng, Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Yuexian Zou et.al. 2212.03657v1 null
2022-12-04 Improving End-to-end Speech Translation by Leveraging Auxiliary Speech and Text Data Yuhao Zhang, Chen Xu, Bojie Hu, Chunliang Zhang, Tong Xiao, Jingbo Zhu et.al. 2212.01778v1 null
2022-11-22 ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched Egyptian Arabic - English Injy Hamed, Nizar Habash, Slim Abdennadher, Ngoc Thang Vu et.al. 2211.12000v1 null
2023-06-01 MT Metrics Correlate with Human Ratings of Simultaneous Speech Translation Dominik Macháček, Ondřej Bojar, Raj Dabre et.al. 2211.08633v2 link
2022-11-11 Speech-to-Speech Translation For A Real-world Unwritten Language Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Pino, Wei-Ning Hsu, Ann Lee et.al. 2211.06474v1 null
2022-11-11 Align, Write, Re-order: Explainable End-to-End Speech Translation via Operation Sequence Generation Motoi Omachi, Brian Yan, Siddharth Dalmia, Yuya Fujita, Shinji Watanabe et.al. 2211.05967v1 null
2022-11-09 Efficient Speech Translation with Pre-trained Models Zhaolin Li, Jan Niehues et.al. 2211.04939v1 null
2022-11-08 SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations Paul-Ambroise Duquenne, Hongyu Gong, Ning Dong, Jingfei Du, Ann Lee, Vedanuj Goswani, Changhan Wang, Juan Pino, Benoît Sagot, Holger Schwenk et.al. 2211.04508v1 null
2022-10-31 Textless Direct Speech-to-Speech Translation with Discrete Speech Representation Xinjian Li, Ye Jia, Chung-Cheng Chiu et.al. 2211.00115v1 null
2022-10-31 Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation Kun Wei, Long Zhou, Ziqiang Zhang, Liping Chen, Shujie Liu, Lei He, Jinyu Li, Furu Wei et.al. 2210.17027v1 link
2023-03-14 Efficient Speech Translation with Dynamic Latent Perceivers Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà et.al. 2210.16264v2 link
2022-10-26 Improving Speech-to-Speech Translation Through Unlabeled Text Xuan-Phi Nguyen, Sravya Popuri, Changhan Wang, Yun Tang, Ilia Kulikov, Hongyu Gong et.al. 2210.14514v1 null
2022-11-24 Does Joint Training Really Help Cascaded Speech Translation? Viet Anh Khoa Tran, David Thulke, Yingbo Gao, Christian Herold, Hermann Ney et.al. 2210.13700v2 link
2023-05-20 Joint Speech Translation and Named Entity Recognition Marco Gaido, Sara Papi, Matteo Negri, Marco Turchi et.al. 2210.11987v2 link
2023-03-11 Named Entity Detection and Injection for Direct Speech Translation Marco Gaido, Yun Tang, Ilia Kulikov, Rongqing Huang, Hongyu Gong, Hirofumi Inaguma et.al. 2210.11981v2 null
2022-10-18 Simple and Effective Unsupervised Speech Translation Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli, Juan Pino et.al. 2210.10191v1 null
2022-10-18 Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation Chen Wang, Yuchen Liu, Boxing Chen, Jiajun Zhang, Wei Luo, Zhongqiang Huang, Chengqing Zong et.al. 2210.09556v1 link
2022-10-16 RedApt: An Adaptor for wav2vec 2 Encoding \ Faster and Smaller Speech Translation without Quality Compromise Jinming Zhao, Hao Yang, Gholamreza Haffari, Ehsan Shareghi et.al. 2210.08475v1 null
2023-02-08 Generating Synthetic Speech from SpokenVocab for Speech Translation Jinming Zhao, Gholamreza Haffar, Ehsan Shareghi et.al. 2210.08174v2 link
2022-11-09 Code-Switching without Switching: Language Agnostic End-to-End Speech Translation Christian Huber, Enes Yavuz Ugan, Alexander Waibel et.al. 2210.01512v2 null
2023-07-25 Direct Speech Translation for Automatic Subtitling Sara Papi, Marco Gaido, Alina Karakanta, Mauro Cettolo, Matteo Negri, Marco Turchi et.al. 2209.13192v2 link
2022-08-08 A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation Linh The Nguyen, Nguyen Luong Tran, Long Doan, Manh Luong, Dat Quoc Nguyen et.al. 2208.04243v1 link
2022-07-01 On the Impact of Noises in Crowd-Sourced Data for Speech Translation Siqi Ouyang, Rong Ye, Lei Li et.al. 2206.13756v2 link
2022-06-20 Over-Generation Cannot Be Rewarded: Length-Adaptive Average Lagging for Simultaneous Speech Translation Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi et.al. 2206.05807v3 link
2022-06-14 The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task Ziqiang Zhang, Junyi Ao, Long Zhou, Shujie Liu, Furu Wei, Jinyu Li et.al. 2206.05777v2 link
2023-03-02 TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation Rongjie Huang, Jinglin Liu, Huadai Liu, Yi Ren, Lichao Zhang, Jinzheng He, Zhou Zhao et.al. 2205.12523v2 null
2022-11-04 Non-Parametric Domain Adaptation for End-to-End Speech Translation Yichao Du, Weizhi Wang, Zhirui Zhang, Boxing Chen, Tong Xu, Jun Xie, Enhong Chen et.al. 2205.11211v6 link
2022-05-18 Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Qibing Bai, Yu Zhang et.al. 2205.08993v1 link
2022-05-14 Multiformer: A Head-Configurable Transformer-Based Model for Direct Speech Translation Gerard Sant, Gerard I. Gállego, Belen Alastruey, Marta R. Costa-Jussà et.al. 2205.07100v1 null
2022-05-13 Who Are We Talking About? Handling Person Names in Speech Translation Marco Gaido, Matteo Negri, Marco Turchi et.al. 2205.06755v1 link
2022-05-05 Efficient yet Competitive Speech Translation: FBK@IWSLT2022 Marco Gaido, Sara Papi, Dennis Fucci, Giuseppe Fiameni, Matteo Negri, Marco Turchi et.al. 2205.02629v1 link
2022-05-05 Cross-modal Contrastive Learning for Speech Translation Rong Ye, Mingxuan Wang, Lei Li et.al. 2205.02444v1 link
2022-05-04 ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks Marcely Zanon Boito, John Ortega, Hugo Riguidel, Antoine Laurent, Loïc Barrault, Fethi Bougares, Firas Chaabani, Ha Nguyen, Florentin Barbier, Souhir Gahbiche, Yannick Estève et.al. 2205.01987v1 null
2022-04-22 LibriS2S: A German-English Speech-to-Speech Translation Corpus Pedro Jeuris, Jan Niehues et.al. 2204.10593v1 link
2022-10-03 Exploring Continuous Integrate-and-Fire for Adaptive Simultaneous Speech Translation Chih-Chiang Chang, Hung-yi Lee et.al. 2204.09595v3 link
2022-04-19 On the Locality of Attention in Direct Speech Translation Belen Alastruey, Javier Ferrando, Gerard I. Gállego, Marta R. Costa-jussà et.al. 2204.09028v1 null
2022-04-19 Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation Keqi Deng, Shinji Watanabe, Jiatong Shi, Siddhant Arora et.al. 2204.08920v1 null
2022-05-11 CUNI-KIT System for Simultaneous Speech Translation Task at IWSLT 2022 Peter Polák, Ngoc-Quan Ngoc, Tuan-Nam Nguyen, Danni Liu, Carlos Mullov, Jan Niehues, Ondřej Bojar, Alexander Waibel et.al. 2204.06028v2 null
2022-04-11 Unified Speech-Text Pre-training for Speech Translation and Recognition Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Pino et.al. 2204.05409v1 null
2022-07-01 Large-Scale Streaming End-to-End Speech Translation with Neural Transducers Jian Xue, Peidong Wang, Jinyu Li, Matt Post, Yashesh Gaur et.al. 2204.05352v2 null
2022-04-11 End-to-End Speech Translation for Code Switched Speech Orion Weller, Matthias Sperber, Telmo Pires, Hendra Setiawan, Christian Gollan, Dominic Telaar, Matthias Paulik et.al. 2204.05076v1 link
2023-06-06 GigaST: A 10,000-hour Pseudo Speech Translation Corpus Rong Ye, Chengqi Zhao, Tom Ko, Chutong Meng, Tao Wang, Mingxuan Wang, Jun Cao et.al. 2204.03939v2 null
2022-11-16 Does Simultaneous Speech Translation need Simultaneous Models? Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi et.al. 2204.03783v3 link
2022-09-13 Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation Sravya Popuri, Peng-Jen Chen, Changhan Wang, Juan Pino, Yossi Adi, Jiatao Gu, Wei-Ning Hsu, Ann Lee et.al. 2204.02967v3 null
2022-07-13 Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura et.al. 2203.15479v2 link
2022-03-29 Multilingual Simultaneous Speech Translation Shashank Subramanya, Jan Niehues et.al. 2203.14835v2 null
2022-06-27 Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation Ye Jia, Yifan Ding, Ankur Bapna, Colin Cherry, Yu Zhang, Alexis Conneau, Nobuyuki Morioka et.al. 2203.13339v2 null
2022-03-20 STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation Qingkai Fang, Rong Ye, Lei Li, Yang Feng, Mingxuan Wang et.al. 2203.10426v1 link
2022-03-18 Under the Morphosyntactic Lens: A Multifaceted Evaluation of Gender Bias in Speech Translation Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi et.al. 2203.09866v1 link
2022-03-16 Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler et.al. 2203.08757v1 null
2022-03-04 Comprehension of Subtitles from Re-Translating Simultaneous Speech Translation Dávid Javorský, Dominik Macháček, Ondřej Bojar et.al. 2203.02458v1 null
2022-07-06 SHAS: Approaching optimal Segmentation for End-to-End Speech Translation Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà et.al. 2202.04774v3 link
2022-09-04 Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages Jivnesh Sandhan, Ayush Daksh, Om Adideva Paranjay, Laxmidhar Behera, Pawan Goyal et.al. 2201.11391v2 link
2022-01-26 Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques Tu Anh Dinh, Danni Liu, Jan Niehues et.al. 2201.11172v1 link
2022-06-26 CVSS Corpus and Massively Multilingual Speech-to-Speech Translation Ye Jia, Michelle Tadmor Ramanovich, Quan Wang, Heiga Zen et.al. 2201.03713v3 link
2022-05-25 Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement Yichao Du, Zhirui Zhang, Weizhi Wang, Boxing Chen, Jun Xie, Tong Xu et.al. 2112.10991v2 link
2022-05-04 Textless Speech-to-Speech Translation on Real Data Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Yossi Adi, Juan Pino, Jiatao Gu, Wei-Ning Hsu et.al. 2112.08352v2 null
2021-11-08 Visualization: the missing factor in Simultaneous Speech Translation Sara Papi, Matteo Negri, Marco Turchi et.al. 2111.00514v2 null
2022-06-17 Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems Mohd Abbas Zaidi, Beomseok Lee, Sangha Kim, Chanwoo Kim et.al. 2110.15729v2 null
2021-10-26 Assessing Evaluation Metrics for Speech-to-Speech Translation Elizabeth Salesky, Julian Mäder, Severin Klinger et.al. 2110.13877v1 null
2022-01-12 Direct Simultaneous Speech-to-Speech Translation with Variational Monotonic Multihead Attention Xutai Ma, Hongyu Gong, Danni Liu, Ann Lee, Yun Tang, Peng-Jen Chen, Wei-Ning Hsu, Phillip Koehn, Juan Pino et.al. 2110.08250v2 null
2022-07-15 From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation Danni Liu, Changhan Wang, Hongyu Gong, Xutai Ma, Yun Tang, Juan Pino et.al. 2110.08214v3 null
2021-09-27 Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates Hirofumi Inaguma, Siddharth Dalmia, Brian Yan, Shinji Watanabe et.al. 2109.12804v1 null
2021-09-15 Is "moby dick" a Whale or a Bird? Named Entities and Terminology in Speech Translation Marco Gaido, Susana Rodríguez, Matteo Negri, Luisa Bentivogli, Marco Turchi et.al. 2109.07439v1 link
2021-09-09 Speechformer: Reducing Information Loss in Direct Speech Translation Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi et.al. 2109.04574v1 link
2021-09-09 Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe et.al. 2109.04411v1 null
2021-08-09 The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation Minghan Wang, Yuxia Wang, Chang Su, Jiaxin Guo, Yingtao Zhang, Yujia Liu, Min Zhang, Shimin Tao, Xingshan Zeng, Liangyou Li, Hao Yang, Ying Qin et.al. 2108.03845v1 null
2021-07-24 The USYD-JD Speech Translation System for IWSLT 2021 Liang Ding, Di Wu, Dacheng Tao et.al. 2107.11572v1 null
2021-07-20 Simultaneous Speech Translation for Live Subtitling: from Delay to Display Alina Karakanta, Sara Papi, Matteo Negri, Marco Turchi et.al. 2107.08807v2 link
2022-05-17 Translatotron 2: High-quality direct speech-to-speech translation with voice preservation Ye Jia, Michelle Tadmor Ramanovich, Tal Remez, Roi Pomerantz et.al. 2107.08661v5 null
2021-08-14 FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task Yun Tang, Hongyu Gong, Xian Li, Changhan Wang, Juan Pino, Holger Schwenk, Naman Goyal et.al. 2107.06959v2 null
2021-07-13 The IWSLT 2021 BUT Speech Translation Systems Hari Krishna Vydana, Martin Karafi'at, Luk'as Burget, "Honza" Cernock'y et.al. 2107.06155v1 null
2021-07-13 Zero-shot Speech Translation Tu Anh Dinh et.al. 2107.06010v1 null
2021-07-12 Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task Yun Tang, Juan Pino, Xian Li, Changhan Wang, Dmitriy Genzel et.al. 2107.05782v1 null
2022-03-21 Direct speech-to-speech translation with discrete units Ann Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Sravya Popuri, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Pino, Wei-Ning Hsu et.al. 2107.05604v2 null
2021-07-07 Efficient Transformer for Direct Speech Translation Belen Alastruey, Gerard I. Gállego, Marta R. Costa-jussà et.al. 2107.03069v1 null
2021-07-08 The NiuTrans End-to-End Speech Translation System for IWSLT 2021 Offline Task Chen Xu, Xiaoqian Liu, Xiaowen Liu, Laohu Wang, Canan Huang, Tong Xiao, Jingbo Zhu et.al. 2107.02444v2 null
2021-07-06 ESPnet-ST IWSLT 2021 Offline Speech Translation System Hirofumi Inaguma, Brian Yan, Siddharth Dalmia, Pengcheng Guo, Jiatong Shi, Kevin Duh, Shinji Watanabe et.al. 2107.00636v2 null
2021-07-09 The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021 Dan Liu, Mengge Du, Xiaoxi Li, Yuchen Hu, Lirong Dai et.al. 2107.00279v2 null
2021-06-30 IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task Pavel Denisov, Manuel Mager, Ngoc Thang Vu et.al. 2106.16055v1 null
2021-06-17 Lost in Interpreting: Speech Translation from Source or Interpreter? Dominik Macháček, Matúš Žilinec, Ondřej Bojar et.al. 2106.09343v1 null
2021-06-09 RealTranS: End-to-End Simultaneous Speech Translation with Convolutional Weighted-Shrinking Transformer Xingshan Zeng, Liangyou Li, Qun Liu et.al. 2106.04833v1 null
2021-07-12 Lightweight Adapter Tuning for Multilingual Speech Translation Hang Le, Juan Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier et.al. 2106.01463v2 link
2021-06-02 Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference? Luisa Bentivogli, Mauro Cettolo, Marco Gaido, Alina Karakanta, Alberto Martinelli, Matteo Negri, Marco Turchi et.al. 2106.01045v1 null
2021-06-22 Multilingual Speech Translation with Unified Transformer: Huawei Noah's Ark Lab at IWSLT 2021 Xingshan Zeng, Liangyou Li, Qun Liu et.al. 2106.00197v2 null
2021-05-28 How to Split: the Effect of Word Segmentation on Gender Bias in Speech Translation Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi et.al. 2105.13782v1 link
2021-06-30 The Volctrans Neural Speech Translation System for IWSLT 2021 Chengqi Zhao, Zhicheng Liu, Jian Tong, Tao Wang, Mingxuan Wang, Rong Ye, Qianqian Dong, Jun Cao, Lei Li et.al. 2105.07319v2 link
2021-06-15 Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders Chen Xu, Bojie Hu, Yanyang Li, Yuhao Zhang, shen huang, Qi Ju, Tong Xiao, Jingbo Zhu et.al. 2105.05752v2 null
2021-05-11 Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation Shun-Po Chuang, Yung-Sung Chuang, Chih-Chiang Chang, Hung-yi Lee et.al. 2105.04840v1 link
2021-06-28 End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021 Gerard I. Gállego, Ioannis Tsiamas, Carlos Escolano, José A. R. Fonollosa, Marta R. Costa-jussà et.al. 2105.04512v2 link
2021-07-02 AlloST: Low-resource Speech Translation without Source Transcription Yao-Fei Cheng, Hung-Shin Lee, Hsin-Min Wang et.al. 2105.00171v3 link
2021-06-14 Impact of Encoding and Segmentation Strategies on End-to-End Simultaneous Speech Translation Ha Nguyen, Yannick Estève, Laurent Besacier et.al. 2104.14470v2 null
2021-10-14 Beyond Voice Activity Detection: Hybrid Audio Segmentation for Direct Speech Translation Marco Gaido, Matteo Negri, Mauro Cettolo, Marco Turchi et.al. 2104.11710v2 null
2021-06-18 End-to-end Speech Translation via Cross-modal Progressive Training Rong Ye, Mingxuan Wang, Lei Li et.al. 2104.10380v2 link
2021-04-14 Large-Scale Self- and Semi-Supervised Learning for Speech Translation Changhan Wang, Anne Wu, Juan Pino, Alexei Baevski, Michael Auli, Alexis Conneau et.al. 2104.06678v1 null
2021-04-13 Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation Hirofumi Inaguma, Tatsuya Kawahara, Shinji Watanabe et.al. 2104.06457v1 null
2021-04-27 BSTC: A Large-Scale Chinese-English Speech Translation Dataset Ruiqing Zhang, Xiyang Wang, Chuanqiang Zhang, Zhongjun He, Hua Wu, Zhi Li, Haifeng Wang, Ying Chen, Qinfei Li et.al. 2104.03575v4 null
2021-06-30 Towards the evaluation of automatic simultaneous speech translation from a communicative perspective Claudio Fantinuoli, Bianca Prandi et.al. 2103.08364v2 null
2021-03-04 An Empirical Study of End-to-end Simultaneous Speech Translation Decoding Strategies Ha Nguyen, Yannick Estève, Laurent Besacier et.al. 2103.03233v1 null
2021-09-14 Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation Renjie Zheng, Junkun Chen, Mingbo Ma, Liang Huang et.al. 2102.05766v2 null
2021-02-02 CTC-based Compression for Direct Speech Translation Marco Gaido, Mauro Cettolo, Matteo Negri, Marco Turchi et.al. 2102.01578v1 link
2021-06-15 NeurST: Neural Speech Translation Toolkit Chengqi Zhao, Mingxuan Wang, Qianqian Dong, Rong Ye, Lei Li et.al. 2012.10018v3 link
2020-12-09 On Knowledge Distillation for Direct Speech Translation Marco Gaido, Mattia A. Di Gangi, Matteo Negri, Marco Turchi et.al. 2012.04964v1 link
2020-12-09 Breeding Gender-aware Direct Speech Translation Systems Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi et.al. 2012.04955v1 null
2020-11-24 Tight Integrated End-to-End Training for Cascaded Speech Translation Parnia Bahar, Tobias Bieschke, Ralf Schlüter, Hermann Ney et.al. 2011.12167v1 null
2020-11-11 Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS Katsuhito Sudoh, Takatomo Kano, Sashi Novitasari, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura et.al. 2011.04845v2 null
2020-11-03 SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End Simultaneous Speech Translation Xutai Ma, Juan Pino, Philipp Koehn et.al. 2011.02048v1 link
2020-11-02 Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation Hang Le, Juan Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier et.al. 2011.00747v1 link

(back to top)

Legal

Publish Date Title Authors PDF Code
2024-08-15 ArabLegalEval: A Multitask Benchmark for Assessing Arabic Legal Knowledge in Large Language Models Faris Hijazi, Somayah AlHarbi, Abdulaziz AlHussein, Harethah Abu Shairah, Reem AlZahrani, Hebah AlShamlan, Omar Knio, George Turkiyyah et.al. 2408.07983v1 link
2024-08-13 ELLA: Empowering LLMs for Interpretable, Accurate and Informative Legal Advice Yutong Hu, Kangcheng Luo, Yansong Feng et.al. 2408.07137v1 link
2024-08-08 Redefining Accountability: Navigating Legal Challenges of Participant Liability in Decentralized Autonomous Organizations Aneta Napieralska, Przemysław Kępczyński et.al. 2408.04717v1 null
2024-08-05 A Multi-Source Heterogeneous Knowledge Injected Prompt Learning Method for Legal Charge Prediction Jingyun Sun, Chi Wei, Yang Li et.al. 2408.02233v1 null
2024-08-01 DeliLaw: A Chinese Legal Counselling System Based on a Large Language Model Nan Xie, Yuelin Bai, Hengyuan Gao, Feiteng Fang, Qixuan Zhao, Zhijian Li, Ziqiang Xue, Liang Zhu, Shiwen Ni, Min Yang et.al. 2408.00357v1 null
2024-07-27 LawLLM: Law Large Language Model for the US Legal System Dong Shu, Haoran Zhao, Xukun Liu, David Demeter, Mengnan Du, Yongfeng Zhang et.al. 2407.21065v1 null
2024-07-30 A Three Steps Methodological Approach to Legal Governance Validation Pompeu Casanovas, Mustafa Hashmi, Louis de Koker, Ho-Pun Lam et.al. 2407.20691v1 null
2024-07-30 The Future of International Data Transfers: Managing Legal Risk with a User-Held Data Model Paulius Jurcys, Marcelo Corrales Compagnucci, Mark Fenwick et.al. 2407.20514v1 null
2024-07-29 Legal Aspects of Decentralized and Platform-Driven Economies Marcelo Corrales Compagnucci, Toshiyuki Kono, Shinto Teramoto et.al. 2407.20301v1 null
2024-08-09 Legal Minds, Algorithmic Decisions: How LLMs Apply Constitutional Principles in Complex Scenarios Camilla Bignotti, Carolina Camassa et.al. 2407.19760v2 null
2024-07-28 SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain Pierre Colombo, Telmo Pires, Malik Boudiaf, Rui Melo, Dominic Culver, Sofia Morgado, Etienne Malaboeuf, Gabriel Hautreux, Johanne Charpentier, Michael Desa et.al. 2407.19584v1 null
2024-07-26 Optimizing Numerical Estimation and Operational Efficiency in the Legal Domain through Large Language Models Jia-Hong Huang, Chao-Chun Yang, Yixian Shen, Alessio M. Pacces, Evangelos Kanoulas et.al. 2407.19041v1 null
2024-07-05 Challenges and Considerations in Annotating Legal Data: A Comprehensive Overview Harshil Darji, Jelena Mitrović, Michael Granitzer et.al. 2407.17503v1 null
2024-07-23 Lawma: The Power of Specialization for Legal Tasks Ricardo Dominguez-Olmedo, Vedant Nanda, Rediet Abebe, Stefan Bechtold, Christoph Engel, Jens Frankenreiter, Krishna Gummadi, Moritz Hardt, Michael Livermore et.al. 2407.16615v1 null
2024-07-19 LeKUBE: A Legal Knowledge Update BEnchmark Changyue Wang, Weihang Su, Hu Yiran, Qingyao Ai, Yueyue Wu, Cheng Luo, Yiqun Liu, Min Zhang, Shaoping Ma et.al. 2407.14192v1 null
2024-05-28 The Cost of Arbitrariness for Individuals: Examining the Legal and Technical Challenges of Model Multiplicity Prakhar Ganesh, Ihsan Ibrahim Daldaban, Ignacio Cofone, Golnoosh Farnadi et.al. 2407.13070v1 null
2024-07-20 Applicability of Large Language Models and Generative Models for Legal Case Judgement Summarization Aniket Deroy, Kripabandhu Ghosh, Saptarshi Ghosh et.al. 2407.12848v2 null
2024-08-12 Across Platforms and Languages: Dutch Influencers and Legal Disclosures on Instagram, YouTube and TikTok Haoyang Gui, Thales Bertaglia, Catalina Goanta, Sybe de Vries, Gerasimos Spanakis et.al. 2407.12451v2 null
2024-07-07 Auditing of AI: Legal, Ethical and Technical Approaches Jakob Mokander et.al. 2407.06235v1 null
2024-07-07 IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning Abhinav Joshi, Shounak Paul, Akshat Sharma, Pawan Goyal, Saptarshi Ghosh, Ashutosh Modi et.al. 2407.05399v1 null
2024-08-06 Enabling Discriminative Reasoning in LLMs for Legal Judgment Prediction Chenlong Deng, Kelong Mao, Yuyao Zhang, Zhicheng Dou et.al. 2407.01964v4 link
2024-06-28 Learning Interpretable Legal Case Retrieval via Knowledge-Guided Case Reformulation Chenlong Deng, Kelong Mao, Zhicheng Dou et.al. 2406.19760v1 link
2024-06-27 CLERC: A Dataset for Legal Case Retrieval and Retrieval-Augmented Analysis Generation Abe Bohan Hou, Orion Weller, Guanghui Qin, Eugene Yang, Dawn Lawrie, Nils Holzenberger, Andrew Blair-Stanek, Benjamin Van Durme et.al. 2406.17186v2 link
2024-06-24 eagerlearners at SemEval2024 Task 5: The Legal Argument Reasoning Task in Civil Procedure Hoorieh Sabzevari, Mohammadmostafa Rostamkhani, Sauleh Eetemadi et.al. 2406.16490v1 link
2024-04-26 Examining the Legal Status of Digital Assets as Property: A Comparative Analysis of Jurisdictional Approaches Luke Lee et.al. 2406.15391v1 null
2024-06-21 GiusBERTo: A Legal Language Model for Personal Data De-identification in Italian Court of Auditors Decisions Giulio Salierno, Rosamaria Bertè, Luca Attias, Carla Morrone, Dario Pettazzoni, Daniela Battisti et.al. 2406.15032v1 null
2024-06-21 InternLM-Law: An Open Source Chinese Legal Large Language Model Zhiwei Fei, Songyang Zhang, Xiaoyu Shen, Dawei Zhu, Xiao Wang, Maosong Cao, Fengzhe Zhou, Yining Li, Wenwei Zhang, Dahua Lin, Kai Chen, Jidong Ge et.al. 2406.14887v1 null
2024-06-17 Enhancing Criminal Case Matching through Diverse Legal Factors Jie Zhao, Ziyu Guan, Wei Zhao, Yue Jiang et.al. 2406.11172v1 null
2024-06-16 Towards Supporting Legal Argumentation with NLP: Is More Data Really All You Need? T. Y. S. S Santosh, Kevin D. Ashley, Katie Atkinson, Matthias Grabmair et.al. 2406.10974v1 null
2024-06-15 Applications of Generative AI in Healthcare: algorithmic, ethical, legal and societal considerations Onyekachukwu R. Okonji, Kamol Yunusov, Bonnie Gordon et.al. 2406.10632v1 null
2024-06-10 The Legal Duty to Search for Less Discriminatory Algorithms Emily Black, Logan Koepke, Pauline Kim, Solon Barocas, Mingwei Hsu et.al. 2406.06817v1 null
2024-06-10 AGB-DE: A Corpus for the Automated Legal Assessment of Clauses in German Consumer Contracts Daniel Braun, Florian Matthes et.al. 2406.06809v1 link
2024-06-07 On Ambiguity and the Expressive Function of Law: The Role of Pragmatics in Smart Legal Ecosystems Pompeu Casanovas et.al. 2406.05084v1 null
2024-06-07 LawGPT: A Chinese Legal Knowledge-Enhanced Large Language Model Zhi Zhou, Jiang-Xin Shi, Peng-Xiao Song, Xiao-Wen Yang, Yi-Xuan Jin, Lan-Zhe Guo, Yu-Feng Li et.al. 2406.04614v1 link
2024-06-06 Legal Documents Drafting with Fine-Tuned Pre-Trained Large Language Model Chun-Hsien Lin, Pu-Jen Cheng et.al. 2406.04202v1 link
2024-06-06 Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian Courts Shubham Kumar Nigam, Anurag Sharma, Danush Khanna, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya et.al. 2406.04136v1 link
2024-06-05 Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning Yang Wu, Chenghao Wang, Ece Gumusel, Xiaozhong Liu et.al. 2406.03600v1 null
2024-06-30 Unveiling Themes in Judicial Proceedings: A Cross-Country Study Using Topic Modeling on Legal Documents from India and the UK Krish Didwania, Dr. Durga Toshniwal, Amit Agarwal et.al. 2406.00040v2 null
2024-05-30 Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools Varun Magesh, Faiz Surani, Matthew Dahl, Mirac Suzgun, Christopher D. Manning, Daniel E. Ho et.al. 2405.20362v1 null
2024-05-27 Explainable machine learning multi-label classification of Spanish legal judgements Francisco de Arriba-Pérez, Silvia García-Méndez, Francisco J. González-Castaño, Jaime González-González et.al. 2405.17610v1 null
2024-05-23 Artificial Intelligence (AI) in Legal Data Mining Aniket Deroy, Naksatra Kumar Bailung, Kripabandhu Ghosh, Saptarshi Ghosh, Abhijnan Chakraborty et.al. 2405.14707v1 null
2024-05-23 ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks T. Y. S. S Santosh, Tuan-Quang Vuong, Matthias Grabmair et.al. 2405.14211v1 null
2024-05-20 CaseGNN++: Graph Contrastive Learning for Legal Case Retrieval with Graph Augmentation Yanran Tang, Ruihong Qiu, Yilun Liu, Xue Li, Zi Huang et.al. 2405.11791v1 link
2024-05-17 Empowering Prior to Court Legal Analysis: A Transparent and Accessible Dataset for Defensive Statement Classification and Interpretation Yannis Spyridis, Jean-Paul, Haneen Deeb, Vasileios Argyriou et.al. 2405.10702v1 null
2024-05-16 Co-Matching: Towards Human-Machine Collaborative Legal Case Matching Chen Huang, Xinwei Yang, Yang Deng, Wenqiang Lei, JianCheng Lv, Tat-Seng Chua et.al. 2405.10248v1 null
2024-05-09 Letter to the Editor: What are the legal and ethical considerations of submitting radiology reports to ChatGPT? Siddharth Agarwal, David Wood, Robin Carpenter, Yiran Wei, Marc Modat, Thomas C Booth et.al. 2405.05647v1 null
2024-05-01 A Legal Framework for Natural Language Processing Model Training in Portugal Rúben Almeida, Evelin Amorim et.al. 2405.00536v1 null
2024-05-02 Towards A Structured Overview of Use Cases for Natural Language Processing in the Legal Domain: A German Perspective Juraj Vladika, Stephen Meisenbacher, Martina Preis, Alexandra Klymenko, Florian Matthes et.al. 2404.18759v2 null
2024-04-26 Enhancing Legal Compliance and Regulation Analysis with Large Language Models Shabnam Hassani et.al. 2404.17522v1 null
2024-04-25 Legal Aspects for Software Developers Interested in Generative AI Applications Steffen Herbold, Brian Valerius, Anamaria Mojica-Hanke, Isabella Lex, Joel Mittel et.al. 2404.16630v1 null
2024-04-22 Rethinking Legal Compliance Automation: Opportunities with Large Language Models Shabnam Hassani, Mehrdad Sabetzadeh, Daniel Amyot, Jain Liao et.al. 2404.14356v1 null
2024-04-16 BayesJudge: Bayesian Kernel Language Modelling with Confidence Uncertainty in Legal Judgment Prediction Ubaid Azam, Imran Razzak, Shelly Vishwakarma, Hakim Hacid, Dell Zhang, Shoaib Jameel et.al. 2404.10481v1 null
2024-04-15 LegalPro-BERT: Classification of Legal Provisions by fine-tuning BERT Large Language Model Amit Tewari et.al. 2404.10097v1 null
2024-04-15 Debunking Robot Rights Metaphysically, Ethically, and Legally Abeba Birhane, Jelle van Dijk, Frank Pasquale et.al. 2404.10072v1 null
2024-06-27 Software Engineering Methods For AI-Driven Deductive Legal Reasoning Rohan Padhye et.al. 2404.09868v2 null
2024-05-23 A Legal Risk Taxonomy for Generative Artificial Intelligence David Atkinson, Jacob Morrison et.al. 2404.09479v3 null
2024-04-08 Text clustering applied to data augmentation in legal contexts Lucas José Gonçalves Freitas, Thaís Rodrigues, Guilherme Rodrigues, Pamella Edokawa, Ariane Farias et.al. 2404.08683v1 null
2024-04-10 Leveraging open-source models for legal language modeling and analysis: a case study on the Indian constitution Vikhyath Gupta, Srinivasa Rao P et.al. 2404.06751v1 null
2024-04-08 Privacy and Security of Women's Reproductive Health Apps in a Changing Legal Landscape Shalini Saini, Nitesh Saxena et.al. 2404.05876v1 null
2024-04-04 CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question Answering Nirmalie Wiratunga, Ramitha Abeyratne, Lasal Jayawardena, Kyle Martin, Stewart Massie, Ikechukwu Nkisi-Orji, Ruvan Weerasinghe, Anne Liret, Bruno Fleisch et.al. 2404.04302v1 link
2024-04-04 NLP at UC Santa Cruz at SemEval-2024 Task 5: Legal Answer Validation using Few-Shot Multi-Choice QA Anish Pahilajani, Samyak Rajesh Jain, Devasha Trivedi et.al. 2404.03150v1 link
2024-05-03 Automated Transparency: A Legal and Empirical Analysis of the Digital Services Act Transparency Database Rishabh Kaushal, Jacob van de Kerkhof, Catalina Goanta, Gerasimos Spanakis, Adriana Iamnitchi et.al. 2404.02894v2 null
2024-04-02 FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning Joel Niklaus, Lucia Zheng, Arya D. McCarthy, Christopher Hahn, Brian M. Rosen, Peter Henderson, Daniel E. Ho, Garrett Honke, Percy Liang, Christopher Manning et.al. 2404.02127v1 link
2024-03-31 Mind Your Neighbours: Leveraging Analogous Instances for Rhetorical Role Labeling for Legal Documents T. Y. S. S Santosh, Hassan Sarwat, Ahmed Abdou, Matthias Grabmair et.al. 2404.01344v1 null
2024-04-01 Exploring the Nexus of Large Language Models and Legal Systems: A Short Survey Weicong Qin, Zhongxiang Sun et.al. 2404.00990v1 null
2024-04-01 Towards an In-Depth Comprehension of Case Relevance for Better Legal Retrieval Haitao Li, You Chen, Zhekai Ge, Qingyao Ai, Yiqun Liu, Quan Zhou, Shuai Huo et.al. 2404.00947v1 null
2024-03-31 Query-driven Relevant Paragraph Extraction from Legal Judgments T. Y. S. S Santosh, Elvin Quero Hernandez, Matthias Grabmair et.al. 2404.00595v1 null
2024-03-31 LexAbSumm: Aspect-based Summarization of Legal Decisions T. Y. S. S Santosh, Mahmoud Aly, Matthias Grabmair et.al. 2404.00594v1 null
2024-03-30 Automatic explanation of the classification of Spanish legal judgments in jurisdiction-dependent law categories with tree estimators Jaime González-González, Francisco de Arriba-Pérez, Silvia García-Méndez, Andrea Busto-Castiñeira, Francisco J. González-Castaño et.al. 2404.00437v1 null
2024-03-28 Beyond Borders: Investigating Cross-Jurisdiction Transfer in Legal Case Summarization T. Y. S. S Santosh, Vatsal Venkatkrishna, Saptarshi Ghosh, Matthias Grabmair et.al. 2403.19317v1 null
2024-03-27 High Recall, Small Data: The Challenges of Within-System Evaluation in a Live Legal Search System Gineke Wiggers, Suzan Verberne, Arjen de Vries, Roel van der Burg et.al. 2403.18962v1 null
2024-03-27 A Path Towards Legal Autonomy: An interoperable and explainable approach to extracting, transforming, loading and computing legal information using large language models, expert systems and Bayesian networks Axel Constant, Hannes Westermann, Bryan Wilson, Alex Kiefer, Ines Hipolito, Sylvain Pronovost, Steven Swanson, Mahault Albarracin, Maxwell J. D. Ramstead et.al. 2403.18537v1 null
2024-03-27 DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment Haitao Li, Qingyao Ai, Xinyan Han, Jia Chen, Qian Dong, Yiqun Liu, Chong Chen, Qi Tian et.al. 2403.18435v1 null
2024-03-27 Leveraging Large Language Models for Relevance Judgments in Legal Case Retrieval Shengjie Ma, Chong Chen, Qi Chu, Jiaxin Mao et.al. 2403.18405v1 null
2024-03-26 Juru: Legal Brazilian Large Language Model from Reputable Sources Roseval Malaquias Junior, Ramon Pires, Roseli Romero, Rodrigo Nogueira et.al. 2403.18140v1 null
2024-03-26 GPTs and Language Barrier: A Cross-Lingual Legal QA Examination Ha-Thanh Nguyen, Hiroaki Yamada, Ken Satoh et.al. 2403.18098v1 null
2024-03-26 Enhancing Legal Document Retrieval: A Multi-Phase Approach with Large Language Models Hai-Long Nguyen, Duc-Minh Nguyen, Tan-Minh Nguyen, Ha-Thanh Nguyen, Thi-Hai-Yen Vuong, Ken Satoh et.al. 2403.18093v1 null
2024-06-12 CaseLink: Inductive Graph Learning for Legal Case Retrieval Yanran Tang, Ruihong Qiu, Hongzhi Yin, Xue Li, Zi Huang et.al. 2403.17780v3 link
2024-04-16 Towards Explainability in Legal Outcome Prediction Models Josef Valvoda, Ryan Cotterell et.al. 2403.16852v2 link
2024-03-22 "The Law Doesn't Work Like a Computer": Exploring Software Licensing Issues Faced by Legal Practitioners Nathan Wintersgill, Trevor Stalnaker, Laura A. Heymann, Oscar Chaparro, Denys Poshyvanyk et.al. 2403.14927v1 link
2024-03-20 PARAMANU-AYN: An Efficient Novel Generative and Instruction-tuned Language Model for Indian Legal Case Documents Mitodru Niyogi, Arnab Bhattacharya et.al. 2403.13681v1 null
2024-03-20 Improving Legal Case Retrieval with Brain Signals Ruizhe Zhang, Qingyao Ai, Ziyi Ye, Yueyue Wu, Xiaohui Xie, Yiqun Liu et.al. 2403.13242v1 null
2024-07-02 Towards Unsupervised Question Answering System with Multi-level Summarization for Legal Text M Manvith Prabhu, Haricharana Srinivasa, Anand Kumar M et.al. 2403.13107v2 null
2024-03-17 Evaluation Ethics of LLMs in Legal Domain Ruizhe Zhang, Haitao Li, Yueyue Wu, Qingyao Ai, Yiqun Liu, Min Zhang, Shaoping Ma et.al. 2403.11152v1 null
2024-03-16 Human Centered AI for Indian Legal Text Analytics Sudipto Ghosh, Devanshu Verma, Balaji Ganesan, Purnima Bindal, Vikas Kumar, Vasudha Bhatnagar et.al. 2403.10944v1 null
2024-03-14 Caveat Lector: Large Language Models in Legal Practice Eliza Mik et.al. 2403.09163v1 null
2024-05-08 Legally Binding but Unfair? Towards Assessing Fairness of Privacy Policies Vincent Freiberger, Erik Buchmann et.al. 2403.08115v2 null
2024-03-11 Exploring Large Language Models and Hierarchical Frameworks for Classification of Large Unstructured Legal Documents Nishchal Prasad, Mohand Boughanem, Taoufiq Dkaki et.al. 2403.06872v1 link
2024-03-06 VLSP 2023 -- LTER: A Summary of the Challenge on Legal Textual Entailment Recognition Vu Tran, Ha-Thanh Nguyen, Trung Vo, Son T. Luu, Hoang-Anh Dang, Ngoc-Cam Le, Thi-Thuy Le, Minh-Tien Nguyen, Truong-Son Nguyen, Le-Minh Nguyen et.al. 2403.03435v1 null
2024-03-03 Logic Rules as Explanations for Legal Case Retrieval Zhongxiang Sun, Kepu Zhang, Weijie Yu, Haoyu Wang, Jun Xu et.al. 2403.01457v1 link
2024-03-08 Evault for legal records Jeba N, Anas S, Anuragav S, Abhishek R, Sachin K et.al. 2403.01186v2 null
2024-02-25 Gender Biased Legal Case Retrieval System on Users' Decision Process Ruizhe Zhang, Qingyao Ai, Yiqun Liu, Yueyue Wu, Beining Wang et.al. 2403.00814v1 null
2024-06-14 EUROPA: A Legal Multilingual Keyphrase Generation Dataset Olivier Salaün, Frédéric Piedboeuf, Guillaume Le Berre, David Alfonso Hermelo, Philippe Langlais et.al. 2403.00252v2 link
2024-03-04 Improving Legal Judgement Prediction in Romanian with Long Text Encoders Mihai Masala, Traian Rebedea, Horia Velicu et.al. 2402.19170v2 null
2024-07-02 Leveraging Large Language Models for Learning Complex Legal Concepts through Storytelling Hang Jiang, Xiajie Zhang, Robert Mahari, Daniel Kessler, Eric Ma, Tal August, Irene Li, Alex 'Sandy' Pentland, Yoon Kim, Deb Roy, Jad Kabbara et.al. 2402.17019v4 link
2024-06-17 **InSaAF: Incorporating Safety through Accuracy and Fairness Are LLMs ready for the Indian Legal Domain?** Yogesh Tripathi, Raghav Donakanti, Sahil Girhepuje, Ishan Kavathekar, Bhaskara Hanuma Vedula, Gokul S Krishnan, Shreya Goyal, Anmol Goel, Balaraman Ravindran, Ponnurangam Kumaraguru et.al. 2402.10567v4
2024-02-12 Large Language Models "Ad Referendum": How Good Are They at Machine Translation in the Legal Domain? Vicent Briva-Iglesias, Joao Lucas Cavalheiro Camargo, Gokhan Dogru et.al. 2402.07681v1 null
2024-06-06 Through the Lens of Split Vote: Exploring Disagreement, Difficulty and Calibration in Legal Case Outcome Classification Shanshan Xu, T. Y. S. S Santosh, Oana Ichim, Barbara Plank, Matthias Grabmair et.al. 2402.07214v3 null
2024-02-06 LegalLens: Leveraging LLMs for Legal Violation Identification in Unstructured Text Dor Bernsohn, Gil Semo, Yaron Vazana, Gila Hayat, Ben Hagag, Joel Niklaus, Rohit Saha, Kyryl Truskovskyi et.al. 2402.04335v1 link
2024-02-29 Advancing Legal Reasoning: The Integration of AI to Navigate Complexities and Biases in Global Jurisprudence with Semi-Automated Arbitration Processes (SAAPs) Michael De'Shazer et.al. 2402.04140v3 null
2024-05-03 (A)I Am Not a Lawyer, But...: Engaging Legal Experts towards Responsible LLM Policies for Legal Advice Inyoung Cheong, King Xia, K. J. Kevin Feng, Quan Ze Chen, Amy X. Zhang et.al. 2402.01864v2 null
2024-01-30 Aalap: AI Assistant for Legal & Paralegal Functions in India Aman Tiwari, Prathamesh Kalamkar, Atreyo Banerjee, Saurabh Karn, Varun Hemachandran, Smita Gupta et.al. 2402.01758v1 null
2024-01-18 Legal and ethical implications of applications based on agreement technologies: the case of auction-based road intersections José-Antonio Santos, Alberto Fernández, Mar Moreno-Rebato, Holger Billhardt, José-A. Rodríguez-García, Sascha Ossowski et.al. 2402.01673v1 null
2024-01-10 Promises and pitfalls of artificial intelligence for legal applications Sayash Kapoor, Peter Henderson, Arvind Narayanan et.al. 2402.01656v1 null
2024-01-31 Employing Label Models on ChatGPT Answers Improves Legal Text Entailment Performance Chau Nguyen, Le-Minh Nguyen et.al. 2401.17897v1 null
2024-04-13 PILOT: Legal Case Outcome Prediction with Case Law Lang Cao, Zifeng Wang, Cao Xiao, Jimeng Sun et.al. 2401.15770v3 null
2024-02-28 LegalDuet: Learning Effective Representations for Legal Judgment Prediction through a Dual-View Legal Clue Reasoning Pengjie Liu, Zhenghao Liu, Xiaoyuan Yi, Liner Yang, Shuo Wang, Yu Gu, Ge Yu, Xing Xie, Shuang-hua Yang et.al. 2401.15371v2 null
2024-01-26 A Korean Legal Judgment Prediction Dataset for Insurance Disputes Alice Saebom Kwak, Cheonkam Jeong, Ji Weon Lim, Byeongcheol Min et.al. 2401.14654v1 null
2024-01-25 Automated legal reasoning with discretion to act using s(LAW) Joaquín Arias, Mar Moreno-Rebato, José A. Rodríguez-García, Sascha Ossowski et.al. 2401.14511v1 null
2024-01-22 Streamlining Advanced Taxi Assignment Strategies based on Legal Analysis Holger Billhardt, José-Antonio Santos, Alberto Fernández, Mar Moreno, Sascha Ossowski, José A. Rodríguez et.al. 2401.12324v1 null
2024-01-22 The Right Model for the Job: An Evaluation of Legal Multi-Label Classification Baselines Martina Forster, Claudia Schulz, Prudhvi Nokku, Melicaalsadat Mirsafian, Jaykumar Kasundra, Stavroula Skylaki et.al. 2401.11852v1 null
2024-01-09 Answer Retrieval in Legal Community Question Answering Arian Askari, Zihui Yang, Zhaochun Ren, Suzan Verberne et.al. 2401.04852v1 link
2024-01-07 CAPTAIN at COLIEE 2023: Efficient Methods for Legal Information Retrieval and Entailment Tasks Chau Nguyen, Phuong Nguyen, Thanh Tran, Dat Nguyen, An Trieu, Tin Pham, Anh Dang, Le-Minh Nguyen et.al. 2401.03551v1 link
2024-06-21 Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models Matthew Dahl, Varun Magesh, Mirac Suzgun, Daniel E. Ho et.al. 2401.01301v2 link
2024-01-02 Discovering Significant Topics from Legal Decisions with Selective Inference Jerrold Soh et.al. 2401.01068v1 null
2023-12-31 Viz: A QLoRA-based Copyright Marketplace for Legally Compliant Generative AI Dipankar Sarkar et.al. 2401.00503v1 null
2023-12-19 CaseGNN: Graph Neural Networks for Legal Case Retrieval with Text-Attributed Graphs Yanran Tang, Ruihong Qiu, Yilun Liu, Xue Li, Zi Huang et.al. 2312.11229v2 link
2024-04-02 Social, Legal, Ethical, Empathetic, and Cultural Rules: Compilation and Reasoning (Extended Version) Nicolas Troquard, Martina De Sanctis, Paola Inverardi, Patrizio Pelliccione, Gian Luca Scoccia et.al. 2312.09699v2 null
2024-04-15 Explicitly Integrating Judgment Prediction with Legal Document Retrieval: A Law-Guided Generative Approach Weicong Qin, Zelin Cao, Weijie Yu, Zihua Si, Sirui Chen, Jun Xu et.al. 2312.09591v2 link
2023-12-14 Weaving Pathways for Justice with GPT: LLM-driven automated drafting of interactive legal applications Quinten Steenhuis, David Colarusso, Bryce Willey et.al. 2312.09198v1 link
2023-12-13 SLJP: Semantic Extraction based Legal Judgment Prediction Prameela Madambakam, Shathanaa Rajmohan, Himangshu Sharma, Tummepalli Anka Chandrahas Purushotham Gupta et.al. 2312.07979v1 null
2023-12-10 Multi-Defendant Legal Judgment Prediction via Hierarchical Reasoning Yougang Lyu, Jitai Hao, Zihan Wang, Kai Zhao, Shen Gao, Pengjie Ren, Zhumin Chen, Fang Wang, Zhaochun Ren et.al. 2312.05762v1 link
2023-12-06 Boosting legal case retrieval by query content selection with large language models Youchao Zhou, Heyan Huang, Zhijing Wu et.al. 2312.03494v1 link
2023-12-03 Towards Mitigating Perceived Unfairness in Contracts from a Non-Legal Stakeholder's Perspective Anmol Singhal, Preethu Rose Anish, Shirish Karande, Smita Ghaisas et.al. 2312.01398v1 null
2023-12-01 The Ethics of Automating Legal Actors Josef Valvoda, Alec Thompson, Ryan Cotterell, Simone Teufel et.al. 2312.00584v1 null
2023-12-01 Questioning Biases in Case Judgment Summaries: Legal Datasets or Large Language Models? Aniket Deroy, Subhankar Maity et.al. 2312.00554v1 null
2024-06-13 Japanese Tort-case Dataset for Rationale-supported Legal Judgment Prediction Hiroaki Yamada, Takenobu Tokunaga, Ryutaro Ohara, Akira Tokutsu, Keisuke Takeshita, Mihoko Sumida et.al. 2312.00480v2 null
2023-11-27 Justifiable Artificial Intelligence: Engineering Large Language Models for Legal Applications Sabine Wehnert et.al. 2311.15716v1 null
2024-02-17 Legal Requirements Analysis Sallam Abualhaija, Marcello Ceci, Lionel Briand et.al. 2311.13871v3 null
2023-11-22 Intention and Context Elicitation with Large Language Models in the Legal Aid Intake Process Nick Goodson, Rongfei Lu et.al. 2311.13281v1 null
2023-11-22 Enhancing Logical Reasoning in Large Language Models to Facilitate Legal Applications Ha-Thanh Nguyen, Wachara Fungwacharakorn, Ken Satoh et.al. 2311.13095v1 null
2023-11-21 Development of a Legal Document AI-Chatbot Pranav Nataraj Devaraj, Rakesh Teja P V, Aaryav Gangrade, Manoj Kumar R et.al. 2311.12719v1 null
2023-11-20 Multi-Task Faces (MTF) Data Set: A Legally and Ethically Compliant Collection of Face Images for Various Classification Tasks Rami Haffar, David Sánchez, Josep Domingo-Ferrer et.al. 2311.11882v1 link
2023-10-19 Proceedings of the 3rd International Workshop on Mining and Learning in the Legal Domain (MLLD-23) Masoud Makrehchi, Dell Zhang, Alina Petrova, John Armour et.al. 2311.10733v1 null
2024-02-28 BLT: Can Large Language Models Handle Basic Legal Text? Andrew Blair-Stanek, Nils Holzenberger, Benjamin Van Durme et.al. 2311.09693v2 link
2023-11-15 Explainable Text Classification Techniques in Legal Document Review: Locating Rationales without Using Human Annotated Training Text Snippets Christian Mahoney, Peter Gronvall, Nathaniel Huber-Fliflet, Jianping Zhang et.al. 2311.09133v1 null
2023-11-15 Large Language Models are legal but they are not: Making the case for a powerful LegalLLM Thanmay Jayakumar, Fauzan Farooqui, Luqman Farooqui et.al. 2311.08890v1 null
2023-11-14 Exploring Semi-supervised Hierarchical Stacked Encoder for Legal Judgement Prediction Nishchal Prasad, Mohand Boughanem, Taoufiq Dkaki et.al. 2311.08103v1 link
2024-03-02 Translating Legalese: Enhancing Public Understanding of Court Opinions with Legal Summarizers Elliott Ash, Aniket Kesari, Suresh Naidu, Lena Song, Dominik Stammbach et.al. 2311.06534v2 null
2023-11-10 Citation Recommendation on Scholarly Legal Articles Doğukan Arslan, Saadet Sena Erdoğan, Gülşen Eryiğit et.al. 2311.05902v1 link
2023-11-09 Legal-HNet: Mixing Legal Long-Context Tokens with Hartley Transform Daniele Giofré, Sneha Ghantasala et.al. 2311.05089v1 null
2023-11-01 From Text to Structure: Using Large Language Models to Support the Development of Legal Expert Systems Samyar Janatian, Hannes Westermann, Jinzhe Tan, Jaromir Savelka, Karim Benyekhlef et.al. 2311.04911v1 link
2024-02-05 An energy-based comparative analysis of common approaches to text classification in the Legal domain Sinan Gultekin, Achille Globo, Andrea Zugarini, Marco Ernandes, Leonardo Rigutini et.al. 2311.01256v2 null
2024-01-02 Caseformer: Pre-training for Legal Case Retrieval Based on Inter-Case Distinctions Weihang Su, Qingyao Ai, Yueyue Wu, Yixiao Ma, Haitao Li, Yiqun Liu, Zhijing Wu, Min Zhang et.al. 2311.00333v2 link
2023-10-28 Using Large Language Models to Support Thematic Analysis in Empirical Legal Studies Jakub Drápal, Hannes Westermann, Jaromir Savelka et.al. 2310.18729v1 null
2023-10-28 MILDSum: A Novel Benchmark Dataset for Multilingual Summarization of Indian Legal Case Judgments Debtanu Datta, Shubham Soni, Rajdeep Mukherjee, Saptarshi Ghosh et.al. 2310.18600v1 link
2023-10-27 Modeling Legal Reasoning: LM Annotation at the Edge of Human Agreement Rosamond Thalken, Edward H. Stiglitz, David Mimno, Matthew Wilkens et.al. 2310.18440v1 link
2023-10-26 LeCaRDv2: A Large-Scale Chinese Legal Case Retrieval Dataset Haitao Li, Yunqiu Shao, Yueyue Wu, Qingyao Ai, Yixiao Ma, Yiqun Liu et.al. 2310.17609v1 null
2023-10-26 Harnessing GPT-3.5-turbo for Rhetorical Role Prediction in Legal Cases Anas Belfathi, Nicolas Hernandez, Laura Monceaux et.al. 2310.17413v1 null
2023-10-25 Human-centred explanation of rule-based decision-making systems in the legal domain Suzan Zuurmond, AnneMarie Borg, Matthijs van Kempen, Remi Wieten et.al. 2310.16704v1 null
2023-10-24 DALE: Generative Data Augmentation for Low-Resource Legal NLP Sreyan Ghosh, Chandra Kiran Evuru, Sonal Kumar, S Ramaneswaran, S Sakshi, Utkarsh Tyagi, Dinesh Manocha et.al. 2310.15799v1 link
2023-10-24 Navigating ICT In-House Procurement in Finland: Evaluating Legal Frameworks and Practical Challenges Reetta Ghezzi, Minnamaria Korhonen, Hannu Vilpponen, Tommi Mikkonen et.al. 2310.15643v1 null
2023-11-03 Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer? Xiaoxi Kang, Lizhen Qu, Lay-Ki Soon, Adnan Trakic, Terry Yue Zhuo, Patrick Charles Emerton, Genevieve Grant et.al. 2310.14880v2 link
2023-10-19 Do Language Models Learn about Legal Entity Types during Pretraining? Claire Barale, Michael Rovatsos, Nehal Bhuta et.al. 2310.13092v1 link
2023-10-19 Exploring Graph Neural Networks for Indian Legal Judgment Prediction Mann Khatri, Mirza Yusuf, Yaman Kumar, Rajiv Ratn Shah, Ponnurangam Kumaraguru et.al. 2310.12800v1 null
2023-10-19 Transformer-based Entity Legal Form Classification Alexander Arimond, Mauro Molteni, Dominik Jany, Zornitsa Manolova, Damian Borth, Andreas G. F. Hoepner et.al. 2310.12766v1 link
2023-10-18 Automated Attribute Extraction from Legal Proceedings Subinay Adhikary, Sagnik Das, Sagnik Saha, Procheta Sen, Dwaipayan Roy, Kripabandhu Ghosh et.al. 2310.12131v1 null
2023-10-18 A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction Ruihao Shui, Yixin Cao, Xiang Wang, Tat-Seng Chua et.al. 2310.11761v1 link
2023-10-17 Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation Shubham Kumar Nigam, Aniket Deroy, Noel Shallum, Ayush Kumar Mishra, Anup Roy, Shubham Kumar Mishra, Arnab Bhattacharya, Saptarshi Ghosh, Kripabandhu Ghosh et.al. 2310.11049v1 link
2023-10-25 Legal NLP Meets MiCAR: Advancing the Analysis of Crypto White Papers Carolina Camassa et.al. 2310.10333v3 null
2023-10-16 Prediction of Arabic Legal Rulings using Large Language Models Adel Ammar, Anis Koubaa, Bilel Benjdira, Omar Najar, Serry Sibaee et.al. 2310.10260v1 null
2023-10-15 Improving Access to Justice for the Indian Population: A Benchmark for Evaluating Translation of Legal Text to Indian Languages Sayan Mahapatra, Debtanu Datta, Shubham Soni, Adrijit Goswami, Saptarshi Ghosh et.al. 2310.09765v1 null
2023-10-13 Precedent-Enhanced Legal Judgment Prediction with LLM and Domain-Model Collaboration Yiquan Wu, Siying Zhou, Yifei Liu, Weiming Lu, Xiaozhong Liu, Yating Zhang, Changlong Sun, Fei Wu, Kun Kuang et.al. 2310.09241v1 null
2023-10-11 Empirical Analysis of the Impact of Legal Tender Digital Currency on Monetary Policy -Based on China's Data Ruimin Song, TIntian Zhao, Chunhui Zhou et.al. 2310.07326v1 null
2023-10-12 Automated Argument Generation from Legal Facts Oscar Tuvey, Procheta Sen et.al. 2310.05680v3 null
2024-02-18 LAiW: A Chinese Legal Large Language Models Benchmark Yongfu Dai, Duanyu Feng, Jimin Huang, Haochen Jia, Qianqian Xie, Yifang Zhang, Weiguang Han, Wei Tian, Hao Wang et.al. 2310.05620v2 link
2023-10-08 Enhancing Pre-Trained Language Models with Sentence Position Embeddings for Rhetorical Roles Recognition in Legal Opinions Anas Belfathi, Nicolas Hernandez, Laura Monceaux et.al. 2310.05276v1 null
2023-10-07 Investigating the Influence of Legal Case Retrieval Systems on Users' Decision Process Beining Wang, Ruizhe Zhang, Yueyue Wu, Qingyao Ai, Min Zhang, Yiqun Liu et.al. 2310.04735v1 null
2023-10-06 Marketing to Children Through Online Targeted Advertising: Targeting Mechanisms and Legal Aspects Tinhinane Medjkoune, Oana Goga, Juliette Senechal et.al. 2310.04104v1 null
2023-10-10 LEEC: A Legal Element Extraction Dataset with an Extensive Domain-Specific Label System Xue Zongyue, Liu Huanghai, Hu Yiran, Kong Kangle, Wang Chenlu, Liu Yun, Shen Weixing et.al. 2310.01271v2 link
2023-10-02 Comparative Analysis of Technical and Legal Frameworks of Various National Digial Identity Solutions Montassar Naghmouchi, Maryline Laurent, Claire Levallois-Barth, Nesrine Kaaniche et.al. 2310.01006v1 null
2023-09-29 STRONG -- Structure Controllable Legal Opinion Summary Generation Yang Zhong, Diane Litman et.al. 2309.17280v1 link
2023-09-29 Interpretable Long-Form Legal Question Answering with Retrieval-Augmented Large Language Models Antoine Louis, Gijs van Dijck, Gerasimos Spanakis et.al. 2309.17050v1 link
2023-09-28 LawBench: Benchmarking Legal Knowledge of Large Language Models Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Songyang Zhang, Kai Chen, Zongwen Shen, Jidong Ge et.al. 2309.16289v1 link
2023-12-18 Question-Answering Approach to Evaluating Legal Summaries Huihui Xu, Kevin Ashley et.al. 2309.15016v2 link
2023-10-16 Legal Question-Answering in the Indian Context: Efficacy, Challenges, and Potential of Modern AI Models Shubham Kumar Nigam, Shubham Kumar Mishra, Ayush Kumar Mishra, Noel Shallum, Arnab Bhattacharya et.al. 2309.14735v2 null
2024-01-01 The Cambridge Law Corpus: A Dataset for Legal AI Research Andreas Östling, Holli Sargeant, Huiyuan Xie, Ludwig Bull, Alexander Terenin, Leif Jonsson, Måns Magnusson, Felix Steffek et.al. 2309.12269v4 null
2023-10-13 Legitimate Interest is the New Consent -- Large-Scale Measurement and Legal Compliance of IAB Europe TCF Paywalls Victor Morel, Cristiana Santos, Viktor Fredholm, Adam Thunberg et.al. 2309.11625v3 null
2023-09-23 DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal Services Shengbin Yue, Wei Chen, Siyuan Wang, Bingxuan Li, Chenchen Shen, Shujun Liu, Yuxuan Zhou, Yao Xiao, Song Yun, Xuanjing Huang, Zhongyu Wei et.al. 2309.11325v2 link
2023-09-25 A Hierarchical Neural Framework for Classification and its Explanation in Large Unstructured Legal Documents Nishchal Prasad, Mohand Boughanem, Taoufik Dkaki et.al. 2309.10563v2 null
2023-09-16 NOWJ1@ALQAC 2023: Enhancing Legal Task Performance with Classic Statistical Models and Pre-trained Language Models Tan-Minh Nguyen, Xuan-Hoa Nguyen, Ngoc-Duy Mai, Minh-Quan Hoang, Van-Huan Nguyen, Hoang-Viet Nguyen, Ha-Thanh Nguyen, Thi-Hai-Yen Vuong et.al. 2309.09070v1 null
2023-09-16 Constructing a Knowledge Graph for Vietnamese Legal Cases with Heterogeneous Graphs Thi-Hai-Yen Vuong, Minh-Quan Hoang, Tan-Minh Nguyen, Hoang-Trung Nguyen, Ha-Thanh Nguyen et.al. 2309.09069v1 null
2023-09-15 Resolving Legalese: A Multilingual Exploration of Negation Scope Resolution in Legal Documents Ramona Christen, Anastassia Shaitarova, Matthias Stürmer, Joel Niklaus et.al. 2309.08695v1 link
2023-09-15 Encoded Summarization: Summarizing Documents into Continuous Vector Space for Legal Case Retrieval Vu Tran, Minh Le Nguyen, Satoshi Tojo, Ken Satoh et.al. 2309.08187v1 null
2023-12-21 FedJudge: Federated Legal Large Language Model Linan Yue, Qi Liu, Yichao Du, Weibo Gao, Ye Liu, Fangzhou Yao et.al. 2309.08173v2 link
2023-08-11 India's Progress in Space Exploration and International Legal Challenges in Meeting Goals within International Space Boundaries: A Review Jayanthi Vajiram, Utkarsh Maurya, Negha Senthil et.al. 2309.06560v1 null
2023-09-11 Black-Box Analysis: GPTs Across Time in Legal Textual Entailment Task Ha-Thanh Nguyen, Randy Goebel, Francesca Toni, Kostas Stathis, Ken Satoh et.al. 2309.05501v1 null
2023-09-11 NeCo@ALQAC 2023: Legal Domain Knowledge Acquisition for Low-Resource Languages through Data Enrichment Hai-Long Nguyen, Dieu-Quynh Nguyen, Hoang-Trung Nguyen, Thu-Trang Pham, Huu-Dong Nguyen, Thach-Anh Nguyen, Thi-Hai-Yen Vuong, Ha-Thanh Nguyen et.al. 2309.05500v1 null
2024-02-05 NESTLE: a No-Code Tool for Statistical Analysis of Legal Corpus Kyoungyeon Cho, Seungkum Han, Young Rok Choi, Wonseok Hwang et.al. 2309.04146v2 null
2023-09-06 Prompt-based Effective Input Reformulation for Legal Case Retrieval Yanran Tang, Ruihong Qiu, Xue Li et.al. 2309.02962v1 link
2023-09-01 ALJP: An Arabic Legal Judgment Prediction in Personal Status Cases Using Machine Learning Models Salwa Abbara, Mona Hafez, Aya Kazzaz, Areej Alhothali, Alhanouf Alsolami et.al. 2309.00238v1 null
2023-09-05 Is the U.S. Legal System Ready for AI's Challenges to Human Values? Inyoung Cheong, Aylin Caliskan, Tadayoshi Kohno et.al. 2308.15906v3 null
2023-08-20 LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models Neel Guha, Julian Nyarko, Daniel E. Ho, Christopher Ré, Adam Chilton, Aditya Narayana, Alex Chohlas-Wood, Austin Peters, Brandon Waldon, Daniel N. Rockmore, Diego Zambrano, Dmitry Talisman, Enam Hoque, Faiz Surani, Frank Fagan, Galit Sarfaty, Gregory M. Dickinson, Haggai Porat, Jason Hegland, Jessica Wu, Joe Nudell, Joel Niklaus, John Nay, Jonathan H. Choi, Kevin Tobia, Margaret Hagan, Megan Ma, Michael Livermore, Nikon Rasumov-Rahe, Nils Holzenberger, Noam Kolt, Peter Henderson, Sean Rehaag, Sharad Goel, Shang Gao, Spencer Williams, Sunny Gandhi, Tom Zur, Varun Iyer, Zehua Li et.al. 2308.11462v1 link
2023-08-08 SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore Sewon Min, Suchin Gururangan, Eric Wallace, Hannaneh Hajishirzi, Noah A. Smith, Luke Zettlemoyer et.al. 2308.04430v1 link
2023-08-04 Legal Summarisation through LLMs: The PRODIGIT Project Thiago Dal Pont, Federico Galli, Andrea Loreggia, Giuseppe Pisano, Riccardo Rovatti, Giovanni Sartor et.al. 2308.04416v1 null
2023-08-08 Large Language Model Prompt Chaining for Long Legal Document Classification Dietrich Trautmann et.al. 2308.04138v1 null
2023-08-02 Exploring the psychology of GPT-4's Moral and Legal Reasoning Guilherme F. C. F. Almeida, José Luiz Nunes, Neele Engelmann, Alex Wiegmann, Marcelo de Araújo et.al. 2308.01264v1 null
2023-07-31 Adversarially Robust Neural Legal Judgement Systems Rohit Raj, V Susheela Devi et.al. 2308.00165v1 null
2023-07-27 Exploration of legal implications of air and space travel for international and domestic travel and the Environment Jayanthi Vajiram, Negha Senthil, Nean Adhith. P, Ritikaa. VN et.al. 2307.14661v1 null
2023-07-25 An Intent Taxonomy of Legal Case Retrieval Yunqiu Shao, Haitao Li, Yueyue Wu, Yiqun Liu, Qingyao Ai, Jiaxin Mao, Yixiao Ma, Shaoping Ma et.al. 2307.13298v1 null
2023-07-17 Legal Syllogism Prompting: Teaching Large Language Models for Legal Judgment Prediction Cong Jiang, Xiaolei Yang et.al. 2307.08321v1 link
2023-07-11 Argumentative Segmentation Enhancement for Legal Summarization Huihui Xu, Kevin Ashley et.al. 2307.05081v1 null
2023-07-10 Legal Decision-making for Highway Automated Driving Xiaohan Ma, Wenhao Yu, Chengxiang Zhao, Changjun Wang, Wenhui Zhou, Guangming Zhao, Mingyue Ma, Weida Wang, Lin Yang, Rui Mu, Hong Wang, Jun Li et.al. 2307.04327v1 null
2023-07-07 Specification, Validation and Verification of Social, Legal, Ethical, Empathetic and Cultural Requirements for Autonomous Agents Sinem Getir Yaman, Ana Cavalcanti, Radu Calinescu, Colin Paterson, Pedro Ribeiro, Beverley Townsend et.al. 2307.03697v1 null
2024-01-18 Towards Open Federated Learning Platforms: Survey and Vision from Technical and Legal Perspectives Moming Duan et.al. 2307.02140v2 link
2023-07-04 Racial Bias Trends in the Text of US Legal Opinions Rohan Jinturkar et.al. 2307.01693v1 null
2023-06-29 Towards Grammatical Tagging for the Legal Language of Cybersecurity Gianpietro Castiglione, Giampaolo Bella, Daniele Francesco Santamaria et.al. 2306.17042v1 null
2023-06-29 Beyond Logic Programming for Legal Reasoning Ha-Thanh Nguyen, Francesca Toni, Kostas Stathis, Ken Satoh et.al. 2306.16632v1 null
2023-06-28 ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases Jiaxi Cui, Zongjian Li, Yang Yan, Bohua Chen, Li Yuan et.al. 2306.16092v1 link
2023-06-09 Legal and ethical considerations regarding the use of ChatGPT in education Fereniki Panagopoulou, Christina Parpoula, Kostas Karpouzis et.al. 2306.10037v1 null
2023-06-22 Explaining Legal Concepts with Augmented Large Language Models (GPT-4) Jaromir Savelka, Kevin D. Ashley, Morgan A. Gray, Hannes Westermann, Huihui Xu et.al. 2306.09525v2 null
2023-06-12 Large Language Models as Tax Attorneys: A Case Study in Legal Capabilities Emergence John J. Nay, David Karamardian, Sarah B. Lawsky, Wenting Tao, Meghana Bhat, Raghav Jain, Aaron Travis Lee, Jonathan H. Choi, Jungo Kasai et.al. 2306.07075v1 null
2023-06-09 Towards the Exploitation of LLM-based Chatbot for Providing Legal Support to Palestinian Cooperatives Rabee Qasem, Banan Tantour, Mohammed Maree et.al. 2306.05827v1 null
2023-06-08 NOWJ at COLIEE 2023 -- Multi-Task and Ensemble Approaches in Legal Information Processing Thi-Hai-Yen Vuong, Hai-Long Nguyen, Tan-Minh Nguyen, Hoang-Trung Nguyen, Thai-Binh Nguyen, Ha-Thanh Nguyen et.al. 2306.04903v1 null
2023-06-08 Improving Vietnamese Legal Question--Answering System based on Automatic Data Enrichment Thi-Hai-Yen Vuong, Ha-Thanh Nguyen, Quang-Huy Nguyen, Le-Minh Nguyen, Xuan-Hieu Phan et.al. 2306.04841v1 null
2023-06-03 FlairNLP at SemEval-2023 Task 6b: Extraction of Legal Named Entities from Legal Texts using Contextual String Embeddings Vinay N Ramesh, Rohan Eswara et.al. 2306.02182v1 link
2023-06-03 TransDocAnalyser: A Framework for Offline Semi-structured Handwritten Document Analysis in the Legal Domain Sagar Chakraborty, Gaurav Harit, Saptarshi Ghosh et.al. 2306.02142v1 link
2023-06-06 MultiLegalPile: A 689GB Multilingual Legal Corpus Joel Niklaus, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho et.al. 2306.02069v2 null
2023-06-14 How Ready are Pre-trained Abstractive Models and LLMs for Legal Case Judgement Summarization? Aniket Deroy, Kripabandhu Ghosh, Saptarshi Ghosh et.al. 2306.01248v2 null
2023-06-01 Towards Argument-Aware Abstractive Summarization of Long Legal Opinions with Summary Reranking Mohamed Elaraby, Yang Zhong, Diane Litman et.al. 2306.00672v1 null
2023-05-29 Datasets for Portuguese Legal Semantic Textual Similarity: Comparing weak supervision and an annotation process approaches Daniel da Silva Junior, Paulo Roberto dos S. Corval, Aline Paes, Daniel de Oliveira et.al. 2306.00007v1 null
2023-05-09 Stronger Together: on the Articulation of Ethical Charters, Legal Tools, and Technical Documentation in ML Giada Pistilli, Carlos Munoz Ferrandis, Yacine Jernite, Margaret Mitchell et.al. 2305.18615v1 null
2023-05-20 CDJUR-BR -- A Golden Collection of Legal Document from Brazilian Justice with Fine-Grained Named Entities Antonio Mauricio, Vladia Pinheiro, Vasco Furtado, João Araújo Monteiro Neto, Francisco das Chagas Jucá Bomfim, André Câmara Ferreira da Costa, Raquel Silveira, Nilsiton Aragão et.al. 2305.18315v1 null
2023-05-25 Prototype-Based Interpretability for Legal Citation Prediction Chu Fei Luo, Rohan Bhambhoria, Samuel Dahan, Xiaodan Zhu et.al. 2305.16490v1 null
2023-05-24 Automated Refugee Case Analysis: An NLP Pipeline for Supporting Legal Practitioners Claire Barale, Michael Rovatsos, Nehal Bhuta et.al. 2305.15533v1 link
2023-05-23 Adversarial Machine Learning and Cybersecurity: Risks, Challenges, and Legal Implications Micah Musser, Andrew Lohn, James X. Dempsey, Jonathan Spring, Ram Shankar Siva Kumar, Brenda Leong, Christina Liaghati, Cindy Martinez, Crystal D. Grant, Daniel Rohrer, Heather Frase, Jonathan Elliott, John Bansemer, Mikel Rodriguez, Mitt Regan, Rumman Chowdhury, Stefan Hermanek et.al. 2305.14553v1 null
2023-11-01 Towards Legally Enforceable Hate Speech Detection for Public Forums Chu Fei Luo, Rohan Bhambhoria, Xiaodan Zhu, Samuel Dahan et.al. 2305.13677v2 link
2023-05-20 Proceedings of the International Workshop on Methodologies for Translating Legal Norms into Formal Representations (LN2FR 2022) in association with 35th International Conference on Legal Knowledge and Information Systems (JURIX 2022) Georg Borges, Ken Satoh, Erich Schweighofer et.al. 2305.12203v1 null
2023-05-04 Late-Binding Scholarship in the Age of AI: Navigating Legal and Normative Challenges of a New Form of Knowledge Production Bill Tomlinson, Andrew W. Torrance, Rebecca W. Black, Donald J. Patterson et.al. 2305.11058v1 null
2023-05-15 Legal Extractive Summarization of U.S. Court Opinions Emmanuel Bauer, Dominik Stammbach, Nianlong Gu, Elliott Ash et.al. 2305.08428v1 link
2023-05-22 LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development Ilias Chalkidis, Nicolas Garneau, Catalina Goanta, Daniel Martin Katz, Anders Søgaard et.al. 2305.07507v2 link
2023-05-11 THUIR@COLIEE 2023: More Parameters and Legal Knowledge for Legal Case Entailment Haitao Li, Changyue Wang, Weihang Su, Yueyue Wu, Qingyao Ai, Yiqun Liu et.al. 2305.06817v1 link
2023-05-11 THUIR@COLIEE 2023: Incorporating Structural Knowledge into Pre-trained Language Models for Legal Case Retrieval Haitao Li, Weihang Su, Changyue Wang, Yueyue Wu, Qingyao Ai, Yiqun Liu et.al. 2305.06812v1 link
2023-05-10 Extracting Complex Named Entities in Legal Documents via Weakly Supervised Object Detection Hsiu-Wei Yang, Abhinav Agrawal et.al. 2305.05836v1 null
2023-05-09 An Exploration of Encoder-Decoder Approaches to Multi-Label Classification for Legal and Biomedical Text Yova Kementchedjhieva, Ilias Chalkidis et.al. 2305.05627v1 link
2023-05-09 CaseEncoder: A Knowledge-enhanced Pre-trained Model for Legal Case Encoding Yixiao Ma, Yueyue Wu, Weihang Su, Qingyao Ai, Yiqun Liu et.al. 2305.05393v1 null
2023-05-08 Unlocking Practical Applications in Legal Domain: Evaluation of GPT for Zero-Shot Semantic Annotation of Legal Texts Jaromir Savelka et.al. 2305.04417v1 null
2023-05-06 Rhetorical Role Labeling of Legal Documents using Transformers and Graph Neural Networks Anshika Gupta, Shaz Furniturewala, Vijay Kumari, Yashvardhan Sharma et.al. 2305.04100v1 null
2023-05-04 ChatGPT and Works Scholarly: Best Practices and Legal Pitfalls in Writing with AI Bill Tomlinson, Andrew W. Torrance, Rebecca W. Black et.al. 2305.03722v1 null
2023-05-03 CiteCaseLAW: Citation Worthiness Detection in Caselaw for Legal Assistive Writing Mann Khatri, Pritish Wadhwa, Gitansh Satija, Reshma Sheik, Yaman Kumar, Rajiv Ratn Shah, Ponnurangam Kumaraguru et.al. 2305.03508v1 null
2023-05-04 Analyzing Hong Kong's Legal Judgments from a Computational Linguistics point-of-view Sankalok Sen et.al. 2305.02558v1 null
2023-05-02 MultiLegalSBD: A Multilingual Legal Sentence Boundary Detection Dataset Tobias Brugger, Matthias Stürmer, Joel Niklaus et.al. 2305.01211v1 link
2023-04-27 Analyzing Vietnamese Legal Questions Using Deep Neural Networks with Biaffine Classifiers Nguyen Anh Tu, Hoang Thi Thu Uyen, Tu Minh Phuong, Ngo Xuan Bach et.al. 2304.14447v1 null
2023-04-21 The Dark Side of ChatGPT: Legal and Ethical Challenges from Stochastic Parrots and Hallucination Zihao Li et.al. 2304.14347v1 null
2023-04-22 SAILER: Structure-aware Pre-trained Language Model for Legal Case Retrieval Haitao Li, Qingyao Ai, Jia Chen, Qian Dong, Yueyue Wu, Yiqun Liu, Chong Chen, Qi Tian et.al. 2304.11370v1 link
2023-05-01 SemEval 2023 Task 6: LegalEval - Understanding Legal Texts Ashutosh Modi, Prathamesh Kalamkar, Saurabh Karn, Aman Tiwari, Abhinav Joshi, Sai Kiran Tanikella, Shouvik Kumar Guha, Sachin Malhan, Vivek Raghavan et.al. 2304.09548v3 null
2023-06-29 How well do SOTA legal reasoning models support abductive reasoning? Ha-Thanh Nguyen, Randy Goebel, Francesca Toni, Kostas Stathis, Ken Satoh et.al. 2304.06912v2 null
2023-09-15 Exploring the State of the Art in Legal QA Systems Abdelrahman Abdallah, Bhawna Piryani, Adam Jatowt et.al. 2304.06623v3 link
2023-04-12 FALQU: Finding Answers to Legal Questions Behrooz Mansouri, Ricardo Campos et.al. 2304.05611v1 link
2023-04-25 Context-Aware Classification of Legal Document Pages Pavlos Fragkogiannis, Martina Forster, Grace E. Lee, Dell Zhang et.al. 2304.02787v2 null
2023-03-25 (Legal Design) Research through Litigation Reuben Kirkham et.al. 2303.14336v1 null
2023-07-19 Understand Legal Documents with Contextualized Large Language Models Xin Jin, Yuchen Wang et.al. 2303.12135v4 null
2023-03-16 A Short Survey of Viewing Large Language Models in Legal Aspect Zhongxiang Sun et.al. 2303.09136v1 link
2023-03-14 Are Models Trained on Indian Legal Data Fair? Sahil Girhepuje, Anmol Goel, Gokul S Krishnan, Shreya Goyal, Satyendra Pandey, Ponnurangam Kumaraguru, Balaraman Ravindran et.al. 2303.07247v2 null
2023-08-04 Meaningful human command: Advance control directives as a method to enable moral and legal responsibility for autonomous weapons systems Susannah Kate Devitt et.al. 2303.06813v3 null
2023-03-07 German BERT Model for Legal Named Entity Recognition Harshil Darji, Jelena Mitrović, Michael Granitzer et.al. 2303.05388v1 null
2023-03-08 Automatic Detection of Industry Sectors in Legal Articles Using Machine Learning Approaches Hui Yang, Stella Hadjiantoni, Yunfei Long, Ruta Petraityte, Berthold Lausen et.al. 2303.05387v1 null
2023-02-23 Natural Language Processing in the Legal Domain Daniel Martin Katz, Dirk Hartung, Lauritz Gerlach, Abhik Jana, Michael J. Bommarito II et.al. 2302.12039v1 null
2023-02-21 Combining Blockchain and Biometrics: A Survey on Technical Aspects and a First Legal Analysis Mahdi Ghafourian, Bilgesu Sumer, Ruben Vera-Rodriguez, Julian Fierrez, Ruben Tolosana, Aythami Moralez, Els Kindt et.al. 2302.10883v1 null
2023-02-12 AIDA: Legal Judgment Predictions for Non-Professional Fact Descriptions via Partial-and-Imbalanced Domain Adaptation Guangyi Xiao, Xinlong Liu, Hao Chen, Jingzhi Guo, Zhiguo Gong et.al. 2302.07728v1 null
2023-02-13 Joint Span Segmentation and Rhetorical Role Labeling with Data Augmentation for Legal Documents T. Y. S. S. Santosh, Philipp Bock, Matthias Grabmair et.al. 2302.06448v1 null
2023-03-20 Minding rights: Mapping ethical and legal foundations of 'neurorights' Sjors Ligthart, Marcello Ienca, Gerben Meynen, Fruzsina Molnar-Gabor, Roberto Andorno, Christoph Bublitz, Paul Catley, Lisa Claydon, Thomas Douglas, Nita Farahany, Joseph J. Fins, Sara Goering, Pim Haselager, Fabrice Jotterand, Andrea Lavazza, Allan McCay, Abel Wajnerman Paz, Stephen Rainey, Jesper Ryberg, Philipp Kellmeyer et.al. 2302.06281v2 null
2023-02-14 A Brief Report on LawGPT 1.0: A Virtual Legal Assistant Based on GPT-3 Ha-Thanh Nguyen et.al. 2302.05729v2 null
2023-02-03 Leveraging task dependency and contrastive learning for Legal Judgement Prediction on the European Court of Human Rights T. Y. S. S Santosh, Marcel Perez San Blas, Phillip Kemper, Matthias Grabmair et.al. 2302.00768v2 null
2023-02-13 Zero-shot Transfer of Article-aware Legal Outcome Classification for European Court of Human Rights Cases T. Y. S. S Santosh, Oana Ichim, Matthias Grabmair et.al. 2302.00609v3 null
2023-01-30 LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain Joel Niklaus, Veton Matoshi, Pooja Rani, Andrea Galassi, Matthias Stürmer, Ilias Chalkidis et.al. 2301.13126v1 link
2023-01-29 Diverse legal case search Ruizhe Zhang, Qingyao Ai, Yueyue Wu, Yixiao Ma, Yiqun Liu et.al. 2301.12504v1 null
2023-01-30 Large Language Models as Fiduciaries: A Case Study Toward Robustly Communicating With Artificial Intelligence Through Legal Standards John J. Nay et.al. 2301.10095v2 null
2023-07-15 On left legal semigroups Attila Nagy et.al. 2301.08793v2 null
2023-01-19 Legal Obligation and Ethical Best Practice: Towards Meaningful Verbal Consent for Voice Assistants William Seymour, Mark Cote, Jose Such et.al. 2301.08091v1 null
2023-01-07 Graph-based Keyword Planning for Legal Clause Generation from Topics Sagar Joshi, Sumanth Balaji, Aparna Garimella, Vasudeva Varma et.al. 2301.06901v1 link
2023-01-06 MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding Steven H. Wang, Antoine Scardigli, Leonard Tang, Wei Chen, Dimitry Levkin, Anya Chen, Spencer Ball, Thomas Woodside, Oliver Zhang, Dan Hendrycks et.al. 2301.00876v2 link
2022-12-13 Attentive Deep Neural Networks for Legal Document Retrieval Ha-Thanh Nguyen, Manh-Kien Phi, Xuan-Bach Ngo, Vu Tran, Le-Minh Nguyen, Minh-Phuong Tu et.al. 2212.13899v1 null
2022-12-19 What to Read in a Contract? Party-Specific Summarization of Important Obligations, Entitlements, and Prohibitions in Legal Documents Abhilasha Sancheti, Aparna Garimella, Balaji Vasan Srinivasan, Rachel Rudinger et.al. 2212.09825v1 null
2022-12-19 E-NER -- An Annotated Named Entity Recognition Corpus of Legal Text Ting Wai Terence Au, Ingemar J. Cox, Vasileios Lampos et.al. 2212.09306v1 link
2022-12-16 Law to Binary Tree -- An Formal Interpretation of Legal Natural Language Ha-Thanh Nguyen, Vu Tran, Ngoc-Cam Le, Thi-Thuy Le, Quang-Huy Nguyen, Le-Minh Nguyen, Ken Satoh et.al. 2212.08335v1 null
2022-12-16 LegalRelectra: Mixed-domain Language Modeling for Long-range Legal Text Comprehension Wenyue Hua, Yuchen Zhang, Zhe Chen, Josie Li, Melanie Weber et.al. 2212.08204v1 null
2023-06-01 No driver, No Regulation? --Online Legal Driving Behavior Monitoring for Self-driving Vehicles Wenhao Yu, Chengxiang Zhao, Jiaxin Liu, Yingkai Yang, Xiaohan Ma, Jun Li, Weida Wang, Hong Wang, Ding Zhao, Xiaosong Hu et.al. 2212.04156v3 null
2022-12-06 Formal Modeling and Analysis of Legal Contracts using ContractCheck Alan Khoja, Martin Kölbl, Stefan Leue, Rüdiger Wilhelmi et.al. 2212.03349v1 null
2022-12-05 Legal Prompt Engineering for Multilingual Legal Judgement Prediction Dietrich Trautmann, Alina Petrova, Frank Schilder et.al. 2212.02199v1 null
2022-12-08 Legal Prompting: Teaching a Language Model to Think Like a Lawyer Fangyi Yu, Lee Quartey, Frank Schilder et.al. 2212.01326v2 null
2022-11-30 BudgetLongformer: Can we Cheaply Pretrain a SotA Legal Language Model From Scratch? Joel Niklaus, Daniele Giofré et.al. 2211.17135v1 null
2022-11-15 DeepParliament: A Legal domain Benchmark & Dataset for Parliament Bills Prediction Ankit Pal et.al. 2211.15424v1 link
2022-11-23 Agent-Specific Deontic Modality Detection in Legal Language Abhilasha Sancheti, Aparna Garimella, Balaji Vasan Srinivasan, Rachel Rudinger et.al. 2211.12752v1 null
2022-11-21 Legal and Political Stance Detection of SCOTUS Language Noah Bergam, Emily Allaway, Kathleen McKeown et.al. 2211.11724v1 link
2022-11-15 Exploiting Contrastive Learning and Numerical Evidence for Improving Confusing Legal Judgment Prediction Leilei Gan, Baokui Li, Kun Kuang, Yi Yang, Fei Wu et.al. 2211.08238v1 null
2022-11-15 An Efficient Active Learning Pipeline for Legal Text Classification Sepideh Mamooler, Rémi Lebret, Stéphane Massonnet, Karl Aberer et.al. 2211.08112v1 null
2022-11-06 Computing and Exploiting Document Structure to Improve Unsupervised Extractive Summarization of Legal Case Decisions Yang Zhong, Diane Litman et.al. 2211.03229v1 link
2023-04-18 Knowledge is Power: Understanding Causality Makes Legal judgment Prediction Models More Generalizable and Robust Haotian Chen, Lingwei Zhang, Yiran Liu, Fanchao Chen, Yang Yu et.al. 2211.03046v2 null
2022-11-05 Privacy-Preserving Models for Legal Natural Language Processing Ying Yin, Ivan Habernal et.al. 2211.02956v1 link
2022-11-05 The Legal Argument Reasoning Task in Civil Procedure Leonard Bongard, Lena Held, Ivan Habernal et.al. 2211.02950v1 link
2022-11-04 Miko Team: Deep Learning Approach for Legal Question Answering in ALQAC 2022 Hieu Nguyen Van, Dat Nguyen, Phuong Minh Nguyen, Minh Le Nguyen et.al. 2211.02200v1 null
2022-11-03 Data-efficient End-to-end Information Extraction for Statistical Legal Analysis Wonseok Hwang, Saehee Eom, Hanuhl Lee, Hai Jin Park, Minjoon Seo et.al. 2211.01692v1 null
2022-11-10 Processing Long Legal Documents with Pre-trained Transformers: Modding LegalBERT and Longformer Dimitris Mamakas, Petros Tsotsi, Ion Androutsopoulos, Ilias Chalkidis et.al. 2211.00974v2 null
2022-11-01 ClassActionPrediction: A Challenging Benchmark for Legal Judgment Prediction of Class Action Cases in the US Gil Semo, Dor Bernsohn, Ben Hagag, Gila Hayat, Joel Niklaus et.al. 2211.00582v1 link
2022-10-31 Do Charge Prediction Models Learn Legal Theory? Zhenwei An, Quzhe Huang, Cong Jiang, Yansong Feng, Dongyan Zhao et.al. 2210.17108v1 link
2022-10-30 Validity Assessment of Legal Will Statements as Natural Language Inference Alice Saebom Kwak, Jacob O. Israelsen, Clayton T. Morrison, Derek E. Bambauer, Mihai Surdeanu et.al. 2210.16989v1 link
2022-10-25 Deconfounding Legal Judgment Prediction for European Court of Human Rights Cases Towards Better Alignment with Experts T. Y. S. S Santosh, Shanshan Xu, Oana Ichim, Matthias Grabmair et.al. 2210.13836v1 link
2022-11-04 Parameter-Efficient Legal Domain Adaptation Jonathan Li, Rohan Bhambhoria, Xiaodan Zhu et.al. 2210.13712v2 null
2022-10-24 Toward an Intelligent Tutoring System for Argument Mining in Legal Texts Hannes Westermann, Jaromir Savelka, Vern R. Walker, Kevin D. Ashley, Karim Benyekhlef et.al. 2210.13635v1 null
2022-10-24 EUR-Lex-Sum: A Multi- and Cross-lingual Dataset for Long-form Summarization in the Legal Domain Dennis Aumiller, Ashish Chouhan, Michael Gertz et.al. 2210.13448v1 link
2022-10-24 Legal-Tech Open Diaries: Lesson learned on how to develop and deploy light-weight models in the era of humongous Language Models Stelios Maroudas, Sotiris Legkas, Prodromos Malakasiotis, Ilias Chalkidis et.al. 2210.13086v1 null
2022-10-22 Extractive Summarization of Legal Decisions using Multi-task Learning and Maximal Marginal Relevance Abhishek Agarwal, Shanshan Xu, Matthias Grabmair et.al. 2210.12437v1 null
2022-12-08 Modelling and Explaining Legal Case-based Reasoners through Classifiers Xinghan Liu, Emiliano Lorini, Antonino Rotolo, Giovanni Sartor et.al. 2210.11217v2 null
2023-04-26 Law Article-Enhanced Legal Case Matching: a Causal Learning Approach Zhongxiang Sun, Jun Xu, Xiao Zhang, Zhenhua Dong, Ji-Rong Wen et.al. 2210.11012v2 link
2022-10-19 Multi-granularity Argument Mining in Legal Texts Huihui Xu, Kevin Ashley et.al. 2210.09472v2 null
2023-04-05 Conversion of Legal Agreements into Smart Legal Contracts using NLP Eason Chen, Niall Roche, Yuen-Hsien Tseng, Walter Hernandez, Jiangbo Shangguan, Alastair Moore et.al. 2210.08954v2 null
2022-10-15 AraLegal-BERT: A pretrained language model for Arabic Legal text Muhammad AL-Qurishi, Sarah AlQaseemi, Riad Soussi et.al. 2210.08284v1 null
2022-10-14 Legal Case Document Summarization: Extractive and Abstractive Methods and their Evaluation Abhay Shukla, Paheli Bhattacharya, Soham Poddar, Rajdeep Mukherjee, Kripabandhu Ghosh, Pawan Goyal, Saptarshi Ghosh et.al. 2210.07544v1 link
2022-10-11 Legal Element-oriented Modeling with Multi-view Contrastive Learning for Legal Case Retrieval Zhaowei Wang et.al. 2210.05188v1 null
2022-10-01 Using Argumentation Schemes to Model Legal Reasoning Trevor Bench-Capon, Katie Atkinson et.al. 2210.00315v1 null
2022-11-12 Multi-stage Information Retrieval for Vietnamese Legal Texts Nhat-Minh Pham, Ha-Thanh Nguyen, Trong-Hop Do et.al. 2209.14494v2 null
2023-05-16 Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans John J. Nay et.al. 2209.13020v14 null
2022-09-26 Legal Case Document Similarity: You Need Both Network and Text Paheli Bhattacharya, Kripabandhu Ghosh, Arindam Pal, Saptarshi Ghosh et.al. 2209.12474v1 link
2022-09-25 An Empirical Study on Cross-X Transfer for Legal Judgment Prediction Joel Niklaus, Matthias Stürmer, Ilias Chalkidis et.al. 2209.12325v1 link
2022-09-13 LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning Neel Guha, Daniel E. Ho, Julian Nyarko, Christopher Ré et.al. 2209.06120v1 link
2023-05-15 Pre-trained Language Models for the Legal Domain: A Case Study on Indian Law Shounak Paul, Arpan Mandal, Pawan Goyal, Saptarshi Ghosh et.al. 2209.06049v5 null
2022-08-29 Bias Impact Analysis of AI in Consumer Mobile Health Technologies: Legal, Technical, and Policy Kristine Gloria, Nidhi Rastogi, Stevie DeGroff et.al. 2209.05440v1 null
2022-09-11 Eiger: Auditable, executable, flexible legal regulations Alexander Bernauer, Richard A. Eisenberg et.al. 2209.04939v1 null
2023-05-28 Early Verification of Legal Compliance via Bounded Satisfiability Checking Nick Feng, Lina Marsso, Mehrdad Sabetzadeh, Marsha Chechik et.al. 2209.04052v3 link
2022-09-18 An Argumentation-Based Legal Reasoning Approach for DL-Ontology Zhe Yu, Yiwei Lu et.al. 2209.03070v2 null
2022-09-06 From Legal Contracts to Legal Calculi: the code-driven normativity Silvia Crafa et.al. 2209.02353v1 null
2022-09-20 ArgLegalSumm: Improving Abstractive Summarization of Legal Documents with Argument Mining Mohamed Elaraby, Diane Litman et.al. 2209.01650v2 link
2022-09-02 Entity Graph Extraction from Legal Acts -- a Prototype for a Use Case in Policy Design Analysis Anna Wróblewska, Bartosz Pieliński, Karolina Seweryn, Karol Saputa, Aleksandra Wichrowska, Sylwia Sysko-Romańczuk, Hanna Schreiber et.al. 2209.00944v1 null
2022-09-01 Unsupervised Simplification of Legal Texts Mert Cemri, Tolga Çukur, Aykut Koç et.al. 2209.00557v1 null
2022-10-06 On the Role of Negative Precedent in Legal Outcome Prediction Josef Valvoda, Ryan Cotterell, Simone Teufel et.al. 2208.08225v2 link
2023-05-17 Mining Legal Arguments in Court Decisions Ivan Habernal, Daniel Faber, Nicola Recchia, Sebastian Bretthauer, Iryna Gurevych, Indra Spiecker genannt Döhmann, Christoph Burchard et.al. 2208.06178v2 link
2022-08-08 Valid Widgets Contain Legal Subwidgets Nathan Donagi et.al. 2208.03866v1 null
2022-08-06 Preventing or Mitigating Adversarial Supply Chain Attacks; a legal analysis Kaspar Rosager Ludvigsen, Shishir Nagaraja, Angela Daly et.al. 2208.03466v1 null
2022-09-01 Upgrading the protection of children from manipulative and addictive strategies in online games: Legal and technical solutions beyond privacy regulation Tommaso Crepax, Jan Tobias Muehlberg et.al. 2207.09928v2 null
2022-07-10 Developing an NLP-based Recommender System for the Ethical, Legal, and Social Implications of Synthetic Biology Damien Dablain, Lilian Huang, Brandon Sepulvado et.al. 2207.06360v1 null
2022-07-09 Explainable Legal Case Matching via Inverse Optimal Transport-based Rationale Extraction Weijie Yu, Zhongxiang Sun, Jun Xu, Zhenhua Dong, Xu Chen, Hongteng Xu, Ji-Rong Wen et.al. 2207.04182v1 link
2022-07-15 Sequence-aware multimodal page classification of Brazilian legal documents Pedro H. Luz de Araujo, Ana Paula G. S. de Almeida, Fabricio A. Braz, Nilton C. da Silva, Flavio de Barros Vidal, Teofilo E. de Campos et.al. 2207.00748v2 link
2022-11-29 Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset Peter Henderson, Mark S. Krass, Lucia Zheng, Neel Guha, Christopher D. Manning, Dan Jurafsky, Daniel E. Ho et.al. 2207.00220v2 link
2022-07-20 Cybersecurity Law: Legal Jurisdiction and Authority Feras A. Batarseh et.al. 2206.09465v3 null
2022-06-15 Legal Provocations for HCI in the Design and Development of Trustworthy Autonomous Systems Lachlan D. Urquhart, Glenn McGarry, Andy Crabtree et.al. 2206.07506v1 null
2022-09-13 Indian Legal Text Summarization: A Text Normalisation-based Approach Satyajit Ghosh, Mousumi Dutta, Tanaya Das et.al. 2206.06238v2 null
2022-06-13 Tackling Algorithmic Disability Discrimination in the Hiring Process: An Ethical, Legal and Technical Analysis Maarten Buyl, Christina Cociancig, Cristina Frattone, Nele Roekens et.al. 2206.06149v1 null
2022-10-05 A Multi-Task Benchmark for Korean Legal Language Understanding and Judgement Prediction Wonseok Hwang, Dongjun Lee, Kyoungyeon Cho, Hanuhl Lee, Minjoon Seo et.al. 2206.05224v2 link
2022-06-08 Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification Stratos Xenouleas, Alexia Tsoukara, Giannis Panagiotakis, Ilias Chalkidis, Ion Androutsopoulos et.al. 2206.03785v1 null
2022-05-30 Billions of Parameters Are Worth More Than In-domain Training Data: A case study in the Legal Case Entailment Task Guilherme Moraes Rosa, Luiz Bonifacio, Vitor Jeronymo, Hugo Abonizio, Roberto Lotufo, Rodrigo Nogueira et.al. 2205.15172v1 link
2022-05-17 An Evaluation Framework for Legal Document Summarization Ankan Mullick, Abhilash Nandy, Manav Nitin Kapadnis, Sohan Patnaik, R Raghav, Roshni Kar et.al. 2205.08478v1 link
2022-05-15 Regulating Facial Processing Technologies: Tensions Between Legal and Technical Considerations in the Application of Illinois BIPA Rui-Jie Yew, Alice Xiang et.al. 2205.07299v1 null
2022-05-13 The Case for a Legal Compliance API for the Enforcement of the EU's Digital Services Act on Social Media Platforms Catalina Goanta, Thales Bertaglia, Adriana Iamnitchi et.al. 2205.06666v1 null
2022-05-06 Fine-grained Intent Classification in the Legal Domain Ankan Mullick, Abhilash Nandy, Manav Nitin Kapadnis, Sohan Patnaik, R Raghav et.al. 2205.03509v1 null
2022-04-19 Sharing and Caring: Creating a Culture of Constructive Criticism in Computational Legal Studies Corinna Coupette, Dirk Hartung et.al. 2205.01071v1 null
2022-04-16 nigam@COLIEE-22: Legal Case Retrieval and Entailment using Cascading of Lexical and Semantic-based models Shubham Kumar Nigam, Navansh Goel et.al. 2204.07853v1 link
2022-03-10 State of the Art in Artificial Intelligence applied to the Legal Domain João Dias, Pedro A. Santos, Nuno Cordeiro, Ana Antunes, Bruno Martins, Jorge Baptista, Carlos Gonçalves et.al. 2204.07047v1 null
2022-04-11 A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and Challenges Junyun Cui, Xiaoyu Shen, Feiping Nie, Zheng Wang, Jinglong Wang, Yulong Chen et.al. 2204.04859v1 null
2022-04-02 Recordism: A social-scientific prospect of blockchain from social, legal, financial, and technological perspectives Zihao Li, Hao Xu, Yang Fang, Boyuan Zhao, Lei Zhang et.al. 2204.00823v1 null
2022-04-02 HLDC: Hindi Legal Documents Corpus Arnav Kapoor, Mudit Dhawan, Anmol Goel, T. H. Arjun, Akshala Bhatnagar, Vibhu Agrawal, Amul Agrawal, Arnab Bhattacharya, Ponnurangam Kumaraguru, Ashutosh Modi et.al. 2204.00806v1 link
2022-03-29 An Evaluation Dataset for Legal Word Embedding: A Case Study On Chinese Codex Chun-Hsien Lin, Pu-Jen Cheng et.al. 2203.15173v1 link
2022-04-12 Gender and Racial Stereotype Detection in Legal Opinion Word Embeddings Sean Matthews, John Hudzina, Dawn Sepehr et.al. 2203.13369v2 null
2022-03-16 LEVEN: A Large-Scale Chinese Legal Event Detection Dataset Feng Yao, Chaojun Xiao, Xiaozhi Wang, Zhiyuan Liu, Lei Hou, Cunchao Tu, Juanzi Li, Yun Liu, Weixing Shen, Maosong Sun et.al. 2203.08556v1 link
2022-03-15 Toward Improving Attentive Neural Networks in Legal Text Processing Ha-Thanh Nguyen et.al. 2203.08244v1 null
2022-03-14 FairLex: A Multilingual Benchmark for Evaluating Fairness in Legal Text Processing Ilias Chalkidis, Tommaso Pasini, Sheng Zhang, Letizia Tomada, Sebastian Felix Schwemer, Anders Søgaard et.al. 2203.07228v1 link
2022-03-08 An Uncommon Task: Participatory Design in Legal AI Fernando Delgado, Solon Barocas, Karen Levy et.al. 2203.06246v1 null
2022-03-05 Prediction of terrorism pattern accompanied by cyber-terrorism and the development direction of corresponding legal systems Daegeon Kim et.al. 2203.03620v1 null
2022-03-04 Information retrieval and structural complexity of legal trees Yanik-Pascal Förster, Alessia Annibale, Luca Gamberi, Evan Tzanis, Pierpaolo Vivo et.al. 2203.02259v1 null
2022-03-03 LegalVis: Exploring and Inferring Precedent Citations in Legal Documents Lucas E. Resck, Jean R. Ponciano, Luis Gustavo Nonato, Jorge Poco et.al. 2203.02001v1 null
2022-04-06 Enhancing Legal Argument Mining with Domain Pre-training and Neural Networks Gechuan Zhang, Paul Nulty, David Lillis et.al. 2202.13457v2 link
2022-02-25 Measuring Shocks to Central Bank Independence using Legal Rulings Stefan Griller, Florian Huber, Michael Pfarrhofer et.al. 2202.12695v1 null
2022-02-13 Transformer-based Approaches for Legal Text Processing Ha-Thanh Nguyen, Minh-Phuong Nguyen, Thi-Hai-Yen Vuong, Minh-Quan Bui, Minh-Chau Nguyen, Tran-Binh Dang, Vu Tran, Le-Minh Nguyen, Ken Satoh et.al. 2202.06397v1 null
2022-02-07 To Tune or Not To Tune? Zero-shot Models for Legal Case Entailment Guilherme Moraes Rosa, Ruan Chaves Rodrigues, Roberto de Alencar Lotufo, Rodrigo Nogueira et.al. 2202.03120v1 link
2022-02-05 Classification on Sentence Embeddings for Legal Assistance Arka Mitra et.al. 2202.02639v1 null
2022-01-31 Bankruptcy Shocks and Legal Labor Markets: Evidence from the Court Competition Era Chad Brown, Jeronimo Carballo, Alessandro Peri et.al. 2202.00044v1 null
2022-01-31 Don't let Ricci v. DeStefano Hold You Back: A Bias-Aware Legal Solution to the Hiring Paradox Jad Salem, Deven R. Desai, Swati Gupta et.al. 2201.13367v1 null
2022-01-31 Guided Semi-Supervised Non-negative Matrix Factorization on Legal Documents Pengyu Li, Christine Tseng, Yaxuan Zheng, Joyce A. Chew, Longxiu Huang, Benjamin Jarman, Deanna Needell et.al. 2201.13324v1 null
2022-09-19 Corpus for Automatic Structuring of Legal Documents Prathamesh Kalamkar, Aman Tiwari, Astha Agarwal, Saurabh Karn, Smita Gupta, Vivek Raghavan, Ashutosh Modi et.al. 2201.13125v2 null
2022-04-19 Expert Finding in Legal Community Question Answering Arian Askari, Suzan Verberne, Gabriella Pasi et.al. 2201.07667v3 link
2022-01-17 Data-Centric Machine Learning in the Legal Domain Hannes Westermann, Jaromir Savelka, Vern R. Walker, Kevin D. Ashley, Karim Benyekhlef et.al. 2201.06653v1 null
2022-01-14 Sequence-to-Sequence Models for Extracting Information from Registration and Legal Documents Ramon Pires, Fábio C. de Souza, Guilherme Rosa, Roberto A. Lotufo, Rodrigo Nogueira et.al. 2201.05658v1 link
2022-01-01 Interpretable Low-Resource Legal Decision Making Rohan Bhambhoria, Hui Liu, Samuel Dahan, Xiaodan Zhu et.al. 2201.01164v1 null
2021-12-29 LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal Statute Identification from Indian Legal Documents Shounak Paul, Pawan Goyal, Saptarshi Ghosh et.al. 2112.14731v1 link
2021-12-21 Sentence Embeddings and High-speed Similarity Search for Fast Computer Assisted Annotation of Legal Documents Hannes Westermann, Jaromir Savelka, Vern R. Walker, Kevin D. Ashley, Karim Benyekhlef et.al. 2112.11494v1 null
2021-12-15 Lex Rosetta: Transfer of Predictive Models Across Languages, Jurisdictions, and Legal Domains Jaromir Savelka, Hannes Westermann, Karim Benyekhlef, Charlotte S. Alexander, Jayla C. Grant, David Restrepo Amariles, Rajaa El Hamdani, Sébastien Meeùs, Michał Araszkiewicz, Kevin D. Ashley, Alexandra Ashley, Karl Branting, Mattia Falduti, Matthias Grabmair, Jakub Harašta, Tereza Novotná, Elizabeth Tippett, Shiwanni Johnson et.al. 2112.07882v1 link
2021-12-15 Cross-Domain Generalization and Knowledge Transfer in Transformers Trained on Legal Data Jaromir Savelka, Hannes Westermann, Karim Benyekhlef et.al. 2112.07870v1 null
2021-12-14 Discovering Explanatory Sentences in Legal Case Decisions Using Pre-trained Language Models Jaromir Savelka, Kevin D. Ashley et.al. 2112.07165v1 link
2021-12-23 Ergo -- a programming language for Smart Legal Contracts Niall Roche, Walter Hernandez, Eason Chen, Jérôme Siméon, Dan Selman et.al. 2112.07064v2 null
2021-12-13 Dependency Learning for Legal Judgment Prediction with a Unified Text-to-Text Transformer Yunyun Huang, Xiaoyu Shen, Chuanyi Li, Jidong Ge, Bin Luo et.al. 2112.06370v1 link
2021-12-10 Computer-Assisted Creation of Boolean Search Rules for Text Classification in the Legal Domain Hannes Westermann, Jaromir Savelka, Vern R. Walker, Kevin D. Ashley, Karim Benyekhlef et.al. 2112.05807v1 null
2022-11-07 Semantic Segmentation of Legal Documents via Rhetorical Roles Vijit Malik, Rishabh Sanjay, Shouvik Kumar Guha, Angshuman Hazarika, Shubham Nigam, Arnab Bhattacharya, Ashutosh Modi et.al. 2112.01836v2 link
2021-12-11 Zero-Shot Cross-Lingual Transfer in Legal Domain Using Transformer Models Zein Shaheen, Gerhard Wohlgenannt, Dmitry Mouromtsev et.al. 2111.14192v2 null
2021-11-05 From impact refugees to deterritorialized states: foresighting extreme legal-policy cases in asteroid impact scenarios Elisa Simó-Soler, Eloy Peña-Asensio et.al. 2111.13643v1 null
2021-11-23 Robust Deep Reinforcement Learning for Extractive Legal Summarization Duy-Hung Nguyen, Bao-Sinh Nguyen, Nguyen Viet Dung Nghiem, Dung Tien Le, Mim Amina Khatun, Minh-Tien Nguyen, Hung Le et.al. 2111.07158v2 null
2021-11-14 Critical Sentence Identification in Legal Cases Using Multi-Class Classification Sahan Jayasinghe, Lakith Rambukkanage, Ashan Silva, Nisansa de Silva, Amal Shehan Perera et.al. 2111.05721v2 null
2021-11-03 Building Legal Datasets Jerrold Soh et.al. 2111.02034v1 null
2021-10-05 LegalNLP -- Natural Language Processing methods for the Brazilian Legal Language Felipe Maia Polo, Gabriel Caiaffa Floriano Mendonça, Kauê Capellato J. Parreira, Lucka Gianvechio, Peterson Cordeiro, Jonathan Batista Ferreira, Leticia Maria Paz de Lima, Antônio Carlos do Amaral Maia, Renato Vicente et.al. 2110.15709v1 link
2021-10-15 Law Smells: Defining and Detecting Problematic Patterns in Legal Drafting Corinna Coupette, Dirk Hartung, Janis Beckedorf, Maximilian Böther, Daniel Martin Katz et.al. 2110.11984v1 null
2021-10-21 Pacta sunt servanda: legal contracts in Stipula Silvia Crafa, Cosimo Laneve, Giovanni Sartor et.al. 2110.11069v1 null
2021-10-12 A Survey on Legal Question Answering Systems Jorge Martinez-Gil et.al. 2110.07333v1 null
2021-10-09 Dynamic Logic of Legal Competences Huimin Dong, Olivier Roy et.al. 2110.04454v1 null
2021-10-07 Cookie Banners, What's the Purpose? Analyzing Cookie Banner Text Through a Legal Lens Cristiana Santos, Arianna Rossi, Lorena Sánchez Chamorro, Kerstin Bongard-Blanchy, Ruba Abu-Salma et.al. 2110.02597v2 null

(back to top)

Speech Recognition

Publish Date Title Authors PDF Code
2024-08-15 Enhancing Large Language Model-based Speech Recognition by Contextualization for Rare and Ambiguous Words Kento Nozawa, Takashi Masuko, Toru Taniguchi et.al. 2408.08027v1 null
2024-08-12 Enhancing Dialogue Speech Recognition with Robust Contextual Awareness via Noise Representation Learning Wonjun Lee, San Kim, Gary Geunbae Lee et.al. 2408.06043v1 null
2024-08-11 LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition Eunseop Yoon, Hee Suk Yoon, John Harvill, Mark Hasegawa-Johnson, Chang D. Yoo et.al. 2408.05769v1 null
2024-08-09 MooER: LLM-based Speech Recognition and Translation Models from Moore Threads Junhao Xu, Zhenlin Liang, Yi Liu, Yichao Hu, Jian Li, Yajun Zheng, Meng Cai, Hua Wang et.al. 2408.05101v1 link
2024-08-05 Clustering and Mining Accented Speech for Inclusive and Fair Speech Recognition Jaeyoung Kim, Han Lu, Soheil Khorram, Anshuman Tripathi, Qian Zhang, Hasim Sak et.al. 2408.02582v1 null
2024-08-08 The NPU-ASLP System Description for Visual Speech Recognition in CNVSRC 2024 He Wang, Lei Xie et.al. 2408.02369v2 link
2024-08-01 SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data Yichen Lu, Jiaqi Song, Xuankai Chang, Hengwei Bian, Soumi Maiti, Shinji Watanabe et.al. 2408.00624v1 link
2024-07-18 Handling Numeric Expressions in Automatic Speech Recognition Christian Huber, Alexander Waibel et.al. 2408.00004v1 null
2024-07-31 On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition Nick Rossenbach, Ralf Schlüter, Sakriani Sakti et.al. 2407.21476v1 null
2024-07-30 Self-Supervised Models in Automatic Whispered Speech Recognition Aref Farhadipour, Homa Asadi, Volker Dellwo et.al. 2407.21211v1 null
2024-07-10 Dynamic Encoder Size Based on Data-Driven Layer-wise Pruning for Speech Recognition Jingjing Xu, Wei Zhou, Zijian Yang, Eugen Beck, Ralf Schlueter et.al. 2407.18930v1 null
2024-08-07 Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing Hukai Huang, Shenghui Lu, Yahui Shan, He Qu, Wenhao Guan, Qingyang Hong, Lin Li et.al. 2407.18581v2 link
2024-07-26 Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation Shiyao Wang, Shiwan Zhao, Jiaming Zhou, Aobo Kong, Yong Qin et.al. 2407.18461v1 link
2024-07-25 On the Effect of Purely Synthetic Training Data for Different Automatic Speech Recognition Architectures Nick Rossenbach, Benedikt Hilmes, Ralf Schlüter et.al. 2407.17997v1 null
2024-07-25 Scaling A Simple Approach to Zero-Shot Speech Recognition Jinming Zhao, Vineel Pratap, Michael Auli et.al. 2407.17852v1 link
2024-07-24 A Comparative Analysis of Bilingual and Trilingual Wav2Vec Models for Automatic Speech Recognition in Multilingual Oral History Archives Jan Lehečka, Josef V. Psutka, Luboš Šmídl, Pavel Ircing, Josef Psutka et.al. 2407.17160v1 null
2024-07-23 Quantifying the Role of Textual Predictability in Automatic Speech Recognition Sean Robertson, Gerald Penn, Ewan Dunbar et.al. 2407.16537v1 null
2024-07-23 The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization Samuele Cornell, Taejin Park, Steve Huang, Christoph Boeddeker, Xuankai Chang, Matthew Maciejewski, Matthew Wiesner, Paola Garcia, Shinji Watanabe et.al. 2407.16447v1 null
2024-07-07 Morse Code-Enabled Speech Recognition for Individuals with Visual and Hearing Impairments Ritabrata Roy Choudhury et.al. 2407.14525v1 null
2024-07-19 Reexamining Racial Disparities in Automatic Speech Recognition Performance: The Role of Confounding by Provenance Changye Li, Trevor Cohen, Serguei Pakhomov et.al. 2407.13982v1 null
2024-07-03 Self-supervised ASR Models and Features For Dysarthric and Elderly Speech Recognition Shujie Hu, Xurong Xie, Mengzhe Geng, Zengrui Jin, Jiajun Deng, Guinan Li, Yi Wang, Mingyu Cui, Tianzi Wang, Helen Meng, Xunying Liu et.al. 2407.13782v1 null
2024-07-18 Low-Resourced Speech Recognition for Iu Mien Language via Weakly-Supervised Phoneme-based Multilingual Pre-training Lukuan Dong, Donghong Qin, Fengbo Bai, Fanhua Song, Yan Liu, Chen Xu, Zhijian Ou et.al. 2407.13292v1 null
2024-06-29 Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition Yuchun Shu, Bo Hu, Yifeng He, Hao Shi, Longbiao Wang, Jianwu Dang et.al. 2407.12817v1 null
2024-07-14 Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation Ruizhe Huang, Mahsa Yarmohammadi, Sanjeev Khudanpur, Daniel Povey et.al. 2407.10303v1 null
2024-07-13 Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System Lingwei Meng, Jiawen Kang, Yuejiao Wang, Zengrui Jin, Xixin Wu, Xunying Liu, Helen Meng et.al. 2407.09817v1 null
2024-07-13 A Streaming Multi-Channel End-to-End Speech Recognition System with Realistic Evaluations Xiangzhu Kong, Tianqi Ning, Hao Huang, Zhijian Ou et.al. 2407.09807v1 link
2024-07-09 Tailored Design of Audio-Visual Speech Recognition Models using Branchformers David Gimeno-Gómez, Carlos-D. Martínez-Hinarejos et.al. 2407.06606v1 link
2024-07-10 Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li, Xiaoyang Li, Zeyang Li, Zehua Lin, Rui Liu, Shouda Liu, Lu Lu, Yizhou Lu, Jingting Ma, Shengtao Ma, Yulin Pei, Chen Shen, Tian Tan, Xiaogang Tian, Ming Tu, Bo Wang, Hao Wang, Yuping Wang, Yuxuan Wang, Hanzhang Xia, Rui Xia, Shuangyi Xie, Hongmin Xu, Meng Yang, Bihong Zhang, Jun Zhang, Wanyi Zhang, Yang Zhang, Yawei Zhang, Yijie Zheng, Ming Zou et.al. 2407.04675v2 null
2024-07-05 Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models Bolaji Yusuf, Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran et.al. 2407.04641v1 null
2024-07-04 Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis Cong-Thanh Do, Shuhei Imai, Rama Doddipatla, Thomas Hain et.al. 2407.04047v1 null
2024-07-04 Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition Sungnyun Kim, Kangwook Jang, Sangmin Bae, Hoirin Kim, Se-Young Yun et.al. 2407.03563v1 null
2024-07-03 Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations Kunal Dhawan, Nithin Rao Koluguri, Ante Jukić, Ryan Langman, Jagadeesh Balam, Boris Ginsburg et.al. 2407.03495v1 null
2024-07-03 Qifusion-Net: Layer-adapted Stream/Non-stream Model for End-to-End Multi-Accent Speech Recognition Jinming Chen, Jingyi Fang, Yuanzhong Zheng, Yaoxuan Wang, Haojun Fei et.al. 2407.03026v1 null
2024-07-02 Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models Zhiyuan Tang, Dong Wang, Shen Huang, Shidong Shang et.al. 2407.01909v1 link
2024-06-28 Less is More: Accurate Speech Recognition & Translation without Web-Scale Data Krishna C. Puvvada, Piotr Żelasko, He Huang, Oleksii Hrinchuk, Nithin Rao Koluguri, Kunal Dhawan, Somshubra Majumdar, Elena Rastorgueva, Zhehuai Chen, Vitaly Lavrukhin, Jagadeesh Balam, Boris Ginsburg et.al. 2406.19674v1 null
2024-06-27 Zero-Query Adversarial Attack on Black-box Automatic Speech Recognition Systems Zheng Fang, Tao Wang, Lingchen Zhao, Shenyi Zhang, Bowen Li, Yunjie Ge, Qi Li, Chao Shen, Qian Wang et.al. 2406.19311v1 null
2024-06-27 Streaming Decoder-Only Automatic Speech Recognition with Discrete Speech Units: A Pilot Study Peikun Chen, Sining Sun, Changhao Shan, Qing Yang, Lei Xie et.al. 2406.18862v1 link
2024-06-26 Dynamic Data Pruning for Automatic Speech Recognition Qiao Xiao, Pingchuan Ma, Adriana Fernandez-Lopez, Boqian Wu, Lu Yin, Stavros Petridis, Mykola Pechenizkiy, Maja Pantic, Decebal Constantin Mocanu, Shiwei Liu et.al. 2406.18373v1 null
2024-06-26 MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research Song Li, Yongbin You, Xuezhi Wang, Zhengkun Tian, Ke Ding, Guanglu Wan et.al. 2406.18301v1 null
2024-06-26 Automatic Speech Recognition for Hindi Anish Saha, A. G. Ramakrishnan et.al. 2406.18135v1 null
2024-07-12 ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs Ahmed Heakl, Youssef Zaghloul, Mennatullah Ali, Rania Hossam, Walid Gomaa et.al. 2406.18120v2 link
2024-06-25 Sequential Editing for Lifelong Training of Speech Recognition Models Devang Kulshreshtha, Saket Dingliwal, Brady Houston, Nikolaos Pappas, Srikanth Ronanki et.al. 2406.17935v1 null
2024-06-25 Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet Manish Dhakal, Arman Chhetri, Aman Kumar Gupta, Prabin Lamichhane, Suraj Pandey, Subarna Shakya et.al. 2406.17825v1 link
2024-06-25 MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization Adriana Fernandez-Lopez, Honglie Chen, Pingchuan Ma, Lu Yin, Qiao Xiao, Stavros Petridis, Shiwei Liu, Maja Pantic et.al. 2406.17614v1 null
2024-06-23 Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss Muhammad Shakeel, Yui Sudo, Yifan Peng, Shinji Watanabe et.al. 2406.16120v1 null
2024-08-01 Decoder-only Architecture for Streaming End-to-end Speech Recognition Emiru Tsunoo, Hayato Futami, Yosuke Kashiwagi, Siddhant Arora, Shinji Watanabe et.al. 2406.16107v2 null
2024-06-21 Perception of Phonological Assimilation by Neural Speech Recognition Models Charlotte Pouw, Marianne de Heer Kloots, Afra Alishahi, Willem Zuidema et.al. 2406.15265v1 null
2024-06-19 Joint vs Sequential Speaker-Role Detection and Automatic Speech Recognition for Air-traffic Control Alexander Blatt, Aravind Krishnan, Dietrich Klakow et.al. 2406.13842v1 null
2024-06-24 Children's Speech Recognition through Discrete Token Enhancement Vrunda N. Sukhadia, Shammur Absar Chowdhury et.al. 2406.13431v2 null
2024-06-16 Automatic Speech Recognition for Biomedical Data in Bengali Language Shariar Kabir, Nazmun Nahar, Shyamasree Saha, Mamunur Rashid et.al. 2406.12931v1 null
2024-06-18 Bridging the Gap: Integrating Pre-trained Speech Enhancement and Recognition Models for Robust Speech Recognition Kuan-Chen Wang, You-Jin Li, Wei-Lun Chen, Yu-Wen Chen, Yi-Ching Wang, Ping-Cheng Yeh, Chao Zhang, Yu Tsao et.al. 2406.12699v1 null
2024-06-18 Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting Yosuke Kashiwagi, Hayato Futami, Emiru Tsunoo, Siddhant Arora, Shinji Watanabe et.al. 2406.12611v1 null
2024-06-18 Unsupervised Online Continual Learning for Automatic Speech Recognition Steven Vander Eeckt, Hugo Van hamme et.al. 2406.12503v1 link
2024-06-18 SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization Young Jin Ahn, Jungwoo Park, Sangha Park, Jonghyun Choi, Kee-Eung Kim et.al. 2406.12233v1 link
2024-06-16 Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech Guan-Ting Lin, Wei-Ping Huang, Hung-yi Lee et.al. 2406.11064v1 null
2024-06-16 Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation for Embedding Undetectable Vulnerabilities on Speech Recognition Wenhan Yao, Jiangkun Yang, Yongqiang He, Jia Liu, Weiping Wen et.al. 2406.10932v1 null
2024-06-14 CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge Chen Chen, Zehua Liu, Xiaolou Li, Lantian Li, Dong Wang et.al. 2406.10313v1 null
2024-06-12 Improving child speech recognition with augmented child-like speech Yuanyuan Zhang, Zhengjun Yue, Tanvina Patel, Odette Scharenborg et.al. 2406.10284v1 null
2024-06-14 Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation Andrew Rouditchenko, Yuan Gong, Samuel Thomas, Leonid Karlinsky, Hilde Kuehne, Rogerio Feris, James Glass et.al. 2406.10082v1 link
2024-06-14 An efficient text augmentation approach for contextualized Mandarin speech recognition Naijun Zheng, Xucheng Wan, Kai Liu, Ziqing Du, Zhou Huan et.al. 2406.09950v1 null
2024-06-14 Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition Yicong Jiang, Tianzi Wang, Xurong Xie, Juan Liu, Wei Sun, Nan Yan, Hui Chen, Lan Wang, Xunying Liu, Feng Tian et.al. 2406.09873v1 null
2024-06-13 Multi-Modal Retrieval For Large Language Model Based Speech Recognition Jari Kolehmainen, Aditya Gourav, Prashanth Gurunath Shivakumar, Yile Gu, Ankur Gandhe, Ariya Rastrow, Grant Strimel, Ivan Bulyko et.al. 2406.09618v1 null
2024-06-13 Speech ReaLLM -- Real-time Streaming Speech Recognition with Multimodal LLMs by Teaching the Flow of Time Frank Seide, Morrie Doulaty, Yangyang Shi, Yashesh Gaur, Junteng Jia, Chunyang Wu et.al. 2406.09569v1 null
2024-06-13 Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't Chihiro Taguchi, David Chiang et.al. 2406.09202v1 link
2024-06-13 Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition William Ravenscroft, George Close, Stefan Goetze, Thomas Hain, Mohammad Soleymanpour, Anurag Chowdhury, Mark C. Fuhs et.al. 2406.08914v1 null
2024-06-13 A Single-Step Non-Autoregressive Automatic Speech Recognition Architecture with High Accuracy and Inference Speed Ziyang Zhuang, Chenfeng Miao, Kun Zou, Shuai Gong, Ming Fang, Tao Wei, Zijian Li, Wei Hu, Shaojun Wang, Jing Xiao et.al. 2406.08835v1 null
2024-06-12 Training Data Augmentation for Dysarthric Automatic Speech Recognition by Text-to-Dysarthric-Speech Synthesis Wing-Zin Leung, Mattias Cross, Anton Ragni, Stefan Goetze et.al. 2406.08568v1 null
2024-06-12 Neural Blind Source Separation and Diarization for Distant Speech Recognition Yoshiaki Bando, Tomohiko Nakamura, Shinji Watanabe et.al. 2406.08396v1 null
2024-06-12 Towards Unsupervised Speech Recognition Without Pronunciation Models Junrui Ni, Liming Wang, Yang Zhang, Kaizhi Qian, Heting Gao, Mark Hasegawa-Johnson, Chang D. Yoo et.al. 2406.08380v1 null
2024-06-11 Tag and correct: high precision post-editing approach to correction of speech recognition errors Tomasz Ziętkiewicz et.al. 2406.07589v1 null
2024-06-11 AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection Rong Gong, Hongfei Xue, Lezhi Wang, Xin Xu, Qisheng Li, Lei Xie, Hui Bu, Shaomei Wu, Jiaming Zhou, Yong Qin, Binbin Zhang, Jun Du, Jia Bin, Ming Li et.al. 2406.07256v1 null
2024-06-11 Reading Miscue Detection in Primary School through Automatic Speech Recognition Lingyun Gao, Cristian Tejedor-Garcia, Helmer Strik, Catia Cucchiarini et.al. 2406.07060v1 null
2024-06-06 LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition Sreyan Ghosh, Sonal Kumar, Ashish Seth, Purva Chiniya, Utkarsh Tyagi, Ramani Duraiswami, Dinesh Manocha et.al. 2406.04432v1 link
2024-06-06 Speed of Light Exact Greedy Decoding for RNN-T Speech Recognition Models on GPU Daniel Galvez, Vladimir Bataev, Hainan Xu, Tim Kaldewey et.al. 2406.03791v1 null
2024-06-11 Enhancing CTC-based speech recognition with diverse modeling units Shiyi Han, Zhihong Lei, Mingbin Xu, Xingyu Na, Zhen Huang et.al. 2406.03274v2 null
2024-06-05 Error-preserving Automatic Speech Recognition of Young English Learners' Language Janick Michot, Manuela Hürlimann, Jan Deriu, Luzia Sauer, Katsiaryna Mlynchyk, Mark Cieliebak et.al. 2406.03235v1 link
2024-06-15 Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition Hsuan Su, Hua Farn, Fan-Yun Sun, Shang-Tse Chen, Hung-yi Lee et.al. 2406.02925v2 null
2024-06-04 Keyword-Guided Adaptation of Automatic Speech Recognition Aviv Shamsian, Aviv Navon, Neta Glazer, Gill Hetz, Joseph Keshet et.al. 2406.02649v1 null
2024-05-03 Combining X-Vectors and Bayesian Batch Active Learning: Two-Stage Active Learning Pipeline for Speech Recognition Ognjen Kundacina, Vladimir Vincan, Dragisa Miskovic et.al. 2406.02566v1 null
2024-04-24 Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices Gwantae Kim, Bokyeung Lee, Donghyeon Kim, Hanseok Ko et.al. 2406.02562v1 null
2024-04-23 Breaking Walls: Pioneering Automatic Speech Recognition for Central Kurdish: End-to-End Transformer Paradigm Abdulhady Abas Abdullah, Hadi Veisi, Tarik Rashid et.al. 2406.02561v1 null
2024-03-27 PhoWhisper: Automatic Speech Recognition for Vietnamese Thanh-Thien Le, Linh The Nguyen, Dat Quoc Nguyen et.al. 2406.02555v1 link
2024-06-04 Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision Saierdaer Yusuyin, Te Ma, Hao Huang, Wenbo Zhao, Zhijian Ou et.al. 2406.02166v1 link
2024-05-27 ViSpeR: Multilingual Audio-Visual Speech Recognition Sanath Narayan, Yasser Abdelaziz Dahou Djilali, Ankit Singh, Eustache Le Bihan, Hakim Hacid et.al. 2406.00038v1 null
2024-05-27 Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous Clients Mohamed Nabih Ali, Alessio Brutti, Daniele Falavigna et.al. 2405.17376v1 null
2024-05-24 Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition Zijin Gu, Tatiana Likhomanenko, He Bai, Erik McDermott, Ronan Collobert, Navdeep Jaitly et.al. 2405.15216v1 null
2024-05-22 Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation Muhammad Shakeel, Yui Sudo, Yifan Peng, Shinji Watanabe et.al. 2405.13514v1 null
2024-05-22 Contextualized Automatic Speech Recognition with Dynamic Vocabulary Yui Sudo, Yosuke Fukumoto, Muhammad Shakeel, Yifan Peng, Shinji Watanabe et.al. 2405.13344v1 null
2024-05-28 FairLENS: Assessing Fairness in Law Enforcement Speech Recognition Yicheng Wang, Mark Cusick, Mohamed Laila, Kate Puech, Zhengping Ji, Xia Hu, Michael Wilson, Noah Spitzer-Williams, Bryan Wheeler, Yasser Ibrahim et.al. 2405.13166v2 null
2024-05-15 Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings Ahmed Adel Attia, Dorottya Demszky, Tolulope Ogunremi, Jing Liu, Carol Espy-Wilson et.al. 2405.13018v1 null
2024-03-14 Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer Maxime Burchi, Krishna C. Puvvada, Jagadeesh Balam, Boris Ginsburg, Radu Timofte et.al. 2405.12983v1 null
2024-05-17 Acoustic modeling for Overlapping Speech Recognition: JHU Chime-5 Challenge System Vimal Manohar, Szu-Jui Chen, Zhiqi Wang, Yusuke Fujita, Shinji Watanabe, Sanjeev Khudanpur et.al. 2405.11078v1 link
2024-05-16 Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models Yuchen Hu, Chen Chen, Chengwei Qin, Qiushi Zhu, Eng Siong Chng, Ruizhe Li et.al. 2405.10025v1 null
2024-05-15 Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer Weifei Jin, Yuxin Cao, Junjie Su, Qi Shen, Kai Ye, Derui Wang, Jie Hao, Ziyao Liu et.al. 2405.09470v1 null
2024-05-10 Lost in Transcription: Identifying and Quantifying the Accuracy Biases of Automatic Speech Recognition Systems Against Disfluent Speech Dena Mujtaba, Nihar R. Mahapatra, Megan Arney, J. Scott Yaruss, Hope Gerlach-Houck, Caryn Herring, Jia Bin et.al. 2405.06150v1 null
2024-05-09 The RoyalFlush Automatic Speech Diarization and Recognition System for In-Car Multi-Channel Automatic Speech Recognition Challenge Jingguang Tian, Shuaishuai Ye, Shunfei Chen, Yang Xiang, Zhaohui Yin, Xinhui Hu, Xinkang Xu et.al. 2405.05498v1 null
2024-05-06 MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition Bingshen Mu, Yangze Li, Qijie Shao, Kun Wei, Xucheng Wan, Naijun Zheng, Huan Zhou, Lei Xie et.al. 2405.03152v1 null
2024-05-02 Low-resource speech recognition and dialect identification of Irish in a multi-task framework Liam Lonergan, Mengjie Qian, Neasa Ní Chiaráin, Christer Gobl, Ailbhe Ní Chasaide et.al. 2405.01293v1 null
2024-05-02 Deep Learning Models in Speech Recognition: Measuring GPU Energy Consumption, Impact of Noise and Model Quantization for Edge Deployment Aditya Chakravarty et.al. 2405.01004v1 link
2024-07-24 Confides: A Visual Analytics Solution for Automated Speech Recognition Analysis and Exploration Sunwoo Ha, Chaehun Lim, R. Jordan Crouser, Alvitta Ottley et.al. 2405.00223v2 null
2024-04-30 EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization Jianzong Wang, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao et.al. 2404.19214v1 null
2024-04-26 Child Speech Recognition in Human-Robot Interaction: Problem Solved? Ruben Janssens, Eva Verhelst, Giulio Antonio Abbo, Qiaoqiao Ren, Maria Jose Pinto Bernal, Tony Belpaeme et.al. 2404.17394v1 null
2024-04-26 Automatic Speech Recognition System-Independent Word Error Rate Estimation Chanho Park, Mingjie Chen, Thomas Hain et.al. 2404.16743v2 null
2024-04-25 Developing Acoustic Models for Automatic Speech Recognition in Swedish Giampiero Salvi et.al. 2404.16547v1 null
2024-04-23 Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information Chihiro Taguchi, Jefferson Saransig, Dayana Velásquez, David Chiang et.al. 2404.15501v1 link
2024-04-23 Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance Tsubasa Ochiai, Kazuma Iwamoto, Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki, Shigeru Katagiri et.al. 2404.14860v1 null
2024-04-20 Semantically Corrected Amharic Automatic Speech Recognition Samuael Adnew, Paul Pu Liang et.al. 2404.13362v1 link
2024-04-19 Efficient infusion of self-supervised representations in Automatic Speech Recognition Darshan Prabhu, Sai Ganesh Mirishkar, Pankaj Wasnik et.al. 2404.12628v1 null
2024-07-26 Automatic Speech Recognition Advancements for Indigenous Languages of the Americas Monica Romero, Sandra Gomez, Ivan G. Torre et.al. 2404.08368v2 null
2024-05-28 VietMed: A Dataset and Benchmark for Automatic Speech Recognition of Vietnamese in the Medical Domain Khai Le-Duc et.al. 2404.05659v2 link
2024-04-04 Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition Hainan Xu, Zhehuai Chen, Fei Jia, Boris Ginsburg et.al. 2404.04295v1 null
2024-04-03 Mai Ho'omāuna i ka 'Ai: Language Models Improve Automatic Speech Recognition in Hawaiian Kaavya Chaparala, Guido Zarrella, Bruce Torres Fischer, Larry Kimura, Oiwi Parker Jones et.al. 2404.03073v1 null
2024-04-02 BRAVEn: Improving Self-Supervised Pre-training for Visual and Auditory Speech Recognition Alexandros Haliassos, Andreas Zinonos, Rodrigo Mira, Stavros Petridis, Maja Pantic et.al. 2404.02098v1 link
2024-03-28 Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition Yash Jain, David Chan, Pranav Dheram, Aparna Khare, Olabanji Shonibare, Venkatesh Ravichandran, Shalini Ghosh et.al. 2403.19822v1 null
2024-03-04 JEP-KD: Joint-Embedding Predictive Architecture Based Knowledge Distillation for Visual Speech Recognition Chang Sun, Hong Yang, Bo Qin et.al. 2403.18843v1 null
2024-04-11 DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition Yi-Cheng Wang, Hsin-Wei Wang, Bi-Cheng Yan, Chi-Han Lin, Berlin Chen et.al. 2403.17645v3 null
2024-03-20 Advanced Long-Content Speech Recognition With Factorized Neural Transducer Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian et.al. 2403.13423v1 null
2024-03-18 AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition SooHwan Eom, Eunseop Yoon, Hee Suk Yoon, Chanwoo Kim, Mark Hasegawa-Johnson, Chang D. Yoo et.al. 2403.11578v1 null
2024-03-14 More than words: Advancements and challenges in speech recognition for singing Anna Kruspe et.al. 2403.09298v1 null
2024-05-21 Skipformer: A Skip-and-Recover Strategy for Efficient Speech Recognition Wenjing Zhu, Sining Sun, Changhao Shan, Peng Fan, Qing Yang et.al. 2403.08258v2 null
2024-03-13 SpeechColab Leaderboard: An Open-Source Platform for Automatic Speech Recognition Evaluation Jiayu Du, Jinpeng Li, Guoguo Chen, Wei-Qiang Zhang et.al. 2403.08196v1 link
2024-03-13 Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean children Taekyung Ahn, Yeonjung Hong, Younggon Im, Do Hyung Kim, Dayoung Kang, Joo Won Jeong, Jae Won Kim, Min Jung Kim, Ah-ra Cho, Dae-Hyun Jang, Hosung Nam et.al. 2403.08187v1 null
2024-03-12 Gujarati-English Code-Switching Speech Recognition using ensemble prediction of spoken language Yash Sharma, Basil Abraham, Preethi Jyothi et.al. 2403.08011v1 null
2024-03-11 The evaluation of a code-switched Sepedi-English automatic speech recognition system Amanda Phaladi, Thipe Modipa et.al. 2403.07947v1 null
2024-03-08 Speech Robust Bench: A Robustness Benchmark For Speech Recognition Muhammad A. Shah, David Solans Noguero, Mikko A. Heikkila, Nicolas Kourtellis et.al. 2403.07937v1 null
2024-03-12 Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets Jan Pešán, Santosh Kesiraju, Lukáš Burget, Jan ''Honza'' Černocký et.al. 2403.07767v1 null
2024-03-09 Aligning Speech to Languages to Enhance Code-switching Speech Recognition Hexin Liu, Xiangyu Zhang, Leibny Paola Garcia, Andy W. H. Khong, Eng Siong Chng, Shinji Watanabe et.al. 2403.05887v1 null
2024-05-30 A New Benchmark for Evaluating Automatic Speech Recognition in the Arabic Call Domain Qusai Abo Obaidah, Muhy Eddin Za'ter, Adnan Jaljuli, Ali Mahboub, Asma Hakouz, Bashar Al-Rfooh, Yazan Estaitia et.al. 2403.04280v2 null
2024-03-07 A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition Yusheng Dai, Hang Chen, Jun Du, Ruoyu Wang, Shihao Chen, Jiefeng Ma, Haotian Wang, Chin-Hui Lee et.al. 2403.04245v1 link
2024-03-05 AIx Speed: Playback Speed Optimization Using Listening Comprehension of Speech Recognition Models Kazuki Kawamura, Jun Rekimoto et.al. 2403.02938v1 null
2024-04-18 Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey Hamza Kheddar, Mustapha Hemis, Yassine Himeur et.al. 2403.01255v2 null
2024-03-01 Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview Heyang Liu, Yu Wang, Yanfeng Wang et.al. 2403.00370v1 null
2024-02-29 Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems Quentin Raymondaud, Mickael Rouvier, Richard Dufour et.al. 2402.19443v1 null
2024-02-29 Inappropriate Pause Detection In Dysarthric Speech Using Large-Scale Speech Recognition Jeehyun Lee, Yerin Choi, Tae-Jin Song, Myoung-Wan Koo et.al. 2402.18923v1 null
2024-06-04 Exploration of Adapter for Noise Robust Automatic Speech Recognition Hao Shi, Tatsuya Kawahara et.al. 2402.18275v3 null
2024-06-19 Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps Giuseppe Attanasio, Beatrice Savoldi, Dennis Fucci, Dirk Hovy et.al. 2402.17954v2 link
2024-02-27 An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement Tzu-Ting Yang, Hsin-Wei Wang, Yi-Cheng Wang, Chi-Han Lin, Berlin Chen et.al. 2402.17189v1 null
2024-04-01 ArEEG_Chars: Dataset for Envisioned Speech Recognition using EEG for Arabic Characters Hazem Darwish, Abdalrahman Al Malah, Khloud Al Jallad, Nada Ghneim et.al. 2402.15733v2 null
2024-02-20 How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli et.al. 2402.13208v1 link
2024-02-20 Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition Yang Li, Yuan Shangguan, Yuhao Wang, Liangzhen Lai, Ernie Chang, Changsheng Zhao, Yangyang Shi, Vikas Chandra et.al. 2402.13076v1 null
2024-02-20 Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition David Gimeno-Gómez, Carlos-D. Martínez-Hinarejos et.al. 2402.13004v1 null
2024-06-16 OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification Yifan Peng, Yui Sudo, Muhammad Shakeel, Shinji Watanabe et.al. 2402.12654v2 null
2024-01-04 AntiDeepFake: AI for Deep Fake Speech Recognition Enkhtogtokh Togootogtokh, Christian Klasen et.al. 2402.10218v1 null
2024-02-09 Self-consistent context aware conformer transducer for speech recognition Konstantin Kolokolov, Pavel Pekichev, Karthik Raghunathan et.al. 2402.06592v1 null
2024-02-08 It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng, Chao-Han Huck Yang et.al. 2402.05457v1 null
2023-10-15 Large Vocabulary Spontaneous Speech Recognition for Tigrigna Ataklti Kahsu, Solomon Teferra et.al. 2402.04254v1 null
2024-02-05 A Comprehensive Study of the Current State-of-the-Art in Nepali Automatic Speech Recognition Systems Rupak Raj Ghimire, Bal Krishna Bal, Prakash Poudyal et.al. 2402.03050v1 null
2024-02-03 Predicting positive transfer for improved low-resource speech recognition using acoustic pseudo-tokens Nay San, Georgios Paraskevopoulos, Aryaman Arora, Xiluo He, Prabhjot Kaur, Oliver Adams, Dan Jurafsky et.al. 2402.02302v1 null
2024-02-01 Introduction to speech recognition Gabriel Dauphin et.al. 2402.01778v1 null
2024-01-31 Exploring the limits of decoder-only models trained on public speech recognition corpora Ankit Gupta, George Saon, Brian Kingsbury et.al. 2402.00235v1 null
2024-02-08 Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition Lei Liu, Li Liu, Haizhou Li et.al. 2401.17604v2 null
2024-01-28 Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition Ahnaf Mozib Samin et.al. 2401.15532v1 null
2024-01-26 Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline Seonmin Koo, Chanjun Park, Jinsung Kim, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim et.al. 2401.14625v1 null
2024-01-19 Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search Yui Sudo, Muhammad Shakeel, Yosuke Fukumoto, Yifan Peng, Shinji Watanabe et.al. 2401.10449v1 null
2024-01-19 Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition Yu Yu, Chao-Han Huck Yang, Tuan Dinh, Sungho Ryu, Jari Kolehmainen, Roger Ren, Denis Filimonov, Prashanth G. Shivakumar, Ankur Gandhe, Ariya Rastow, Jia Xu, Ivan Bulyko, Andreas Stolcke et.al. 2401.10447v1 null
2024-01-19 Large Language Models are Efficient Learners of Noise-Robust Speech Recognition Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng et.al. 2401.10446v1 link
2024-01-18 AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition Ju Lin, Niko Moritz, Yiteng Huang, Ruiming Xie, Ming Sun, Christian Fuegen, Frank Seide et.al. 2401.10411v1 null
2024-01-18 Multilingual Visual Speech Recognition with a Single Model by Learning with Discrete Visual Speech Units Minsu Kim, Jeong Hun Yeo, Jeongsoo Choi, Se Jin Park, Yong Man Ro et.al. 2401.09802v1 null
2024-01-18 SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition Hao Wang, Shuhei Kurita, Shuichiro Shimizu, Daisuke Kawahara et.al. 2401.09759v1 null
2024-01-17 Two-pass Endpoint Detection for Speech Recognition Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow et.al. 2401.08916v1 null
2024-01-15 SeMaScore : a new evaluation metric for automatic speech recognition tasks Zitha Sasindran, Harsha Yelchuri, T. V. Prabhakar et.al. 2401.07506v1 null
2024-01-13 Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization A F M Saif, Xiaodong Cui, Han Shen, Songtao Lu, Brian Kingsbury, Tianyi Chen et.al. 2401.06980v1 link
2024-02-29 The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023 He Wang, Pengcheng Guo, Wei Chen, Pan Zhou, Lei Xie et.al. 2401.06788v2 link
2024-01-12 Dynamic Behaviour of Connectionist Speech Recognition with Strong Latency Constraints Giampiero Salvi et.al. 2401.06588v1 null
2024-01-12 LCB-net: Long-Context Biasing for Audio-Visual Speech Recognition Fan Yu, Haoxu Wang, Xian Shi, Shiliang Zhang et.al. 2401.06390v1 link
2024-01-11 UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction Jiaxin Guo, Minghan Wang, Xiaosong Qiao, Daimeng Wei, Hengchao Shang, Zongyao Li, Zhengzhe Yu, Yinglu Li, Chang Su, Min Zhang, Shimin Tao, Hao Yang et.al. 2401.05689v1 null
2024-01-10 Useful Blunders: Can Automated Speech Recognition Errors Improve Downstream Dementia Classification? Changye Li, Weizhe Xu, Trevor Cohen, Serguei Pakhomov et.al. 2401.05551v1 null
2024-01-09 Continuously Learning New Words in Automatic Speech Recognition Christian Huber, Alexander Waibel et.al. 2401.04482v1 null
2024-01-08 Cross-Speaker Encoding Network for Multi-Talker Speech Recognition Jiawen Kang, Lingwei Meng, Mingyu Cui, Haohan Guo, Xixin Wu, Xunying Liu, Helen Meng et.al. 2401.04152v1 null
2024-02-21 ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge He Wang, Pengcheng Guo, Yue Li, Ao Zhang, Jiayao Sun, Lei Xie, Wei Chen, Pan Zhou, Hui Bu, Xin Xu, Binbin Zhang, Zhuo Chen, Jian Wu, Longbiao Wang, Eng Siong Chng, Sun Li et.al. 2401.03473v3 null
2024-04-08 MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition He Wang, Pengcheng Guo, Pan Zhou, Lei Xie et.al. 2401.03424v3 null
2024-01-05 A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model Dongdi Zhao, Jianbo Ma, Lu Lu, Jinke Li, Xuan Ji, Lei Zhu, Fuming Fang, Ming Liu, Feijun Jiang et.al. 2401.02673v1 null
2024-01-04 Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition David M. Chan, Shalini Ghosh, Hitesh Tulsiani, Ariya Rastrow, Björn Hoffmeister et.al. 2401.02417v1 link
2024-01-04 CTC Blank Triggered Dynamic Layer-Skipping for Efficient CTC-based Speech Recognition Junfeng Hou, Peiyao Wang, Jincheng Zhang, Meng Yang, Minwei Feng, Jingcheng Yin et.al. 2401.02046v1 null
2024-01-03 Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models Rita Frieske, Bertram E. Shi et.al. 2401.01572v1 null
2024-01-01 Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation Huimeng Wang, Zengrui Jin, Mengzhe Geng, Shujie Hu, Guinan Li, Tianzi Wang, Haoning Xu, Xunying Liu et.al. 2401.00662v1 null
2024-05-02 Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition Vahid Noroozi, Somshubra Majumdar, Ankur Kumar, Jagadeesh Balam, Boris Ginsburg et.al. 2312.17279v3 null
2023-12-22 BLSTM-Based Confidence Estimation for End-to-End Speech Recognition Atsunori Ogawa, Naohiro Tawara, Takatomo Kano, Marc Delcroix et.al. 2312.14609v1 null
2024-02-09 Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification Anirudh S. Sundar, Chao-Han Huck Yang, David M. Chan, Shalini Ghosh, Venkatesh Ravichandran, Phani Sankar Nidadavolu et.al. 2312.14378v2 null
2023-12-21 BANSpEmo: A Bangla Emotional Speech Recognition Dataset Md Gulzar Hussain, Mahmuda Rahman, Babe Sultana, Ye Shiren et.al. 2312.14020v1 null
2023-12-20 Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition Ashish Seth, Sreyan Ghosh, S. Umesh, Dinesh Manocha et.al. 2312.12783v1 link
2024-01-11 Automated speech audiometry: Can it work using open-source pre-trained Kaldi-NL automatic speech recognition? Gloria Araiza-Illan, Luke Meyer, Khiet P. Truong, Deniz Baskent et.al. 2312.12269v2 null
2023-12-18 Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers Guru Prakash Arumugam, Shuo-yiin Chang, Tara N. Sainath, Rohit Prabhavalkar, Quan Wang, Shaan Bijwadia et.al. 2312.11123v1 null
2023-12-18 Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition Peng Shen, Xugang Lu, Hisashi Kawai et.al. 2312.10959v1 null
2024-05-13 Conformer-Based Speech Recognition On Extreme Edge-Computing Devices Mingbin Xu, Alex Jin, Sicheng Wang, Mu Su, Tim Ng, Henry Mason, Shiyi Han, Zhihong Lei, Yaqiao Deng, Zhen Huang, Mahesh Krishnamoorthy et.al. 2312.10359v3 null
2023-12-19 On Robustness to Missing Video for Audiovisual Speech Recognition Oscar Chang, Otavio Braga, Hank Liao, Dmitriy Serdyuk, Olivier Siohan et.al. 2312.10088v2 null
2023-12-19 Revisiting the Entropy Semiring for Neural Speech Recognition Oscar Chang, Dongseong Hwang, Olivier Siohan et.al. 2312.10087v2 null
2023-12-15 On the compression of shallow non-causal ASR models using knowledge distillation and tied-and-reduced decoder for low-latency on-device speech recognition Nagaraj Adiga, Jinhwan Park, Chintigari Shiva Kumar, Shatrughan Singh, Kyungmin Lee, Chanwoo Kim, Dhananjaya Gowda et.al. 2312.09842v1 null
2023-12-15 Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies Bingshen Mu, Pengcheng Guo, Dake Guo, Pan Zhou, Wei Chen, Lei Xie et.al. 2312.09746v1 null
2023-12-15 LiteVSR: Efficient Visual Speech Recognition by Learning from Speech Representations of Unlabeled Data Hendrik Laux, Emil Mededovic, Ahmed Hallawa, Lukas Martin, Arne Peine, Anke Schmeink et.al. 2312.09727v1 null
2023-12-15 Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition Tzu-Ting Yang, Hsin-Wei Wang, Berlin Chen et.al. 2312.09583v1 null
2023-12-15 IR-UWB Radar-Based Contactless Silent Speech Recognition of Vowels, Consonants, Words, and Phrases Sunghwa Lee, Younghoon Shin, Myungjong Kim, Jiwon Seo et.al. 2312.09572v1 null
2024-01-12 Attention-Guided Adaptation for Code-Switching Speech Recognition Bobbi Aditya, Mahdin Rohmatillah, Liang-Hsuan Tai, Jen-Tzung Chien et.al. 2312.08856v2 null
2023-12-14 Hourglass-AVSR: Down-Up Sampling-based Computational Efficiency Model for Audio-Visual Speech Recognition Fan Yu, Haoxu Wang, Ziyang Ma, Shiliang Zhang et.al. 2312.08850v1 null
2023-12-14 Towards Automatic Data Augmentation for Disordered Speech Recognition Zengrui Jin, Xurong Xie, Tianzi Wang, Mengzhe Geng, Jiajun Deng, Guinan Li, Shujie Hu, Xunying Liu et.al. 2312.08641v1 null
2023-12-13 PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition Chengxi Lei, Satwinder Singh, Feng Hou, Xiaoyun Jia, Ruili Wang et.al. 2312.08571v1 null
2024-01-16 USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Zhonglin Han, Jian Li, Amir Yazdanbakhsh, Shivani Agrawal et.al. 2312.08553v3 null
2023-12-11 Deep Photonic Reservoir Computer for Speech Recognition Enrico Picco, Alessandro Lupo, Serge Massar et.al. 2312.06558v1 null
2023-12-06 An Integration of Pre-Trained Speech and Language Models for End-to-End Speech Recognition Yukiya Hono, Koh Mitsuda, Tianyu Zhao, Kentaro Mitsui, Toshiaki Wakatsuki, Kei Sawada et.al. 2312.03668v1 null
2023-11-29 FAT-HuBERT: Front-end Adaptive Training of Hidden-unit BERT for Distortion-Invariant Robust Speech Recognition Dongning Yang, Wei Wang, Yanmin Qian et.al. 2311.17790v1 null
2023-11-29 Adapting OpenAI's Whisper for Speech Recognition on Code-Switch Mandarin-English SEAME and ASRU2019 Datasets Yuhang Yang, Yizhou Peng, Xionghu Zhong, Hao Huang, Eng Siong Chng et.al. 2311.17382v1 null
2023-11-25 Multilingual self-supervised speech representations improve the speech recognition of low-resource African languages with codeswitching Tolúlopé Ògúnrèmí, Christopher D. Manning, Dan Jurafsky et.al. 2311.15077v1 null
2023-11-21 Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish David Gimeno-Gómez, Carlos-D. Martínez-Hinarejos et.al. 2311.12480v1 null
2023-11-20 How does end-to-end speech recognition training impact speech enhancement artifacts? Kazuma Iwamoto, Tsubasa Ochiai, Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki, Shigeru Katagiri et.al. 2311.11599v1 null
2023-11-19 Label-Synchronous Neural Transducer for Adaptable Online E2E Speech Recognition Keqi Deng, Philip C. Woodland et.al. 2311.11353v1 null
2023-11-17 GhostVec: A New Threat to Speaker Privacy of End-to-End Speech Recognition System Xiaojiao Chen, Sheng Li, Jiyi Li, Hao Huang, Yang Cao, Liang He et.al. 2311.10689v1 null
2023-11-09 Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation Zhaofeng Lin, Tanvina Patel, Odette Scharenborg et.al. 2311.05179v1 link
2023-11-08 GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition Daniel Galvez, Tim Kaldewey et.al. 2311.04996v1 link
2023-11-07 A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition Andrei Barcovschi, Rishabh Jain, Peter Corcoran et.al. 2311.04936v1 link
2023-11-07 Fine-tuning convergence model in Bengali speech recognition Zhu Ruiying, Shen Meng et.al. 2311.04122v1 null
2023-11-06 Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition Rabindra Nath Nandi, Mehadi Hasan Menon, Tareq Al Muntasir, Sagor Sarker, Quazi Sarwar Muhtaseem, Md. Tariqul Islam, Shammur Absar Chowdhury, Firoj Alam et.al. 2311.03196v1 link
2023-10-20 Intelligibility prediction with a pretrained noise-robust automatic speech recognition model Zehai Tu, Ning Ma, Jon Barker et.al. 2310.19817v1 null
2023-10-29 MUST: A Multilingual Student-Teacher Learning approach for low-resource speech recognition Muhammad Umar Farooq, Rehan Ahmad, Thomas Hain et.al. 2310.18865v1 null
2023-10-27 MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition Jiamin Xie, John H. L. Hansen et.al. 2310.18450v1 link
2023-10-27 TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao, Robin Scheibler, Samuele Cornell, Sean Kim, Stavros Petridis et.al. 2310.17864v1 link
2023-10-25 Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors Marek Kubis, Paweł Skórzewski, Marcin Sowański, Tomasz Ziętkiewicz et.al. 2310.16609v1 link
2023-10-27 Accented Speech Recognition With Accent-specific Codebooks Darshan Prabhu, Preethi Jyothi, Sriram Ganapathy, Vinit Unni et.al. 2310.15970v3 link
2023-10-28 Key Frame Mechanism For Efficient Conformer Based End-to-end Speech Recognition Peng Fan, Changhao Shan, Sining Sun, Qing Yang, Jianwei Zhang et.al. 2310.14954v2 link
2023-10-23 Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model Joanna Hong, Se Jin Park, Yong Man Ro et.al. 2310.14946v1 null
2023-10-22 Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation Kun Wei, Bei Li, Hang Lv, Quan Lu, Ning Jiang, Lei Xie et.al. 2310.14278v1 null
2023-10-17 Audio-AdapterFusion: A Task-ID-free Approach for Efficient and Non-Destructive Multi-task Speech Recognition Hillary Ngai, Rohan Agrawal, Neeraj Gaur, Ronny Huang, Parisa Haghani, Pedro Moreno Mengibar et.al. 2310.13015v1 null
2023-10-17 Generative error correction for code-switching speech recognition using large language models Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Hexin Liu, Sabato Marco Siniscalchi, Eng Siong Chng et.al. 2310.13013v1 null
2023-10-17 Multi-stage Large Language Model Correction for Speech Recognition Jie Pu, Thai-Son Nguyen, Sebastian Stüker et.al. 2310.11532v1 null
2024-03-05 Zipformer: A faster and better encoder for automatic speech recognition Zengwei Yao, Liyong Guo, Xiaoyu Yang, Wei Kang, Fangjun Kuang, Yifan Yang, Zengrui Jin, Long Lin, Daniel Povey et.al. 2310.11230v3 link
2023-10-27 VoxArabica: A Robust Dialect-Aware Arabic Speech Recognition System Abdul Waheed, Bashar Talafha, Peter Sullivan, AbdelRahim Elmadany, Muhammad Abdul-Mageed et.al. 2310.11069v4 null
2023-10-17 Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition Atsunori Ogawa, Takafumi Moriya, Naoyuki Kamo, Naohiro Tawara, Marc Delcroix et.al. 2310.11010v1 null
2023-10-17 Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition Shahram Ghorbani, John H. L. Hansen et.al. 2310.11004v1 null
2023-10-17 Correction Focused Language Model Training for Speech Recognition Yingyi Ma, Zhe Liu, Ozlem Kalinli et.al. 2310.11003v1 null
2023-10-16 Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization Zhihong Lei, Ernest Pusateri, Shiyi Han, Leo Liu, Mingbin Xu, Tim Ng, Ruchir Travadi, Youyuan Zhang, Mirko Hannemann, Man-Hung Siu, Zhen Huang et.al. 2310.09988v1 null
2024-03-04 Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring Ankitha Sudarshan, Vinay Samuel, Parth Patwa, Ibtihel Amara, Aman Chadha et.al. 2310.09680v4 null
2023-10-13 SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation Zhehuai Chen, He Huang, Andrei Andrusenko, Oleksii Hrinchuk, Krishna C. Puvvada, Jason Li, Subhankar Ghosh, Jagadeesh Balam, Boris Ginsburg et.al. 2310.09424v1 link
2023-10-12 On the Relevance of Phoneme Duration Variability of Synthesized Training Data for Automatic Speech Recognition Nick Rossenbach, Benedikt Hilmes, Ralf Schlüter et.al. 2310.08132v1 null
2023-10-10 Acoustic Model Fusion for End-to-end Speech Recognition Zhihong Lei, Mingbin Xu, Shiyi Han, Leo Liu, Zhen Huang, Tim Ng, Yuanyuan Zhang, Ernest Pusateri, Mirko Hannemann, Yaqiao Deng, Man-Hung Siu et.al. 2310.07062v1 null
2023-10-10 No Pitch Left Behind: Addressing Gender Unbalance in Automatic Speech Recognition through Pitch Manipulation Dennis Fucci, Marco Gaido, Matteo Negri, Mauro Cettolo, Luisa Bentivogli et.al. 2310.06590v1 link
2023-10-16 Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan, Rohit Kumar, Narsis A. Kiani, David Gomez-Cabrero, Jesper N. Tegner et.al. 2310.06434v2 link
2023-10-10 Discriminative Speech Recognition Rescoring with Pre-trained Language Models Prashanth Gurunath Shivakumar, Jari Kolehmainen, Yile Gu, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko et.al. 2310.06248v1 null
2023-10-07 Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition Kaixun Huang, Ao Zhang, Binbin Zhang, Tianyi Xu, Xingchen Song, Lei Xie et.al. 2310.04657v1 null
2023-12-15 Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder Zih-Jyun Lin, Yi-Ju Chen, Po-Chih Kuo, Likai Huang, Chaur-Jong Hu, Cheng-Yu Chen et.al. 2310.03985v2 link
2023-10-06 The North System for Formosa Speech Recognition Challenge 2023 Li-Wei Chen, Kai-Chen Cheng, Hung-Shin Lee et.al. 2310.03443v2 null
2023-10-05 Neural Language Model Pruning for Automatic Speech Recognition Leonardo Emili, Thiago Fraga-Silva, Ernest Pusateri, Markus Nußbaum-Thom, Youssef Oualil et.al. 2310.03424v1 null
2023-10-08 BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition Peikun Chen, Fan Yu, Yuhao Lian, Hongfei Xue, Xucheng Wan, Naijun Zheng, Huan Zhou, Lei Xie et.al. 2310.02629v2 null
2023-10-03 Unsupervised Speech Recognition with N-Skipgram and Positional Unigram Matching Liming Wang, Mark Hasegawa-Johnson, Chang D. Yoo et.al. 2310.02382v1 link
2023-10-02 One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition Samuele Cornell, Jee-weon Jung, Shinji Watanabe, Stefano Squartini et.al. 2310.01688v1 null
2023-09-29 Federated Learning with Differential Privacy for End-to-End Speech Recognition Martin Pelikan, Sheikh Shams Azam, Vitaly Feldman, Jan "Honza" Silovsky, Kunal Talwar, Tatiana Likhomanenko et.al. 2310.00098v1 null
2023-09-29 AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition Andrew Rouditchenko, Ronan Collobert, Tatiana Likhomanenko et.al. 2309.17395v1 null
2023-09-29 Enhancing Code-switching Speech Recognition with Interactive Language Biases Hexin Liu, Leibny Paola Garcia, Xiangyu Zhang, Andy W. H. Khong, Sanjeev Khudanpur et.al. 2309.16953v1 null
2023-09-29 SSHR: Leveraging Self-supervised Hierarchical Representations for Multilingual Automatic Speech Recognition Hongfei Xue, Qijie Shao, Kaixun Huang, Peikun Chen, Lei Xie, Jie Liu et.al. 2309.16937v1 null
2023-09-26 Unsupervised Pre-Training for Vietnamese Automatic Speech Recognition in the HYKIST Project Khai Le-Duc et.al. 2309.15869v1 null
2023-09-27 Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study Xuankai Chang, Brian Yan, Kwanghee Choi, Jeeweon Jung, Yichen Lu, Soumi Maiti, Roshan Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang et.al. 2309.15800v1 null
2023-09-26 Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition Dongji Gao, Hainan Xu, Desh Raj, Leibny Paola Garcia Perera, Daniel Povey, Sanjeev Khudanpur et.al. 2309.15796v1 link
2023-10-16 HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Macro Siniscalchi, Pin-Yu Chen, Eng Siong Chng et.al. 2309.15701v2 link
2023-10-10 Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting Chao-Han Huck Yang, Yile Gu, Yi-Chieh Liu, Shalini Ghosh, Ivan Bulyko, Andreas Stolcke et.al. 2309.15649v2 null
2023-10-10 Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth G. Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastow, Ivan Bulyko et.al. 2309.15223v2 null
2023-09-26 Updated Corpora and Benchmarks for Long-Form Speech Recognition Jennifer Drexler Fox, Desh Raj, Natalie Delworth, Quinn McNamara, Corey Miller, Migüel Jetté et.al. 2309.15013v1 link
2023-09-25 On the Impact of Quantization and Pruning of Self-Supervised Speech Models for Downstream Speech Recognition Tasks "In-the-Wild'' Arthur Pimentel, Heitor Guimarães, Anderson R. Avila, Mehdi Rezagholizadeh, Tiago H. Falk et.al. 2309.14462v1 null
2023-09-21 Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition Chen Xu, Xiaoqian Liu, Erfeng He, Yuhao Zhang, Qianqian Dong, Tong Xiao, Jingbo Zhu, Dapeng Man, Wu Yang et.al. 2309.12234v1 link
2024-01-08 Sparsely Shared LoRA on Whisper for Child Speech Recognition Wei Liu, Ying Qin, Zhiyuan Peng, Tan Lee et.al. 2309.11756v2 null
2023-09-20 AudioFool: Fast, Universal and synchronization-free Cross-Domain Attack on Speech Recognition Mohamad Fakih, Rouwaida Kanj, Fadi Kurdahi, Mohammed E. Fouda et.al. 2309.11462v1 null
2023-09-25 Leveraging Data Collection and Unsupervised Learning for Code-switched Tunisian Arabic Automatic Speech Recognition Ahmed Amine Ben Abdallah, Ata Kabboudi, Amir Kanoun, Salah Zaiem et.al. 2309.11327v2 null
2023-09-20 Directional Source Separation for Robust Speech Recognition on Smart Glasses Tiantian Feng, Ju Lin, Yiteng Huang, Weipeng He, Kaustubh Kalgaonkar, Niko Moritz, Li Wan, Xin Lei, Ming Sun, Frank Seide et.al. 2309.10993v1 null
2023-09-19 Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition Krishna C. Puvvada, Nithin Rao Koluguri, Kunal Dhawan, Jagadeesh Balam, Boris Ginsburg et.al. 2309.10922v1 null
2023-09-19 End-to-End Speech Recognition Contextualization with Large Language Models Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen et.al. 2309.10917v1 null
2023-09-19 Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi et.al. 2309.10524v1 null
2023-09-16 Improving Speech Recognition for African American English With Audio Classification Shefali Garg, Zhouyuan Huo, Khe Chai Sim, Suzan Schwartz, Mason Chua, Alëna Aksënova, Tsendsuren Munkhdalai, Levi King, Darryl Wright, Zion Mengesha, Dongseong Hwang, Tara Sainath, Françoise Beaufays, Pedro Moreno Mengibar et.al. 2309.09996v1 null
2023-09-18 Instruction-Following Speech Recognition Cheng-I Jeff Lai, Zhiyun Lu, Liangliang Cao, Ruoming Pang et.al. 2309.09843v1 null
2023-09-18 Training dynamic models using early exits for automatic speech recognition on resource-constrained devices George August Wright, Umberto Cappellazzo, Salah Zaiem, Desh Raj, Lucas Ondel Yang, Daniele Falavigna, Alessio Brutti et.al. 2309.09546v1 null
2023-09-19 Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter Song Li, Yongbin You, Xuezhi Wang, Ke Ding, Guanglu Wan et.al. 2309.09443v2 null
2023-09-18 Are Soft Prompts Good Zero-shot Learners for Speech Recognition? Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter-Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma et.al. 2309.09413v1 null
2023-09-16 Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation Emiru Tsunoo, Hayato Futami, Yosuke Kashiwagi, Siddhant Arora, Shinji Watanabe et.al. 2309.08876v1 null
2023-12-27 Augmenting conformers with structured state-space sequence models for online speech recognition Haozhe Shan, Albert Gu, Zhong Meng, Weiran Wang, Krzysztof Choromanski, Tara Sainath et.al. 2309.08551v2 null
2023-09-15 Visual Speech Recognition for Low-resource Languages with Automatic Labels From Whisper Model Jeong Hun Yeo, Minsu Kim, Shinji Watanabe, Yong Man Ro et.al. 2309.08535v1 link
2023-09-15 Chunked Attention-based Encoder-Decoder Model for Streaming Speech Recognition Mohammad Zeineldeen, Albert Zeyer, Ralf Schlüter, Hermann Ney et.al. 2309.08436v1 null
2023-09-15 Unimodal Aggregation for CTC-based Speech Recognition Ying Fang, Xiaofei Li et.al. 2309.08150v1 link
2023-09-21 Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition Yang Li, Liangzhen Lai, Yuan Shangguan, Forrest N. Iandola, Ernie Chang, Yangyang Shi, Vikas Chandra et.al. 2309.07988v2 null
2023-09-18 Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks Soumi Maiti, Yifan Peng, Shukjae Choi, Jee-weon Jung, Xuankai Chang, Shinji Watanabe et.al. 2309.07937v2 null
2023-09-18 Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults Ahmed Adel Attia, Jing Liu, Wei Ai, Dorottya Demszky, Carol Espy-Wilson et.al. 2309.07927v2 null
2023-09-21 CPPF: A contextual and post-processing-free model for automatic speech recognition Lei Zhang, Zhengkun Tian, Xiang Chen, Jiaming Sun, Hongyu Xiang, Ke Ding, Guanglu Wan et.al. 2309.07413v2 null
2023-09-09 Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech Recognition Huaibo Zhao, Yosuke Higuchi, Yusuke Kida, Tetsuji Ogawa, Tetsunori Kobayashi et.al. 2309.04654v1 null
2023-09-08 End-to-End Speech Recognition and Disfluency Removal with Acoustic Language Model Pretraining Saksham Bassi, Giulio Duregon, Siddhartha Jalagam, David Roth et.al. 2309.04516v1 link
2023-10-07 Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation Jiaxu Zhu, Weinan Tong, Yaoxun Xu, Changhe Song, Zhiyong Wu, Zhao You, Dan Su, Dong Yu, Helen Meng et.al. 2309.02459v2 null
2023-09-05 Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition Patrick Eickhoff, Matthias Möller, Theresa Pekarek Rosin, Johannes Twiefel, Stefan Wermter et.al. 2309.02145v1 null
2023-10-07 SememeASR: Boosting Performance of End-to-End Speech Recognition against Domain and Long-Tailed Data Shift with Sememe Semantic Knowledge Jiaxu Zhu, Changhe Song, Zhiyong Wu, Helen Meng et.al. 2309.01437v2 null
2023-09-01 OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation Zhening Huang, Xiaoyang Wu, Xi Chen, Hengshuang Zhao, Lei Zhu, Joan Lasenby et.al. 2309.00616v1 link
2023-09-01 Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Yiwen Tang, Xianzheng Ma, Jiaming Han, Kexin Chen, Peng Gao, Xianzhi Li, Hongsheng Li, Pheng-Ann Heng et.al. 2309.00615v1 link
2023-09-01 Iterative Multi-granular Image Editing using Diffusion Models K J Joseph, Prateksha Udhayanan, Tripti Shukla, Aishwarya Agarwal, Srikrishna Karanam, Koustava Goswami, Balaji Vasan Srinivasan et.al. 2309.00613v1 null
2023-09-01 CityDreamer: Compositional Generative Model of Unbounded 3D Cities Haozhe Xie, Zhaoxi Chen, Fangzhou Hong, Ziwei Liu et.al. 2309.00610v1 null
2023-09-01 Time Series Analysis of Urban Liveability Alex Levering, Diego Marcos, Devis Tuia et.al. 2309.00594v1 null
2023-09-01 Discrete Morphological Neural Networks Diego Marcondes, Junior Barrera et.al. 2309.00588v1 link
2023-09-01 Mechanism of feature learning in convolutional neural networks Daniel Beaglehole, Adityanarayanan Radhakrishnan, Parthe Pandit, Mikhail Belkin et.al. 2309.00570v1 link
2023-09-01 Amyloid-Beta Axial Plane PET Synthesis from Structural MRI: An Image Translation Approach for Screening Alzheimer's Disease Fernando Vega, Abdoljalil Addeh, M. Ethan MacDonald et.al. 2309.00569v1 null
2023-09-01 Impact of Image Context for Single Deep Learning Face Morphing Attack Detection Joana Pimenta, Iurii Medvedev, Nuno Gonçalves et.al. 2309.00549v1 null
2023-09-01 Trust your Good Friends: Source-free Domain Adaptation by Reciprocal Neighborhood Clustering Shiqi Yang, Yaxing Wang, Joost van de Weijer, Luis Herranz, Shangling Jui, Jian Yang et.al. 2309.00528v1 null
2023-09-01 SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation Youhong Wang, Yunji Liang, Hao Xu, Shaohui Jiao, Hongkai Yu et.al. 2309.00526v1 null
2023-09-01 A Machine Vision Method for Correction of Eccentric Error: Based on Adaptive Enhancement Algorithm Fanyi Wang, Pin Cao, Yihui Zhang, Haotian Hu, Yongying Yang et.al. 2309.00514v1 null
2023-09-01 Multi-stage Deep Learning Artifact Reduction for Computed Tomography Jiayang Shi, Daniel M. Pelt, K. Joost Batenburg et.al. 2309.00494v1 null
2023-09-01 Asymmetric double-winged multi-view clustering network for exploring Diverse and Consistent Information Qun Zheng, Xihong Yang, Siwei Wang, Xinru An, Qi Liu et.al. 2309.00474v1 null
2023-09-01 General and Practical Tuning Method for Off-the-Shelf Graph-Based Index: SISAP Indexing Challenge Report by Team UTokyo Yutaro Oguri, Yusuke Matsui et.al. 2309.00472v1 link
2023-09-01 An Improved Encoder-Decoder Framework for Food EnergyEstimation Jack Ma, Jiangpeng He, Fengqing Zhu et.al. 2309.00468v1 null
2023-09-01 A Theoretical and Practical Framework for Evaluating Uncertainty Calibration in Object Detection Pedro Conde, Rui L. Lopes, Cristiano Premebida et.al. 2309.00464v1 link
2023-09-01 dacl10k: Benchmark for Semantic Bridge Damage Segmentation Johannes Flotzinger, Philipp J. Rösch, Thomas Braml et.al. 2309.00460v1 null
2023-09-01 Unsupervised bias discovery in medical image segmentation Nicolás Gaggion, Rodrigo Echeveste, Lucas Mansilla, Diego H. Milone, Enzo Ferrante et.al. 2309.00451v1 null
2023-09-01 Improving the matching of deformable objects by learning to detect keypoints Felipe Cadar, Welerson, Vaishnavi Kanagasabapathi, Guilherme Potje, Renato Martins, Erickson R. Nascimento et.al. 2309.00434v1 null
2023-09-01 CPSP: Learning Speech Concepts From Phoneme Supervision Chunyu Qiang, Hao Li, Yixin Tian, Ruibo Fu, Tao Wang, Longbiao Wang, Jianwu Dang et.al. 2309.00424v1 null
2023-09-01 Selective Scene Text Removal Hayato Mitani, Akisato Kimura, Seiichi Uchida et.al. 2309.00410v1 null
2023-09-01 Fine-grained Recognition with Learnable Semantic Data Augmentation Yifan Pu, Yizeng Han, Yulin Wang, Junlan Feng, Chao Deng, Gao Huang et.al. 2309.00399v1 null
2023-09-01 VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation Xin Li, Wenqing Chu, Ye Wu, Weihang Yuan, Fanglong Liu, Qi Zhang, Fu Li, Haocheng Feng, Errui Ding, Jingdong Wang et.al. 2309.00398v1 null
2023-09-01 Dense Voxel 3D Reconstruction Using a Monocular Event Camera Haodong Chen, Vera Chung, Li Tan, Xiaoming Chen et.al. 2309.00385v1 null
2023-09-01 Long-Term Memorability On Advertisements Harini S I, Somesh Singh, Yaman K Singla, Aanisha Bhattacharyya, Veeky Baths, Changyou Chen, Rajiv Ratn Shah, Balaji Krishnamurthy et.al. 2309.00378v1 null
2023-09-01 On the Localization of Ultrasound Image Slices within Point Distribution Models Lennart Bastian, Vincent Bürgin, Ha Young Kim, Alexander Baumann, Benjamin Busam, Mahdi Saleh, Nassir Navab et.al. 2309.00372v1 null
2023-09-01 Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior Ashmit Khandelwal, Aditya Agrawal, Aanisha Bhattacharyya, Yaman K Singla, Somesh Singh, Uttaran Bhattacharya, Ishita Dasgupta, Stefano Petrangeli, Rajiv Ratn Shah, Changyou Chen, Balaji Krishnamurthy et.al. 2309.00359v1 null
2023-09-01 How You Split Matters: Data Leakage and Subject Characteristics Studies in Longitudinal Brain MRI Analysis Dewinda Julianensi Rumala et.al. 2309.00350v1 null
2023-09-01 MuraNet: Multi-task Floor Plan Recognition with Relation Attention Lingxiao Huang, Jung-Hsuan Wu, Chiching Wei, Wilson Li et.al. 2309.00348v1 null
2023-09-01 Towards Contrastive Learning in Music Video Domain Karel Veldkamp, Mariya Hendriksen, Zoltán Szlávik, Alexander Keijser et.al. 2309.00347v1 null
2023-09-01 Robust Point Cloud Processing through Positional Embedding Jianqiao Zheng, Xueqian Li, Sameera Ramasinghe, Simon Lucey et.al. 2309.00339v1 null
2023-09-01 Human trajectory prediction using LSTM with Attention mechanism Amin Manafi Soltan Ahmadi, Samaneh Hoseini Semnani et.al. 2309.00331v1 null
2023-09-01 Mi-Go: Test Framework which uses YouTube as Data Source for Evaluating Speech Recognition Models like OpenAI's Whisper Tomasz Wojnar, Jaroslaw Hryszko, Adam Roman et.al. 2309.00329v1 null
2023-09-01 ARFA: An Asymmetric Receptive Field Autoencoder Model for Spatiotemporal Prediction Wenxuan Zhang, Xuechao Zou, Li Wu, Jianqiang Huang, Xiaoying Wang et.al. 2309.00314v1 null
2023-09-01 Fusing Monocular Images and Sparse IMU Signals for Real-time Human Motion Capture Shaohua Pan, Qi Ma, Xinyu Yi, Weifeng Hu, Xiong Wang, Xingkang Zhou, Jijunnan Li, Feng Xu et.al. 2309.00310v1 link
2023-09-01 Efficient Surrogate Models for Materials Science Simulations: Machine Learning-based Prediction of Microstructure Properties Binh Duong Nguyen, Pavlo Potapenko, Aytekin Dermici, Kishan Govinda, Stefan Sandfeld et.al. 2309.00305v1 null
2023-09-01 Fine-Grained Spatiotemporal Motion Alignment for Contrastive Video Representation Learning Minghao Zhu, Xiao Lin, Ronghao Dang, Chengju Liu, Qijun Chen et.al. 2309.00297v1 null
2023-09-01 Fast Diffusion EM: a diffusion model for blind inverse problems with application to deconvolution Charles Laroche, Andrés Almansa, Eva Coupete et.al. 2309.00287v1 null
2023-09-01 SparseSat-NeRF: Dense Depth Supervised Neural Radiance Fields for Sparse Satellite Images Lulin Zhang, Ewelina Rupnik et.al. 2309.00277v1 link
2023-09-01 Application of Machine Learning in Melanoma Detection and the Identification of 'Ugly Duckling' and Suspicious Naevi: A Review Fatima Al Zegair, Nathasha Naranpanawa, Brigid Betz-Stablein, Monika Janda, H. Peter Soyer, Shekhar S. Chandra et.al. 2309.00265v1 null
2023-09-01 Interpretable Medical Imagery Diagnosis with Self-Attentive Transformers: A Review of Explainable AI for Health Care Tin Lai et.al. 2309.00252v1 null
2023-09-01 MIMOCrypt: Multi-User Privacy-Preserving Wi-Fi Sensing via MIMO Encryption Jun Luo, Hangcheng Cao, Hongbo Jiang, Yanbing Yang, Zhe Chen et.al. 2309.00250v1 null
2023-09-01 DiffuGen: Adaptable Approach for Generating Labeled Image Datasets using Stable Diffusion Models Michael Shenoda, Edward Kim et.al. 2309.00248v1 link
2023-09-01 Object-Centric Multiple Object Tracking Zixu Zhao, Jiaze Wang, Max Horn, Yizhuo Ding, Tong He, Zechen Bai, Dominik Zietlow, Carl-Johann Simon-Gabriel, Bing Shuai, Zhuowen Tu, Thomas Brox, Bernt Schiele, Yanwei Fu, Francesco Locatello, Zheng Zhang, Tianjun Xiao et.al. 2309.00233v1 null
2023-09-01 What Makes Good Open-Vocabulary Detector: A Disassembling Perspective Jincheng Li, Chunyu Xie, Xiaoyu Wu, Bin Wang, Dawei Leng et.al. 2309.00227v1 null
2023-09-01 Human-Inspired Facial Sketch Synthesis with Dynamic Adaptation Fei Gao, Yifan Zhu, Chang Jiang, Nannan Wang et.al. 2309.00216v1 link
2023-09-01 Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding Joshua Feinglass, Yezhou Yang et.al. 2309.00215v1 null
2023-09-01 Gap and Overlap Detection in Automated Fiber Placement Assef Ghamisi, Homayoun Najjaran et.al. 2309.00206v1 null
2023-09-01 Diffusion Model with Clustering-based Conditioning for Food Image Generation Yue Han, Jiangpeng He, Mridul Gupta, Edward J. Delp, Fengqing Zhu et.al. 2309.00199v1 null
2023-09-01 DARC: Distribution-Aware Re-Coloring Model for Generalizable Nucleus Segmentation Shengcong Chen, Changxing Ding, Dacheng Tao, Hao Chen et.al. 2309.00188v1 null
2023-09-01 Vision-aided nonlinear control framework for shake table tests Zhongwei Chen, T. Y. Yang, Yifei Xiao, Xiao Pan, Wanyan Yang et.al. 2309.00187v1 null
2023-08-31 Typing on Any Surface: A Deep Learning-based Method for Real-Time Keystroke Detection in Augmented Reality Xingyu Fu, Mingze Xi et.al. 2309.00174v1 null
2023-08-31 RepCodec: A Speech Representation Codec for Speech Tokenization Zhichao Huang, Chutong Meng, Tom Ko et.al. 2309.00169v1 link
2023-08-31 Pose-Graph Attentional Graph Neural Network for Lidar Place Recognition Milad Ramezani, Liang Wang, Joshua Knights, Zhibin Li, Pauline Pounds, Peyman Moghadam et.al. 2309.00168v1 null
2023-08-31 BuilDiff: 3D Building Shape Generation using Single-Image Conditional Point Cloud Diffusion Models Yao Wei, George Vosselman, Michael Ying Yang et.al. 2309.00158v1 null
2023-08-31 Optimized Deep Feature Selection for Pneumonia Detection: A Novel RegNet and XOR-Based PSO Approach Fatemehsadat Ghanadi Ladani, Samaneh Hosseini Semnani et.al. 2309.00147v1 null
2023-08-31 Self-supervised Semantic Segmentation: Consistency over Transformation Sanaz Karimijafarbigloo, Reza Azad, Amirhossein Kazerouni, Yury Velichko, Ulas Bagci, Dorit Merhof et.al. 2309.00143v1 link
2023-08-31 Improving vision-inspired keyword spotting using dynamic module skipping in streaming conformer encoder Alexandre Bittar, Paul Dixon, Mohammad Samragh, Kumari Nishu, Devang Naik et.al. 2309.00140v1 null
2023-08-31 Fuzzy Approach for Audio-Video Emotion Recognition in Computer Games for Children Pavel Kozlov, Alisher Akram, Pakizar Shamoi et.al. 2309.00138v1 null
2023-08-31 Distraction-free Embeddings for Robust VQA Atharvan Dogra, Deeksha Varshney, Ashwin Kalyan, Ameet Deshpande, Neeraj Kumar et.al. 2309.00133v1 null
2023-08-31 QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning Haohan Guo, Fenglong Xie, Jiawen Kang, Yujia Xiao, Xixin Wu, Helen Meng et.al. 2309.00126v1 null
2023-08-31 Segmentação e contagem de troncos de madeira utilizando deep learning e processamento de imagens João V. C. Mazzochin, Gustavo Tiecker, Erick O. Rodrigues et.al. 2309.00123v1 null
2023-08-31 Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation Reza Azad, Leon Niggemeier, Michael Huttemann, Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, Yury Velichko, Ulas Bagci, Dorit Merhof et.al. 2309.00121v1 null
2023-08-31 Laplacian-Former: Overcoming the Limitations of Vision Transformers in Local Texture Detection Reza Azad, Amirhossein Kazerouni, Babak Azad, Ehsan Khodapanah Aghdam, Yury Velichko, Ulas Bagci, Dorit Merhof et.al. 2309.00108v1 null
2023-08-31 Unsupervised evaluation of GAN sample quality: Introducing the TTJac Score Egor Sevriugov, Ivan Oseledets et.al. 2309.00107v1 null
2023-08-31 Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation Chaofan Ma, Yuhuan Yang, Chen Ju, Fei Zhang, Ya Zhang, Yanfeng Wang et.al. 2309.00096v1 null
2023-08-31 Few-shot Diagnosis of Chest x-rays Using an Ensemble of Random Discriminative Subspaces Kshitiz, Garvit Garg, Angshuman Paul et.al. 2309.00081v1 link
2023-08-31 SoDaCam: Software-defined Cameras via Single-Photon Imaging Varun Sundar, Andrei Ardelean, Tristan Swedish, Claudio Brusschini, Edoardo Charbon, Mohit Gupta et.al. 2309.00066v1 null
2023-08-31 STint: Self-supervised Temporal Interpolation for Geospatial Data Nidhin Harilal, Bri-Mathias Hodge, Aneesh Subramanian, Claire Monteleoni et.al. 2309.00059v1 null
2023-08-31 Bellybutton: Accessible and Customizable Deep-Learning Image Segmentation Sam Dillavou, Jesse M. Hanlan, Anthony T. Chieco, Hongyi Xiao, Sage Fulco, Kevin T. Turner, Douglas J. Durian et.al. 2309.00058v1 null
2023-08-31 FACET: Fairness in Computer Vision Evaluation Benchmark Laura Gustafson, Chloe Rolland, Nikhila Ravi, Quentin Duval, Aaron Adcock, Cheng-Yang Fu, Melissa Hall, Candace Ross et.al. 2309.00035v1 null
2023-08-31 Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis Linsen Song, Wayne Wu, Chaoyou Fu, Chen Change Loy, Ran He et.al. 2309.00030v1 null
2023-08-31 Vision-Based Cranberry Crop Ripening Assessment Faith Johnson, Jack Lowry, Kristin Dana, Peter Oudemans et.al. 2309.00028v1 null
2023-08-31 A Sequential Framework for Detection and Classification of Abnormal Teeth in Panoramic X-rays Tudor Dascalu, Shaqayeq Ramezanzade, Azam Bakhshandeh, Lars Bjorndal, Bulat Ibragimov et.al. 2309.00027v1 link
2023-08-31 PointLLM: Empowering Large Language Models to Understand Point Clouds Runsen Xu, Xiaolong Wang, Tai Wang, Yilun Chen, Jiangmiao Pang, Dahua Lin et.al. 2308.16911v1 link
2023-08-31 StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation Yuhan Wang, Liming Jiang, Chen Change Loy et.al. 2308.16909v1 link
2023-08-31 Fine-Grained Cross-View Geo-Localization Using a Correlation-Aware Homography Estimator Xiaolong Wang, Runsen Xu, Zuofan Cui, Zeyu Wan, Yu Zhang et.al. 2308.16906v1 link
2023-08-31 InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion Sirui Xu, Zhengyuan Li, Yu-Xiong Wang, Liang-Yan Gui et.al. 2308.16905v1 link
2023-08-31 PointOcc: Cylindrical Tri-Perspective View for Point-based 3D Semantic Occupancy Prediction Sicheng Zuo, Wenzhao Zheng, Yuanhui Huang, Jie Zhou, Jiwen Lu et.al. 2308.16896v1 link
2023-08-31 EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild Manuel Kaufmann, Jie Song, Chen Guo, Kaiyue Shen, Tianjian Jiang, Chengcheng Tang, Juan Zarate, Otmar Hilliges et.al. 2308.16894v1 link
2023-08-31 Language-Conditioned Path Planning Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James et.al. 2308.16893v1 null
2023-09-01 GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields Yanjie Ze, Ge Yan, Yueh-Hua Wu, Annabella Macaluso, Yuying Ge, Jianglong Ye, Nicklas Hansen, Li Erran Li, Xiaolong Wang et.al. 2308.16891v2 link
2023-08-31 TouchStone: Evaluating Vision-Language Models by Language Models Shuai Bai, Shusheng Yang, Jinze Bai, Peng Wang, Xingxuan Zhang, Junyang Lin, Xinggang Wang, Chang Zhou, Jingren Zhou et.al. 2308.16890v1 null
2023-08-31 Text2Scene: Text-driven Indoor Scene Stylization with Part-aware Details Inwoo Hwang, Hyeonwoo Kim, Young Min Kim et.al. 2308.16880v1 null
2023-08-31 SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation Jiaben Chen, Huaizu Jiang et.al. 2308.16876v1 null
2023-08-31 Holistic Processing of Colour Images Using Novel Quaternion-Valued Wavelets on the Plane Neil D. Dizon, Jeffrey A. Hogan et.al. 2308.16875v1 null
2023-08-31 Self-pruning Graph Neural Network for Predicting Inflammatory Disease Activity in Multiple Sclerosis from Brain MR Images Chinmay Prabhakar, Hongwei Bran Li, Johannes C. Paetzold, Timo Loehr, Chen Niu, Mark Mühlau, Daniel Rueckert, Benedikt Wiestler, Bjoern Menze et.al. 2308.16863v1 link
2023-08-31 Diffusion Models for Interferometric Satellite Aperture Radar Alexandre Tuel, Thomas Kerdreux, Claudia Hulbert, Bertrand Rouet-Leduc et.al. 2308.16847v1 null
2023-08-31 Machine learning of microscopic structure-dynamics relationships in complex molecular systems Martina Crippa, Annalisa Cardellini, Matteo Cioni, Gábor Csányi, Giovanni M. Pavan et.al. 2308.16829v1 link
2023-08-31 Coarse-to-Fine Amodal Segmentation with Shape Prior Jianxiong Gao, Xuelin Qian, Yikai Wang, Tianjun Xiao, Tong He, Zheng Zhang, Yanwei Fu et.al. 2308.16825v1 null
2023-08-31 BTSeg: Barlow Twins Regularization for Domain Adaptation in Semantic Segmentation Johannes Künzel, Anna Hilsmann, Peter Eisert et.al. 2308.16819v1 null
2023-08-31 Multiscale Residual Learning of Graph Convolutional Sequence Chunks for Human Motion Prediction Mohsen Zand, Ali Etemad, Michael Greenspan et.al. 2308.16801v1 null
2023-09-01 Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models Minheng Ni, Yabo Zhang, Kailai Feng, Xiaoming Li, Yiwen Guo, Wangmeng Zuo et.al. 2308.16777v2 null
2023-08-31 Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images Cuican Yu, Guansong Lu, Yihan Zeng, Jian Sun, Xiaodan Liang, Huibin Li, Zongben Xu, Songcen Xu, Wei Zhang, Hang Xu et.al. 2308.16758v1 null
2023-08-31 Unsupervised CT Metal Artifact Reduction by Plugging Diffusion Priors in Dual Domains Xuan Liu, Yaoqin Xie, Songhui Diao, Shan Tan, Xiaokun Liang et.al. 2308.16742v1 link
2023-08-31 Socratis: Are large multimodal models emotionally aware? Katherine Deng, Arijit Ray, Reuben Tan, Saadia Gabriel, Bryan A. Plummer, Kate Saenko et.al. 2308.16741v1 null
2023-08-31 Parsing is All You Need for Accurate Gait Recognition in the Wild Jinkai Zheng, Xinchen Liu, Shuai Wang, Lihao Wang, Chenggang Yan, Wu Liu et.al. 2308.16739v1 link
2023-08-31 US-SFNet: A Spatial-Frequency Domain-based Multi-branch Network for Cervical Lymph Node Lesions Diagnoses in Ultrasound Images Yubiao Yue, Jun Xue, Haihua Liang, Bingchun Luo, Zhenzhang Li et.al. 2308.16738v1 null
2023-08-31 Post-Deployment Adaptation with Access to Source Data via Federated Learning and Source-Target Remote Gradient Alignment Felix Wagner, Zeju Li, Pramit Saha, Konstantinos Kamnitsas et.al. 2308.16735v1 link
2023-08-30 ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers Yi Liu, Yuekang Li, Gelei Deng, Felix Juefei-Xu, Yao Du, Cen Zhang, Chengwei Liu, Yeting Li, Lei Ma, Yang Liu et.al. 2308.15742v1 null
2023-08-28 Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition Zhisheng Zheng, Ziyang Ma, Yu Wang, Xie Chen et.al. 2308.14814v1 null
2023-08-23 KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods Antoine Nzeyimana et.al. 2308.11863v1 null
2023-09-05 Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model Yuezhou Zhang, Amos A Folarin, Judith Dineley, Pauline Conde, Valeria de Angel, Shaoxiong Sun, Yatharth Ranjan, Zulqarnain Rashid, Callum Stewart, Petroula Laiou, Heet Sankesara, Linglong Qian, Faith Matcham, Katie M White, Carolin Oetzmann, Femke Lamers, Sara Siddi, Sara Simblett, Björn W. Schuller, Srinivasan Vairavan, Til Wykes, Josep Maria Haro, Brenda WJH Penninx, Vaibhav A Narayan, Matthew Hotopf, Richard JB Dobson, Nicholas Cummins, RADAR-CNS consortium et.al. 2308.11773v2 null
2023-08-20 Indonesian Automatic Speech Recognition with XLSR-53 Panji Arisaputra, Amalia Zahra et.al. 2308.11589v1 null
2023-08-22 Convoifilter: A case study of doing cocktail party speech recognition Thai-Binh Nguyen, Alexander Waibel et.al. 2308.11380v1 null
2023-08-14 Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder Yusheng Dai, Hang Chen, Jun Du, Xiaofei Ding, Ning Ding, Feijun Jiang, Chin-Hui Lee et.al. 2308.08488v1 link
2023-08-16 Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals Running Zhao, Jiangtao Yu, Hang Zhao, Edith C. H. Ngai et.al. 2308.08125v1 null
2023-08-15 AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model Jeong Hun Yeo, Minsu Kim, Jeongsoo Choi, Dae Hoe Kim, Yong Man Ro et.al. 2308.07593v1 null
2023-08-14 Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations Wen Wu, Chao Zhang, Philip C. Woodland et.al. 2308.07145v1 link
2023-08-12 Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition Han Zhu, Dongji Gao, Gaofeng Cheng, Daniel Povey, Pengyuan Zhang, Yonghong Yan et.al. 2308.06547v1 null
2023-08-11 Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Haithem Boussaid, Ebtessam Almazrouei, Merouane Debbah et.al. 2308.06112v1 null
2023-08-10 A Novel Self-training Approach for Low-resource Speech Recognition Satwinder Singh, Feng Hou, Ruili Wang et.al. 2308.05269v1 null
2023-08-09 Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio Yang Zhang, Krishna C. Puvvada, Vitaly Lavrukhin, Boris Ginsburg et.al. 2308.05218v1 link
2023-08-07 Cuing Without Sharing: A Federated Cued Speech Recognition Framework via Mutual Knowledge Distillation Yuxuan Zhang, Lei Liu, Li Liu et.al. 2308.03432v1 link
2023-08-07 Federated Representation Learning for Automatic Speech Recognition Guruprasad V Ramesh, Gopinath Chennupati, Milind Rao, Anit Kumar Sahu, Ariya Rastrow, Jasha Droppo et.al. 2308.02013v2 null
2023-08-02 Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification Laurin Wagner, Mario Zusag, Theresa Bloder et.al. 2308.01327v1 null
2023-07-28 The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systems Andreas Liesenfeld, Alianda Lopez, Mark Dingemanse et.al. 2307.15493v1 null
2023-07-27 Say Goodbye to RNN-T Loss: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition Tian-Hao Zhang, Dinghao Zhou, Guiping Zhong, Baoxiang Li et.al. 2307.14132v2 null
2023-07-24 Adaptation of Whisper models to child speech recognition Rishabh Jain, Andrei Barcovschi, Mariam Yiwere, Peter Corcoran, Horia Cucu et.al. 2307.13008v1 link
2023-07-24 Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition Emiru Tsunoo, Hayato Futami, Yosuke Kashiwagi, Siddhant Arora, Shinji Watanabe et.al. 2307.12767v1 null
2023-07-24 Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training Gege Qi, Yuefeng Chen, Xiaofeng Mao, Xiaojun Jia, Ranjie Duan, Rong Zhang, Hui Xue et.al. 2307.12498v1 null
2023-07-23 A meta learning scheme for fast accent domain expansion in Mandarin speech recognition Ziwei Zhu, Changhao Shan, Bihong Zhang, Jian Yu et.al. 2307.12262v1 null
2023-07-21 Prompting Large Language Models with Speech Recognition Abilities Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer et.al. 2307.11795v1 null
2023-07-20 Transsion TSUP's speech recognition system for ASRU 2023 MADASR Challenge Xiaoxiao Li, Gaosheng Zhang, An Zhu, Weiyong Li, Shuming Fang, Xiaoyue Yang, Jianchao Zhu et.al. 2307.11778v1 null
2023-07-20 Globally Normalising the Transducer for Streaming Speech Recognition Rogier van Dalen et.al. 2307.10975v1 null
2023-10-06 Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning Feng-Ting Liao, Yung-Chieh Chan, Yi-Chang Chen, Chan-Jan Hsu, Da-shan Shiu et.al. 2307.10274v2 link
2023-07-17 TST: Time-Sparse Transducer for Automatic Speech Recognition Xiaohui Zhang, Mangui Liang, Zhengkun Tian, Jiangyan Yi, Jianhua Tao et.al. 2307.08323v1 null
2023-08-03 Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition Shaoshi Ling, Yuxuan Hu, Shuangbei Qian, Guoli Ye, Yao Qian, Yifan Gong, Ed Lin, Michael Zeng et.al. 2307.08234v2 link
2023-07-17 Towards Stealthy Backdoor Attacks against Speech Recognition via Elements of Sound Hanbo Cai, Pengcheng Zhang, Hai Dong, Yan Xiao, Stefanos Koffas, Yiming Li et.al. 2307.08208v1 link
2023-07-12 Sumformer: A Linear-Complexity Alternative to Self-Attention for Speech Recognition Titouan Parcollet, Rogier van Dalen, Shucong Zhang, Sourav Bhattacharya et.al. 2307.07421v1 null
2023-10-18 Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition Theresa Pekarek Rosin, Stefan Wermter et.al. 2307.07280v2 null
2023-07-13 Personalization for BERT-based Discriminative Speech Recognition Rescoring Jari Kolehmainen, Yile Gu, Aditya Gourav, Prashanth Gurunath Shivakumar, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko et.al. 2307.06832v1 null
2023-07-13 Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study Zeping Min, Jinbo Wang et.al. 2307.06530v1 null
2023-07-14 Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition Wenxuan Wang, Guodong Ma, Yuke Li, Binbin Du et.al. 2307.05956v2 null
2023-07-10 SparseVSR: Lightweight and Noise Robust Visual Speech Recognition Adriana Fernandez-Lopez, Honglie Chen, Pingchuan Ma, Alexandros Haliassos, Stavros Petridis, Maja Pantic et.al. 2307.04552v1 null
2023-07-06 Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment Aref Farhadipour, Hadi Veisi et.al. 2307.03296v1 link
2023-07-05 Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture Haoran Miao, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan et.al. 2307.02351v1 null
2023-07-05 Using Data Augmentations and VTLN to Reduce Bias in Dutch End-to-End Speech Recognition Systems Tanvina Patel, Odette Scharenborg et.al. 2307.02009v1 null
2023-07-04 Boosting Norwegian Automatic Speech Recognition Javier de la Rosa, Rolv-Arild Braaten, Per Egil Kummervold, Freddy Wetjen, Svein Arne Brygfjeld et.al. 2307.01672v1 null
2023-06-29 Automatic Speech Recognition of Non-Native Child Speech for Language Learning Applications Simone Wills, Yu Bai, Cristian Tejedor-Garcia, Catia Cucchiarini, Helmer Strik et.al. 2306.16710v1 null
2023-06-28 Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition Yuang Li, Yu Wu, Jinyu Li, Shujie Liu et.al. 2306.16007v1 null
2023-06-27 Confidence-based Ensembles of End-to-End Speech Recognition Models Igor Gitman, Vitaly Lavrukhin, Aleksandr Laptev, Boris Ginsburg et.al. 2306.15824v1 null
2023-06-27 Scaling Laws for Discriminative Speech Recognition Rescoring Models Yile Gu, Prashanth Gurunath Shivakumar, Jari Kolehmainen, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko et.al. 2306.15815v1 null
2023-06-27 Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition Tianzi Wang, Shoukang Hu, Jiajun Deng, Zengrui Jin, Mengzhe Geng, Yi Wang, Helen Meng, Xunying Liu et.al. 2306.15265v1 null
2023-06-26 Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems Jiajun Deng, Guinan Li, Xurong Xie, Zengrui Jin, Mingyu Cui, Tianzi Wang, Shujie Hu, Mengzhe Geng, Xunying Liu et.al. 2306.14608v1 null
2023-06-24 An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-Hearing Lester Phillip Violeta, Tomoki Toda et.al. 2306.13953v1 null
2023-06-26 Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems Mingyu Cui, Jiawen Kang, Jiajun Deng, Xi Yin, Yutao Xie, Xie Chen, Xunying Liu et.al. 2306.13307v2 null
2023-06-21 A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-Supervision Kamer Ali Yuksel, Thiago Ferreira, Ahmet Gunduz, Mohamed Al-Badrashiny, Golara Javadi et.al. 2306.13114v1 link
2023-06-21 NoRefER: a Referenceless Quality Metric for Automatic Speech Recognition via Semi-Supervised Language Model Fine-Tuning with Contrastive Learning Kamer Ali Yuksel, Thiago Ferreira, Golara Javadi, Mohamed El-Badrashiny, Ahmet Gunduz et.al. 2306.12577v1 link
2023-06-21 Federated Self-Learning with Weak Supervision for Speech Recognition Milind Rao, Gopinath Chennupati, Gautam Tiwari, Anit Kumar Sahu, Anirudh Raju, Ariya Rastrow, Jasha Droppo et.al. 2306.12015v1 null
2023-06-20 Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition Xuefei Wang, Yanhua Long, Yijie Li, Haoran Wei et.al. 2306.11309v1 null
2023-06-19 Rehearsal-Free Online Continual Learning for Automatic Speech Recognition Steven Vander Eeckt, Hugo Van hamme et.al. 2306.10860v1 link
2023-06-18 MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition Yuchen Hu, Chen Chen, Ruizhe Li, Heqing Zou, Eng Siong Chng et.al. 2306.10567v1 link
2023-06-18 Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiushi Zhu, Eng Siong Chng et.al. 2306.10563v1 link
2023-09-19 SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition Desh Raj, Daniel Povey, Sanjeev Khudanpur et.al. 2306.10559v2 link
2023-06-15 Distillation Strategies for Discriminative Speech Recognition Rescoring Prashanth Gurunath Shivakumar, Jari Kolehmainen, Yile Gu, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko et.al. 2306.09452v1 null
2023-06-15 MobileASR: A resource-aware on-device personalisation framework for automatic speech recognition in mobile phones Zitha Sasindran, Harsha Yelchuri, Pooja Rao, T. V. Prabhakar et.al. 2306.09384v1 null
2023-09-16 Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer Kunal Dhawan, Dima Rekesh, Boris Ginsburg et.al. 2306.08753v3 link
2023-06-14 Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition Muhammad Umar Farooq, Thomas Hain et.al. 2306.08577v1 null
2023-06-14 Research on an improved Conformer end-to-end Speech Recognition Model with R-Drop Structure Weidong Ji, Shijie Zan, Guohui Zhou, Xu Wang et.al. 2306.08329v1 null
2023-06-14 Automated Speaker Independent Visual Speech Recognition: A Comprehensive Survey Praneeth Nemani, G. Sai Krishna, Supriya Kundrapu et.al. 2306.08314v1 null
2023-06-09 Improving Frame-level Classifier for Word Timings with Non-peaky CTC in End-to-End Automatic Speech Recognition Xianzhao Chen, Yist Y. Lin, Kang Wang, Yi He, Zejun Ma et.al. 2306.07949v1 null
2023-06-09 A Theory of Unsupervised Speech Recognition Liming Wang, Mark Hasegawa-Johnson, Chang D. Yoo et.al. 2306.07926v1 link
2023-06-13 Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition Ui-Hyeop Shin, Hyung-Min Park et.al. 2306.07562v1 null
2023-06-12 Parameter-efficient Dysarthric Speech Recognition Using Adapter Fusion and Householder Transformation Jinzi Qi, Hugo Van hamme et.al. 2306.07090v1 null
2023-06-12 Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition Belen Alastruey, Lukas Drude, Jahn Heymann, Simon Wiesler et.al. 2306.06954v1 null
2023-06-10 OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment Xize Cheng, Tao Jin, Linjun Li, Wang Lin, Xinyu Duan, Zhou Zhao et.al. 2306.06410v1 link
2023-06-06 Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering Irina-Elena Veliche, Pascale Fung et.al. 2306.06083v1 null
2023-06-08 Language-specific Acoustic Boundary Learning for Mandarin-English Code-switching Speech Recognition Zhiyun Fan, Linhao Dong, Chen Shen, Zhenlin Liang, Jun Zhang, Lu Lu, Zejun Ma et.al. 2306.05279v1 null
2023-06-07 Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency Shigeki Karita, Richard Sproat, Haruko Ishikawa et.al. 2306.04530v1 null
2023-06-07 Transfer Learning of Transformer-based Speech Recognition Models from Czech to Slovak Jan Lehečka, Josef V. Psutka, Josef Psutka et.al. 2306.04399v1 null
2023-06-07 Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation Massa Baali, Ibrahim Almakky, Shady Shehata, Fakhri Karray et.al. 2306.04368v1 link
2023-09-12 RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain Sangeet Sagar, Mirco Ravanelli, Bernd Kiefer, Ivana Kruijff Korbayova, Josef van Genabith et.al. 2306.04054v2 null
2023-06-02 Streaming Speech-to-Confusion Network Speech Recognition Denis Filimonov, Prabhat Pandey, Ariya Rastrow, Ankur Gandhe, Andreas Stolcke et.al. 2306.03778v1 null
2023-06-01 Some voices are too common: Building fair speech recognition systems using the Common Voice dataset Lucas Maison, Yannick Estève et.al. 2306.03773v1 null
2023-06-05 N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition Bashar Talafha, Abdul Waheed, Muhammad Abdul-Mageed et.al. 2306.02902v1 null
2023-06-05 OTF: Optimal Transport based Fusion of Supervised and Self-Supervised Learning Models for Automatic Speech Recognition Li Fu, Siqi Li, Qingtao Li, Fangzhu Li, Liping Deng, Lu Fan, Meng Chen, Youzheng Wu, Xiaodong He et.al. 2306.02541v1 null
2023-06-05 Incorporating L2 Phonemes Using Articulatory Features for Robust Speech Recognition Jisung Wang, Haram Lee, Myungwoo Oh et.al. 2306.02534v1 null
2023-06-21 SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization Changhun Kim, Joonhyung Park, Hajin Shim, Eunho Yang et.al. 2306.01981v4 link
2023-06-02 Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation Hanbyul Kim, Seunghyun Seo, Lukas Lee, Seolki Baek et.al. 2306.01296v1 null
2023-06-01 Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts Dongji Gao, Matthew Wiesner, Hainan Xu, Leibny Paola Garcia, Daniel Povey, Sanjeev Khudanpur et.al. 2306.01031v1 null
2023-08-15 Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition Tianyi Xu, Zhanheng Yang, Kaixun Huang, Pengcheng Guo, Ao Zhang, Biao Li, Changru Chen, Chao Li, Lei Xie et.al. 2306.00804v3 null
2023-06-01 SlothSpeech: Denial-of-service Attack Against Speech Recognition Models Mirazul Haque, Rutvij Shah, Simin Chen, Berrak Şişman, Cong Liu, Wei Yang et.al. 2306.00794v1 link
2023-06-01 Adaptation and Optimization of Automatic Speech Recognition (ASR) for the Maritime Domain in the Field of VHF Communication Emin Cagatay Nakilcioglu, Maximilian Reimann, Ole John et.al. 2306.00614v1 null
2023-05-31 ViLaS: Integrating Vision and Language into Automatic Speech Recognition Minglun Han, Feilong Chen, Ziyi Ni, Linghui Meng, Jing Shi, Shuang Xu, Bo Xu et.al. 2305.19972v1 null
2023-05-31 Accurate and Structured Pruning for Efficient Automatic Speech Recognition Huiqiang Jiang, Li Lyna Zhang, Yuang Li, Yu Wu, Shijie Cao, Ting Cao, Yuqing Yang, Jinyu Li, Mao Yang, Lili Qiu et.al. 2305.19549v1 null
2023-05-29 HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition Florian Mai, Juan Zuluaga-Gomez, Titouan Parcollet, Petr Motlicek et.al. 2305.18281v1 link
2023-05-30 speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition Haoyu Lu, Nan Li, Tongtong Song, Longbiao Wang, Jianwu Dang, Xiaobao Wang, Shiliang Zhang et.al. 2305.17860v2 link
2023-05-28 RASR2: The RWTH ASR Toolkit for Generic Sequence-to-sequence Speech Recognition Wei Zhou, Eugen Beck, Simon Berger, Ralf Schlüter, Hermann Ney et.al. 2305.17782v1 null
2023-07-19 Synthesizing Speech Test Cases with Text-to-Speech? An Empirical Study on the False Alarms in Automated Speech Recognition Testing Julia Kaiwen Lau, Kelvin Kai Wen Kong, Julian Hao Yong, Per Hoong Tan, Zhou Yang, Zi Qian Yong, Joshua Chern Wey Low, Chun Yong Chong, Mei Kuan Lim, David Lo et.al. 2305.17445v3 link
2023-05-26 2-bit Conformer quantization for automatic speech recognition Oleg Rybakov, Phoenix Meadowlark, Shaojin Ding, David Qiu, Jian Li, David Rim, Yanzhang He et.al. 2305.16619v1 null
2023-05-25 INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition Eunseop Yoon, Hee Suk Yoon, John Harvill, Mark Hasegawa-Johnson, Chang D. Yoo et.al. 2305.16371v1 null
2023-05-29 InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition Zhi-Hao Lai, Tian-Hao Zhang, Qi Liu, Xinyuan Qian, Li-Fang Wei, Song-Lu Chen, Feng Chen, Xu-Cheng Yin et.al. 2305.16342v2 null
2023-06-29 Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition Wangyou Zhang, Yanmin Qian et.al. 2305.16286v2 null
2023-05-25 Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator Lingwei Meng, Jiawen Kang, Mingyu Cui, Haibin Wu, Xixin Wu, Helen Meng et.al. 2305.16263v1 null
2023-05-25 VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation Tianrui Wang, Long Zhou, Ziqiang Zhang, Yu Wu, Shujie Liu, Yashesh Gaur, Zhuo Chen, Jinyu Li, Furu Wei et.al. 2305.16107v1 null
2023-05-24 Iteratively Improving Speech Recognition and Voice Conversion Mayank Kumar Singh, Naoya Takahashi, Onoe Naoyuki et.al. 2305.15055v1 null
2023-05-23 Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning Sara Kashiwagi, Keitaro Tanaka, Qi Feng, Shigeo Morishima et.al. 2305.14203v1 null

(back to top)

Audio Forenisc

Publish Date Title Authors PDF Code
2022-11-29 Synthetic Voice Detection and Audio Splicing Detection using SE-Res2Net-Conformer Architecture Lei Wang, Benedict Yeoh, Jun Wah Ng et.al. 2210.03581v2 null
2024-05-03 Towards Unconstrained Audio Splicing Detection and Localization with Neural Networks Denise Moussa, Germans Hirsch, Christian Riess et.al. 2207.14682v4 null
2014-11-26 Audio Splicing Detection and Localization Using Environmental Signature Hong Zhao, Yifan Chen, Rui Wang, Hafiz Malik et.al. 1411.7084v1 null

(back to top)

About

arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily


Languages

Language:Python 100.0%