LearnNLP/nlp_arxiv_daily

Updated on 2024.08.19

Table of Contents

Speech Translation
Legal
Speech Recognition
Audio Forenisc

Speech Translation

Publish Date	Title	Authors	PDF	Code
2024-08-14	CMU's IWSLT 2024 Simultaneous Speech Translation System	Xi Xu, Siqi Ouyang, Brian Yan, Patrick Fernandes, William Chen, Lei Li, Graham Neubig, Shinji Watanabe et.al.	2408.07452v1	null
2024-07-31	Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent	Shanbo Cheng, Zhichao Huang, Tom Ko, Hang Li, Ningxin Peng, Lu Xu, Qini Zhang et.al.	2407.21646v1	null
2024-07-31	Contrastive Feedback Mechanism for Simultaneous Speech Translation	Haotian Tan, Sakriani Sakti et.al.	2407.20524v2	null
2024-07-08	Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation	Jarod Duret, Yannick Estève, Titouan Parcollet et.al.	2407.18332v1	null
2024-07-22	LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models	Xi Chen, Songyang Zhang, Qibing Bai, Kai Chen, Satoshi Nakamura et.al.	2407.15415v1	link
2024-07-18	Preset-Voice Matching for Privacy Regulated Speech-to-Speech Translation Systems	Daniel Platnick, Bishoy Abdelnour, Eamon Earl, Rahul Kumar, Zahra Rezaei, Thomas Tsangaris, Faraj Lagum et.al.	2407.13153v1	null
2024-06-26	Navigating the Minefield of MT Beam Search in Cascaded Streaming Speech Translation	Rastislav Rabatin, Frank Seide, Ernie Chang et.al.	2407.11010v1	null
2024-07-01	Cross-Lingual Transfer Learning for Speech Translation	Rao Ma, Yassir Fathullah, Mengjie Qian, Siyuan Tang, Mark Gales, Kate Knill et.al.	2407.01130v1	null
2024-06-30	NAIST Simultaneous Speech Translation System for IWSLT 2024	Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Tomoya Yanagita, Kosuke Doi, Mana Makinae, Haotian Tan, Makoto Sakai, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura et.al.	2407.00826v1	null
2024-06-27	Leveraging Synthetic Audio Data for End-to-End Low-Resource Speech Translation	Yasmin Moslem et.al.	2406.17363v2	null
2024-06-24	Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024	Sai Koneru, Thai-Binh Nguyen, Ngoc-Quan Pham, Danni Liu, Zhaolin Li, Alexander Waibel, Jan Niehues et.al.	2406.16777v1	null
2024-06-20	SimulSeamless: FBK at IWSLT 2024 Simultaneous Speech Translation	Sara Papi, Marco Gaido, Matteo Negri, Luisa Bentivogli et.al.	2406.14177v1	link
2024-06-16	CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving	Bhavani Shankar, Preethi Jyothi, Pushpak Bhattacharyya et.al.	2406.10993v1	null
2024-06-15	Lightweight Audio Segmentation for Long-form Speech Translation	Jaesong Lee, Soyoon Kim, Hanbyul Kim, Joon Son Chung et.al.	2406.10549v1	null
2024-06-12	Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation	Peidong Wang, Jian Xue, Jinyu Li, Junkun Chen, Aswin Shanmugam Subramanian et.al.	2406.10276v1	null
2024-06-14	Diffusion Synthesizer for Efficient Multilingual Speech to Speech Translation	Nameer Hirschkind, Xiao Yu, Mahesh Kumar Nandwana, Joseph Liu, Eloi DuBois, Dao Le, Nicolas Thiebaut, Colin Sinclair, Kyle Spence, Charles Shang, Zoe Abrams, Morgan McGuire et.al.	2406.10223v1	null
2024-06-14	Exploring the Correlation between Human and Machine Evaluation of Simultaneous Speech Translation	Xiaoman Wang, Claudio Fantinuoli et.al.	2406.10091v1	null
2024-06-11	CTC-based Non-autoregressive Textless Speech-to-Speech Translation	Qingkai Fang, Zhengrui Ma, Yan Zhou, Min Zhang, Yang Feng et.al.	2406.07330v1	link
2024-06-11	Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?	Qingkai Fang, Shaolei Zhang, Zhengrui Ma, Min Zhang, Yang Feng et.al.	2406.07289v1	null
2024-06-06	Label-Synchronous Neural Transducer for E2E Simultaneous Speech Translation	Keqi Deng, Philip C. Woodland et.al.	2406.04541v1	link
2024-06-06	Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation	Matthias Sperber, Ondřej Bojar, Barry Haddow, Dávid Javorský, Xutai Ma, Matteo Negri, Jan Niehues, Peter Polák, Elizabeth Salesky, Katsuhito Sudoh, Marco Turchi et.al.	2406.03881v1	null
2024-06-05	StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning	Shaolei Zhang, Qingkai Fang, Shoutao Guo, Zhengrui Ma, Min Zhang, Yang Feng et.al.	2406.03049v1	link
2024-06-04	Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation	Min-Jae Hwang, Ilia Kulikov, Benjamin Peloquin, Hongyu Gong, Peng-Jen Chen, Ann Lee et.al.	2406.02733v1	null
2024-06-04	SimulTron: On-Device Simultaneous Speech to Speech Translation	Alex Agranovich, Eliya Nachmani, Oleg Rybakov, Yifan Ding, Ye Jia, Nadav Bar, Heiga Zen, Michelle Tadmor Ramanovich et.al.	2406.02133v1	null
2024-06-01	Recent Advances in End-to-End Simultaneous Speech Translation	Xiaoqian Liu, Guoqiang Hu, Yangfan Du, Erfeng He, YingFeng Luo, Chen Xu, Tong Xiao, Jingbo Zhu et.al.	2406.00497v1	null
2024-05-30	SeamlessExpressiveLM: Speech Language Model for Expressive Speech-to-Speech Translation with Chain-of-Thought	Hongyu Gong, Bandhav Veluri et.al.	2405.20410v1	null
2024-05-28	TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation	Chenyang Le, Yao Qian, Dongmei Wang, Long Zhou, Shujie Liu, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Sheng Zhao, Michael Zeng et.al.	2405.17809v1	null
2024-05-22	DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation	Weiting Tan, Jingyu Zhang, Lingfeng Shen, Daniel Khashabi, Philipp Koehn et.al.	2405.13274v1	link
2024-05-21	MELD-ST: An Emotion-aware Speech Translation Dataset	Sirou Chen, Sakiko Yahata, Shuichiro Shimizu, Zhengdong Yang, Yihang Li, Chenhui Chu, Sadao Kurohashi et.al.	2405.13233v1	null
2024-03-25	Advancing Speech Translation: A Corpus of Mandarin-English Conversational Telephone Speech	Shannon Wotherspoon, William Hartmann, Matthew Snover et.al.	2404.11619v1	null
2024-03-19	MSLM-S2ST: A Multitask Speech Language Model for Textless Speech-to-Speech Translation with Speaker Style Preservation	Yifan Peng, Ilia Kulikov, Yilin Yang, Sravya Popuri, Hui Lu, Changhan Wang, Hongyu Gong et.al.	2403.12408v1	null
2024-03-08	FFSTC: Fongbe to French Speech Translation Corpus	D. Fortune Kponou, Frejus A. A. Laleye, Eugene C. Ezin et.al.	2403.05488v1	null
2024-06-26	Compact Speech Translation Models via Discrete Speech Units Pretraining	Tsz Kin Lam, Alexandra Birch, Barry Haddow et.al.	2402.19333v2	null
2024-02-25	Direct Punjabi to English speech translation using discrete units	Prabhjot Kaur, L. Andrew M. Bush, Weisong Shi et.al.	2402.15967v1	null
2024-05-17	Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?	Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli et.al.	2402.12025v2	null
2024-06-05	Pushing the Limits of Zero-shot End-to-End Speech Translation	Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà et.al.	2402.10422v2	link
2024-02-02	A Case Study on Filtering for End-to-End Speech Translation	Md Mahfuz Ibn Alam, Antonios Anastasopoulos et.al.	2402.01945v1	null
2024-01-17	TranSentence: Speech-to-speech Translation via Language-agnostic Sentence-level Speech Encoding without Language-parallel Data	Seung-Bin Kim, Sang-Hoon Lee, Seong-Whan Lee et.al.	2401.12992v1	null
2024-01-11	R-BI: Regularized Batched Inputs enhance Incremental Decoding Framework for Low-Latency Simultaneous Speech Translation	Jiaxin Guo, Zhanglin Wu, Zongyao Li, Hengchao Shang, Daimeng Wei, Xiaoyu Chen, Zhiqiang Rao, Shaojun Li, Hao Yang et.al.	2401.05700v1	null
2023-12-21	Speech Translation with Large Language Models: An Industrial Practice	Zhichao Huang, Rong Ye, Tom Ko, Qianqian Dong, Shanbo Cheng, Mingxuan Wang, Hang Li et.al.	2312.13585v1	null
2023-12-18	Soft Alignment of Modality Space for End-to-end Speech Translation	Yuhao Zhang, Kaiqi Kou, Bei Li, Chen Xu, Chunliang Zhang, Tong Xiao, Jingbo Zhu et.al.	2312.10952v1	null
2023-12-08	Seamless: Multilingual Expressive and Streaming Speech Translation	Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek, Yilin Yang, Ethan Ye, Ivan Evtimov, Pierre Fernandez, Cynthia Gao, Prangthip Hansanti, Elahe Kalbassi, Amanda Kallet, Artyom Kozhevnikov, Gabriel Mejia Gonzalez, Robin San Roman, Christophe Touret, Corinne Wong, Carleigh Wood, Bokai Yu, Pierre Andrews, Can Balioglu, Peng-Jen Chen, Marta R. Costa-jussà, Maha Elbayad, Hongyu Gong, Francisco Guzmán, Kevin Heffernan, Somya Jain, Justine Kao, Ann Lee, Xutai Ma, Alex Mourachko, Benjamin Peloquin, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Anna Sun, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang, Mary Williamson et.al.	2312.05187v1	link
2024-03-26	AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation	Jeongsoo Choi, Se Jin Park, Minsu Kim, Yong Man Ro et.al.	2312.02512v2	link
2023-11-07	Rethinking and Improving Multi-task Learning for End-to-end Speech Translation	Yuhao Zhang, Chen Xu, Bei Li, Hao Chen, Tong Xiao, Chunliang Zhang, Jingbo Zhu et.al.	2311.03810v1	link
2023-11-01	End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation	Juan Zuluaga-Gomez, Zhaocheng Huang, Xing Niu, Rohit Paturi, Sundararajan Srinivasan, Prashant Mathur, Brian Thompson, Marcello Federico et.al.	2311.00697v1	link
2023-10-31	Towards a Deep Understanding of Multilingual End-to-End Speech Translation	Haoran Sun, Xiaohu Zhao, Yikun Lei, Shaolin Zhu, Deyi Xiong et.al.	2310.20456v1	link
2023-10-26	DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation	Yongxin Zhu, Zhujin Gao, Xinyuan Zhou, Zhongyi Ye, Linli Xu et.al.	2310.17570v1	null
2023-10-24	Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection	Dennis Fucci, Marco Gaido, Sara Papi, Mauro Cettolo, Matteo Negri, Luisa Bentivogli et.al.	2310.15752v1	link
2023-10-23	How To Build Competitive Multi-gender Speech Translation Models For Controlling Speaker Gender Translation	Marco Gaido, Dennis Fucci, Matteo Negri, Luisa Bentivogli et.al.	2310.15114v1	link
2023-10-23	Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models	Arya D. McCarthy, Hao Zhang, Shankar Kumar, Felix Stahlberg, Ke Wu et.al.	2310.13678v2	null
2023-10-23	Towards Real-World Streaming Speech Translation for Code-Switched Speech	Belen Alastruey, Matthias Sperber, Christian Gollan, Dominic Telaar, Tim Ng, Aashish Agarwal et.al.	2310.12648v2	link
2023-10-17	Long-form Simultaneous Speech Translation: Thesis Proposal	Peter Polák et.al.	2310.11141v1	null
2023-10-13	Dialect Transfer for Swiss German Speech Translation	Claudio Paonessa, Yanick Schraner, Jan Deriu, Manuela Hürlimann, Manfred Vogel, Mark Cieliebak et.al.	2310.09088v1	null
2023-10-11	DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation	Qingkai Fang, Yan Zhou, Yang Feng et.al.	2310.07403v1	link
2023-10-11	Enhancing expressivity transfer in textless speech-to-speech translation	Jarod Duret, Benjamin O'Brien, Yannick Estève, Titouan Parcollet et.al.	2310.07279v1	null
2023-10-06	Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach	Junkun Chen, Jian Xue, Peidong Wang, Jing Pan, Jinyu Li et.al.	2310.04399v1	null
2023-10-03	Tuning Large language model for End-to-end Speech Translation	Hao Zhang, Nianwen Si, Yaqi Chen, Wenlin Zhang, Xukui Yang, Dan Qu, Xiaolin Jiao et.al.	2310.02050v1	null
2023-10-07	LAE-ST-MoE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-switching ASR	Guodong Ma, Wenxuan Wang, Yuke Li, Yuting Yang, Binbin Du, Haoran Fu et.al.	2309.16178v2	null
2023-09-27	Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization	Amir Hussein, Brian Yan, Antonios Anastasopoulos, Shinji Watanabe, Sanjeev Khudanpur et.al.	2309.15686v1	null
2023-09-21	Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition	Chen Xu, Xiaoqian Liu, Erfeng He, Yuhao Zhang, Qianqian Dong, Tong Xiao, Jingbo Zhu, Dapeng Man, Wu Yang et.al.	2309.12234v1	link
2024-04-25	SpeechAlign: a Framework for Speech Translation Alignment Evaluation	Belen Alastruey, Aleix Sant, Gerard I. Gállego, David Dale, Marta R. Costa-jussà et.al.	2309.11585v2	null
2023-09-20	Long-Form End-to-End Speech Translation via Latent Alignment Segmentation	Peter Polák, Ondřej Bojar et.al.	2309.11384v1	null
2023-09-20	Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff	Peter Polák, Brian Yan, Shinji Watanabe, Alex Waibel, Ondřej Bojar et.al.	2309.11379v1	null
2024-01-22	DiariST: Streaming Speech Translation with Speaker Diarization	Mu Yang, Naoyuki Kanda, Xiaofei Wang, Junkun Chen, Peidong Wang, Jian Xue, Jinyu Li, Takuya Yoshioka et.al.	2309.08007v2	link
2024-07-19	Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer	Yongqi Wang, Jionghao Bai, Rongjie Huang, Ruiqi Li, Zhiqing Hong, Zhou Zhao et.al.	2309.07566v2	null
2023-09-14	Direct Text to Speech Translation System using Acoustic Units	Victoria Mingote, Pablo Gimeno, Luis Vicente, Sameer Khurana, Antoine Laurent, Jarod Duret et.al.	2309.07478v1	null
2024-07-17	End-to-End Evaluation for Low-Latency Simultaneous Speech Translation	Christian Huber, Tu Anh Dinh, Carlos Mullov, Ngoc Quan Pham, Thai Binh Nguyen, Fabian Retkowski, Stefan Constantin, Enes Yavuz Ugan, Danni Liu, Zhaolin Li, Sai Koneru, Jan Niehues, Alexander Waibel et.al.	2308.03415v3	null
2023-07-17	Multilingual Speech-to-Speech Translation into Multiple Target Languages	Hongyu Gong, Ning Dong, Sravya Popuri, Vedanuj Goswami, Ann Lee, Juan Pino et.al.	2307.08655v1	null
2023-07-17	Improving End-to-End Speech Translation by Imitation-Based Knowledge Distillation with Synthetic Transcripts	Rebekka Hubert, Artem Sokolov, Stefan Riezler et.al.	2307.08426v1	link
2023-07-10	The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task	Kun Song, Yi lei, Peikun Chen, Yiqing Cao, Kun Wei, Yongmao Zhang, Lei Xie, Ning Jiang, Guoqing Zhao et.al.	2307.04630v1	null
2023-07-03	Implicit Memory Transformer for Computationally Efficient Simultaneous Speech Translation	Matthew Raffel, Lizhong Chen et.al.	2307.01381v1	link
2023-07-03	Shiftable Context: Addressing Training-Inference Context Mismatch in Simultaneous Speech Translation	Matthew Raffel, Drew Penney, Lizhong Chen et.al.	2307.01377v1	link
2023-06-20	HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation	Cihan Xiao, Henry Li Xinyuan, Jinyi Yang, Dongji Gao, Matthew Wiesner, Kevin Duh, Sanjeev Khudanpur et.al.	2306.11252v1	link
2023-06-14	Tagged End-to-End Simultaneous Speech Translation Training using Simultaneous Interpretation Data	Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Katsuhito Sudoh, Satoshi Nakamura et.al.	2306.08582v1	null
2023-06-13	NAVER LABS Europe's Multilingual Speech Translation Systems for the IWSLT 2023 Low-Resource Track	Edward Gow-Smith, Alexandre Berard, Marcely Zanon Boito, Ioan Calapodescu et.al.	2306.07763v1	null
2023-06-13	Modality Adaption or Regularization? A Case Study on End-to-End Speech Translation	Yuchen Han, Chen Xu, Tong Xiao, Jingbo Zhu et.al.	2306.07650v1	link
2023-07-12	KIT's Multilingual Speech Translation System for IWSLT 2023	Danni Liu, Thai Binh Nguyen, Sai Koneru, Enes Yavuz Ugan, Ngoc-Quan Pham, Tuan-Nam Nguyen, Tu Anh Dinh, Carlos Mullov, Alexander Waibel, Jan Niehues et.al.	2306.05320v3	link
2023-06-13	PolyVoice: Language Models for Speech to Speech Translation	Qianqian Dong, Zhiying Huang, Qiao Tian, Chen Xu, Tom Ko, Yunlong Zhao, Siyuan Feng, Tang Li, Kexin Wang, Xuxin Cheng, Fengpeng Yue, Ye Bai, Xi Chen, Lu Lu, Zejun Ma, Yuping Wang, Mingxuan Wang, Yuxuan Wang et.al.	2306.02982v2	null
2023-06-02	Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23	Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà et.al.	2306.01327v1	null
2023-06-01	Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models	Liam Dugan, Anshul Wadhawan, Kyle Spence, Chris Callison-Burch, Morgan McGuire, Victor Zordan et.al.	2306.01201v1	link
2024-01-25	Improved Cross-Lingual Transfer Learning For Automatic Speech Translation	Sameer Khurana, Nauman Dawalatabad, Antoine Laurent, Luis Vicente, Pablo Gimeno, Victoria Mingote, James Glass et.al.	2306.00789v4	null
2023-07-25	StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation	Kun Song, Yi Ren, Yi Lei, Chunfeng Wang, Kun Wei, Lei Xie, Xiang Yin, Zejun Ma et.al.	2305.17732v4	null
2024-01-16	Translatotron 3: Speech to Speech Translation with Monolingual Data	Eliya Nachmani, Alon Levkovitch, Yifan Ding, Chulayuth Asawaroengchai, Heiga Zen, Michelle Tadmor Ramanovich et.al.	2305.17547v3	null
2023-05-27	CTC-based Non-autoregressive Speech Translation	Chen Xu, Xiaoqian Liu, Xiaowen Liu, Qingxuan Sun, Yuhao Zhang, Murun Yang, Qianqian Dong, Tom Ko, Mingxuan Wang, Tong Xiao, Anxiang Ma, Jingbo Zhu et.al.	2305.17358v1	link
2023-05-26	Inter-connection: Effective Connection between Pre-trained Encoder and Decoder for Speech Translation	Yuta Nishikawa, Satoshi Nakamura et.al.	2305.16897v1	null
2023-06-18	End-to-End Simultaneous Speech Translation with Differentiable Segmentation	Shaolei Zhang, Yang Feng et.al.	2305.16093v2	link
2024-02-20	Textless Low-Resource Speech-to-Speech Translation With Unit Language Models	Anuj Diwan, Anirudh Srinivasan, David Harwath, Eunsol Choi et.al.	2305.15405v2	link
2023-05-24	AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation	Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao et.al.	2305.15403v1	null
2023-05-25	CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation	Yan Zhou, Qingkai Fang, Yang Feng et.al.	2305.14635v2	link
2023-05-23	Improving speech translation by fusing speech and text	Wenbiao Yin, Zhicheng Liu, Chengqi Zhao, Tao Wang, Jian Tong, Rong Ye et.al.	2305.14042v1	null
2023-05-22	Improving Metrics for Speech Translation	Claudio Paonessa, Dominik Frefel, Manfred Vogel et.al.	2305.12918v1	null
2023-05-22	Duplex Diffusion Models Improve Speech-to-Speech Translation	Xianchao Wu et.al.	2305.12628v1	null
2023-05-19	DUB: Discrete Unit Back-translation for Speech Translation	Dong Zhang, Rong Ye, Tom Ko, Mingxuan Wang, Yaqian Zhou et.al.	2305.11411v1	link
2023-07-20	AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation	Sara Papi, Marco Turchi, Matteo Negri et.al.	2305.11408v2	link
2023-10-17	The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation	Mutian He, Philip N. Garner et.al.	2305.09652v2	link
2023-05-15	Understanding and Bridging the Modality Gap for Speech Translation	Qingkai Fang, Yang Feng et.al.	2305.08706v1	link
2023-05-12	Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation	Yu-Kuan Fu, Liang-Hsuan Tseng, Jiatong Shi, Chen-An Li, Tsu-Yuan Hsu, Shinji Watanabe, Hung-yi Lee et.al.	2305.07455v1	null
2023-12-18	Improving Speech Translation Accuracy and Time Efficiency with Fine-tuned wav2vec 2.0-based Speech Segmentation	Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura et.al.	2304.12659v2	link
2023-04-20	Improving Speech Translation by Cross-Modal Multi-Grained Contrastive Learning	Hao Zhang, Nianwen Si, Yaqi Chen, Wenlin Zhang, Xukui Yang, Dan Qu, Wei-Qiang Zhang et.al.	2304.10309v1	null
2023-04-20	Decouple Non-parametric Knowledge Distillation For End-to-end Speech Translation	Hao Zhang, Nianwen Si, Yaqi Chen, Wenlin Zhang, Xukui Yang, Dan Qu, Zhen Li et.al.	2304.10295v1	null
2023-04-10	Enhancing Speech-to-Speech Translation with Multiple TTS Targets	Jiatong Shi, Yun Tang, Ann Lee, Hirofumi Inaguma, Changhan Wang, Juan Pino, Shinji Watanabe et.al.	2304.04618v1	null
2023-04-25	Selective Data Augmentation for Robust Speech Translation	Rajul Acharya, Ashish Panda, Sunil Kumar Kopparapu et.al.	2304.03169v2	null
2023-10-26	Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference	Biao Fu, Minpeng Liao, Kai Fan, Zhongqiang Huang, Boxing Chen, Yidong Chen, Xiaodong Shi et.al.	2303.07914v2	link
2023-03-09	MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition	Xize Cheng, Linjun Li, Tao Jin, Rongjie Huang, Wang Lin, Zehan Wang, Huangdai Liu, Ye Wang, Aoxiong Yin, Zhou Zhao et.al.	2303.05309v1	link
2023-02-21	Efficient CTC Regularization via Coarse Labels for End-to-End Speech Translation	Biao Zhang, Barry Haddow, Rico Sennrich et.al.	2302.10871v1	link
2023-06-05	Pre-training for Speech Translation: CTC Meets Optimal Transport	Phuong-Hang Le, Hongyu Gong, Changhan Wang, Juan Pino, Benjamin Lecouteux, Didier Schwab et.al.	2301.11716v3	link
2023-01-25	A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation	Wen-Chin Huang, Benjamin Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen et.al.	2301.10606v1	null
2023-11-01	SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations	Ioannis Tsiamas, José A. R. Fonollosa, Marta R. Costa-jussà et.al.	2212.09699v3	link
2023-07-07	WACO: Word-Aligned Contrastive Learning for Speech Translation	Siqi Ouyang, Rong Ye, Lei Li et.al.	2212.09359v3	link
2022-12-17	AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation	Xingshan Zeng, Liangyou Li, Qun Liu et.al.	2212.08911v1	null
2022-12-16	BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric	Mingda Chen, Paul-Ambroise Duquenne, Pierre Andrews, Justine Kao, Alexandre Mourachko, Holger Schwenk, Marta R. Costa-jussà et.al.	2212.08486v1	link
2023-05-26	UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units	Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino et.al.	2212.08055v2	link
2023-05-11	Attention as a Guide for Simultaneous Speech Translation	Sara Papi, Matteo Negri, Marco Turchi et.al.	2212.07850v2	link
2022-12-12	Direct Speech-to-speech Translation without Textual Annotation using Bottleneck Features	Junhui Zhang, Junjie Pan, Xiang Yin, Zejun Ma et.al.	2212.05805v1	null
2022-12-11	End-to-End Speech Translation of Arabic to English Broadcast News	Fethi Bougares, Salim Jouili et.al.	2212.05479v1	null
2022-12-07	M3ST: Mix at Three Levels for Speech Translation	Xuxin Cheng, Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Yuexian Zou et.al.	2212.03657v1	null
2022-12-04	Improving End-to-end Speech Translation by Leveraging Auxiliary Speech and Text Data	Yuhao Zhang, Chen Xu, Bojie Hu, Chunliang Zhang, Tong Xiao, Jingbo Zhu et.al.	2212.01778v1	null
2022-11-22	ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched Egyptian Arabic - English	Injy Hamed, Nizar Habash, Slim Abdennadher, Ngoc Thang Vu et.al.	2211.12000v1	null
2023-06-01	MT Metrics Correlate with Human Ratings of Simultaneous Speech Translation	Dominik Macháček, Ondřej Bojar, Raj Dabre et.al.	2211.08633v2	link
2022-11-11	Speech-to-Speech Translation For A Real-world Unwritten Language	Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Pino, Wei-Ning Hsu, Ann Lee et.al.	2211.06474v1	null
2022-11-11	Align, Write, Re-order: Explainable End-to-End Speech Translation via Operation Sequence Generation	Motoi Omachi, Brian Yan, Siddharth Dalmia, Yuya Fujita, Shinji Watanabe et.al.	2211.05967v1	null
2022-11-09	Efficient Speech Translation with Pre-trained Models	Zhaolin Li, Jan Niehues et.al.	2211.04939v1	null
2022-11-08	SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations	Paul-Ambroise Duquenne, Hongyu Gong, Ning Dong, Jingfei Du, Ann Lee, Vedanuj Goswani, Changhan Wang, Juan Pino, Benoît Sagot, Holger Schwenk et.al.	2211.04508v1	null
2022-10-31	Textless Direct Speech-to-Speech Translation with Discrete Speech Representation	Xinjian Li, Ye Jia, Chung-Cheng Chiu et.al.	2211.00115v1	null
2022-10-31	Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation	Kun Wei, Long Zhou, Ziqiang Zhang, Liping Chen, Shujie Liu, Lei He, Jinyu Li, Furu Wei et.al.	2210.17027v1	link
2023-03-14	Efficient Speech Translation with Dynamic Latent Perceivers	Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà et.al.	2210.16264v2	link
2022-10-26	Improving Speech-to-Speech Translation Through Unlabeled Text	Xuan-Phi Nguyen, Sravya Popuri, Changhan Wang, Yun Tang, Ilia Kulikov, Hongyu Gong et.al.	2210.14514v1	null
2022-11-24	Does Joint Training Really Help Cascaded Speech Translation?	Viet Anh Khoa Tran, David Thulke, Yingbo Gao, Christian Herold, Hermann Ney et.al.	2210.13700v2	link
2023-05-20	Joint Speech Translation and Named Entity Recognition	Marco Gaido, Sara Papi, Matteo Negri, Marco Turchi et.al.	2210.11987v2	link
2023-03-11	Named Entity Detection and Injection for Direct Speech Translation	Marco Gaido, Yun Tang, Ilia Kulikov, Rongqing Huang, Hongyu Gong, Hirofumi Inaguma et.al.	2210.11981v2	null
2022-10-18	Simple and Effective Unsupervised Speech Translation	Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli, Juan Pino et.al.	2210.10191v1	null
2022-10-18	Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation	Chen Wang, Yuchen Liu, Boxing Chen, Jiajun Zhang, Wei Luo, Zhongqiang Huang, Chengqing Zong et.al.	2210.09556v1	link
2022-10-16	RedApt: An Adaptor for wav2vec 2 Encoding \ Faster and Smaller Speech Translation without Quality Compromise	Jinming Zhao, Hao Yang, Gholamreza Haffari, Ehsan Shareghi et.al.	2210.08475v1	null
2023-02-08	Generating Synthetic Speech from SpokenVocab for Speech Translation	Jinming Zhao, Gholamreza Haffar, Ehsan Shareghi et.al.	2210.08174v2	link
2022-11-09	Code-Switching without Switching: Language Agnostic End-to-End Speech Translation	Christian Huber, Enes Yavuz Ugan, Alexander Waibel et.al.	2210.01512v2	null
2023-07-25	Direct Speech Translation for Automatic Subtitling	Sara Papi, Marco Gaido, Alina Karakanta, Mauro Cettolo, Matteo Negri, Marco Turchi et.al.	2209.13192v2	link
2022-08-08	A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation	Linh The Nguyen, Nguyen Luong Tran, Long Doan, Manh Luong, Dat Quoc Nguyen et.al.	2208.04243v1	link
2022-07-01	On the Impact of Noises in Crowd-Sourced Data for Speech Translation	Siqi Ouyang, Rong Ye, Lei Li et.al.	2206.13756v2	link
2022-06-20	Over-Generation Cannot Be Rewarded: Length-Adaptive Average Lagging for Simultaneous Speech Translation	Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi et.al.	2206.05807v3	link
2022-06-14	The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task	Ziqiang Zhang, Junyi Ao, Long Zhou, Shujie Liu, Furu Wei, Jinyu Li et.al.	2206.05777v2	link
2023-03-02	TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation	Rongjie Huang, Jinglin Liu, Huadai Liu, Yi Ren, Lichao Zhang, Jinzheng He, Zhou Zhao et.al.	2205.12523v2	null
2022-11-04	Non-Parametric Domain Adaptation for End-to-End Speech Translation	Yichao Du, Weizhi Wang, Zhirui Zhang, Boxing Chen, Tong Xu, Jun Xie, Enhong Chen et.al.	2205.11211v6	link
2022-05-18	Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation	Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Qibing Bai, Yu Zhang et.al.	2205.08993v1	link
2022-05-14	Multiformer: A Head-Configurable Transformer-Based Model for Direct Speech Translation	Gerard Sant, Gerard I. Gállego, Belen Alastruey, Marta R. Costa-Jussà et.al.	2205.07100v1	null
2022-05-13	Who Are We Talking About? Handling Person Names in Speech Translation	Marco Gaido, Matteo Negri, Marco Turchi et.al.	2205.06755v1	link
2022-05-05	Efficient yet Competitive Speech Translation: FBK@IWSLT2022	Marco Gaido, Sara Papi, Dennis Fucci, Giuseppe Fiameni, Matteo Negri, Marco Turchi et.al.	2205.02629v1	link
2022-05-05	Cross-modal Contrastive Learning for Speech Translation	Rong Ye, Mingxuan Wang, Lei Li et.al.	2205.02444v1	link
2022-05-04	ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks	Marcely Zanon Boito, John Ortega, Hugo Riguidel, Antoine Laurent, Loïc Barrault, Fethi Bougares, Firas Chaabani, Ha Nguyen, Florentin Barbier, Souhir Gahbiche, Yannick Estève et.al.	2205.01987v1	null
2022-04-22	LibriS2S: A German-English Speech-to-Speech Translation Corpus	Pedro Jeuris, Jan Niehues et.al.	2204.10593v1	link
2022-10-03	Exploring Continuous Integrate-and-Fire for Adaptive Simultaneous Speech Translation	Chih-Chiang Chang, Hung-yi Lee et.al.	2204.09595v3	link
2022-04-19	On the Locality of Attention in Direct Speech Translation	Belen Alastruey, Javier Ferrando, Gerard I. Gállego, Marta R. Costa-jussà et.al.	2204.09028v1	null
2022-04-19	Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation	Keqi Deng, Shinji Watanabe, Jiatong Shi, Siddhant Arora et.al.	2204.08920v1	null
2022-05-11	CUNI-KIT System for Simultaneous Speech Translation Task at IWSLT 2022	Peter Polák, Ngoc-Quan Ngoc, Tuan-Nam Nguyen, Danni Liu, Carlos Mullov, Jan Niehues, Ondřej Bojar, Alexander Waibel et.al.	2204.06028v2	null
2022-04-11	Unified Speech-Text Pre-training for Speech Translation and Recognition	Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Pino et.al.	2204.05409v1	null
2022-07-01	Large-Scale Streaming End-to-End Speech Translation with Neural Transducers	Jian Xue, Peidong Wang, Jinyu Li, Matt Post, Yashesh Gaur et.al.	2204.05352v2	null
2022-04-11	End-to-End Speech Translation for Code Switched Speech	Orion Weller, Matthias Sperber, Telmo Pires, Hendra Setiawan, Christian Gollan, Dominic Telaar, Matthias Paulik et.al.	2204.05076v1	link
2023-06-06	GigaST: A 10,000-hour Pseudo Speech Translation Corpus	Rong Ye, Chengqi Zhao, Tom Ko, Chutong Meng, Tao Wang, Mingxuan Wang, Jun Cao et.al.	2204.03939v2	null
2022-11-16	Does Simultaneous Speech Translation need Simultaneous Models?	Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi et.al.	2204.03783v3	link
2022-09-13	Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation	Sravya Popuri, Peng-Jen Chen, Changhan Wang, Juan Pino, Yossi Adi, Jiatao Gu, Wei-Ning Hsu, Ann Lee et.al.	2204.02967v3	null
2022-07-13	Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation	Ryo Fukuda, Katsuhito Sudoh, Satoshi Nakamura et.al.	2203.15479v2	link
2022-03-29	Multilingual Simultaneous Speech Translation	Shashank Subramanya, Jan Niehues et.al.	2203.14835v2	null
2022-06-27	Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation	Ye Jia, Yifan Ding, Ankur Bapna, Colin Cherry, Yu Zhang, Alexis Conneau, Nobuyuki Morioka et.al.	2203.13339v2	null
2022-03-20	STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation	Qingkai Fang, Rong Ye, Lei Li, Yang Feng, Mingxuan Wang et.al.	2203.10426v1	link
2022-03-18	Under the Morphosyntactic Lens: A Multifaceted Evaluation of Gender Bias in Speech Translation	Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi et.al.	2203.09866v1	link
2022-03-16	Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation	Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler et.al.	2203.08757v1	null
2022-03-04	Comprehension of Subtitles from Re-Translating Simultaneous Speech Translation	Dávid Javorský, Dominik Macháček, Ondřej Bojar et.al.	2203.02458v1	null
2022-07-06	SHAS: Approaching optimal Segmentation for End-to-End Speech Translation	Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà et.al.	2202.04774v3	link
2022-09-04	Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages	Jivnesh Sandhan, Ayush Daksh, Om Adideva Paranjay, Laxmidhar Behera, Pawan Goyal et.al.	2201.11391v2	link
2022-01-26	Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques	Tu Anh Dinh, Danni Liu, Jan Niehues et.al.	2201.11172v1	link
2022-06-26	CVSS Corpus and Massively Multilingual Speech-to-Speech Translation	Ye Jia, Michelle Tadmor Ramanovich, Quan Wang, Heiga Zen et.al.	2201.03713v3	link
2022-05-25	Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement	Yichao Du, Zhirui Zhang, Weizhi Wang, Boxing Chen, Jun Xie, Tong Xu et.al.	2112.10991v2	link
2022-05-04	Textless Speech-to-Speech Translation on Real Data	Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Yossi Adi, Juan Pino, Jiatao Gu, Wei-Ning Hsu et.al.	2112.08352v2	null
2021-11-08	Visualization: the missing factor in Simultaneous Speech Translation	Sara Papi, Matteo Negri, Marco Turchi et.al.	2111.00514v2	null
2022-06-17	Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems	Mohd Abbas Zaidi, Beomseok Lee, Sangha Kim, Chanwoo Kim et.al.	2110.15729v2	null
2021-10-26	Assessing Evaluation Metrics for Speech-to-Speech Translation	Elizabeth Salesky, Julian Mäder, Severin Klinger et.al.	2110.13877v1	null
2022-01-12	Direct Simultaneous Speech-to-Speech Translation with Variational Monotonic Multihead Attention	Xutai Ma, Hongyu Gong, Danni Liu, Ann Lee, Yun Tang, Peng-Jen Chen, Wei-Ning Hsu, Phillip Koehn, Juan Pino et.al.	2110.08250v2	null
2022-07-15	From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation	Danni Liu, Changhan Wang, Hongyu Gong, Xutai Ma, Yun Tang, Juan Pino et.al.	2110.08214v3	null
2021-09-27	Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates	Hirofumi Inaguma, Siddharth Dalmia, Brian Yan, Shinji Watanabe et.al.	2109.12804v1	null
2021-09-15	Is "moby dick" a Whale or a Bird? Named Entities and Terminology in Speech Translation	Marco Gaido, Susana Rodríguez, Matteo Negri, Luisa Bentivogli, Marco Turchi et.al.	2109.07439v1	link
2021-09-09	Speechformer: Reducing Information Loss in Direct Speech Translation	Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi et.al.	2109.04574v1	link
2021-09-09	Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring	Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe et.al.	2109.04411v1	null
2021-08-09	The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation	Minghan Wang, Yuxia Wang, Chang Su, Jiaxin Guo, Yingtao Zhang, Yujia Liu, Min Zhang, Shimin Tao, Xingshan Zeng, Liangyou Li, Hao Yang, Ying Qin et.al.	2108.03845v1	null
2021-07-24	The USYD-JD Speech Translation System for IWSLT 2021	Liang Ding, Di Wu, Dacheng Tao et.al.	2107.11572v1	null
2021-07-20	Simultaneous Speech Translation for Live Subtitling: from Delay to Display	Alina Karakanta, Sara Papi, Matteo Negri, Marco Turchi et.al.	2107.08807v2	link
2022-05-17	Translatotron 2: High-quality direct speech-to-speech translation with voice preservation	Ye Jia, Michelle Tadmor Ramanovich, Tal Remez, Roi Pomerantz et.al.	2107.08661v5	null
2021-08-14	FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task	Yun Tang, Hongyu Gong, Xian Li, Changhan Wang, Juan Pino, Holger Schwenk, Naman Goyal et.al.	2107.06959v2	null
2021-07-13	The IWSLT 2021 BUT Speech Translation Systems	Hari Krishna Vydana, Martin Karafi'at, Luk'as Burget, "Honza" Cernock'y et.al.	2107.06155v1	null
2021-07-13	Zero-shot Speech Translation	Tu Anh Dinh et.al.	2107.06010v1	null
2021-07-12	Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task	Yun Tang, Juan Pino, Xian Li, Changhan Wang, Dmitriy Genzel et.al.	2107.05782v1	null
2022-03-21	Direct speech-to-speech translation with discrete units	Ann Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Sravya Popuri, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Pino, Wei-Ning Hsu et.al.	2107.05604v2	null
2021-07-07	Efficient Transformer for Direct Speech Translation	Belen Alastruey, Gerard I. Gállego, Marta R. Costa-jussà et.al.	2107.03069v1	null
2021-07-08	The NiuTrans End-to-End Speech Translation System for IWSLT 2021 Offline Task	Chen Xu, Xiaoqian Liu, Xiaowen Liu, Laohu Wang, Canan Huang, Tong Xiao, Jingbo Zhu et.al.	2107.02444v2	null
2021-07-06	ESPnet-ST IWSLT 2021 Offline Speech Translation System	Hirofumi Inaguma, Brian Yan, Siddharth Dalmia, Pengcheng Guo, Jiatong Shi, Kevin Duh, Shinji Watanabe et.al.	2107.00636v2	null
2021-07-09	The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021	Dan Liu, Mengge Du, Xiaoxi Li, Yuchen Hu, Lirong Dai et.al.	2107.00279v2	null
2021-06-30	IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task	Pavel Denisov, Manuel Mager, Ngoc Thang Vu et.al.	2106.16055v1	null
2021-06-17	Lost in Interpreting: Speech Translation from Source or Interpreter?	Dominik Macháček, Matúš Žilinec, Ondřej Bojar et.al.	2106.09343v1	null
2021-06-09	RealTranS: End-to-End Simultaneous Speech Translation with Convolutional Weighted-Shrinking Transformer	Xingshan Zeng, Liangyou Li, Qun Liu et.al.	2106.04833v1	null
2021-07-12	Lightweight Adapter Tuning for Multilingual Speech Translation	Hang Le, Juan Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier et.al.	2106.01463v2	link
2021-06-02	Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference?	Luisa Bentivogli, Mauro Cettolo, Marco Gaido, Alina Karakanta, Alberto Martinelli, Matteo Negri, Marco Turchi et.al.	2106.01045v1	null
2021-06-22	Multilingual Speech Translation with Unified Transformer: Huawei Noah's Ark Lab at IWSLT 2021	Xingshan Zeng, Liangyou Li, Qun Liu et.al.	2106.00197v2	null
2021-05-28	How to Split: the Effect of Word Segmentation on Gender Bias in Speech Translation	Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi et.al.	2105.13782v1	link
2021-06-30	The Volctrans Neural Speech Translation System for IWSLT 2021	Chengqi Zhao, Zhicheng Liu, Jian Tong, Tao Wang, Mingxuan Wang, Rong Ye, Qianqian Dong, Jun Cao, Lei Li et.al.	2105.07319v2	link
2021-06-15	Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders	Chen Xu, Bojie Hu, Yanyang Li, Yuhao Zhang, shen huang, Qi Ju, Tong Xiao, Jingbo Zhu et.al.	2105.05752v2	null
2021-05-11	Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation	Shun-Po Chuang, Yung-Sung Chuang, Chih-Chiang Chang, Hung-yi Lee et.al.	2105.04840v1	link
2021-06-28	End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021	Gerard I. Gállego, Ioannis Tsiamas, Carlos Escolano, José A. R. Fonollosa, Marta R. Costa-jussà et.al.	2105.04512v2	link
2021-07-02	AlloST: Low-resource Speech Translation without Source Transcription	Yao-Fei Cheng, Hung-Shin Lee, Hsin-Min Wang et.al.	2105.00171v3	link
2021-06-14	Impact of Encoding and Segmentation Strategies on End-to-End Simultaneous Speech Translation	Ha Nguyen, Yannick Estève, Laurent Besacier et.al.	2104.14470v2	null
2021-10-14	Beyond Voice Activity Detection: Hybrid Audio Segmentation for Direct Speech Translation	Marco Gaido, Matteo Negri, Mauro Cettolo, Marco Turchi et.al.	2104.11710v2	null
2021-06-18	End-to-end Speech Translation via Cross-modal Progressive Training	Rong Ye, Mingxuan Wang, Lei Li et.al.	2104.10380v2	link
2021-04-14	Large-Scale Self- and Semi-Supervised Learning for Speech Translation	Changhan Wang, Anne Wu, Juan Pino, Alexei Baevski, Michael Auli, Alexis Conneau et.al.	2104.06678v1	null
2021-04-13	Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation	Hirofumi Inaguma, Tatsuya Kawahara, Shinji Watanabe et.al.	2104.06457v1	null
2021-04-27	BSTC: A Large-Scale Chinese-English Speech Translation Dataset	Ruiqing Zhang, Xiyang Wang, Chuanqiang Zhang, Zhongjun He, Hua Wu, Zhi Li, Haifeng Wang, Ying Chen, Qinfei Li et.al.	2104.03575v4	null
2021-06-30	Towards the evaluation of automatic simultaneous speech translation from a communicative perspective	Claudio Fantinuoli, Bianca Prandi et.al.	2103.08364v2	null
2021-03-04	An Empirical Study of End-to-end Simultaneous Speech Translation Decoding Strategies	Ha Nguyen, Yannick Estève, Laurent Besacier et.al.	2103.03233v1	null
2021-09-14	Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation	Renjie Zheng, Junkun Chen, Mingbo Ma, Liang Huang et.al.	2102.05766v2	null
2021-02-02	CTC-based Compression for Direct Speech Translation	Marco Gaido, Mauro Cettolo, Matteo Negri, Marco Turchi et.al.	2102.01578v1	link
2021-06-15	NeurST: Neural Speech Translation Toolkit	Chengqi Zhao, Mingxuan Wang, Qianqian Dong, Rong Ye, Lei Li et.al.	2012.10018v3	link
2020-12-09	On Knowledge Distillation for Direct Speech Translation	Marco Gaido, Mattia A. Di Gangi, Matteo Negri, Marco Turchi et.al.	2012.04964v1	link
2020-12-09	Breeding Gender-aware Direct Speech Translation Systems	Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi et.al.	2012.04955v1	null
2020-11-24	Tight Integrated End-to-End Training for Cascaded Speech Translation	Parnia Bahar, Tobias Bieschke, Ralf Schlüter, Hermann Ney et.al.	2011.12167v1	null
2020-11-11	Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS	Katsuhito Sudoh, Takatomo Kano, Sashi Novitasari, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura et.al.	2011.04845v2	null
2020-11-03	SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End Simultaneous Speech Translation	Xutai Ma, Juan Pino, Philipp Koehn et.al.	2011.02048v1	link
2020-11-02	Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation	Hang Le, Juan Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier et.al.	2011.00747v1	link

(back to top)

Legal

Publish Date	Title	Authors	PDF	Code
2024-08-15	ArabLegalEval: A Multitask Benchmark for Assessing Arabic Legal Knowledge in Large Language Models	Faris Hijazi, Somayah AlHarbi, Abdulaziz AlHussein, Harethah Abu Shairah, Reem AlZahrani, Hebah AlShamlan, Omar Knio, George Turkiyyah et.al.	2408.07983v1	link
2024-08-13	ELLA: Empowering LLMs for Interpretable, Accurate and Informative Legal Advice	Yutong Hu, Kangcheng Luo, Yansong Feng et.al.	2408.07137v1	link
2024-08-08	Redefining Accountability: Navigating Legal Challenges of Participant Liability in Decentralized Autonomous Organizations	Aneta Napieralska, Przemysław Kępczyński et.al.	2408.04717v1	null
2024-08-05	A Multi-Source Heterogeneous Knowledge Injected Prompt Learning Method for Legal Charge Prediction	Jingyun Sun, Chi Wei, Yang Li et.al.	2408.02233v1	null
2024-08-01	DeliLaw: A Chinese Legal Counselling System Based on a Large Language Model	Nan Xie, Yuelin Bai, Hengyuan Gao, Feiteng Fang, Qixuan Zhao, Zhijian Li, Ziqiang Xue, Liang Zhu, Shiwen Ni, Min Yang et.al.	2408.00357v1	null
2024-07-27	LawLLM: Law Large Language Model for the US Legal System	Dong Shu, Haoran Zhao, Xukun Liu, David Demeter, Mengnan Du, Yongfeng Zhang et.al.	2407.21065v1	null
2024-07-30	A Three Steps Methodological Approach to Legal Governance Validation	Pompeu Casanovas, Mustafa Hashmi, Louis de Koker, Ho-Pun Lam et.al.	2407.20691v1	null
2024-07-30	The Future of International Data Transfers: Managing Legal Risk with a User-Held Data Model	Paulius Jurcys, Marcelo Corrales Compagnucci, Mark Fenwick et.al.	2407.20514v1	null
2024-07-29	Legal Aspects of Decentralized and Platform-Driven Economies	Marcelo Corrales Compagnucci, Toshiyuki Kono, Shinto Teramoto et.al.	2407.20301v1	null
2024-08-09	Legal Minds, Algorithmic Decisions: How LLMs Apply Constitutional Principles in Complex Scenarios	Camilla Bignotti, Carolina Camassa et.al.	2407.19760v2	null
2024-07-28	SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain	Pierre Colombo, Telmo Pires, Malik Boudiaf, Rui Melo, Dominic Culver, Sofia Morgado, Etienne Malaboeuf, Gabriel Hautreux, Johanne Charpentier, Michael Desa et.al.	2407.19584v1	null
2024-07-26	Optimizing Numerical Estimation and Operational Efficiency in the Legal Domain through Large Language Models	Jia-Hong Huang, Chao-Chun Yang, Yixian Shen, Alessio M. Pacces, Evangelos Kanoulas et.al.	2407.19041v1	null
2024-07-05	Challenges and Considerations in Annotating Legal Data: A Comprehensive Overview	Harshil Darji, Jelena Mitrović, Michael Granitzer et.al.	2407.17503v1	null
2024-07-23	Lawma: The Power of Specialization for Legal Tasks	Ricardo Dominguez-Olmedo, Vedant Nanda, Rediet Abebe, Stefan Bechtold, Christoph Engel, Jens Frankenreiter, Krishna Gummadi, Moritz Hardt, Michael Livermore et.al.	2407.16615v1	null
2024-07-19	LeKUBE: A Legal Knowledge Update BEnchmark	Changyue Wang, Weihang Su, Hu Yiran, Qingyao Ai, Yueyue Wu, Cheng Luo, Yiqun Liu, Min Zhang, Shaoping Ma et.al.	2407.14192v1	null
2024-05-28	The Cost of Arbitrariness for Individuals: Examining the Legal and Technical Challenges of Model Multiplicity	Prakhar Ganesh, Ihsan Ibrahim Daldaban, Ignacio Cofone, Golnoosh Farnadi et.al.	2407.13070v1	null
2024-07-20	Applicability of Large Language Models and Generative Models for Legal Case Judgement Summarization	Aniket Deroy, Kripabandhu Ghosh, Saptarshi Ghosh et.al.	2407.12848v2	null
2024-08-12	Across Platforms and Languages: Dutch Influencers and Legal Disclosures on Instagram, YouTube and TikTok	Haoyang Gui, Thales Bertaglia, Catalina Goanta, Sybe de Vries, Gerasimos Spanakis et.al.	2407.12451v2	null
2024-07-07	Auditing of AI: Legal, Ethical and Technical Approaches	Jakob Mokander et.al.	2407.06235v1	null
2024-07-07	IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning	Abhinav Joshi, Shounak Paul, Akshat Sharma, Pawan Goyal, Saptarshi Ghosh, Ashutosh Modi et.al.	2407.05399v1	null
2024-08-06	Enabling Discriminative Reasoning in LLMs for Legal Judgment Prediction	Chenlong Deng, Kelong Mao, Yuyao Zhang, Zhicheng Dou et.al.	2407.01964v4	link
2024-06-28	Learning Interpretable Legal Case Retrieval via Knowledge-Guided Case Reformulation	Chenlong Deng, Kelong Mao, Zhicheng Dou et.al.	2406.19760v1	link
2024-06-27	CLERC: A Dataset for Legal Case Retrieval and Retrieval-Augmented Analysis Generation	Abe Bohan Hou, Orion Weller, Guanghui Qin, Eugene Yang, Dawn Lawrie, Nils Holzenberger, Andrew Blair-Stanek, Benjamin Van Durme et.al.	2406.17186v2	link
2024-06-24	eagerlearners at SemEval2024 Task 5: The Legal Argument Reasoning Task in Civil Procedure	Hoorieh Sabzevari, Mohammadmostafa Rostamkhani, Sauleh Eetemadi et.al.	2406.16490v1	link
2024-04-26	Examining the Legal Status of Digital Assets as Property: A Comparative Analysis of Jurisdictional Approaches	Luke Lee et.al.	2406.15391v1	null
2024-06-21	GiusBERTo: A Legal Language Model for Personal Data De-identification in Italian Court of Auditors Decisions	Giulio Salierno, Rosamaria Bertè, Luca Attias, Carla Morrone, Dario Pettazzoni, Daniela Battisti et.al.	2406.15032v1	null
2024-06-21	InternLM-Law: An Open Source Chinese Legal Large Language Model	Zhiwei Fei, Songyang Zhang, Xiaoyu Shen, Dawei Zhu, Xiao Wang, Maosong Cao, Fengzhe Zhou, Yining Li, Wenwei Zhang, Dahua Lin, Kai Chen, Jidong Ge et.al.	2406.14887v1	null
2024-06-17	Enhancing Criminal Case Matching through Diverse Legal Factors	Jie Zhao, Ziyu Guan, Wei Zhao, Yue Jiang et.al.	2406.11172v1	null
2024-06-16	Towards Supporting Legal Argumentation with NLP: Is More Data Really All You Need?	T. Y. S. S Santosh, Kevin D. Ashley, Katie Atkinson, Matthias Grabmair et.al.	2406.10974v1	null
2024-06-15	Applications of Generative AI in Healthcare: algorithmic, ethical, legal and societal considerations	Onyekachukwu R. Okonji, Kamol Yunusov, Bonnie Gordon et.al.	2406.10632v1	null
2024-06-10	The Legal Duty to Search for Less Discriminatory Algorithms	Emily Black, Logan Koepke, Pauline Kim, Solon Barocas, Mingwei Hsu et.al.	2406.06817v1	null
2024-06-10	AGB-DE: A Corpus for the Automated Legal Assessment of Clauses in German Consumer Contracts	Daniel Braun, Florian Matthes et.al.	2406.06809v1	link
2024-06-07	On Ambiguity and the Expressive Function of Law: The Role of Pragmatics in Smart Legal Ecosystems	Pompeu Casanovas et.al.	2406.05084v1	null
2024-06-07	LawGPT: A Chinese Legal Knowledge-Enhanced Large Language Model	Zhi Zhou, Jiang-Xin Shi, Peng-Xiao Song, Xiao-Wen Yang, Yi-Xuan Jin, Lan-Zhe Guo, Yu-Feng Li et.al.	2406.04614v1	link
2024-06-06	Legal Documents Drafting with Fine-Tuned Pre-Trained Large Language Model	Chun-Hsien Lin, Pu-Jen Cheng et.al.	2406.04202v1	link
2024-06-06	Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian Courts	Shubham Kumar Nigam, Anurag Sharma, Danush Khanna, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya et.al.	2406.04136v1	link
2024-06-05	Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning	Yang Wu, Chenghao Wang, Ece Gumusel, Xiaozhong Liu et.al.	2406.03600v1	null
2024-06-30	Unveiling Themes in Judicial Proceedings: A Cross-Country Study Using Topic Modeling on Legal Documents from India and the UK	Krish Didwania, Dr. Durga Toshniwal, Amit Agarwal et.al.	2406.00040v2	null
2024-05-30	Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools	Varun Magesh, Faiz Surani, Matthew Dahl, Mirac Suzgun, Christopher D. Manning, Daniel E. Ho et.al.	2405.20362v1	null
2024-05-27	Explainable machine learning multi-label classification of Spanish legal judgements	Francisco de Arriba-Pérez, Silvia García-Méndez, Francisco J. González-Castaño, Jaime González-González et.al.	2405.17610v1	null
2024-05-23	Artificial Intelligence (AI) in Legal Data Mining	Aniket Deroy, Naksatra Kumar Bailung, Kripabandhu Ghosh, Saptarshi Ghosh, Abhijnan Chakraborty et.al.	2405.14707v1	null
2024-05-23	ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks	T. Y. S. S Santosh, Tuan-Quang Vuong, Matthias Grabmair et.al.	2405.14211v1	null
2024-05-20	CaseGNN++: Graph Contrastive Learning for Legal Case Retrieval with Graph Augmentation	Yanran Tang, Ruihong Qiu, Yilun Liu, Xue Li, Zi Huang et.al.	2405.11791v1	link
2024-05-17	Empowering Prior to Court Legal Analysis: A Transparent and Accessible Dataset for Defensive Statement Classification and Interpretation	Yannis Spyridis, Jean-Paul, Haneen Deeb, Vasileios Argyriou et.al.	2405.10702v1	null
2024-05-16	Co-Matching: Towards Human-Machine Collaborative Legal Case Matching	Chen Huang, Xinwei Yang, Yang Deng, Wenqiang Lei, JianCheng Lv, Tat-Seng Chua et.al.	2405.10248v1	null
2024-05-09	Letter to the Editor: What are the legal and ethical considerations of submitting radiology reports to ChatGPT?	Siddharth Agarwal, David Wood, Robin Carpenter, Yiran Wei, Marc Modat, Thomas C Booth et.al.	2405.05647v1	null
2024-05-01	A Legal Framework for Natural Language Processing Model Training in Portugal	Rúben Almeida, Evelin Amorim et.al.	2405.00536v1	null
2024-05-02	Towards A Structured Overview of Use Cases for Natural Language Processing in the Legal Domain: A German Perspective	Juraj Vladika, Stephen Meisenbacher, Martina Preis, Alexandra Klymenko, Florian Matthes et.al.	2404.18759v2	null
2024-04-26	Enhancing Legal Compliance and Regulation Analysis with Large Language Models	Shabnam Hassani et.al.	2404.17522v1	null
2024-04-25	Legal Aspects for Software Developers Interested in Generative AI Applications	Steffen Herbold, Brian Valerius, Anamaria Mojica-Hanke, Isabella Lex, Joel Mittel et.al.	2404.16630v1	null
2024-04-22	Rethinking Legal Compliance Automation: Opportunities with Large Language Models	Shabnam Hassani, Mehrdad Sabetzadeh, Daniel Amyot, Jain Liao et.al.	2404.14356v1	null
2024-04-16	BayesJudge: Bayesian Kernel Language Modelling with Confidence Uncertainty in Legal Judgment Prediction	Ubaid Azam, Imran Razzak, Shelly Vishwakarma, Hakim Hacid, Dell Zhang, Shoaib Jameel et.al.	2404.10481v1	null
2024-04-15	LegalPro-BERT: Classification of Legal Provisions by fine-tuning BERT Large Language Model	Amit Tewari et.al.	2404.10097v1	null
2024-04-15	Debunking Robot Rights Metaphysically, Ethically, and Legally	Abeba Birhane, Jelle van Dijk, Frank Pasquale et.al.	2404.10072v1	null
2024-06-27	Software Engineering Methods For AI-Driven Deductive Legal Reasoning	Rohan Padhye et.al.	2404.09868v2	null
2024-05-23	A Legal Risk Taxonomy for Generative Artificial Intelligence	David Atkinson, Jacob Morrison et.al.	2404.09479v3	null
2024-04-08	Text clustering applied to data augmentation in legal contexts	Lucas José Gonçalves Freitas, Thaís Rodrigues, Guilherme Rodrigues, Pamella Edokawa, Ariane Farias et.al.	2404.08683v1	null
2024-04-10	Leveraging open-source models for legal language modeling and analysis: a case study on the Indian constitution	Vikhyath Gupta, Srinivasa Rao P et.al.	2404.06751v1	null
2024-04-08	Privacy and Security of Women's Reproductive Health Apps in a Changing Legal Landscape	Shalini Saini, Nitesh Saxena et.al.	2404.05876v1	null
2024-04-04	CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question Answering	Nirmalie Wiratunga, Ramitha Abeyratne, Lasal Jayawardena, Kyle Martin, Stewart Massie, Ikechukwu Nkisi-Orji, Ruvan Weerasinghe, Anne Liret, Bruno Fleisch et.al.	2404.04302v1	link
2024-04-04	NLP at UC Santa Cruz at SemEval-2024 Task 5: Legal Answer Validation using Few-Shot Multi-Choice QA	Anish Pahilajani, Samyak Rajesh Jain, Devasha Trivedi et.al.	2404.03150v1	link
2024-05-03	Automated Transparency: A Legal and Empirical Analysis of the Digital Services Act Transparency Database	Rishabh Kaushal, Jacob van de Kerkhof, Catalina Goanta, Gerasimos Spanakis, Adriana Iamnitchi et.al.	2404.02894v2	null
2024-04-02	FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning	Joel Niklaus, Lucia Zheng, Arya D. McCarthy, Christopher Hahn, Brian M. Rosen, Peter Henderson, Daniel E. Ho, Garrett Honke, Percy Liang, Christopher Manning et.al.	2404.02127v1	link
2024-03-31	Mind Your Neighbours: Leveraging Analogous Instances for Rhetorical Role Labeling for Legal Documents	T. Y. S. S Santosh, Hassan Sarwat, Ahmed Abdou, Matthias Grabmair et.al.	2404.01344v1	null
2024-04-01	Exploring the Nexus of Large Language Models and Legal Systems: A Short Survey	Weicong Qin, Zhongxiang Sun et.al.	2404.00990v1	null
2024-04-01	Towards an In-Depth Comprehension of Case Relevance for Better Legal Retrieval	Haitao Li, You Chen, Zhekai Ge, Qingyao Ai, Yiqun Liu, Quan Zhou, Shuai Huo et.al.	2404.00947v1	null
2024-03-31	Query-driven Relevant Paragraph Extraction from Legal Judgments	T. Y. S. S Santosh, Elvin Quero Hernandez, Matthias Grabmair et.al.	2404.00595v1	null
2024-03-31	LexAbSumm: Aspect-based Summarization of Legal Decisions	T. Y. S. S Santosh, Mahmoud Aly, Matthias Grabmair et.al.	2404.00594v1	null
2024-03-30	Automatic explanation of the classification of Spanish legal judgments in jurisdiction-dependent law categories with tree estimators	Jaime González-González, Francisco de Arriba-Pérez, Silvia García-Méndez, Andrea Busto-Castiñeira, Francisco J. González-Castaño et.al.	2404.00437v1	null
2024-03-28	Beyond Borders: Investigating Cross-Jurisdiction Transfer in Legal Case Summarization	T. Y. S. S Santosh, Vatsal Venkatkrishna, Saptarshi Ghosh, Matthias Grabmair et.al.	2403.19317v1	null
2024-03-27	High Recall, Small Data: The Challenges of Within-System Evaluation in a Live Legal Search System	Gineke Wiggers, Suzan Verberne, Arjen de Vries, Roel van der Burg et.al.	2403.18962v1	null
2024-03-27	A Path Towards Legal Autonomy: An interoperable and explainable approach to extracting, transforming, loading and computing legal information using large language models, expert systems and Bayesian networks	Axel Constant, Hannes Westermann, Bryan Wilson, Alex Kiefer, Ines Hipolito, Sylvain Pronovost, Steven Swanson, Mahault Albarracin, Maxwell J. D. Ramstead et.al.	2403.18537v1	null
2024-03-27	DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment	Haitao Li, Qingyao Ai, Xinyan Han, Jia Chen, Qian Dong, Yiqun Liu, Chong Chen, Qi Tian et.al.	2403.18435v1	null
2024-03-27	Leveraging Large Language Models for Relevance Judgments in Legal Case Retrieval	Shengjie Ma, Chong Chen, Qi Chu, Jiaxin Mao et.al.	2403.18405v1	null
2024-03-26	Juru: Legal Brazilian Large Language Model from Reputable Sources	Roseval Malaquias Junior, Ramon Pires, Roseli Romero, Rodrigo Nogueira et.al.	2403.18140v1	null
2024-03-26	GPTs and Language Barrier: A Cross-Lingual Legal QA Examination	Ha-Thanh Nguyen, Hiroaki Yamada, Ken Satoh et.al.	2403.18098v1	null
2024-03-26	Enhancing Legal Document Retrieval: A Multi-Phase Approach with Large Language Models	Hai-Long Nguyen, Duc-Minh Nguyen, Tan-Minh Nguyen, Ha-Thanh Nguyen, Thi-Hai-Yen Vuong, Ken Satoh et.al.	2403.18093v1	null
2024-06-12	CaseLink: Inductive Graph Learning for Legal Case Retrieval	Yanran Tang, Ruihong Qiu, Hongzhi Yin, Xue Li, Zi Huang et.al.	2403.17780v3	link
2024-04-16	Towards Explainability in Legal Outcome Prediction Models	Josef Valvoda, Ryan Cotterell et.al.	2403.16852v2	link
2024-03-22	"The Law Doesn't Work Like a Computer": Exploring Software Licensing Issues Faced by Legal Practitioners	Nathan Wintersgill, Trevor Stalnaker, Laura A. Heymann, Oscar Chaparro, Denys Poshyvanyk et.al.	2403.14927v1	link
2024-03-20	PARAMANU-AYN: An Efficient Novel Generative and Instruction-tuned Language Model for Indian Legal Case Documents	Mitodru Niyogi, Arnab Bhattacharya et.al.	2403.13681v1	null
2024-03-20	Improving Legal Case Retrieval with Brain Signals	Ruizhe Zhang, Qingyao Ai, Ziyi Ye, Yueyue Wu, Xiaohui Xie, Yiqun Liu et.al.	2403.13242v1	null
2024-07-02	Towards Unsupervised Question Answering System with Multi-level Summarization for Legal Text	M Manvith Prabhu, Haricharana Srinivasa, Anand Kumar M et.al.	2403.13107v2	null
2024-03-17	Evaluation Ethics of LLMs in Legal Domain	Ruizhe Zhang, Haitao Li, Yueyue Wu, Qingyao Ai, Yiqun Liu, Min Zhang, Shaoping Ma et.al.	2403.11152v1	null
2024-03-16	Human Centered AI for Indian Legal Text Analytics	Sudipto Ghosh, Devanshu Verma, Balaji Ganesan, Purnima Bindal, Vikas Kumar, Vasudha Bhatnagar et.al.	2403.10944v1	null
2024-03-14	Caveat Lector: Large Language Models in Legal Practice	Eliza Mik et.al.	2403.09163v1	null
2024-05-08	Legally Binding but Unfair? Towards Assessing Fairness of Privacy Policies	Vincent Freiberger, Erik Buchmann et.al.	2403.08115v2	null
2024-03-11	Exploring Large Language Models and Hierarchical Frameworks for Classification of Large Unstructured Legal Documents	Nishchal Prasad, Mohand Boughanem, Taoufiq Dkaki et.al.	2403.06872v1	link
2024-03-06	VLSP 2023 -- LTER: A Summary of the Challenge on Legal Textual Entailment Recognition	Vu Tran, Ha-Thanh Nguyen, Trung Vo, Son T. Luu, Hoang-Anh Dang, Ngoc-Cam Le, Thi-Thuy Le, Minh-Tien Nguyen, Truong-Son Nguyen, Le-Minh Nguyen et.al.	2403.03435v1	null
2024-03-03	Logic Rules as Explanations for Legal Case Retrieval	Zhongxiang Sun, Kepu Zhang, Weijie Yu, Haoyu Wang, Jun Xu et.al.	2403.01457v1	link
2024-03-08	Evault for legal records	Jeba N, Anas S, Anuragav S, Abhishek R, Sachin K et.al.	2403.01186v2	null
2024-02-25	Gender Biased Legal Case Retrieval System on Users' Decision Process	Ruizhe Zhang, Qingyao Ai, Yiqun Liu, Yueyue Wu, Beining Wang et.al.	2403.00814v1	null
2024-06-14	EUROPA: A Legal Multilingual Keyphrase Generation Dataset	Olivier Salaün, Frédéric Piedboeuf, Guillaume Le Berre, David Alfonso Hermelo, Philippe Langlais et.al.	2403.00252v2	link
2024-03-04	Improving Legal Judgement Prediction in Romanian with Long Text Encoders	Mihai Masala, Traian Rebedea, Horia Velicu et.al.	2402.19170v2	null
2024-07-02	Leveraging Large Language Models for Learning Complex Legal Concepts through Storytelling	Hang Jiang, Xiajie Zhang, Robert Mahari, Daniel Kessler, Eric Ma, Tal August, Irene Li, Alex 'Sandy' Pentland, Yoon Kim, Deb Roy, Jad Kabbara et.al.	2402.17019v4	link
2024-06-17	**InSaAF: Incorporating Safety through Accuracy and Fairness	Are LLMs ready for the Indian Legal Domain?**	Yogesh Tripathi, Raghav Donakanti, Sahil Girhepuje, Ishan Kavathekar, Bhaskara Hanuma Vedula, Gokul S Krishnan, Shreya Goyal, Anmol Goel, Balaraman Ravindran, Ponnurangam Kumaraguru et.al.	2402.10567v4
2024-02-12	Large Language Models "Ad Referendum": How Good Are They at Machine Translation in the Legal Domain?	Vicent Briva-Iglesias, Joao Lucas Cavalheiro Camargo, Gokhan Dogru et.al.	2402.07681v1	null
2024-06-06	Through the Lens of Split Vote: Exploring Disagreement, Difficulty and Calibration in Legal Case Outcome Classification	Shanshan Xu, T. Y. S. S Santosh, Oana Ichim, Barbara Plank, Matthias Grabmair et.al.	2402.07214v3	null
2024-02-06	LegalLens: Leveraging LLMs for Legal Violation Identification in Unstructured Text	Dor Bernsohn, Gil Semo, Yaron Vazana, Gila Hayat, Ben Hagag, Joel Niklaus, Rohit Saha, Kyryl Truskovskyi et.al.	2402.04335v1	link
2024-02-29	Advancing Legal Reasoning: The Integration of AI to Navigate Complexities and Biases in Global Jurisprudence with Semi-Automated Arbitration Processes (SAAPs)	Michael De'Shazer et.al.	2402.04140v3	null
2024-05-03	(A)I Am Not a Lawyer, But...: Engaging Legal Experts towards Responsible LLM Policies for Legal Advice	Inyoung Cheong, King Xia, K. J. Kevin Feng, Quan Ze Chen, Amy X. Zhang et.al.	2402.01864v2	null
2024-01-30	Aalap: AI Assistant for Legal & Paralegal Functions in India	Aman Tiwari, Prathamesh Kalamkar, Atreyo Banerjee, Saurabh Karn, Varun Hemachandran, Smita Gupta et.al.	2402.01758v1	null
2024-01-18	Legal and ethical implications of applications based on agreement technologies: the case of auction-based road intersections	José-Antonio Santos, Alberto Fernández, Mar Moreno-Rebato, Holger Billhardt, José-A. Rodríguez-García, Sascha Ossowski et.al.	2402.01673v1	null
2024-01-10	Promises and pitfalls of artificial intelligence for legal applications	Sayash Kapoor, Peter Henderson, Arvind Narayanan et.al.	2402.01656v1	null
2024-01-31	Employing Label Models on ChatGPT Answers Improves Legal Text Entailment Performance	Chau Nguyen, Le-Minh Nguyen et.al.	2401.17897v1	null
2024-04-13	PILOT: Legal Case Outcome Prediction with Case Law	Lang Cao, Zifeng Wang, Cao Xiao, Jimeng Sun et.al.	2401.15770v3	null
2024-02-28	LegalDuet: Learning Effective Representations for Legal Judgment Prediction through a Dual-View Legal Clue Reasoning	Pengjie Liu, Zhenghao Liu, Xiaoyuan Yi, Liner Yang, Shuo Wang, Yu Gu, Ge Yu, Xing Xie, Shuang-hua Yang et.al.	2401.15371v2	null
2024-01-26	A Korean Legal Judgment Prediction Dataset for Insurance Disputes	Alice Saebom Kwak, Cheonkam Jeong, Ji Weon Lim, Byeongcheol Min et.al.	2401.14654v1	null
2024-01-25	Automated legal reasoning with discretion to act using s(LAW)	Joaquín Arias, Mar Moreno-Rebato, José A. Rodríguez-García, Sascha Ossowski et.al.	2401.14511v1	null
2024-01-22	Streamlining Advanced Taxi Assignment Strategies based on Legal Analysis	Holger Billhardt, José-Antonio Santos, Alberto Fernández, Mar Moreno, Sascha Ossowski, José A. Rodríguez et.al.	2401.12324v1	null
2024-01-22	The Right Model for the Job: An Evaluation of Legal Multi-Label Classification Baselines	Martina Forster, Claudia Schulz, Prudhvi Nokku, Melicaalsadat Mirsafian, Jaykumar Kasundra, Stavroula Skylaki et.al.	2401.11852v1	null
2024-01-09	Answer Retrieval in Legal Community Question Answering	Arian Askari, Zihui Yang, Zhaochun Ren, Suzan Verberne et.al.	2401.04852v1	link
2024-01-07	CAPTAIN at COLIEE 2023: Efficient Methods for Legal Information Retrieval and Entailment Tasks	Chau Nguyen, Phuong Nguyen, Thanh Tran, Dat Nguyen, An Trieu, Tin Pham, Anh Dang, Le-Minh Nguyen et.al.	2401.03551v1	link
2024-06-21	Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models	Matthew Dahl, Varun Magesh, Mirac Suzgun, Daniel E. Ho et.al.	2401.01301v2	link
2024-01-02	Discovering Significant Topics from Legal Decisions with Selective Inference	Jerrold Soh et.al.	2401.01068v1	null
2023-12-31	Viz: A QLoRA-based Copyright Marketplace for Legally Compliant Generative AI	Dipankar Sarkar et.al.	2401.00503v1	null
2023-12-19	CaseGNN: Graph Neural Networks for Legal Case Retrieval with Text-Attributed Graphs	Yanran Tang, Ruihong Qiu, Yilun Liu, Xue Li, Zi Huang et.al.	2312.11229v2	link
2024-04-02	Social, Legal, Ethical, Empathetic, and Cultural Rules: Compilation and Reasoning (Extended Version)	Nicolas Troquard, Martina De Sanctis, Paola Inverardi, Patrizio Pelliccione, Gian Luca Scoccia et.al.	2312.09699v2	null
2024-04-15	Explicitly Integrating Judgment Prediction with Legal Document Retrieval: A Law-Guided Generative Approach	Weicong Qin, Zelin Cao, Weijie Yu, Zihua Si, Sirui Chen, Jun Xu et.al.	2312.09591v2	link
2023-12-14	Weaving Pathways for Justice with GPT: LLM-driven automated drafting of interactive legal applications	Quinten Steenhuis, David Colarusso, Bryce Willey et.al.	2312.09198v1	link
2023-12-13	SLJP: Semantic Extraction based Legal Judgment Prediction	Prameela Madambakam, Shathanaa Rajmohan, Himangshu Sharma, Tummepalli Anka Chandrahas Purushotham Gupta et.al.	2312.07979v1	null
2023-12-10	Multi-Defendant Legal Judgment Prediction via Hierarchical Reasoning	Yougang Lyu, Jitai Hao, Zihan Wang, Kai Zhao, Shen Gao, Pengjie Ren, Zhumin Chen, Fang Wang, Zhaochun Ren et.al.	2312.05762v1	link
2023-12-06	Boosting legal case retrieval by query content selection with large language models	Youchao Zhou, Heyan Huang, Zhijing Wu et.al.	2312.03494v1	link
2023-12-03	Towards Mitigating Perceived Unfairness in Contracts from a Non-Legal Stakeholder's Perspective	Anmol Singhal, Preethu Rose Anish, Shirish Karande, Smita Ghaisas et.al.	2312.01398v1	null
2023-12-01	The Ethics of Automating Legal Actors	Josef Valvoda, Alec Thompson, Ryan Cotterell, Simone Teufel et.al.	2312.00584v1	null
2023-12-01	Questioning Biases in Case Judgment Summaries: Legal Datasets or Large Language Models?	Aniket Deroy, Subhankar Maity et.al.	2312.00554v1	null
2024-06-13	Japanese Tort-case Dataset for Rationale-supported Legal Judgment Prediction	Hiroaki Yamada, Takenobu Tokunaga, Ryutaro Ohara, Akira Tokutsu, Keisuke Takeshita, Mihoko Sumida et.al.	2312.00480v2	null
2023-11-27	Justifiable Artificial Intelligence: Engineering Large Language Models for Legal Applications	Sabine Wehnert et.al.	2311.15716v1	null
2024-02-17	Legal Requirements Analysis	Sallam Abualhaija, Marcello Ceci, Lionel Briand et.al.	2311.13871v3	null
2023-11-22	Intention and Context Elicitation with Large Language Models in the Legal Aid Intake Process	Nick Goodson, Rongfei Lu et.al.	2311.13281v1	null
2023-11-22	Enhancing Logical Reasoning in Large Language Models to Facilitate Legal Applications	Ha-Thanh Nguyen, Wachara Fungwacharakorn, Ken Satoh et.al.	2311.13095v1	null
2023-11-21	Development of a Legal Document AI-Chatbot	Pranav Nataraj Devaraj, Rakesh Teja P V, Aaryav Gangrade, Manoj Kumar R et.al.	2311.12719v1	null
2023-11-20	Multi-Task Faces (MTF) Data Set: A Legally and Ethically Compliant Collection of Face Images for Various Classification Tasks	Rami Haffar, David Sánchez, Josep Domingo-Ferrer et.al.	2311.11882v1	link
2023-10-19	Proceedings of the 3rd International Workshop on Mining and Learning in the Legal Domain (MLLD-23)	Masoud Makrehchi, Dell Zhang, Alina Petrova, John Armour et.al.	2311.10733v1	null
2024-02-28	BLT: Can Large Language Models Handle Basic Legal Text?	Andrew Blair-Stanek, Nils Holzenberger, Benjamin Van Durme et.al.	2311.09693v2	link
2023-11-15	Explainable Text Classification Techniques in Legal Document Review: Locating Rationales without Using Human Annotated Training Text Snippets	Christian Mahoney, Peter Gronvall, Nathaniel Huber-Fliflet, Jianping Zhang et.al.	2311.09133v1	null
2023-11-15	Large Language Models are legal but they are not: Making the case for a powerful LegalLLM	Thanmay Jayakumar, Fauzan Farooqui, Luqman Farooqui et.al.	2311.08890v1	null
2023-11-14	Exploring Semi-supervised Hierarchical Stacked Encoder for Legal Judgement Prediction	Nishchal Prasad, Mohand Boughanem, Taoufiq Dkaki et.al.	2311.08103v1	link
2024-03-02	Translating Legalese: Enhancing Public Understanding of Court Opinions with Legal Summarizers	Elliott Ash, Aniket Kesari, Suresh Naidu, Lena Song, Dominik Stammbach et.al.	2311.06534v2	null
2023-11-10	Citation Recommendation on Scholarly Legal Articles	Doğukan Arslan, Saadet Sena Erdoğan, Gülşen Eryiğit et.al.	2311.05902v1	link
2023-11-09	Legal-HNet: Mixing Legal Long-Context Tokens with Hartley Transform	Daniele Giofré, Sneha Ghantasala et.al.	2311.05089v1	null
2023-11-01	From Text to Structure: Using Large Language Models to Support the Development of Legal Expert Systems	Samyar Janatian, Hannes Westermann, Jinzhe Tan, Jaromir Savelka, Karim Benyekhlef et.al.	2311.04911v1	link
2024-02-05	An energy-based comparative analysis of common approaches to text classification in the Legal domain	Sinan Gultekin, Achille Globo, Andrea Zugarini, Marco Ernandes, Leonardo Rigutini et.al.	2311.01256v2	null
2024-01-02	Caseformer: Pre-training for Legal Case Retrieval Based on Inter-Case Distinctions	Weihang Su, Qingyao Ai, Yueyue Wu, Yixiao Ma, Haitao Li, Yiqun Liu, Zhijing Wu, Min Zhang et.al.	2311.00333v2	link
2023-10-28	Using Large Language Models to Support Thematic Analysis in Empirical Legal Studies	Jakub Drápal, Hannes Westermann, Jaromir Savelka et.al.	2310.18729v1	null
2023-10-28	MILDSum: A Novel Benchmark Dataset for Multilingual Summarization of Indian Legal Case Judgments	Debtanu Datta, Shubham Soni, Rajdeep Mukherjee, Saptarshi Ghosh et.al.	2310.18600v1	link
2023-10-27	Modeling Legal Reasoning: LM Annotation at the Edge of Human Agreement	Rosamond Thalken, Edward H. Stiglitz, David Mimno, Matthew Wilkens et.al.	2310.18440v1	link
2023-10-26	LeCaRDv2: A Large-Scale Chinese Legal Case Retrieval Dataset	Haitao Li, Yunqiu Shao, Yueyue Wu, Qingyao Ai, Yixiao Ma, Yiqun Liu et.al.	2310.17609v1	null
2023-10-26	Harnessing GPT-3.5-turbo for Rhetorical Role Prediction in Legal Cases	Anas Belfathi, Nicolas Hernandez, Laura Monceaux et.al.	2310.17413v1	null
2023-10-25	Human-centred explanation of rule-based decision-making systems in the legal domain	Suzan Zuurmond, AnneMarie Borg, Matthijs van Kempen, Remi Wieten et.al.	2310.16704v1	null
2023-10-24	DALE: Generative Data Augmentation for Low-Resource Legal NLP	Sreyan Ghosh, Chandra Kiran Evuru, Sonal Kumar, S Ramaneswaran, S Sakshi, Utkarsh Tyagi, Dinesh Manocha et.al.	2310.15799v1	link
2023-10-24	Navigating ICT In-House Procurement in Finland: Evaluating Legal Frameworks and Practical Challenges	Reetta Ghezzi, Minnamaria Korhonen, Hannu Vilpponen, Tommi Mikkonen et.al.	2310.15643v1	null
2023-11-03	Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer?	Xiaoxi Kang, Lizhen Qu, Lay-Ki Soon, Adnan Trakic, Terry Yue Zhuo, Patrick Charles Emerton, Genevieve Grant et.al.	2310.14880v2	link
2023-10-19	Do Language Models Learn about Legal Entity Types during Pretraining?	Claire Barale, Michael Rovatsos, Nehal Bhuta et.al.	2310.13092v1	link
2023-10-19	Exploring Graph Neural Networks for Indian Legal Judgment Prediction	Mann Khatri, Mirza Yusuf, Yaman Kumar, Rajiv Ratn Shah, Ponnurangam Kumaraguru et.al.	2310.12800v1	null
2023-10-19	Transformer-based Entity Legal Form Classification	Alexander Arimond, Mauro Molteni, Dominik Jany, Zornitsa Manolova, Damian Borth, Andreas G. F. Hoepner et.al.	2310.12766v1	link
2023-10-18	Automated Attribute Extraction from Legal Proceedings	Subinay Adhikary, Sagnik Das, Sagnik Saha, Procheta Sen, Dwaipayan Roy, Kripabandhu Ghosh et.al.	2310.12131v1	null
2023-10-18	A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction	Ruihao Shui, Yixin Cao, Xiang Wang, Tat-Seng Chua et.al.	2310.11761v1	link
2023-10-17	Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation	Shubham Kumar Nigam, Aniket Deroy, Noel Shallum, Ayush Kumar Mishra, Anup Roy, Shubham Kumar Mishra, Arnab Bhattacharya, Saptarshi Ghosh, Kripabandhu Ghosh et.al.	2310.11049v1	link
2023-10-25	Legal NLP Meets MiCAR: Advancing the Analysis of Crypto White Papers	Carolina Camassa et.al.	2310.10333v3	null
2023-10-16	Prediction of Arabic Legal Rulings using Large Language Models	Adel Ammar, Anis Koubaa, Bilel Benjdira, Omar Najar, Serry Sibaee et.al.	2310.10260v1	null
2023-10-15	Improving Access to Justice for the Indian Population: A Benchmark for Evaluating Translation of Legal Text to Indian Languages	Sayan Mahapatra, Debtanu Datta, Shubham Soni, Adrijit Goswami, Saptarshi Ghosh et.al.	2310.09765v1	null
2023-10-13	Precedent-Enhanced Legal Judgment Prediction with LLM and Domain-Model Collaboration	Yiquan Wu, Siying Zhou, Yifei Liu, Weiming Lu, Xiaozhong Liu, Yating Zhang, Changlong Sun, Fei Wu, Kun Kuang et.al.	2310.09241v1	null
2023-10-11	Empirical Analysis of the Impact of Legal Tender Digital Currency on Monetary Policy -Based on China's Data	Ruimin Song, TIntian Zhao, Chunhui Zhou et.al.	2310.07326v1	null
2023-10-12	Automated Argument Generation from Legal Facts	Oscar Tuvey, Procheta Sen et.al.	2310.05680v3	null
2024-02-18	LAiW: A Chinese Legal Large Language Models Benchmark	Yongfu Dai, Duanyu Feng, Jimin Huang, Haochen Jia, Qianqian Xie, Yifang Zhang, Weiguang Han, Wei Tian, Hao Wang et.al.	2310.05620v2	link
2023-10-08	Enhancing Pre-Trained Language Models with Sentence Position Embeddings for Rhetorical Roles Recognition in Legal Opinions	Anas Belfathi, Nicolas Hernandez, Laura Monceaux et.al.	2310.05276v1	null
2023-10-07	Investigating the Influence of Legal Case Retrieval Systems on Users' Decision Process	Beining Wang, Ruizhe Zhang, Yueyue Wu, Qingyao Ai, Min Zhang, Yiqun Liu et.al.	2310.04735v1	null
2023-10-06	Marketing to Children Through Online Targeted Advertising: Targeting Mechanisms and Legal Aspects	Tinhinane Medjkoune, Oana Goga, Juliette Senechal et.al.	2310.04104v1	null
2023-10-10	LEEC: A Legal Element Extraction Dataset with an Extensive Domain-Specific Label System	Xue Zongyue, Liu Huanghai, Hu Yiran, Kong Kangle, Wang Chenlu, Liu Yun, Shen Weixing et.al.	2310.01271v2	link
2023-10-02	Comparative Analysis of Technical and Legal Frameworks of Various National Digial Identity Solutions	Montassar Naghmouchi, Maryline Laurent, Claire Levallois-Barth, Nesrine Kaaniche et.al.	2310.01006v1	null
2023-09-29	STRONG -- Structure Controllable Legal Opinion Summary Generation	Yang Zhong, Diane Litman et.al.	2309.17280v1	link
2023-09-29	Interpretable Long-Form Legal Question Answering with Retrieval-Augmented Large Language Models	Antoine Louis, Gijs van Dijck, Gerasimos Spanakis et.al.	2309.17050v1	link
2023-09-28	LawBench: Benchmarking Legal Knowledge of Large Language Models	Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Songyang Zhang, Kai Chen, Zongwen Shen, Jidong Ge et.al.	2309.16289v1	link
2023-12-18	Question-Answering Approach to Evaluating Legal Summaries	Huihui Xu, Kevin Ashley et.al.	2309.15016v2	link
2023-10-16	Legal Question-Answering in the Indian Context: Efficacy, Challenges, and Potential of Modern AI Models	Shubham Kumar Nigam, Shubham Kumar Mishra, Ayush Kumar Mishra, Noel Shallum, Arnab Bhattacharya et.al.	2309.14735v2	null
2024-01-01	The Cambridge Law Corpus: A Dataset for Legal AI Research	Andreas Östling, Holli Sargeant, Huiyuan Xie, Ludwig Bull, Alexander Terenin, Leif Jonsson, Måns Magnusson, Felix Steffek et.al.	2309.12269v4	null
2023-10-13	Legitimate Interest is the New Consent -- Large-Scale Measurement and Legal Compliance of IAB Europe TCF Paywalls	Victor Morel, Cristiana Santos, Viktor Fredholm, Adam Thunberg et.al.	2309.11625v3	null
2023-09-23	DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal Services	Shengbin Yue, Wei Chen, Siyuan Wang, Bingxuan Li, Chenchen Shen, Shujun Liu, Yuxuan Zhou, Yao Xiao, Song Yun, Xuanjing Huang, Zhongyu Wei et.al.	2309.11325v2	link
2023-09-25	A Hierarchical Neural Framework for Classification and its Explanation in Large Unstructured Legal Documents	Nishchal Prasad, Mohand Boughanem, Taoufik Dkaki et.al.	2309.10563v2	null
2023-09-16	NOWJ1@ALQAC 2023: Enhancing Legal Task Performance with Classic Statistical Models and Pre-trained Language Models	Tan-Minh Nguyen, Xuan-Hoa Nguyen, Ngoc-Duy Mai, Minh-Quan Hoang, Van-Huan Nguyen, Hoang-Viet Nguyen, Ha-Thanh Nguyen, Thi-Hai-Yen Vuong et.al.	2309.09070v1	null
2023-09-16	Constructing a Knowledge Graph for Vietnamese Legal Cases with Heterogeneous Graphs	Thi-Hai-Yen Vuong, Minh-Quan Hoang, Tan-Minh Nguyen, Hoang-Trung Nguyen, Ha-Thanh Nguyen et.al.	2309.09069v1	null
2023-09-15	Resolving Legalese: A Multilingual Exploration of Negation Scope Resolution in Legal Documents	Ramona Christen, Anastassia Shaitarova, Matthias Stürmer, Joel Niklaus et.al.	2309.08695v1	link
2023-09-15	Encoded Summarization: Summarizing Documents into Continuous Vector Space for Legal Case Retrieval	Vu Tran, Minh Le Nguyen, Satoshi Tojo, Ken Satoh et.al.	2309.08187v1	null
2023-12-21	FedJudge: Federated Legal Large Language Model	Linan Yue, Qi Liu, Yichao Du, Weibo Gao, Ye Liu, Fangzhou Yao et.al.	2309.08173v2	link
2023-08-11	India's Progress in Space Exploration and International Legal Challenges in Meeting Goals within International Space Boundaries: A Review	Jayanthi Vajiram, Utkarsh Maurya, Negha Senthil et.al.	2309.06560v1	null
2023-09-11	Black-Box Analysis: GPTs Across Time in Legal Textual Entailment Task	Ha-Thanh Nguyen, Randy Goebel, Francesca Toni, Kostas Stathis, Ken Satoh et.al.	2309.05501v1	null
2023-09-11	NeCo@ALQAC 2023: Legal Domain Knowledge Acquisition for Low-Resource Languages through Data Enrichment	Hai-Long Nguyen, Dieu-Quynh Nguyen, Hoang-Trung Nguyen, Thu-Trang Pham, Huu-Dong Nguyen, Thach-Anh Nguyen, Thi-Hai-Yen Vuong, Ha-Thanh Nguyen et.al.	2309.05500v1	null
2024-02-05	NESTLE: a No-Code Tool for Statistical Analysis of Legal Corpus	Kyoungyeon Cho, Seungkum Han, Young Rok Choi, Wonseok Hwang et.al.	2309.04146v2	null
2023-09-06	Prompt-based Effective Input Reformulation for Legal Case Retrieval	Yanran Tang, Ruihong Qiu, Xue Li et.al.	2309.02962v1	link
2023-09-01	ALJP: An Arabic Legal Judgment Prediction in Personal Status Cases Using Machine Learning Models	Salwa Abbara, Mona Hafez, Aya Kazzaz, Areej Alhothali, Alhanouf Alsolami et.al.	2309.00238v1	null
2023-09-05	Is the U.S. Legal System Ready for AI's Challenges to Human Values?	Inyoung Cheong, Aylin Caliskan, Tadayoshi Kohno et.al.	2308.15906v3	null
2023-08-20	LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models	Neel Guha, Julian Nyarko, Daniel E. Ho, Christopher Ré, Adam Chilton, Aditya Narayana, Alex Chohlas-Wood, Austin Peters, Brandon Waldon, Daniel N. Rockmore, Diego Zambrano, Dmitry Talisman, Enam Hoque, Faiz Surani, Frank Fagan, Galit Sarfaty, Gregory M. Dickinson, Haggai Porat, Jason Hegland, Jessica Wu, Joe Nudell, Joel Niklaus, John Nay, Jonathan H. Choi, Kevin Tobia, Margaret Hagan, Megan Ma, Michael Livermore, Nikon Rasumov-Rahe, Nils Holzenberger, Noam Kolt, Peter Henderson, Sean Rehaag, Sharad Goel, Shang Gao, Spencer Williams, Sunny Gandhi, Tom Zur, Varun Iyer, Zehua Li et.al.	2308.11462v1	link
2023-08-08	SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore	Sewon Min, Suchin Gururangan, Eric Wallace, Hannaneh Hajishirzi, Noah A. Smith, Luke Zettlemoyer et.al.	2308.04430v1	link
2023-08-04	Legal Summarisation through LLMs: The PRODIGIT Project	Thiago Dal Pont, Federico Galli, Andrea Loreggia, Giuseppe Pisano, Riccardo Rovatti, Giovanni Sartor et.al.	2308.04416v1	null
2023-08-08	Large Language Model Prompt Chaining for Long Legal Document Classification	Dietrich Trautmann et.al.	2308.04138v1	null
2023-08-02	Exploring the psychology of GPT-4's Moral and Legal Reasoning	Guilherme F. C. F. Almeida, José Luiz Nunes, Neele Engelmann, Alex Wiegmann, Marcelo de Araújo et.al.	2308.01264v1	null
2023-07-31	Adversarially Robust Neural Legal Judgement Systems	Rohit Raj, V Susheela Devi et.al.	2308.00165v1	null
2023-07-27	Exploration of legal implications of air and space travel for international and domestic travel and the Environment	Jayanthi Vajiram, Negha Senthil, Nean Adhith. P, Ritikaa. VN et.al.	2307.14661v1	null
2023-07-25	An Intent Taxonomy of Legal Case Retrieval	Yunqiu Shao, Haitao Li, Yueyue Wu, Yiqun Liu, Qingyao Ai, Jiaxin Mao, Yixiao Ma, Shaoping Ma et.al.	2307.13298v1	null
2023-07-17	Legal Syllogism Prompting: Teaching Large Language Models for Legal Judgment Prediction	Cong Jiang, Xiaolei Yang et.al.	2307.08321v1	link
2023-07-11	Argumentative Segmentation Enhancement for Legal Summarization	Huihui Xu, Kevin Ashley et.al.	2307.05081v1	null
2023-07-10	Legal Decision-making for Highway Automated Driving	Xiaohan Ma, Wenhao Yu, Chengxiang Zhao, Changjun Wang, Wenhui Zhou, Guangming Zhao, Mingyue Ma, Weida Wang, Lin Yang, Rui Mu, Hong Wang, Jun Li et.al.	2307.04327v1	null
2023-07-07	Specification, Validation and Verification of Social, Legal, Ethical, Empathetic and Cultural Requirements for Autonomous Agents	Sinem Getir Yaman, Ana Cavalcanti, Radu Calinescu, Colin Paterson, Pedro Ribeiro, Beverley Townsend et.al.	2307.03697v1	null
2024-01-18	Towards Open Federated Learning Platforms: Survey and Vision from Technical and Legal Perspectives	Moming Duan et.al.	2307.02140v2	link
2023-07-04	Racial Bias Trends in the Text of US Legal Opinions	Rohan Jinturkar et.al.	2307.01693v1	null
2023-06-29	Towards Grammatical Tagging for the Legal Language of Cybersecurity	Gianpietro Castiglione, Giampaolo Bella, Daniele Francesco Santamaria et.al.	2306.17042v1	null
2023-06-29	Beyond Logic Programming for Legal Reasoning	Ha-Thanh Nguyen, Francesca Toni, Kostas Stathis, Ken Satoh et.al.	2306.16632v1	null
2023-06-28	ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases	Jiaxi Cui, Zongjian Li, Yang Yan, Bohua Chen, Li Yuan et.al.	2306.16092v1	link
2023-06-09	Legal and ethical considerations regarding the use of ChatGPT in education	Fereniki Panagopoulou, Christina Parpoula, Kostas Karpouzis et.al.	2306.10037v1	null
2023-06-22	Explaining Legal Concepts with Augmented Large Language Models (GPT-4)	Jaromir Savelka, Kevin D. Ashley, Morgan A. Gray, Hannes Westermann, Huihui Xu et.al.	2306.09525v2	null
2023-06-12	Large Language Models as Tax Attorneys: A Case Study in Legal Capabilities Emergence	John J. Nay, David Karamardian, Sarah B. Lawsky, Wenting Tao, Meghana Bhat, Raghav Jain, Aaron Travis Lee, Jonathan H. Choi, Jungo Kasai et.al.	2306.07075v1	null
2023-06-09	Towards the Exploitation of LLM-based Chatbot for Providing Legal Support to Palestinian Cooperatives	Rabee Qasem, Banan Tantour, Mohammed Maree et.al.	2306.05827v1	null
2023-06-08	NOWJ at COLIEE 2023 -- Multi-Task and Ensemble Approaches in Legal Information Processing	Thi-Hai-Yen Vuong, Hai-Long Nguyen, Tan-Minh Nguyen, Hoang-Trung Nguyen, Thai-Binh Nguyen, Ha-Thanh Nguyen et.al.	2306.04903v1	null
2023-06-08	Improving Vietnamese Legal Question--Answering System based on Automatic Data Enrichment	Thi-Hai-Yen Vuong, Ha-Thanh Nguyen, Quang-Huy Nguyen, Le-Minh Nguyen, Xuan-Hieu Phan et.al.	2306.04841v1	null
2023-06-03	FlairNLP at SemEval-2023 Task 6b: Extraction of Legal Named Entities from Legal Texts using Contextual String Embeddings	Vinay N Ramesh, Rohan Eswara et.al.	2306.02182v1	link
2023-06-03	TransDocAnalyser: A Framework for Offline Semi-structured Handwritten Document Analysis in the Legal Domain	Sagar Chakraborty, Gaurav Harit, Saptarshi Ghosh et.al.	2306.02142v1	link
2023-06-06	MultiLegalPile: A 689GB Multilingual Legal Corpus	Joel Niklaus, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho et.al.	2306.02069v2	null
2023-06-14	How Ready are Pre-trained Abstractive Models and LLMs for Legal Case Judgement Summarization?	Aniket Deroy, Kripabandhu Ghosh, Saptarshi Ghosh et.al.	2306.01248v2	null
2023-06-01	Towards Argument-Aware Abstractive Summarization of Long Legal Opinions with Summary Reranking	Mohamed Elaraby, Yang Zhong, Diane Litman et.al.	2306.00672v1	null
2023-05-29	Datasets for Portuguese Legal Semantic Textual Similarity: Comparing weak supervision and an annotation process approaches	Daniel da Silva Junior, Paulo Roberto dos S. Corval, Aline Paes, Daniel de Oliveira et.al.	2306.00007v1	null
2023-05-09	Stronger Together: on the Articulation of Ethical Charters, Legal Tools, and Technical Documentation in ML	Giada Pistilli, Carlos Munoz Ferrandis, Yacine Jernite, Margaret Mitchell et.al.	2305.18615v1	null
2023-05-20	CDJUR-BR -- A Golden Collection of Legal Document from Brazilian Justice with Fine-Grained Named Entities	Antonio Mauricio, Vladia Pinheiro, Vasco Furtado, João Araújo Monteiro Neto, Francisco das Chagas Jucá Bomfim, André Câmara Ferreira da Costa, Raquel Silveira, Nilsiton Aragão et.al.	2305.18315v1	null
2023-05-25	Prototype-Based Interpretability for Legal Citation Prediction	Chu Fei Luo, Rohan Bhambhoria, Samuel Dahan, Xiaodan Zhu et.al.	2305.16490v1	null
2023-05-24	Automated Refugee Case Analysis: An NLP Pipeline for Supporting Legal Practitioners	Claire Barale, Michael Rovatsos, Nehal Bhuta et.al.	2305.15533v1	link
2023-05-23	Adversarial Machine Learning and Cybersecurity: Risks, Challenges, and Legal Implications	Micah Musser, Andrew Lohn, James X. Dempsey, Jonathan Spring, Ram Shankar Siva Kumar, Brenda Leong, Christina Liaghati, Cindy Martinez, Crystal D. Grant, Daniel Rohrer, Heather Frase, Jonathan Elliott, John Bansemer, Mikel Rodriguez, Mitt Regan, Rumman Chowdhury, Stefan Hermanek et.al.	2305.14553v1	null
2023-11-01	Towards Legally Enforceable Hate Speech Detection for Public Forums	Chu Fei Luo, Rohan Bhambhoria, Xiaodan Zhu, Samuel Dahan et.al.	2305.13677v2	link
2023-05-20	Proceedings of the International Workshop on Methodologies for Translating Legal Norms into Formal Representations (LN2FR 2022) in association with 35th International Conference on Legal Knowledge and Information Systems (JURIX 2022)	Georg Borges, Ken Satoh, Erich Schweighofer et.al.	2305.12203v1	null
2023-05-04	Late-Binding Scholarship in the Age of AI: Navigating Legal and Normative Challenges of a New Form of Knowledge Production	Bill Tomlinson, Andrew W. Torrance, Rebecca W. Black, Donald J. Patterson et.al.	2305.11058v1	null
2023-05-15	Legal Extractive Summarization of U.S. Court Opinions	Emmanuel Bauer, Dominik Stammbach, Nianlong Gu, Elliott Ash et.al.	2305.08428v1	link
2023-05-22	LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development	Ilias Chalkidis, Nicolas Garneau, Catalina Goanta, Daniel Martin Katz, Anders Søgaard et.al.	2305.07507v2	link
2023-05-11	THUIR@COLIEE 2023: More Parameters and Legal Knowledge for Legal Case Entailment	Haitao Li, Changyue Wang, Weihang Su, Yueyue Wu, Qingyao Ai, Yiqun Liu et.al.	2305.06817v1	link
2023-05-11	THUIR@COLIEE 2023: Incorporating Structural Knowledge into Pre-trained Language Models for Legal Case Retrieval	Haitao Li, Weihang Su, Changyue Wang, Yueyue Wu, Qingyao Ai, Yiqun Liu et.al.	2305.06812v1	link
2023-05-10	Extracting Complex Named Entities in Legal Documents via Weakly Supervised Object Detection	Hsiu-Wei Yang, Abhinav Agrawal et.al.	2305.05836v1	null
2023-05-09	An Exploration of Encoder-Decoder Approaches to Multi-Label Classification for Legal and Biomedical Text	Yova Kementchedjhieva, Ilias Chalkidis et.al.	2305.05627v1	link
2023-05-09	CaseEncoder: A Knowledge-enhanced Pre-trained Model for Legal Case Encoding	Yixiao Ma, Yueyue Wu, Weihang Su, Qingyao Ai, Yiqun Liu et.al.	2305.05393v1	null
2023-05-08	Unlocking Practical Applications in Legal Domain: Evaluation of GPT for Zero-Shot Semantic Annotation of Legal Texts	Jaromir Savelka et.al.	2305.04417v1	null
2023-05-06	Rhetorical Role Labeling of Legal Documents using Transformers and Graph Neural Networks	Anshika Gupta, Shaz Furniturewala, Vijay Kumari, Yashvardhan Sharma et.al.	2305.04100v1	null
2023-05-04	ChatGPT and Works Scholarly: Best Practices and Legal Pitfalls in Writing with AI	Bill Tomlinson, Andrew W. Torrance, Rebecca W. Black et.al.	2305.03722v1	null
2023-05-03	CiteCaseLAW: Citation Worthiness Detection in Caselaw for Legal Assistive Writing	Mann Khatri, Pritish Wadhwa, Gitansh Satija, Reshma Sheik, Yaman Kumar, Rajiv Ratn Shah, Ponnurangam Kumaraguru et.al.	2305.03508v1	null
2023-05-04	Analyzing Hong Kong's Legal Judgments from a Computational Linguistics point-of-view	Sankalok Sen et.al.	2305.02558v1	null
2023-05-02	MultiLegalSBD: A Multilingual Legal Sentence Boundary Detection Dataset	Tobias Brugger, Matthias Stürmer, Joel Niklaus et.al.	2305.01211v1	link
2023-04-27	Analyzing Vietnamese Legal Questions Using Deep Neural Networks with Biaffine Classifiers	Nguyen Anh Tu, Hoang Thi Thu Uyen, Tu Minh Phuong, Ngo Xuan Bach et.al.	2304.14447v1	null
2023-04-21	The Dark Side of ChatGPT: Legal and Ethical Challenges from Stochastic Parrots and Hallucination	Zihao Li et.al.	2304.14347v1	null
2023-04-22	SAILER: Structure-aware Pre-trained Language Model for Legal Case Retrieval	Haitao Li, Qingyao Ai, Jia Chen, Qian Dong, Yueyue Wu, Yiqun Liu, Chong Chen, Qi Tian et.al.	2304.11370v1	link
2023-05-01	SemEval 2023 Task 6: LegalEval - Understanding Legal Texts	Ashutosh Modi, Prathamesh Kalamkar, Saurabh Karn, Aman Tiwari, Abhinav Joshi, Sai Kiran Tanikella, Shouvik Kumar Guha, Sachin Malhan, Vivek Raghavan et.al.	2304.09548v3	null
2023-06-29	How well do SOTA legal reasoning models support abductive reasoning?	Ha-Thanh Nguyen, Randy Goebel, Francesca Toni, Kostas Stathis, Ken Satoh et.al.	2304.06912v2	null
2023-09-15	Exploring the State of the Art in Legal QA Systems	Abdelrahman Abdallah, Bhawna Piryani, Adam Jatowt et.al.	2304.06623v3	link
2023-04-12	FALQU: Finding Answers to Legal Questions	Behrooz Mansouri, Ricardo Campos et.al.	2304.05611v1	link
2023-04-25	Context-Aware Classification of Legal Document Pages	Pavlos Fragkogiannis, Martina Forster, Grace E. Lee, Dell Zhang et.al.	2304.02787v2	null
2023-03-25	(Legal Design) Research through Litigation	Reuben Kirkham et.al.	2303.14336v1	null
2023-07-19	Understand Legal Documents with Contextualized Large Language Models	Xin Jin, Yuchen Wang et.al.	2303.12135v4	null
2023-03-16	A Short Survey of Viewing Large Language Models in Legal Aspect	Zhongxiang Sun et.al.	2303.09136v1	link
2023-03-14	Are Models Trained on Indian Legal Data Fair?	Sahil Girhepuje, Anmol Goel, Gokul S Krishnan, Shreya Goyal, Satyendra Pandey, Ponnurangam Kumaraguru, Balaraman Ravindran et.al.	2303.07247v2	null
2023-08-04	Meaningful human command: Advance control directives as a method to enable moral and legal responsibility for autonomous weapons systems	Susannah Kate Devitt et.al.	2303.06813v3	null
2023-03-07	German BERT Model for Legal Named Entity Recognition	Harshil Darji, Jelena Mitrović, Michael Granitzer et.al.	2303.05388v1	null
2023-03-08	Automatic Detection of Industry Sectors in Legal Articles Using Machine Learning Approaches	Hui Yang, Stella Hadjiantoni, Yunfei Long, Ruta Petraityte, Berthold Lausen et.al.	2303.05387v1	null
2023-02-23	Natural Language Processing in the Legal Domain	Daniel Martin Katz, Dirk Hartung, Lauritz Gerlach, Abhik Jana, Michael J. Bommarito II et.al.	2302.12039v1	null
2023-02-21	Combining Blockchain and Biometrics: A Survey on Technical Aspects and a First Legal Analysis	Mahdi Ghafourian, Bilgesu Sumer, Ruben Vera-Rodriguez, Julian Fierrez, Ruben Tolosana, Aythami Moralez, Els Kindt et.al.	2302.10883v1	null
2023-02-12	AIDA: Legal Judgment Predictions for Non-Professional Fact Descriptions via Partial-and-Imbalanced Domain Adaptation	Guangyi Xiao, Xinlong Liu, Hao Chen, Jingzhi Guo, Zhiguo Gong et.al.	2302.07728v1	null
2023-02-13	Joint Span Segmentation and Rhetorical Role Labeling with Data Augmentation for Legal Documents	T. Y. S. S. Santosh, Philipp Bock, Matthias Grabmair et.al.	2302.06448v1	null
2023-03-20	Minding rights: Mapping ethical and legal foundations of 'neurorights'	Sjors Ligthart, Marcello Ienca, Gerben Meynen, Fruzsina Molnar-Gabor, Roberto Andorno, Christoph Bublitz, Paul Catley, Lisa Claydon, Thomas Douglas, Nita Farahany, Joseph J. Fins, Sara Goering, Pim Haselager, Fabrice Jotterand, Andrea Lavazza, Allan McCay, Abel Wajnerman Paz, Stephen Rainey, Jesper Ryberg, Philipp Kellmeyer et.al.	2302.06281v2	null
2023-02-14	A Brief Report on LawGPT 1.0: A Virtual Legal Assistant Based on GPT-3	Ha-Thanh Nguyen et.al.	2302.05729v2	null
2023-02-03	Leveraging task dependency and contrastive learning for Legal Judgement Prediction on the European Court of Human Rights	T. Y. S. S Santosh, Marcel Perez San Blas, Phillip Kemper, Matthias Grabmair et.al.	2302.00768v2	null
2023-02-13	Zero-shot Transfer of Article-aware Legal Outcome Classification for European Court of Human Rights Cases	T. Y. S. S Santosh, Oana Ichim, Matthias Grabmair et.al.	2302.00609v3	null
2023-01-30	LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain	Joel Niklaus, Veton Matoshi, Pooja Rani, Andrea Galassi, Matthias Stürmer, Ilias Chalkidis et.al.	2301.13126v1	link
2023-01-29	Diverse legal case search	Ruizhe Zhang, Qingyao Ai, Yueyue Wu, Yixiao Ma, Yiqun Liu et.al.	2301.12504v1	null
2023-01-30	Large Language Models as Fiduciaries: A Case Study Toward Robustly Communicating With Artificial Intelligence Through Legal Standards	John J. Nay et.al.	2301.10095v2	null
2023-07-15	On left legal semigroups	Attila Nagy et.al.	2301.08793v2	null
2023-01-19	Legal Obligation and Ethical Best Practice: Towards Meaningful Verbal Consent for Voice Assistants	William Seymour, Mark Cote, Jose Such et.al.	2301.08091v1	null
2023-01-07	Graph-based Keyword Planning for Legal Clause Generation from Topics	Sagar Joshi, Sumanth Balaji, Aparna Garimella, Vasudeva Varma et.al.	2301.06901v1	link
2023-01-06	MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding	Steven H. Wang, Antoine Scardigli, Leonard Tang, Wei Chen, Dimitry Levkin, Anya Chen, Spencer Ball, Thomas Woodside, Oliver Zhang, Dan Hendrycks et.al.	2301.00876v2	link
2022-12-13	Attentive Deep Neural Networks for Legal Document Retrieval	Ha-Thanh Nguyen, Manh-Kien Phi, Xuan-Bach Ngo, Vu Tran, Le-Minh Nguyen, Minh-Phuong Tu et.al.	2212.13899v1	null
2022-12-19	What to Read in a Contract? Party-Specific Summarization of Important Obligations, Entitlements, and Prohibitions in Legal Documents	Abhilasha Sancheti, Aparna Garimella, Balaji Vasan Srinivasan, Rachel Rudinger et.al.	2212.09825v1	null
2022-12-19	E-NER -- An Annotated Named Entity Recognition Corpus of Legal Text	Ting Wai Terence Au, Ingemar J. Cox, Vasileios Lampos et.al.	2212.09306v1	link
2022-12-16	Law to Binary Tree -- An Formal Interpretation of Legal Natural Language	Ha-Thanh Nguyen, Vu Tran, Ngoc-Cam Le, Thi-Thuy Le, Quang-Huy Nguyen, Le-Minh Nguyen, Ken Satoh et.al.	2212.08335v1	null
2022-12-16	LegalRelectra: Mixed-domain Language Modeling for Long-range Legal Text Comprehension	Wenyue Hua, Yuchen Zhang, Zhe Chen, Josie Li, Melanie Weber et.al.	2212.08204v1	null
2023-06-01	No driver, No Regulation? --Online Legal Driving Behavior Monitoring for Self-driving Vehicles	Wenhao Yu, Chengxiang Zhao, Jiaxin Liu, Yingkai Yang, Xiaohan Ma, Jun Li, Weida Wang, Hong Wang, Ding Zhao, Xiaosong Hu et.al.	2212.04156v3	null
2022-12-06	Formal Modeling and Analysis of Legal Contracts using ContractCheck	Alan Khoja, Martin Kölbl, Stefan Leue, Rüdiger Wilhelmi et.al.	2212.03349v1	null
2022-12-05	Legal Prompt Engineering for Multilingual Legal Judgement Prediction	Dietrich Trautmann, Alina Petrova, Frank Schilder et.al.	2212.02199v1	null
2022-12-08	Legal Prompting: Teaching a Language Model to Think Like a Lawyer	Fangyi Yu, Lee Quartey, Frank Schilder et.al.	2212.01326v2	null
2022-11-30	BudgetLongformer: Can we Cheaply Pretrain a SotA Legal Language Model From Scratch?	Joel Niklaus, Daniele Giofré et.al.	2211.17135v1	null
2022-11-15	DeepParliament: A Legal domain Benchmark & Dataset for Parliament Bills Prediction	Ankit Pal et.al.	2211.15424v1	link
2022-11-23	Agent-Specific Deontic Modality Detection in Legal Language	Abhilasha Sancheti, Aparna Garimella, Balaji Vasan Srinivasan, Rachel Rudinger et.al.	2211.12752v1	null
2022-11-21	Legal and Political Stance Detection of SCOTUS Language	Noah Bergam, Emily Allaway, Kathleen McKeown et.al.	2211.11724v1	link
2022-11-15	Exploiting Contrastive Learning and Numerical Evidence for Improving Confusing Legal Judgment Prediction	Leilei Gan, Baokui Li, Kun Kuang, Yi Yang, Fei Wu et.al.	2211.08238v1	null
2022-11-15	An Efficient Active Learning Pipeline for Legal Text Classification	Sepideh Mamooler, Rémi Lebret, Stéphane Massonnet, Karl Aberer et.al.	2211.08112v1	null
2022-11-06	Computing and Exploiting Document Structure to Improve Unsupervised Extractive Summarization of Legal Case Decisions	Yang Zhong, Diane Litman et.al.	2211.03229v1	link
2023-04-18	Knowledge is Power: Understanding Causality Makes Legal judgment Prediction Models More Generalizable and Robust	Haotian Chen, Lingwei Zhang, Yiran Liu, Fanchao Chen, Yang Yu et.al.	2211.03046v2	null
2022-11-05	Privacy-Preserving Models for Legal Natural Language Processing	Ying Yin, Ivan Habernal et.al.	2211.02956v1	link
2022-11-05	The Legal Argument Reasoning Task in Civil Procedure	Leonard Bongard, Lena Held, Ivan Habernal et.al.	2211.02950v1	link
2022-11-04	Miko Team: Deep Learning Approach for Legal Question Answering in ALQAC 2022	Hieu Nguyen Van, Dat Nguyen, Phuong Minh Nguyen, Minh Le Nguyen et.al.	2211.02200v1	null
2022-11-03	Data-efficient End-to-end Information Extraction for Statistical Legal Analysis	Wonseok Hwang, Saehee Eom, Hanuhl Lee, Hai Jin Park, Minjoon Seo et.al.	2211.01692v1	null
2022-11-10	Processing Long Legal Documents with Pre-trained Transformers: Modding LegalBERT and Longformer	Dimitris Mamakas, Petros Tsotsi, Ion Androutsopoulos, Ilias Chalkidis et.al.	2211.00974v2	null
2022-11-01	ClassActionPrediction: A Challenging Benchmark for Legal Judgment Prediction of Class Action Cases in the US	Gil Semo, Dor Bernsohn, Ben Hagag, Gila Hayat, Joel Niklaus et.al.	2211.00582v1	link
2022-10-31	Do Charge Prediction Models Learn Legal Theory?	Zhenwei An, Quzhe Huang, Cong Jiang, Yansong Feng, Dongyan Zhao et.al.	2210.17108v1	link
2022-10-30	Validity Assessment of Legal Will Statements as Natural Language Inference	Alice Saebom Kwak, Jacob O. Israelsen, Clayton T. Morrison, Derek E. Bambauer, Mihai Surdeanu et.al.	2210.16989v1	link
2022-10-25	Deconfounding Legal Judgment Prediction for European Court of Human Rights Cases Towards Better Alignment with Experts	T. Y. S. S Santosh, Shanshan Xu, Oana Ichim, Matthias Grabmair et.al.	2210.13836v1	link
2022-11-04	Parameter-Efficient Legal Domain Adaptation	Jonathan Li, Rohan Bhambhoria, Xiaodan Zhu et.al.	2210.13712v2	null
2022-10-24	Toward an Intelligent Tutoring System for Argument Mining in Legal Texts	Hannes Westermann, Jaromir Savelka, Vern R. Walker, Kevin D. Ashley, Karim Benyekhlef et.al.	2210.13635v1	null
2022-10-24	EUR-Lex-Sum: A Multi- and Cross-lingual Dataset for Long-form Summarization in the Legal Domain	Dennis Aumiller, Ashish Chouhan, Michael Gertz et.al.	2210.13448v1	link
2022-10-24	Legal-Tech Open Diaries: Lesson learned on how to develop and deploy light-weight models in the era of humongous Language Models	Stelios Maroudas, Sotiris Legkas, Prodromos Malakasiotis, Ilias Chalkidis et.al.	2210.13086v1	null
2022-10-22	Extractive Summarization of Legal Decisions using Multi-task Learning and Maximal Marginal Relevance	Abhishek Agarwal, Shanshan Xu, Matthias Grabmair et.al.	2210.12437v1	null
2022-12-08	Modelling and Explaining Legal Case-based Reasoners through Classifiers	Xinghan Liu, Emiliano Lorini, Antonino Rotolo, Giovanni Sartor et.al.	2210.11217v2	null
2023-04-26	Law Article-Enhanced Legal Case Matching: a Causal Learning Approach	Zhongxiang Sun, Jun Xu, Xiao Zhang, Zhenhua Dong, Ji-Rong Wen et.al.	2210.11012v2	link
2022-10-19	Multi-granularity Argument Mining in Legal Texts	Huihui Xu, Kevin Ashley et.al.	2210.09472v2	null
2023-04-05	Conversion of Legal Agreements into Smart Legal Contracts using NLP	Eason Chen, Niall Roche, Yuen-Hsien Tseng, Walter Hernandez, Jiangbo Shangguan, Alastair Moore et.al.	2210.08954v2	null
2022-10-15	AraLegal-BERT: A pretrained language model for Arabic Legal text	Muhammad AL-Qurishi, Sarah AlQaseemi, Riad Soussi et.al.	2210.08284v1	null
2022-10-14	Legal Case Document Summarization: Extractive and Abstractive Methods and their Evaluation	Abhay Shukla, Paheli Bhattacharya, Soham Poddar, Rajdeep Mukherjee, Kripabandhu Ghosh, Pawan Goyal, Saptarshi Ghosh et.al.	2210.07544v1	link
2022-10-11	Legal Element-oriented Modeling with Multi-view Contrastive Learning for Legal Case Retrieval	Zhaowei Wang et.al.	2210.05188v1	null
2022-10-01	Using Argumentation Schemes to Model Legal Reasoning	Trevor Bench-Capon, Katie Atkinson et.al.	2210.00315v1	null
2022-11-12	Multi-stage Information Retrieval for Vietnamese Legal Texts	Nhat-Minh Pham, Ha-Thanh Nguyen, Trong-Hop Do et.al.	2209.14494v2	null
2023-05-16	Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans	John J. Nay et.al.	2209.13020v14	null
2022-09-26	Legal Case Document Similarity: You Need Both Network and Text	Paheli Bhattacharya, Kripabandhu Ghosh, Arindam Pal, Saptarshi Ghosh et.al.	2209.12474v1	link
2022-09-25	An Empirical Study on Cross-X Transfer for Legal Judgment Prediction	Joel Niklaus, Matthias Stürmer, Ilias Chalkidis et.al.	2209.12325v1	link
2022-09-13	LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning	Neel Guha, Daniel E. Ho, Julian Nyarko, Christopher Ré et.al.	2209.06120v1	link
2023-05-15	Pre-trained Language Models for the Legal Domain: A Case Study on Indian Law	Shounak Paul, Arpan Mandal, Pawan Goyal, Saptarshi Ghosh et.al.	2209.06049v5	null
2022-08-29	Bias Impact Analysis of AI in Consumer Mobile Health Technologies: Legal, Technical, and Policy	Kristine Gloria, Nidhi Rastogi, Stevie DeGroff et.al.	2209.05440v1	null
2022-09-11	Eiger: Auditable, executable, flexible legal regulations	Alexander Bernauer, Richard A. Eisenberg et.al.	2209.04939v1	null
2023-05-28	Early Verification of Legal Compliance via Bounded Satisfiability Checking	Nick Feng, Lina Marsso, Mehrdad Sabetzadeh, Marsha Chechik et.al.	2209.04052v3	link
2022-09-18	An Argumentation-Based Legal Reasoning Approach for DL-Ontology	Zhe Yu, Yiwei Lu et.al.	2209.03070v2	null
2022-09-06	From Legal Contracts to Legal Calculi: the code-driven normativity	Silvia Crafa et.al.	2209.02353v1	null
2022-09-20	ArgLegalSumm: Improving Abstractive Summarization of Legal Documents with Argument Mining	Mohamed Elaraby, Diane Litman et.al.	2209.01650v2	link
2022-09-02	Entity Graph Extraction from Legal Acts -- a Prototype for a Use Case in Policy Design Analysis	Anna Wróblewska, Bartosz Pieliński, Karolina Seweryn, Karol Saputa, Aleksandra Wichrowska, Sylwia Sysko-Romańczuk, Hanna Schreiber et.al.	2209.00944v1	null
2022-09-01	Unsupervised Simplification of Legal Texts	Mert Cemri, Tolga Çukur, Aykut Koç et.al.	2209.00557v1	null
2022-10-06	On the Role of Negative Precedent in Legal Outcome Prediction	Josef Valvoda, Ryan Cotterell, Simone Teufel et.al.	2208.08225v2	link
2023-05-17	Mining Legal Arguments in Court Decisions	Ivan Habernal, Daniel Faber, Nicola Recchia, Sebastian Bretthauer, Iryna Gurevych, Indra Spiecker genannt Döhmann, Christoph Burchard et.al.	2208.06178v2	link
2022-08-08	Valid Widgets Contain Legal Subwidgets	Nathan Donagi et.al.	2208.03866v1	null
2022-08-06	Preventing or Mitigating Adversarial Supply Chain Attacks; a legal analysis	Kaspar Rosager Ludvigsen, Shishir Nagaraja, Angela Daly et.al.	2208.03466v1	null
2022-09-01	Upgrading the protection of children from manipulative and addictive strategies in online games: Legal and technical solutions beyond privacy regulation	Tommaso Crepax, Jan Tobias Muehlberg et.al.	2207.09928v2	null
2022-07-10	Developing an NLP-based Recommender System for the Ethical, Legal, and Social Implications of Synthetic Biology	Damien Dablain, Lilian Huang, Brandon Sepulvado et.al.	2207.06360v1	null
2022-07-09	Explainable Legal Case Matching via Inverse Optimal Transport-based Rationale Extraction	Weijie Yu, Zhongxiang Sun, Jun Xu, Zhenhua Dong, Xu Chen, Hongteng Xu, Ji-Rong Wen et.al.	2207.04182v1	link
2022-07-15	Sequence-aware multimodal page classification of Brazilian legal documents	Pedro H. Luz de Araujo, Ana Paula G. S. de Almeida, Fabricio A. Braz, Nilton C. da Silva, Flavio de Barros Vidal, Teofilo E. de Campos et.al.	2207.00748v2	link
2022-11-29	Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset	Peter Henderson, Mark S. Krass, Lucia Zheng, Neel Guha, Christopher D. Manning, Dan Jurafsky, Daniel E. Ho et.al.	2207.00220v2	link
2022-07-20	Cybersecurity Law: Legal Jurisdiction and Authority	Feras A. Batarseh et.al.	2206.09465v3	null
2022-06-15	Legal Provocations for HCI in the Design and Development of Trustworthy Autonomous Systems	Lachlan D. Urquhart, Glenn McGarry, Andy Crabtree et.al.	2206.07506v1	null
2022-09-13	Indian Legal Text Summarization: A Text Normalisation-based Approach	Satyajit Ghosh, Mousumi Dutta, Tanaya Das et.al.	2206.06238v2	null
2022-06-13	Tackling Algorithmic Disability Discrimination in the Hiring Process: An Ethical, Legal and Technical Analysis	Maarten Buyl, Christina Cociancig, Cristina Frattone, Nele Roekens et.al.	2206.06149v1	null
2022-10-05	A Multi-Task Benchmark for Korean Legal Language Understanding and Judgement Prediction	Wonseok Hwang, Dongjun Lee, Kyoungyeon Cho, Hanuhl Lee, Minjoon Seo et.al.	2206.05224v2	link
2022-06-08	Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification	Stratos Xenouleas, Alexia Tsoukara, Giannis Panagiotakis, Ilias Chalkidis, Ion Androutsopoulos et.al.	2206.03785v1	null
2022-05-30	Billions of Parameters Are Worth More Than In-domain Training Data: A case study in the Legal Case Entailment Task	Guilherme Moraes Rosa, Luiz Bonifacio, Vitor Jeronymo, Hugo Abonizio, Roberto Lotufo, Rodrigo Nogueira et.al.	2205.15172v1	link
2022-05-17	An Evaluation Framework for Legal Document Summarization	Ankan Mullick, Abhilash Nandy, Manav Nitin Kapadnis, Sohan Patnaik, R Raghav, Roshni Kar et.al.	2205.08478v1	link
2022-05-15	Regulating Facial Processing Technologies: Tensions Between Legal and Technical Considerations in the Application of Illinois BIPA	Rui-Jie Yew, Alice Xiang et.al.	2205.07299v1	null
2022-05-13	The Case for a Legal Compliance API for the Enforcement of the EU's Digital Services Act on Social Media Platforms	Catalina Goanta, Thales Bertaglia, Adriana Iamnitchi et.al.	2205.06666v1	null
2022-05-06	Fine-grained Intent Classification in the Legal Domain	Ankan Mullick, Abhilash Nandy, Manav Nitin Kapadnis, Sohan Patnaik, R Raghav et.al.	2205.03509v1	null
2022-04-19	Sharing and Caring: Creating a Culture of Constructive Criticism in Computational Legal Studies	Corinna Coupette, Dirk Hartung et.al.	2205.01071v1	null
2022-04-16	nigam@COLIEE-22: Legal Case Retrieval and Entailment using Cascading of Lexical and Semantic-based models	Shubham Kumar Nigam, Navansh Goel et.al.	2204.07853v1	link
2022-03-10	State of the Art in Artificial Intelligence applied to the Legal Domain	João Dias, Pedro A. Santos, Nuno Cordeiro, Ana Antunes, Bruno Martins, Jorge Baptista, Carlos Gonçalves et.al.	2204.07047v1	null
2022-04-11	A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and Challenges	Junyun Cui, Xiaoyu Shen, Feiping Nie, Zheng Wang, Jinglong Wang, Yulong Chen et.al.	2204.04859v1	null
2022-04-02	Recordism: A social-scientific prospect of blockchain from social, legal, financial, and technological perspectives	Zihao Li, Hao Xu, Yang Fang, Boyuan Zhao, Lei Zhang et.al.	2204.00823v1	null
2022-04-02	HLDC: Hindi Legal Documents Corpus	Arnav Kapoor, Mudit Dhawan, Anmol Goel, T. H. Arjun, Akshala Bhatnagar, Vibhu Agrawal, Amul Agrawal, Arnab Bhattacharya, Ponnurangam Kumaraguru, Ashutosh Modi et.al.	2204.00806v1	link
2022-03-29	An Evaluation Dataset for Legal Word Embedding: A Case Study On Chinese Codex	Chun-Hsien Lin, Pu-Jen Cheng et.al.	2203.15173v1	link
2022-04-12	Gender and Racial Stereotype Detection in Legal Opinion Word Embeddings	Sean Matthews, John Hudzina, Dawn Sepehr et.al.	2203.13369v2	null
2022-03-16	LEVEN: A Large-Scale Chinese Legal Event Detection Dataset	Feng Yao, Chaojun Xiao, Xiaozhi Wang, Zhiyuan Liu, Lei Hou, Cunchao Tu, Juanzi Li, Yun Liu, Weixing Shen, Maosong Sun et.al.	2203.08556v1	link
2022-03-15	Toward Improving Attentive Neural Networks in Legal Text Processing	Ha-Thanh Nguyen et.al.	2203.08244v1	null
2022-03-14	FairLex: A Multilingual Benchmark for Evaluating Fairness in Legal Text Processing	Ilias Chalkidis, Tommaso Pasini, Sheng Zhang, Letizia Tomada, Sebastian Felix Schwemer, Anders Søgaard et.al.	2203.07228v1	link
2022-03-08	An Uncommon Task: Participatory Design in Legal AI	Fernando Delgado, Solon Barocas, Karen Levy et.al.	2203.06246v1	null
2022-03-05	Prediction of terrorism pattern accompanied by cyber-terrorism and the development direction of corresponding legal systems	Daegeon Kim et.al.	2203.03620v1	null
2022-03-04	Information retrieval and structural complexity of legal trees	Yanik-Pascal Förster, Alessia Annibale, Luca Gamberi, Evan Tzanis, Pierpaolo Vivo et.al.	2203.02259v1	null
2022-03-03	LegalVis: Exploring and Inferring Precedent Citations in Legal Documents	Lucas E. Resck, Jean R. Ponciano, Luis Gustavo Nonato, Jorge Poco et.al.	2203.02001v1	null
2022-04-06	Enhancing Legal Argument Mining with Domain Pre-training and Neural Networks	Gechuan Zhang, Paul Nulty, David Lillis et.al.	2202.13457v2	link
2022-02-25	Measuring Shocks to Central Bank Independence using Legal Rulings	Stefan Griller, Florian Huber, Michael Pfarrhofer et.al.	2202.12695v1	null
2022-02-13	Transformer-based Approaches for Legal Text Processing	Ha-Thanh Nguyen, Minh-Phuong Nguyen, Thi-Hai-Yen Vuong, Minh-Quan Bui, Minh-Chau Nguyen, Tran-Binh Dang, Vu Tran, Le-Minh Nguyen, Ken Satoh et.al.	2202.06397v1	null
2022-02-07	To Tune or Not To Tune? Zero-shot Models for Legal Case Entailment	Guilherme Moraes Rosa, Ruan Chaves Rodrigues, Roberto de Alencar Lotufo, Rodrigo Nogueira et.al.	2202.03120v1	link
2022-02-05	Classification on Sentence Embeddings for Legal Assistance	Arka Mitra et.al.	2202.02639v1	null
2022-01-31	Bankruptcy Shocks and Legal Labor Markets: Evidence from the Court Competition Era	Chad Brown, Jeronimo Carballo, Alessandro Peri et.al.	2202.00044v1	null
2022-01-31	Don't let Ricci v. DeStefano Hold You Back: A Bias-Aware Legal Solution to the Hiring Paradox	Jad Salem, Deven R. Desai, Swati Gupta et.al.	2201.13367v1	null
2022-01-31	Guided Semi-Supervised Non-negative Matrix Factorization on Legal Documents	Pengyu Li, Christine Tseng, Yaxuan Zheng, Joyce A. Chew, Longxiu Huang, Benjamin Jarman, Deanna Needell et.al.	2201.13324v1	null
2022-09-19	Corpus for Automatic Structuring of Legal Documents	Prathamesh Kalamkar, Aman Tiwari, Astha Agarwal, Saurabh Karn, Smita Gupta, Vivek Raghavan, Ashutosh Modi et.al.	2201.13125v2	null
2022-04-19	Expert Finding in Legal Community Question Answering	Arian Askari, Suzan Verberne, Gabriella Pasi et.al.	2201.07667v3	link
2022-01-17	Data-Centric Machine Learning in the Legal Domain	Hannes Westermann, Jaromir Savelka, Vern R. Walker, Kevin D. Ashley, Karim Benyekhlef et.al.	2201.06653v1	null
2022-01-14	Sequence-to-Sequence Models for Extracting Information from Registration and Legal Documents	Ramon Pires, Fábio C. de Souza, Guilherme Rosa, Roberto A. Lotufo, Rodrigo Nogueira et.al.	2201.05658v1	link
2022-01-01	Interpretable Low-Resource Legal Decision Making	Rohan Bhambhoria, Hui Liu, Samuel Dahan, Xiaodan Zhu et.al.	2201.01164v1	null
2021-12-29	LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal Statute Identification from Indian Legal Documents	Shounak Paul, Pawan Goyal, Saptarshi Ghosh et.al.	2112.14731v1	link
2021-12-21	Sentence Embeddings and High-speed Similarity Search for Fast Computer Assisted Annotation of Legal Documents	Hannes Westermann, Jaromir Savelka, Vern R. Walker, Kevin D. Ashley, Karim Benyekhlef et.al.	2112.11494v1	null
2021-12-15	Lex Rosetta: Transfer of Predictive Models Across Languages, Jurisdictions, and Legal Domains	Jaromir Savelka, Hannes Westermann, Karim Benyekhlef, Charlotte S. Alexander, Jayla C. Grant, David Restrepo Amariles, Rajaa El Hamdani, Sébastien Meeùs, Michał Araszkiewicz, Kevin D. Ashley, Alexandra Ashley, Karl Branting, Mattia Falduti, Matthias Grabmair, Jakub Harašta, Tereza Novotná, Elizabeth Tippett, Shiwanni Johnson et.al.	2112.07882v1	link
2021-12-15	Cross-Domain Generalization and Knowledge Transfer in Transformers Trained on Legal Data	Jaromir Savelka, Hannes Westermann, Karim Benyekhlef et.al.	2112.07870v1	null
2021-12-14	Discovering Explanatory Sentences in Legal Case Decisions Using Pre-trained Language Models	Jaromir Savelka, Kevin D. Ashley et.al.	2112.07165v1	link
2021-12-23	Ergo -- a programming language for Smart Legal Contracts	Niall Roche, Walter Hernandez, Eason Chen, Jérôme Siméon, Dan Selman et.al.	2112.07064v2	null
2021-12-13	Dependency Learning for Legal Judgment Prediction with a Unified Text-to-Text Transformer	Yunyun Huang, Xiaoyu Shen, Chuanyi Li, Jidong Ge, Bin Luo et.al.	2112.06370v1	link
2021-12-10	Computer-Assisted Creation of Boolean Search Rules for Text Classification in the Legal Domain	Hannes Westermann, Jaromir Savelka, Vern R. Walker, Kevin D. Ashley, Karim Benyekhlef et.al.	2112.05807v1	null
2022-11-07	Semantic Segmentation of Legal Documents via Rhetorical Roles	Vijit Malik, Rishabh Sanjay, Shouvik Kumar Guha, Angshuman Hazarika, Shubham Nigam, Arnab Bhattacharya, Ashutosh Modi et.al.	2112.01836v2	link
2021-12-11	Zero-Shot Cross-Lingual Transfer in Legal Domain Using Transformer Models	Zein Shaheen, Gerhard Wohlgenannt, Dmitry Mouromtsev et.al.	2111.14192v2	null
2021-11-05	From impact refugees to deterritorialized states: foresighting extreme legal-policy cases in asteroid impact scenarios	Elisa Simó-Soler, Eloy Peña-Asensio et.al.	2111.13643v1	null
2021-11-23	Robust Deep Reinforcement Learning for Extractive Legal Summarization	Duy-Hung Nguyen, Bao-Sinh Nguyen, Nguyen Viet Dung Nghiem, Dung Tien Le, Mim Amina Khatun, Minh-Tien Nguyen, Hung Le et.al.	2111.07158v2	null
2021-11-14	Critical Sentence Identification in Legal Cases Using Multi-Class Classification	Sahan Jayasinghe, Lakith Rambukkanage, Ashan Silva, Nisansa de Silva, Amal Shehan Perera et.al.	2111.05721v2	null
2021-11-03	Building Legal Datasets	Jerrold Soh et.al.	2111.02034v1	null
2021-10-05	LegalNLP -- Natural Language Processing methods for the Brazilian Legal Language	Felipe Maia Polo, Gabriel Caiaffa Floriano Mendonça, Kauê Capellato J. Parreira, Lucka Gianvechio, Peterson Cordeiro, Jonathan Batista Ferreira, Leticia Maria Paz de Lima, Antônio Carlos do Amaral Maia, Renato Vicente et.al.	2110.15709v1	link
2021-10-15	Law Smells: Defining and Detecting Problematic Patterns in Legal Drafting	Corinna Coupette, Dirk Hartung, Janis Beckedorf, Maximilian Böther, Daniel Martin Katz et.al.	2110.11984v1	null
2021-10-21	Pacta sunt servanda: legal contracts in Stipula	Silvia Crafa, Cosimo Laneve, Giovanni Sartor et.al.	2110.11069v1	null
2021-10-12	A Survey on Legal Question Answering Systems	Jorge Martinez-Gil et.al.	2110.07333v1	null
2021-10-09	Dynamic Logic of Legal Competences	Huimin Dong, Olivier Roy et.al.	2110.04454v1	null
2021-10-07	Cookie Banners, What's the Purpose? Analyzing Cookie Banner Text Through a Legal Lens	Cristiana Santos, Arianna Rossi, Lorena Sánchez Chamorro, Kerstin Bongard-Blanchy, Ruba Abu-Salma et.al.	2110.02597v2	null

(back to top)

Speech Recognition

Publish Date	Title	Authors	PDF	Code
2024-08-15	Enhancing Large Language Model-based Speech Recognition by Contextualization for Rare and Ambiguous Words	Kento Nozawa, Takashi Masuko, Toru Taniguchi et.al.	2408.08027v1	null
2024-08-12	Enhancing Dialogue Speech Recognition with Robust Contextual Awareness via Noise Representation Learning	Wonjun Lee, San Kim, Gary Geunbae Lee et.al.	2408.06043v1	null
2024-08-11	LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition	Eunseop Yoon, Hee Suk Yoon, John Harvill, Mark Hasegawa-Johnson, Chang D. Yoo et.al.	2408.05769v1	null
2024-08-09	MooER: LLM-based Speech Recognition and Translation Models from Moore Threads	Junhao Xu, Zhenlin Liang, Yi Liu, Yichao Hu, Jian Li, Yajun Zheng, Meng Cai, Hua Wang et.al.	2408.05101v1	link
2024-08-05	Clustering and Mining Accented Speech for Inclusive and Fair Speech Recognition	Jaeyoung Kim, Han Lu, Soheil Khorram, Anshuman Tripathi, Qian Zhang, Hasim Sak et.al.	2408.02582v1	null
2024-08-08	The NPU-ASLP System Description for Visual Speech Recognition in CNVSRC 2024	He Wang, Lei Xie et.al.	2408.02369v2	link
2024-08-01	SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data	Yichen Lu, Jiaqi Song, Xuankai Chang, Hengwei Bian, Soumi Maiti, Shinji Watanabe et.al.	2408.00624v1	link
2024-07-18	Handling Numeric Expressions in Automatic Speech Recognition	Christian Huber, Alexander Waibel et.al.	2408.00004v1	null
2024-07-31	On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition	Nick Rossenbach, Ralf Schlüter, Sakriani Sakti et.al.	2407.21476v1	null
2024-07-30	Self-Supervised Models in Automatic Whispered Speech Recognition	Aref Farhadipour, Homa Asadi, Volker Dellwo et.al.	2407.21211v1	null
2024-07-10	Dynamic Encoder Size Based on Data-Driven Layer-wise Pruning for Speech Recognition	Jingjing Xu, Wei Zhou, Zijian Yang, Eugen Beck, Ralf Schlueter et.al.	2407.18930v1	null
2024-08-07	Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing	Hukai Huang, Shenghui Lu, Yahui Shan, He Qu, Wenhao Guan, Qingyang Hong, Lin Li et.al.	2407.18581v2	link
2024-07-26	Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation	Shiyao Wang, Shiwan Zhao, Jiaming Zhou, Aobo Kong, Yong Qin et.al.	2407.18461v1	link
2024-07-25	On the Effect of Purely Synthetic Training Data for Different Automatic Speech Recognition Architectures	Nick Rossenbach, Benedikt Hilmes, Ralf Schlüter et.al.	2407.17997v1	null
2024-07-25	Scaling A Simple Approach to Zero-Shot Speech Recognition	Jinming Zhao, Vineel Pratap, Michael Auli et.al.	2407.17852v1	link
2024-07-24	A Comparative Analysis of Bilingual and Trilingual Wav2Vec Models for Automatic Speech Recognition in Multilingual Oral History Archives	Jan Lehečka, Josef V. Psutka, Luboš Šmídl, Pavel Ircing, Josef Psutka et.al.	2407.17160v1	null
2024-07-23	Quantifying the Role of Textual Predictability in Automatic Speech Recognition	Sean Robertson, Gerald Penn, Ewan Dunbar et.al.	2407.16537v1	null
2024-07-23	The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization	Samuele Cornell, Taejin Park, Steve Huang, Christoph Boeddeker, Xuankai Chang, Matthew Maciejewski, Matthew Wiesner, Paola Garcia, Shinji Watanabe et.al.	2407.16447v1	null
2024-07-07	Morse Code-Enabled Speech Recognition for Individuals with Visual and Hearing Impairments	Ritabrata Roy Choudhury et.al.	2407.14525v1	null
2024-07-19	Reexamining Racial Disparities in Automatic Speech Recognition Performance: The Role of Confounding by Provenance	Changye Li, Trevor Cohen, Serguei Pakhomov et.al.	2407.13982v1	null
2024-07-03	Self-supervised ASR Models and Features For Dysarthric and Elderly Speech Recognition	Shujie Hu, Xurong Xie, Mengzhe Geng, Zengrui Jin, Jiajun Deng, Guinan Li, Yi Wang, Mingyu Cui, Tianzi Wang, Helen Meng, Xunying Liu et.al.	2407.13782v1	null
2024-07-18	Low-Resourced Speech Recognition for Iu Mien Language via Weakly-Supervised Phoneme-based Multilingual Pre-training	Lukuan Dong, Donghong Qin, Fengbo Bai, Fanhua Song, Yan Liu, Chen Xu, Zhijian Ou et.al.	2407.13292v1	null
2024-06-29	Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition	Yuchun Shu, Bo Hu, Yifeng He, Hao Shi, Longbiao Wang, Jianwu Dang et.al.	2407.12817v1	null
2024-07-14	Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation	Ruizhe Huang, Mahsa Yarmohammadi, Sanjeev Khudanpur, Daniel Povey et.al.	2407.10303v1	null
2024-07-13	Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System	Lingwei Meng, Jiawen Kang, Yuejiao Wang, Zengrui Jin, Xixin Wu, Xunying Liu, Helen Meng et.al.	2407.09817v1	null
2024-07-13	A Streaming Multi-Channel End-to-End Speech Recognition System with Realistic Evaluations	Xiangzhu Kong, Tianqi Ning, Hao Huang, Zhijian Ou et.al.	2407.09807v1	link
2024-07-09	Tailored Design of Audio-Visual Speech Recognition Models using Branchformers	David Gimeno-Gómez, Carlos-D. Martínez-Hinarejos et.al.	2407.06606v1	link
2024-07-10	Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition	Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li, Xiaoyang Li, Zeyang Li, Zehua Lin, Rui Liu, Shouda Liu, Lu Lu, Yizhou Lu, Jingting Ma, Shengtao Ma, Yulin Pei, Chen Shen, Tian Tan, Xiaogang Tian, Ming Tu, Bo Wang, Hao Wang, Yuping Wang, Yuxuan Wang, Hanzhang Xia, Rui Xia, Shuangyi Xie, Hongmin Xu, Meng Yang, Bihong Zhang, Jun Zhang, Wanyi Zhang, Yang Zhang, Yawei Zhang, Yijie Zheng, Ming Zou et.al.	2407.04675v2	null
2024-07-05	Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models	Bolaji Yusuf, Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran et.al.	2407.04641v1	null
2024-07-04	Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis	Cong-Thanh Do, Shuhei Imai, Rama Doddipatla, Thomas Hain et.al.	2407.04047v1	null
2024-07-04	Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition	Sungnyun Kim, Kangwook Jang, Sangmin Bae, Hoirin Kim, Se-Young Yun et.al.	2407.03563v1	null
2024-07-03	Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations	Kunal Dhawan, Nithin Rao Koluguri, Ante Jukić, Ryan Langman, Jagadeesh Balam, Boris Ginsburg et.al.	2407.03495v1	null
2024-07-03	Qifusion-Net: Layer-adapted Stream/Non-stream Model for End-to-End Multi-Accent Speech Recognition	Jinming Chen, Jingyi Fang, Yuanzhong Zheng, Yaoxuan Wang, Haojun Fei et.al.	2407.03026v1	null
2024-07-02	Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models	Zhiyuan Tang, Dong Wang, Shen Huang, Shidong Shang et.al.	2407.01909v1	link
2024-06-28	Less is More: Accurate Speech Recognition & Translation without Web-Scale Data	Krishna C. Puvvada, Piotr Żelasko, He Huang, Oleksii Hrinchuk, Nithin Rao Koluguri, Kunal Dhawan, Somshubra Majumdar, Elena Rastorgueva, Zhehuai Chen, Vitaly Lavrukhin, Jagadeesh Balam, Boris Ginsburg et.al.	2406.19674v1	null
2024-06-27	Zero-Query Adversarial Attack on Black-box Automatic Speech Recognition Systems	Zheng Fang, Tao Wang, Lingchen Zhao, Shenyi Zhang, Bowen Li, Yunjie Ge, Qi Li, Chao Shen, Qian Wang et.al.	2406.19311v1	null
2024-06-27	Streaming Decoder-Only Automatic Speech Recognition with Discrete Speech Units: A Pilot Study	Peikun Chen, Sining Sun, Changhao Shan, Qing Yang, Lei Xie et.al.	2406.18862v1	link
2024-06-26	Dynamic Data Pruning for Automatic Speech Recognition	Qiao Xiao, Pingchuan Ma, Adriana Fernandez-Lopez, Boqian Wu, Lu Yin, Stavros Petridis, Mykola Pechenizkiy, Maja Pantic, Decebal Constantin Mocanu, Shiwei Liu et.al.	2406.18373v1	null
2024-06-26	MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research	Song Li, Yongbin You, Xuezhi Wang, Zhengkun Tian, Ke Ding, Guanglu Wan et.al.	2406.18301v1	null
2024-06-26	Automatic Speech Recognition for Hindi	Anish Saha, A. G. Ramakrishnan et.al.	2406.18135v1	null
2024-07-12	ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs	Ahmed Heakl, Youssef Zaghloul, Mennatullah Ali, Rania Hossam, Walid Gomaa et.al.	2406.18120v2	link
2024-06-25	Sequential Editing for Lifelong Training of Speech Recognition Models	Devang Kulshreshtha, Saket Dingliwal, Brady Houston, Nikolaos Pappas, Srikanth Ronanki et.al.	2406.17935v1	null
2024-06-25	Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet	Manish Dhakal, Arman Chhetri, Aman Kumar Gupta, Prabin Lamichhane, Suraj Pandey, Subarna Shakya et.al.	2406.17825v1	link
2024-06-25	MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization	Adriana Fernandez-Lopez, Honglie Chen, Pingchuan Ma, Lu Yin, Qiao Xiao, Stavros Petridis, Shiwei Liu, Maja Pantic et.al.	2406.17614v1	null
2024-06-23	Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss	Muhammad Shakeel, Yui Sudo, Yifan Peng, Shinji Watanabe et.al.	2406.16120v1	null
2024-08-01	Decoder-only Architecture for Streaming End-to-end Speech Recognition	Emiru Tsunoo, Hayato Futami, Yosuke Kashiwagi, Siddhant Arora, Shinji Watanabe et.al.	2406.16107v2	null
2024-06-21	Perception of Phonological Assimilation by Neural Speech Recognition Models	Charlotte Pouw, Marianne de Heer Kloots, Afra Alishahi, Willem Zuidema et.al.	2406.15265v1	null
2024-06-19	Joint vs Sequential Speaker-Role Detection and Automatic Speech Recognition for Air-traffic Control	Alexander Blatt, Aravind Krishnan, Dietrich Klakow et.al.	2406.13842v1	null
2024-06-24	Children's Speech Recognition through Discrete Token Enhancement	Vrunda N. Sukhadia, Shammur Absar Chowdhury et.al.	2406.13431v2	null
2024-06-16	Automatic Speech Recognition for Biomedical Data in Bengali Language	Shariar Kabir, Nazmun Nahar, Shyamasree Saha, Mamunur Rashid et.al.	2406.12931v1	null
2024-06-18	Bridging the Gap: Integrating Pre-trained Speech Enhancement and Recognition Models for Robust Speech Recognition	Kuan-Chen Wang, You-Jin Li, Wei-Lun Chen, Yu-Wen Chen, Yi-Ching Wang, Ping-Cheng Yeh, Chao Zhang, Yu Tsao et.al.	2406.12699v1	null
2024-06-18	Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting	Yosuke Kashiwagi, Hayato Futami, Emiru Tsunoo, Siddhant Arora, Shinji Watanabe et.al.	2406.12611v1	null
2024-06-18	Unsupervised Online Continual Learning for Automatic Speech Recognition	Steven Vander Eeckt, Hugo Van hamme et.al.	2406.12503v1	link
2024-06-18	SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization	Young Jin Ahn, Jungwoo Park, Sangha Park, Jonghyun Choi, Kee-Eung Kim et.al.	2406.12233v1	link
2024-06-16	Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech	Guan-Ting Lin, Wei-Ping Huang, Hung-yi Lee et.al.	2406.11064v1	null
2024-06-16	Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation for Embedding Undetectable Vulnerabilities on Speech Recognition	Wenhan Yao, Jiangkun Yang, Yongqiang He, Jia Liu, Weiping Wen et.al.	2406.10932v1	null
2024-06-14	CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge	Chen Chen, Zehua Liu, Xiaolou Li, Lantian Li, Dong Wang et.al.	2406.10313v1	null
2024-06-12	Improving child speech recognition with augmented child-like speech	Yuanyuan Zhang, Zhengjun Yue, Tanvina Patel, Odette Scharenborg et.al.	2406.10284v1	null
2024-06-14	Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation	Andrew Rouditchenko, Yuan Gong, Samuel Thomas, Leonid Karlinsky, Hilde Kuehne, Rogerio Feris, James Glass et.al.	2406.10082v1	link
2024-06-14	An efficient text augmentation approach for contextualized Mandarin speech recognition	Naijun Zheng, Xucheng Wan, Kai Liu, Ziqing Du, Zhou Huan et.al.	2406.09950v1	null
2024-06-14	Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition	Yicong Jiang, Tianzi Wang, Xurong Xie, Juan Liu, Wei Sun, Nan Yan, Hui Chen, Lan Wang, Xunying Liu, Feng Tian et.al.	2406.09873v1	null
2024-06-13	Multi-Modal Retrieval For Large Language Model Based Speech Recognition	Jari Kolehmainen, Aditya Gourav, Prashanth Gurunath Shivakumar, Yile Gu, Ankur Gandhe, Ariya Rastrow, Grant Strimel, Ivan Bulyko et.al.	2406.09618v1	null
2024-06-13	Speech ReaLLM -- Real-time Streaming Speech Recognition with Multimodal LLMs by Teaching the Flow of Time	Frank Seide, Morrie Doulaty, Yangyang Shi, Yashesh Gaur, Junteng Jia, Chunyang Wu et.al.	2406.09569v1	null
2024-06-13	Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't	Chihiro Taguchi, David Chiang et.al.	2406.09202v1	link
2024-06-13	Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition	William Ravenscroft, George Close, Stefan Goetze, Thomas Hain, Mohammad Soleymanpour, Anurag Chowdhury, Mark C. Fuhs et.al.	2406.08914v1	null
2024-06-13	A Single-Step Non-Autoregressive Automatic Speech Recognition Architecture with High Accuracy and Inference Speed	Ziyang Zhuang, Chenfeng Miao, Kun Zou, Shuai Gong, Ming Fang, Tao Wei, Zijian Li, Wei Hu, Shaojun Wang, Jing Xiao et.al.	2406.08835v1	null
2024-06-12	Training Data Augmentation for Dysarthric Automatic Speech Recognition by Text-to-Dysarthric-Speech Synthesis	Wing-Zin Leung, Mattias Cross, Anton Ragni, Stefan Goetze et.al.	2406.08568v1	null
2024-06-12	Neural Blind Source Separation and Diarization for Distant Speech Recognition	Yoshiaki Bando, Tomohiko Nakamura, Shinji Watanabe et.al.	2406.08396v1	null
2024-06-12	Towards Unsupervised Speech Recognition Without Pronunciation Models	Junrui Ni, Liming Wang, Yang Zhang, Kaizhi Qian, Heting Gao, Mark Hasegawa-Johnson, Chang D. Yoo et.al.	2406.08380v1	null
2024-06-11	Tag and correct: high precision post-editing approach to correction of speech recognition errors	Tomasz Ziętkiewicz et.al.	2406.07589v1	null
2024-06-11	AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection	Rong Gong, Hongfei Xue, Lezhi Wang, Xin Xu, Qisheng Li, Lei Xie, Hui Bu, Shaomei Wu, Jiaming Zhou, Yong Qin, Binbin Zhang, Jun Du, Jia Bin, Ming Li et.al.	2406.07256v1	null
2024-06-11	Reading Miscue Detection in Primary School through Automatic Speech Recognition	Lingyun Gao, Cristian Tejedor-Garcia, Helmer Strik, Catia Cucchiarini et.al.	2406.07060v1	null
2024-06-06	LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition	Sreyan Ghosh, Sonal Kumar, Ashish Seth, Purva Chiniya, Utkarsh Tyagi, Ramani Duraiswami, Dinesh Manocha et.al.	2406.04432v1	link
2024-06-06	Speed of Light Exact Greedy Decoding for RNN-T Speech Recognition Models on GPU	Daniel Galvez, Vladimir Bataev, Hainan Xu, Tim Kaldewey et.al.	2406.03791v1	null
2024-06-11	Enhancing CTC-based speech recognition with diverse modeling units	Shiyi Han, Zhihong Lei, Mingbin Xu, Xingyu Na, Zhen Huang et.al.	2406.03274v2	null
2024-06-05	Error-preserving Automatic Speech Recognition of Young English Learners' Language	Janick Michot, Manuela Hürlimann, Jan Deriu, Luzia Sauer, Katsiaryna Mlynchyk, Mark Cieliebak et.al.	2406.03235v1	link
2024-06-15	Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition	Hsuan Su, Hua Farn, Fan-Yun Sun, Shang-Tse Chen, Hung-yi Lee et.al.	2406.02925v2	null
2024-06-04	Keyword-Guided Adaptation of Automatic Speech Recognition	Aviv Shamsian, Aviv Navon, Neta Glazer, Gill Hetz, Joseph Keshet et.al.	2406.02649v1	null
2024-05-03	Combining X-Vectors and Bayesian Batch Active Learning: Two-Stage Active Learning Pipeline for Speech Recognition	Ognjen Kundacina, Vladimir Vincan, Dragisa Miskovic et.al.	2406.02566v1	null
2024-04-24	Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices	Gwantae Kim, Bokyeung Lee, Donghyeon Kim, Hanseok Ko et.al.	2406.02562v1	null
2024-04-23	Breaking Walls: Pioneering Automatic Speech Recognition for Central Kurdish: End-to-End Transformer Paradigm	Abdulhady Abas Abdullah, Hadi Veisi, Tarik Rashid et.al.	2406.02561v1	null
2024-03-27	PhoWhisper: Automatic Speech Recognition for Vietnamese	Thanh-Thien Le, Linh The Nguyen, Dat Quoc Nguyen et.al.	2406.02555v1	link
2024-06-04	Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision	Saierdaer Yusuyin, Te Ma, Hao Huang, Wenbo Zhao, Zhijian Ou et.al.	2406.02166v1	link
2024-05-27	ViSpeR: Multilingual Audio-Visual Speech Recognition	Sanath Narayan, Yasser Abdelaziz Dahou Djilali, Ankit Singh, Eustache Le Bihan, Hakim Hacid et.al.	2406.00038v1	null
2024-05-27	Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous Clients	Mohamed Nabih Ali, Alessio Brutti, Daniele Falavigna et.al.	2405.17376v1	null
2024-05-24	Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition	Zijin Gu, Tatiana Likhomanenko, He Bai, Erik McDermott, Ronan Collobert, Navdeep Jaitly et.al.	2405.15216v1	null
2024-05-22	Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation	Muhammad Shakeel, Yui Sudo, Yifan Peng, Shinji Watanabe et.al.	2405.13514v1	null
2024-05-22	Contextualized Automatic Speech Recognition with Dynamic Vocabulary	Yui Sudo, Yosuke Fukumoto, Muhammad Shakeel, Yifan Peng, Shinji Watanabe et.al.	2405.13344v1	null
2024-05-28	FairLENS: Assessing Fairness in Law Enforcement Speech Recognition	Yicheng Wang, Mark Cusick, Mohamed Laila, Kate Puech, Zhengping Ji, Xia Hu, Michael Wilson, Noah Spitzer-Williams, Bryan Wheeler, Yasser Ibrahim et.al.	2405.13166v2	null
2024-05-15	Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings	Ahmed Adel Attia, Dorottya Demszky, Tolulope Ogunremi, Jing Liu, Carol Espy-Wilson et.al.	2405.13018v1	null
2024-03-14	Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer	Maxime Burchi, Krishna C. Puvvada, Jagadeesh Balam, Boris Ginsburg, Radu Timofte et.al.	2405.12983v1	null
2024-05-17	Acoustic modeling for Overlapping Speech Recognition: JHU Chime-5 Challenge System	Vimal Manohar, Szu-Jui Chen, Zhiqi Wang, Yusuke Fujita, Shinji Watanabe, Sanjeev Khudanpur et.al.	2405.11078v1	link
2024-05-16	Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models	Yuchen Hu, Chen Chen, Chengwei Qin, Qiushi Zhu, Eng Siong Chng, Ruizhe Li et.al.	2405.10025v1	null
2024-05-15	Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer	Weifei Jin, Yuxin Cao, Junjie Su, Qi Shen, Kai Ye, Derui Wang, Jie Hao, Ziyao Liu et.al.	2405.09470v1	null
2024-05-10	Lost in Transcription: Identifying and Quantifying the Accuracy Biases of Automatic Speech Recognition Systems Against Disfluent Speech	Dena Mujtaba, Nihar R. Mahapatra, Megan Arney, J. Scott Yaruss, Hope Gerlach-Houck, Caryn Herring, Jia Bin et.al.	2405.06150v1	null
2024-05-09	The RoyalFlush Automatic Speech Diarization and Recognition System for In-Car Multi-Channel Automatic Speech Recognition Challenge	Jingguang Tian, Shuaishuai Ye, Shunfei Chen, Yang Xiang, Zhaohui Yin, Xinhui Hu, Xinkang Xu et.al.	2405.05498v1	null
2024-05-06	MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition	Bingshen Mu, Yangze Li, Qijie Shao, Kun Wei, Xucheng Wan, Naijun Zheng, Huan Zhou, Lei Xie et.al.	2405.03152v1	null
2024-05-02	Low-resource speech recognition and dialect identification of Irish in a multi-task framework	Liam Lonergan, Mengjie Qian, Neasa Ní Chiaráin, Christer Gobl, Ailbhe Ní Chasaide et.al.	2405.01293v1	null
2024-05-02	Deep Learning Models in Speech Recognition: Measuring GPU Energy Consumption, Impact of Noise and Model Quantization for Edge Deployment	Aditya Chakravarty et.al.	2405.01004v1	link
2024-07-24	Confides: A Visual Analytics Solution for Automated Speech Recognition Analysis and Exploration	Sunwoo Ha, Chaehun Lim, R. Jordan Crouser, Alvitta Ottley et.al.	2405.00223v2	null
2024-04-30	EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization	Jianzong Wang, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao et.al.	2404.19214v1	null
2024-04-26	Child Speech Recognition in Human-Robot Interaction: Problem Solved?	Ruben Janssens, Eva Verhelst, Giulio Antonio Abbo, Qiaoqiao Ren, Maria Jose Pinto Bernal, Tony Belpaeme et.al.	2404.17394v1	null
2024-04-26	Automatic Speech Recognition System-Independent Word Error Rate Estimation	Chanho Park, Mingjie Chen, Thomas Hain et.al.	2404.16743v2	null
2024-04-25	Developing Acoustic Models for Automatic Speech Recognition in Swedish	Giampiero Salvi et.al.	2404.16547v1	null
2024-04-23	Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information	Chihiro Taguchi, Jefferson Saransig, Dayana Velásquez, David Chiang et.al.	2404.15501v1	link
2024-04-23	Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance	Tsubasa Ochiai, Kazuma Iwamoto, Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki, Shigeru Katagiri et.al.	2404.14860v1	null
2024-04-20	Semantically Corrected Amharic Automatic Speech Recognition	Samuael Adnew, Paul Pu Liang et.al.	2404.13362v1	link
2024-04-19	Efficient infusion of self-supervised representations in Automatic Speech Recognition	Darshan Prabhu, Sai Ganesh Mirishkar, Pankaj Wasnik et.al.	2404.12628v1	null
2024-07-26	Automatic Speech Recognition Advancements for Indigenous Languages of the Americas	Monica Romero, Sandra Gomez, Ivan G. Torre et.al.	2404.08368v2	null
2024-05-28	VietMed: A Dataset and Benchmark for Automatic Speech Recognition of Vietnamese in the Medical Domain	Khai Le-Duc et.al.	2404.05659v2	link
2024-04-04	Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition	Hainan Xu, Zhehuai Chen, Fei Jia, Boris Ginsburg et.al.	2404.04295v1	null
2024-04-03	Mai Ho'omāuna i ka 'Ai: Language Models Improve Automatic Speech Recognition in Hawaiian	Kaavya Chaparala, Guido Zarrella, Bruce Torres Fischer, Larry Kimura, Oiwi Parker Jones et.al.	2404.03073v1	null
2024-04-02	BRAVEn: Improving Self-Supervised Pre-training for Visual and Auditory Speech Recognition	Alexandros Haliassos, Andreas Zinonos, Rodrigo Mira, Stavros Petridis, Maja Pantic et.al.	2404.02098v1	link
2024-03-28	Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition	Yash Jain, David Chan, Pranav Dheram, Aparna Khare, Olabanji Shonibare, Venkatesh Ravichandran, Shalini Ghosh et.al.	2403.19822v1	null
2024-03-04	JEP-KD: Joint-Embedding Predictive Architecture Based Knowledge Distillation for Visual Speech Recognition	Chang Sun, Hong Yang, Bo Qin et.al.	2403.18843v1	null
2024-04-11	DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition	Yi-Cheng Wang, Hsin-Wei Wang, Bi-Cheng Yan, Chi-Han Lin, Berlin Chen et.al.	2403.17645v3	null
2024-03-20	Advanced Long-Content Speech Recognition With Factorized Neural Transducer	Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian et.al.	2403.13423v1	null
2024-03-18	AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition	SooHwan Eom, Eunseop Yoon, Hee Suk Yoon, Chanwoo Kim, Mark Hasegawa-Johnson, Chang D. Yoo et.al.	2403.11578v1	null
2024-03-14	More than words: Advancements and challenges in speech recognition for singing	Anna Kruspe et.al.	2403.09298v1	null
2024-05-21	Skipformer: A Skip-and-Recover Strategy for Efficient Speech Recognition	Wenjing Zhu, Sining Sun, Changhao Shan, Peng Fan, Qing Yang et.al.	2403.08258v2	null
2024-03-13	SpeechColab Leaderboard: An Open-Source Platform for Automatic Speech Recognition Evaluation	Jiayu Du, Jinpeng Li, Guoguo Chen, Wei-Qiang Zhang et.al.	2403.08196v1	link
2024-03-13	Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean children	Taekyung Ahn, Yeonjung Hong, Younggon Im, Do Hyung Kim, Dayoung Kang, Joo Won Jeong, Jae Won Kim, Min Jung Kim, Ah-ra Cho, Dae-Hyun Jang, Hosung Nam et.al.	2403.08187v1	null
2024-03-12	Gujarati-English Code-Switching Speech Recognition using ensemble prediction of spoken language	Yash Sharma, Basil Abraham, Preethi Jyothi et.al.	2403.08011v1	null
2024-03-11	The evaluation of a code-switched Sepedi-English automatic speech recognition system	Amanda Phaladi, Thipe Modipa et.al.	2403.07947v1	null
2024-03-08	Speech Robust Bench: A Robustness Benchmark For Speech Recognition	Muhammad A. Shah, David Solans Noguero, Mikko A. Heikkila, Nicolas Kourtellis et.al.	2403.07937v1	null
2024-03-12	Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets	Jan Pešán, Santosh Kesiraju, Lukáš Burget, Jan ''Honza'' Černocký et.al.	2403.07767v1	null
2024-03-09	Aligning Speech to Languages to Enhance Code-switching Speech Recognition	Hexin Liu, Xiangyu Zhang, Leibny Paola Garcia, Andy W. H. Khong, Eng Siong Chng, Shinji Watanabe et.al.	2403.05887v1	null
2024-05-30	A New Benchmark for Evaluating Automatic Speech Recognition in the Arabic Call Domain	Qusai Abo Obaidah, Muhy Eddin Za'ter, Adnan Jaljuli, Ali Mahboub, Asma Hakouz, Bashar Al-Rfooh, Yazan Estaitia et.al.	2403.04280v2	null
2024-03-07	A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition	Yusheng Dai, Hang Chen, Jun Du, Ruoyu Wang, Shihao Chen, Jiefeng Ma, Haotian Wang, Chin-Hui Lee et.al.	2403.04245v1	link
2024-03-05	AIx Speed: Playback Speed Optimization Using Listening Comprehension of Speech Recognition Models	Kazuki Kawamura, Jun Rekimoto et.al.	2403.02938v1	null
2024-04-18	Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey	Hamza Kheddar, Mustapha Hemis, Yassine Himeur et.al.	2403.01255v2	null
2024-03-01	Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview	Heyang Liu, Yu Wang, Yanfeng Wang et.al.	2403.00370v1	null
2024-02-29	Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems	Quentin Raymondaud, Mickael Rouvier, Richard Dufour et.al.	2402.19443v1	null
2024-02-29	Inappropriate Pause Detection In Dysarthric Speech Using Large-Scale Speech Recognition	Jeehyun Lee, Yerin Choi, Tae-Jin Song, Myoung-Wan Koo et.al.	2402.18923v1	null
2024-06-04	Exploration of Adapter for Noise Robust Automatic Speech Recognition	Hao Shi, Tatsuya Kawahara et.al.	2402.18275v3	null
2024-06-19	Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps	Giuseppe Attanasio, Beatrice Savoldi, Dennis Fucci, Dirk Hovy et.al.	2402.17954v2	link
2024-02-27	An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement	Tzu-Ting Yang, Hsin-Wei Wang, Yi-Cheng Wang, Chi-Han Lin, Berlin Chen et.al.	2402.17189v1	null
2024-04-01	ArEEG_Chars: Dataset for Envisioned Speech Recognition using EEG for Arabic Characters	Hazem Darwish, Abdalrahman Al Malah, Khloud Al Jallad, Nada Ghneim et.al.	2402.15733v2	null
2024-02-20	How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena	Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli et.al.	2402.13208v1	link
2024-02-20	Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition	Yang Li, Yuan Shangguan, Yuhao Wang, Liangzhen Lai, Ernie Chang, Changsheng Zhao, Yangyang Shi, Vikas Chandra et.al.	2402.13076v1	null
2024-02-20	Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition	David Gimeno-Gómez, Carlos-D. Martínez-Hinarejos et.al.	2402.13004v1	null
2024-06-16	OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification	Yifan Peng, Yui Sudo, Muhammad Shakeel, Shinji Watanabe et.al.	2402.12654v2	null
2024-01-04	AntiDeepFake: AI for Deep Fake Speech Recognition	Enkhtogtokh Togootogtokh, Christian Klasen et.al.	2402.10218v1	null
2024-02-09	Self-consistent context aware conformer transducer for speech recognition	Konstantin Kolokolov, Pavel Pekichev, Karthik Raghunathan et.al.	2402.06592v1	null
2024-02-08	It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition	Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng, Chao-Han Huck Yang et.al.	2402.05457v1	null
2023-10-15	Large Vocabulary Spontaneous Speech Recognition for Tigrigna	Ataklti Kahsu, Solomon Teferra et.al.	2402.04254v1	null
2024-02-05	A Comprehensive Study of the Current State-of-the-Art in Nepali Automatic Speech Recognition Systems	Rupak Raj Ghimire, Bal Krishna Bal, Prakash Poudyal et.al.	2402.03050v1	null
2024-02-03	Predicting positive transfer for improved low-resource speech recognition using acoustic pseudo-tokens	Nay San, Georgios Paraskevopoulos, Aryaman Arora, Xiluo He, Prabhjot Kaur, Oliver Adams, Dan Jurafsky et.al.	2402.02302v1	null
2024-02-01	Introduction to speech recognition	Gabriel Dauphin et.al.	2402.01778v1	null
2024-01-31	Exploring the limits of decoder-only models trained on public speech recognition corpora	Ankit Gupta, George Saon, Brian Kingsbury et.al.	2402.00235v1	null
2024-02-08	Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition	Lei Liu, Li Liu, Haizhou Li et.al.	2401.17604v2	null
2024-01-28	Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition	Ahnaf Mozib Samin et.al.	2401.15532v1	null
2024-01-26	Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline	Seonmin Koo, Chanjun Park, Jinsung Kim, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim et.al.	2401.14625v1	null
2024-01-19	Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search	Yui Sudo, Muhammad Shakeel, Yosuke Fukumoto, Yifan Peng, Shinji Watanabe et.al.	2401.10449v1	null
2024-01-19	Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition	Yu Yu, Chao-Han Huck Yang, Tuan Dinh, Sungho Ryu, Jari Kolehmainen, Roger Ren, Denis Filimonov, Prashanth G. Shivakumar, Ankur Gandhe, Ariya Rastow, Jia Xu, Ivan Bulyko, Andreas Stolcke et.al.	2401.10447v1	null
2024-01-19	Large Language Models are Efficient Learners of Noise-Robust Speech Recognition	Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng et.al.	2401.10446v1	link
2024-01-18	AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition	Ju Lin, Niko Moritz, Yiteng Huang, Ruiming Xie, Ming Sun, Christian Fuegen, Frank Seide et.al.	2401.10411v1	null
2024-01-18	Multilingual Visual Speech Recognition with a Single Model by Learning with Discrete Visual Speech Units	Minsu Kim, Jeong Hun Yeo, Jeongsoo Choi, Se Jin Park, Yong Man Ro et.al.	2401.09802v1	null
2024-01-18	SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition	Hao Wang, Shuhei Kurita, Shuichiro Shimizu, Daisuke Kawahara et.al.	2401.09759v1	null
2024-01-17	Two-pass Endpoint Detection for Speech Recognition	Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow et.al.	2401.08916v1	null
2024-01-15	SeMaScore : a new evaluation metric for automatic speech recognition tasks	Zitha Sasindran, Harsha Yelchuri, T. V. Prabhakar et.al.	2401.07506v1	null
2024-01-13	Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization	A F M Saif, Xiaodong Cui, Han Shen, Songtao Lu, Brian Kingsbury, Tianyi Chen et.al.	2401.06980v1	link
2024-02-29	The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023	He Wang, Pengcheng Guo, Wei Chen, Pan Zhou, Lei Xie et.al.	2401.06788v2	link
2024-01-12	Dynamic Behaviour of Connectionist Speech Recognition with Strong Latency Constraints	Giampiero Salvi et.al.	2401.06588v1	null
2024-01-12	LCB-net: Long-Context Biasing for Audio-Visual Speech Recognition	Fan Yu, Haoxu Wang, Xian Shi, Shiliang Zhang et.al.	2401.06390v1	link
2024-01-11	UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction	Jiaxin Guo, Minghan Wang, Xiaosong Qiao, Daimeng Wei, Hengchao Shang, Zongyao Li, Zhengzhe Yu, Yinglu Li, Chang Su, Min Zhang, Shimin Tao, Hao Yang et.al.	2401.05689v1	null
2024-01-10	Useful Blunders: Can Automated Speech Recognition Errors Improve Downstream Dementia Classification?	Changye Li, Weizhe Xu, Trevor Cohen, Serguei Pakhomov et.al.	2401.05551v1	null
2024-01-09	Continuously Learning New Words in Automatic Speech Recognition	Christian Huber, Alexander Waibel et.al.	2401.04482v1	null
2024-01-08	Cross-Speaker Encoding Network for Multi-Talker Speech Recognition	Jiawen Kang, Lingwei Meng, Mingyu Cui, Haohan Guo, Xixin Wu, Xunying Liu, Helen Meng et.al.	2401.04152v1	null
2024-02-21	ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge	He Wang, Pengcheng Guo, Yue Li, Ao Zhang, Jiayao Sun, Lei Xie, Wei Chen, Pan Zhou, Hui Bu, Xin Xu, Binbin Zhang, Zhuo Chen, Jian Wu, Longbiao Wang, Eng Siong Chng, Sun Li et.al.	2401.03473v3	null
2024-04-08	MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition	He Wang, Pengcheng Guo, Pan Zhou, Lei Xie et.al.	2401.03424v3	null
2024-01-05	A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model	Dongdi Zhao, Jianbo Ma, Lu Lu, Jinke Li, Xuan Ji, Lei Zhu, Fuming Fang, Ming Liu, Feijun Jiang et.al.	2401.02673v1	null
2024-01-04	Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition	David M. Chan, Shalini Ghosh, Hitesh Tulsiani, Ariya Rastrow, Björn Hoffmeister et.al.	2401.02417v1	link
2024-01-04	CTC Blank Triggered Dynamic Layer-Skipping for Efficient CTC-based Speech Recognition	Junfeng Hou, Peiyao Wang, Jincheng Zhang, Meng Yang, Minwei Feng, Jingcheng Yin et.al.	2401.02046v1	null
2024-01-03	Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models	Rita Frieske, Bertram E. Shi et.al.	2401.01572v1	null
2024-01-01	Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation	Huimeng Wang, Zengrui Jin, Mengzhe Geng, Shujie Hu, Guinan Li, Tianzi Wang, Haoning Xu, Xunying Liu et.al.	2401.00662v1	null
2024-05-02	Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition	Vahid Noroozi, Somshubra Majumdar, Ankur Kumar, Jagadeesh Balam, Boris Ginsburg et.al.	2312.17279v3	null
2023-12-22	BLSTM-Based Confidence Estimation for End-to-End Speech Recognition	Atsunori Ogawa, Naohiro Tawara, Takatomo Kano, Marc Delcroix et.al.	2312.14609v1	null
2024-02-09	Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification	Anirudh S. Sundar, Chao-Han Huck Yang, David M. Chan, Shalini Ghosh, Venkatesh Ravichandran, Phani Sankar Nidadavolu et.al.	2312.14378v2	null
2023-12-21	BANSpEmo: A Bangla Emotional Speech Recognition Dataset	Md Gulzar Hussain, Mahmuda Rahman, Babe Sultana, Ye Shiren et.al.	2312.14020v1	null
2023-12-20	Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition	Ashish Seth, Sreyan Ghosh, S. Umesh, Dinesh Manocha et.al.	2312.12783v1	link
2024-01-11	Automated speech audiometry: Can it work using open-source pre-trained Kaldi-NL automatic speech recognition?	Gloria Araiza-Illan, Luke Meyer, Khiet P. Truong, Deniz Baskent et.al.	2312.12269v2	null
2023-12-18	Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers	Guru Prakash Arumugam, Shuo-yiin Chang, Tara N. Sainath, Rohit Prabhavalkar, Quan Wang, Shaan Bijwadia et.al.	2312.11123v1	null
2023-12-18	Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition	Peng Shen, Xugang Lu, Hisashi Kawai et.al.	2312.10959v1	null
2024-05-13	Conformer-Based Speech Recognition On Extreme Edge-Computing Devices	Mingbin Xu, Alex Jin, Sicheng Wang, Mu Su, Tim Ng, Henry Mason, Shiyi Han, Zhihong Lei, Yaqiao Deng, Zhen Huang, Mahesh Krishnamoorthy et.al.	2312.10359v3	null
2023-12-19	On Robustness to Missing Video for Audiovisual Speech Recognition	Oscar Chang, Otavio Braga, Hank Liao, Dmitriy Serdyuk, Olivier Siohan et.al.	2312.10088v2	null
2023-12-19	Revisiting the Entropy Semiring for Neural Speech Recognition	Oscar Chang, Dongseong Hwang, Olivier Siohan et.al.	2312.10087v2	null
2023-12-15	On the compression of shallow non-causal ASR models using knowledge distillation and tied-and-reduced decoder for low-latency on-device speech recognition	Nagaraj Adiga, Jinhwan Park, Chintigari Shiva Kumar, Shatrughan Singh, Kyungmin Lee, Chanwoo Kim, Dhananjaya Gowda et.al.	2312.09842v1	null
2023-12-15	Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies	Bingshen Mu, Pengcheng Guo, Dake Guo, Pan Zhou, Wei Chen, Lei Xie et.al.	2312.09746v1	null
2023-12-15	LiteVSR: Efficient Visual Speech Recognition by Learning from Speech Representations of Unlabeled Data	Hendrik Laux, Emil Mededovic, Ahmed Hallawa, Lukas Martin, Arne Peine, Anke Schmeink et.al.	2312.09727v1	null
2023-12-15	Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition	Tzu-Ting Yang, Hsin-Wei Wang, Berlin Chen et.al.	2312.09583v1	null
2023-12-15	IR-UWB Radar-Based Contactless Silent Speech Recognition of Vowels, Consonants, Words, and Phrases	Sunghwa Lee, Younghoon Shin, Myungjong Kim, Jiwon Seo et.al.	2312.09572v1	null
2024-01-12	Attention-Guided Adaptation for Code-Switching Speech Recognition	Bobbi Aditya, Mahdin Rohmatillah, Liang-Hsuan Tai, Jen-Tzung Chien et.al.	2312.08856v2	null
2023-12-14	Hourglass-AVSR: Down-Up Sampling-based Computational Efficiency Model for Audio-Visual Speech Recognition	Fan Yu, Haoxu Wang, Ziyang Ma, Shiliang Zhang et.al.	2312.08850v1	null
2023-12-14	Towards Automatic Data Augmentation for Disordered Speech Recognition	Zengrui Jin, Xurong Xie, Tianzi Wang, Mengzhe Geng, Jiajun Deng, Guinan Li, Shujie Hu, Xunying Liu et.al.	2312.08641v1	null
2023-12-13	PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition	Chengxi Lei, Satwinder Singh, Feng Hou, Xiaoyun Jia, Ruili Wang et.al.	2312.08571v1	null
2024-01-16	USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models	Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Zhonglin Han, Jian Li, Amir Yazdanbakhsh, Shivani Agrawal et.al.	2312.08553v3	null
2023-12-11	Deep Photonic Reservoir Computer for Speech Recognition	Enrico Picco, Alessandro Lupo, Serge Massar et.al.	2312.06558v1	null
2023-12-06	An Integration of Pre-Trained Speech and Language Models for End-to-End Speech Recognition	Yukiya Hono, Koh Mitsuda, Tianyu Zhao, Kentaro Mitsui, Toshiaki Wakatsuki, Kei Sawada et.al.	2312.03668v1	null
2023-11-29	FAT-HuBERT: Front-end Adaptive Training of Hidden-unit BERT for Distortion-Invariant Robust Speech Recognition	Dongning Yang, Wei Wang, Yanmin Qian et.al.	2311.17790v1	null
2023-11-29	Adapting OpenAI's Whisper for Speech Recognition on Code-Switch Mandarin-English SEAME and ASRU2019 Datasets	Yuhang Yang, Yizhou Peng, Xionghu Zhong, Hao Huang, Eng Siong Chng et.al.	2311.17382v1	null
2023-11-25	Multilingual self-supervised speech representations improve the speech recognition of low-resource African languages with codeswitching	Tolúlopé Ògúnrèmí, Christopher D. Manning, Dan Jurafsky et.al.	2311.15077v1	null
2023-11-21	Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish	David Gimeno-Gómez, Carlos-D. Martínez-Hinarejos et.al.	2311.12480v1	null
2023-11-20	How does end-to-end speech recognition training impact speech enhancement artifacts?	Kazuma Iwamoto, Tsubasa Ochiai, Marc Delcroix, Rintaro Ikeshita, Hiroshi Sato, Shoko Araki, Shigeru Katagiri et.al.	2311.11599v1	null
2023-11-19	Label-Synchronous Neural Transducer for Adaptable Online E2E Speech Recognition	Keqi Deng, Philip C. Woodland et.al.	2311.11353v1	null
2023-11-17	GhostVec: A New Threat to Speaker Privacy of End-to-End Speech Recognition System	Xiaojiao Chen, Sheng Li, Jiyi Li, Hao Huang, Yang Cao, Liang He et.al.	2311.10689v1	null
2023-11-09	Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation	Zhaofeng Lin, Tanvina Patel, Odette Scharenborg et.al.	2311.05179v1	link
2023-11-08	GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition	Daniel Galvez, Tim Kaldewey et.al.	2311.04996v1	link
2023-11-07	A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition	Andrei Barcovschi, Rishabh Jain, Peter Corcoran et.al.	2311.04936v1	link
2023-11-07	Fine-tuning convergence model in Bengali speech recognition	Zhu Ruiying, Shen Meng et.al.	2311.04122v1	null
2023-11-06	Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition	Rabindra Nath Nandi, Mehadi Hasan Menon, Tareq Al Muntasir, Sagor Sarker, Quazi Sarwar Muhtaseem, Md. Tariqul Islam, Shammur Absar Chowdhury, Firoj Alam et.al.	2311.03196v1	link
2023-10-20	Intelligibility prediction with a pretrained noise-robust automatic speech recognition model	Zehai Tu, Ning Ma, Jon Barker et.al.	2310.19817v1	null
2023-10-29	MUST: A Multilingual Student-Teacher Learning approach for low-resource speech recognition	Muhammad Umar Farooq, Rehan Ahmad, Thomas Hain et.al.	2310.18865v1	null
2023-10-27	MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition	Jiamin Xie, John H. L. Hansen et.al.	2310.18450v1	link
2023-10-27	TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch	Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao, Robin Scheibler, Samuele Cornell, Sean Kim, Stavros Petridis et.al.	2310.17864v1	link
2023-10-25	Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors	Marek Kubis, Paweł Skórzewski, Marcin Sowański, Tomasz Ziętkiewicz et.al.	2310.16609v1	link
2023-10-27	Accented Speech Recognition With Accent-specific Codebooks	Darshan Prabhu, Preethi Jyothi, Sriram Ganapathy, Vinit Unni et.al.	2310.15970v3	link
2023-10-28	Key Frame Mechanism For Efficient Conformer Based End-to-end Speech Recognition	Peng Fan, Changhao Shan, Sining Sun, Qing Yang, Jianwei Zhang et.al.	2310.14954v2	link
2023-10-23	Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model	Joanna Hong, Se Jin Park, Yong Man Ro et.al.	2310.14946v1	null
2023-10-22	Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation	Kun Wei, Bei Li, Hang Lv, Quan Lu, Ning Jiang, Lei Xie et.al.	2310.14278v1	null
2023-10-17	Audio-AdapterFusion: A Task-ID-free Approach for Efficient and Non-Destructive Multi-task Speech Recognition	Hillary Ngai, Rohan Agrawal, Neeraj Gaur, Ronny Huang, Parisa Haghani, Pedro Moreno Mengibar et.al.	2310.13015v1	null
2023-10-17	Generative error correction for code-switching speech recognition using large language models	Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Hexin Liu, Sabato Marco Siniscalchi, Eng Siong Chng et.al.	2310.13013v1	null
2023-10-17	Multi-stage Large Language Model Correction for Speech Recognition	Jie Pu, Thai-Son Nguyen, Sebastian Stüker et.al.	2310.11532v1	null
2024-03-05	Zipformer: A faster and better encoder for automatic speech recognition	Zengwei Yao, Liyong Guo, Xiaoyu Yang, Wei Kang, Fangjun Kuang, Yifan Yang, Zengrui Jin, Long Lin, Daniel Povey et.al.	2310.11230v3	link
2023-10-27	VoxArabica: A Robust Dialect-Aware Arabic Speech Recognition System	Abdul Waheed, Bashar Talafha, Peter Sullivan, AbdelRahim Elmadany, Muhammad Abdul-Mageed et.al.	2310.11069v4	null
2023-10-17	Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition	Atsunori Ogawa, Takafumi Moriya, Naoyuki Kamo, Naohiro Tawara, Marc Delcroix et.al.	2310.11010v1	null
2023-10-17	Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition	Shahram Ghorbani, John H. L. Hansen et.al.	2310.11004v1	null
2023-10-17	Correction Focused Language Model Training for Speech Recognition	Yingyi Ma, Zhe Liu, Ozlem Kalinli et.al.	2310.11003v1	null
2023-10-16	Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization	Zhihong Lei, Ernest Pusateri, Shiyi Han, Leo Liu, Mingbin Xu, Tim Ng, Ruchir Travadi, Youyuan Zhang, Mirko Hannemann, Man-Hung Siu, Zhen Huang et.al.	2310.09988v1	null
2024-03-04	Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring	Ankitha Sudarshan, Vinay Samuel, Parth Patwa, Ibtihel Amara, Aman Chadha et.al.	2310.09680v4	null
2023-10-13	SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation	Zhehuai Chen, He Huang, Andrei Andrusenko, Oleksii Hrinchuk, Krishna C. Puvvada, Jason Li, Subhankar Ghosh, Jagadeesh Balam, Boris Ginsburg et.al.	2310.09424v1	link
2023-10-12	On the Relevance of Phoneme Duration Variability of Synthesized Training Data for Automatic Speech Recognition	Nick Rossenbach, Benedikt Hilmes, Ralf Schlüter et.al.	2310.08132v1	null
2023-10-10	Acoustic Model Fusion for End-to-end Speech Recognition	Zhihong Lei, Mingbin Xu, Shiyi Han, Leo Liu, Zhen Huang, Tim Ng, Yuanyuan Zhang, Ernest Pusateri, Mirko Hannemann, Yaqiao Deng, Man-Hung Siu et.al.	2310.07062v1	null
2023-10-10	No Pitch Left Behind: Addressing Gender Unbalance in Automatic Speech Recognition through Pitch Manipulation	Dennis Fucci, Marco Gaido, Matteo Negri, Mauro Cettolo, Luisa Bentivogli et.al.	2310.06590v1	link
2023-10-16	Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition	Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan, Rohit Kumar, Narsis A. Kiani, David Gomez-Cabrero, Jesper N. Tegner et.al.	2310.06434v2	link
2023-10-10	Discriminative Speech Recognition Rescoring with Pre-trained Language Models	Prashanth Gurunath Shivakumar, Jari Kolehmainen, Yile Gu, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko et.al.	2310.06248v1	null
2023-10-07	Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition	Kaixun Huang, Ao Zhang, Binbin Zhang, Tianyi Xu, Xingchen Song, Lei Xie et.al.	2310.04657v1	null
2023-12-15	Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder	Zih-Jyun Lin, Yi-Ju Chen, Po-Chih Kuo, Likai Huang, Chaur-Jong Hu, Cheng-Yu Chen et.al.	2310.03985v2	link
2023-10-06	The North System for Formosa Speech Recognition Challenge 2023	Li-Wei Chen, Kai-Chen Cheng, Hung-Shin Lee et.al.	2310.03443v2	null
2023-10-05	Neural Language Model Pruning for Automatic Speech Recognition	Leonardo Emili, Thiago Fraga-Silva, Ernest Pusateri, Markus Nußbaum-Thom, Youssef Oualil et.al.	2310.03424v1	null
2023-10-08	BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition	Peikun Chen, Fan Yu, Yuhao Lian, Hongfei Xue, Xucheng Wan, Naijun Zheng, Huan Zhou, Lei Xie et.al.	2310.02629v2	null
2023-10-03	Unsupervised Speech Recognition with N-Skipgram and Positional Unigram Matching	Liming Wang, Mark Hasegawa-Johnson, Chang D. Yoo et.al.	2310.02382v1	link
2023-10-02	One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition	Samuele Cornell, Jee-weon Jung, Shinji Watanabe, Stefano Squartini et.al.	2310.01688v1	null
2023-09-29	Federated Learning with Differential Privacy for End-to-End Speech Recognition	Martin Pelikan, Sheikh Shams Azam, Vitaly Feldman, Jan "Honza" Silovsky, Kunal Talwar, Tatiana Likhomanenko et.al.	2310.00098v1	null
2023-09-29	AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition	Andrew Rouditchenko, Ronan Collobert, Tatiana Likhomanenko et.al.	2309.17395v1	null
2023-09-29	Enhancing Code-switching Speech Recognition with Interactive Language Biases	Hexin Liu, Leibny Paola Garcia, Xiangyu Zhang, Andy W. H. Khong, Sanjeev Khudanpur et.al.	2309.16953v1	null
2023-09-29	SSHR: Leveraging Self-supervised Hierarchical Representations for Multilingual Automatic Speech Recognition	Hongfei Xue, Qijie Shao, Kaixun Huang, Peikun Chen, Lei Xie, Jie Liu et.al.	2309.16937v1	null
2023-09-26	Unsupervised Pre-Training for Vietnamese Automatic Speech Recognition in the HYKIST Project	Khai Le-Duc et.al.	2309.15869v1	null
2023-09-27	Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study	Xuankai Chang, Brian Yan, Kwanghee Choi, Jeeweon Jung, Yichen Lu, Soumi Maiti, Roshan Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang et.al.	2309.15800v1	null
2023-09-26	Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition	Dongji Gao, Hainan Xu, Desh Raj, Leibny Paola Garcia Perera, Daniel Povey, Sanjeev Khudanpur et.al.	2309.15796v1	link
2023-10-16	HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models	Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Macro Siniscalchi, Pin-Yu Chen, Eng Siong Chng et.al.	2309.15701v2	link
2023-10-10	Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting	Chao-Han Huck Yang, Yile Gu, Yi-Chieh Liu, Shalini Ghosh, Ivan Bulyko, Andreas Stolcke et.al.	2309.15649v2	null
2023-10-10	Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition	Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth G. Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastow, Ivan Bulyko et.al.	2309.15223v2	null
2023-09-26	Updated Corpora and Benchmarks for Long-Form Speech Recognition	Jennifer Drexler Fox, Desh Raj, Natalie Delworth, Quinn McNamara, Corey Miller, Migüel Jetté et.al.	2309.15013v1	link
2023-09-25	On the Impact of Quantization and Pruning of Self-Supervised Speech Models for Downstream Speech Recognition Tasks "In-the-Wild''	Arthur Pimentel, Heitor Guimarães, Anderson R. Avila, Mehdi Rezagholizadeh, Tiago H. Falk et.al.	2309.14462v1	null
2023-09-21	Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition	Chen Xu, Xiaoqian Liu, Erfeng He, Yuhao Zhang, Qianqian Dong, Tong Xiao, Jingbo Zhu, Dapeng Man, Wu Yang et.al.	2309.12234v1	link
2024-01-08	Sparsely Shared LoRA on Whisper for Child Speech Recognition	Wei Liu, Ying Qin, Zhiyuan Peng, Tan Lee et.al.	2309.11756v2	null
2023-09-20	AudioFool: Fast, Universal and synchronization-free Cross-Domain Attack on Speech Recognition	Mohamad Fakih, Rouwaida Kanj, Fadi Kurdahi, Mohammed E. Fouda et.al.	2309.11462v1	null
2023-09-25	Leveraging Data Collection and Unsupervised Learning for Code-switched Tunisian Arabic Automatic Speech Recognition	Ahmed Amine Ben Abdallah, Ata Kabboudi, Amir Kanoun, Salah Zaiem et.al.	2309.11327v2	null
2023-09-20	Directional Source Separation for Robust Speech Recognition on Smart Glasses	Tiantian Feng, Ju Lin, Yiteng Huang, Weipeng He, Kaustubh Kalgaonkar, Niko Moritz, Li Wan, Xin Lei, Ming Sun, Frank Seide et.al.	2309.10993v1	null
2023-09-19	Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition	Krishna C. Puvvada, Nithin Rao Koluguri, Kunal Dhawan, Jagadeesh Balam, Boris Ginsburg et.al.	2309.10922v1	null
2023-09-19	End-to-End Speech Recognition Contextualization with Large Language Models	Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen et.al.	2309.10917v1	null
2023-09-19	Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition	Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi et.al.	2309.10524v1	null
2023-09-16	Improving Speech Recognition for African American English With Audio Classification	Shefali Garg, Zhouyuan Huo, Khe Chai Sim, Suzan Schwartz, Mason Chua, Alëna Aksënova, Tsendsuren Munkhdalai, Levi King, Darryl Wright, Zion Mengesha, Dongseong Hwang, Tara Sainath, Françoise Beaufays, Pedro Moreno Mengibar et.al.	2309.09996v1	null
2023-09-18	Instruction-Following Speech Recognition	Cheng-I Jeff Lai, Zhiyun Lu, Liangliang Cao, Ruoming Pang et.al.	2309.09843v1	null
2023-09-18	Training dynamic models using early exits for automatic speech recognition on resource-constrained devices	George August Wright, Umberto Cappellazzo, Salah Zaiem, Desh Raj, Lucas Ondel Yang, Daniele Falavigna, Alessio Brutti et.al.	2309.09546v1	null
2023-09-19	Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter	Song Li, Yongbin You, Xuezhi Wang, Ke Ding, Guanglu Wan et.al.	2309.09443v2	null
2023-09-18	Are Soft Prompts Good Zero-shot Learners for Speech Recognition?	Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter-Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma et.al.	2309.09413v1	null
2023-09-16	Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation	Emiru Tsunoo, Hayato Futami, Yosuke Kashiwagi, Siddhant Arora, Shinji Watanabe et.al.	2309.08876v1	null
2023-12-27	Augmenting conformers with structured state-space sequence models for online speech recognition	Haozhe Shan, Albert Gu, Zhong Meng, Weiran Wang, Krzysztof Choromanski, Tara Sainath et.al.	2309.08551v2	null
2023-09-15	Visual Speech Recognition for Low-resource Languages with Automatic Labels From Whisper Model	Jeong Hun Yeo, Minsu Kim, Shinji Watanabe, Yong Man Ro et.al.	2309.08535v1	link
2023-09-15	Chunked Attention-based Encoder-Decoder Model for Streaming Speech Recognition	Mohammad Zeineldeen, Albert Zeyer, Ralf Schlüter, Hermann Ney et.al.	2309.08436v1	null
2023-09-15	Unimodal Aggregation for CTC-based Speech Recognition	Ying Fang, Xiaofei Li et.al.	2309.08150v1	link
2023-09-21	Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition	Yang Li, Liangzhen Lai, Yuan Shangguan, Forrest N. Iandola, Ernie Chang, Yangyang Shi, Vikas Chandra et.al.	2309.07988v2	null
2023-09-18	Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks	Soumi Maiti, Yifan Peng, Shukjae Choi, Jee-weon Jung, Xuankai Chang, Shinji Watanabe et.al.	2309.07937v2	null
2023-09-18	Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults	Ahmed Adel Attia, Jing Liu, Wei Ai, Dorottya Demszky, Carol Espy-Wilson et.al.	2309.07927v2	null
2023-09-21	CPPF: A contextual and post-processing-free model for automatic speech recognition	Lei Zhang, Zhengkun Tian, Xiang Chen, Jiaming Sun, Hongyu Xiang, Ke Ding, Guanglu Wan et.al.	2309.07413v2	null
2023-09-09	Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech Recognition	Huaibo Zhao, Yosuke Higuchi, Yusuke Kida, Tetsuji Ogawa, Tetsunori Kobayashi et.al.	2309.04654v1	null
2023-09-08	End-to-End Speech Recognition and Disfluency Removal with Acoustic Language Model Pretraining	Saksham Bassi, Giulio Duregon, Siddhartha Jalagam, David Roth et.al.	2309.04516v1	link
2023-10-07	Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation	Jiaxu Zhu, Weinan Tong, Yaoxun Xu, Changhe Song, Zhiyong Wu, Zhao You, Dan Su, Dong Yu, Helen Meng et.al.	2309.02459v2	null
2023-09-05	Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition	Patrick Eickhoff, Matthias Möller, Theresa Pekarek Rosin, Johannes Twiefel, Stefan Wermter et.al.	2309.02145v1	null
2023-10-07	SememeASR: Boosting Performance of End-to-End Speech Recognition against Domain and Long-Tailed Data Shift with Sememe Semantic Knowledge	Jiaxu Zhu, Changhe Song, Zhiyong Wu, Helen Meng et.al.	2309.01437v2	null
2023-09-01	OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation	Zhening Huang, Xiaoyang Wu, Xi Chen, Hengshuang Zhao, Lei Zhu, Joan Lasenby et.al.	2309.00616v1	link
2023-09-01	Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following	Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Yiwen Tang, Xianzheng Ma, Jiaming Han, Kexin Chen, Peng Gao, Xianzhi Li, Hongsheng Li, Pheng-Ann Heng et.al.	2309.00615v1	link
2023-09-01	Iterative Multi-granular Image Editing using Diffusion Models	K J Joseph, Prateksha Udhayanan, Tripti Shukla, Aishwarya Agarwal, Srikrishna Karanam, Koustava Goswami, Balaji Vasan Srinivasan et.al.	2309.00613v1	null
2023-09-01	CityDreamer: Compositional Generative Model of Unbounded 3D Cities	Haozhe Xie, Zhaoxi Chen, Fangzhou Hong, Ziwei Liu et.al.	2309.00610v1	null
2023-09-01	Time Series Analysis of Urban Liveability	Alex Levering, Diego Marcos, Devis Tuia et.al.	2309.00594v1	null
2023-09-01	Discrete Morphological Neural Networks	Diego Marcondes, Junior Barrera et.al.	2309.00588v1	link
2023-09-01	Mechanism of feature learning in convolutional neural networks	Daniel Beaglehole, Adityanarayanan Radhakrishnan, Parthe Pandit, Mikhail Belkin et.al.	2309.00570v1	link
2023-09-01	Amyloid-Beta Axial Plane PET Synthesis from Structural MRI: An Image Translation Approach for Screening Alzheimer's Disease	Fernando Vega, Abdoljalil Addeh, M. Ethan MacDonald et.al.	2309.00569v1	null
2023-09-01	Impact of Image Context for Single Deep Learning Face Morphing Attack Detection	Joana Pimenta, Iurii Medvedev, Nuno Gonçalves et.al.	2309.00549v1	null
2023-09-01	Trust your Good Friends: Source-free Domain Adaptation by Reciprocal Neighborhood Clustering	Shiqi Yang, Yaxing Wang, Joost van de Weijer, Luis Herranz, Shangling Jui, Jian Yang et.al.	2309.00528v1	null
2023-09-01	SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation	Youhong Wang, Yunji Liang, Hao Xu, Shaohui Jiao, Hongkai Yu et.al.	2309.00526v1	null
2023-09-01	A Machine Vision Method for Correction of Eccentric Error: Based on Adaptive Enhancement Algorithm	Fanyi Wang, Pin Cao, Yihui Zhang, Haotian Hu, Yongying Yang et.al.	2309.00514v1	null
2023-09-01	Multi-stage Deep Learning Artifact Reduction for Computed Tomography	Jiayang Shi, Daniel M. Pelt, K. Joost Batenburg et.al.	2309.00494v1	null
2023-09-01	Asymmetric double-winged multi-view clustering network for exploring Diverse and Consistent Information	Qun Zheng, Xihong Yang, Siwei Wang, Xinru An, Qi Liu et.al.	2309.00474v1	null
2023-09-01	General and Practical Tuning Method for Off-the-Shelf Graph-Based Index: SISAP Indexing Challenge Report by Team UTokyo	Yutaro Oguri, Yusuke Matsui et.al.	2309.00472v1	link
2023-09-01	An Improved Encoder-Decoder Framework for Food EnergyEstimation	Jack Ma, Jiangpeng He, Fengqing Zhu et.al.	2309.00468v1	null
2023-09-01	A Theoretical and Practical Framework for Evaluating Uncertainty Calibration in Object Detection	Pedro Conde, Rui L. Lopes, Cristiano Premebida et.al.	2309.00464v1	link
2023-09-01	dacl10k: Benchmark for Semantic Bridge Damage Segmentation	Johannes Flotzinger, Philipp J. Rösch, Thomas Braml et.al.	2309.00460v1	null
2023-09-01	Unsupervised bias discovery in medical image segmentation	Nicolás Gaggion, Rodrigo Echeveste, Lucas Mansilla, Diego H. Milone, Enzo Ferrante et.al.	2309.00451v1	null
2023-09-01	Improving the matching of deformable objects by learning to detect keypoints	Felipe Cadar, Welerson, Vaishnavi Kanagasabapathi, Guilherme Potje, Renato Martins, Erickson R. Nascimento et.al.	2309.00434v1	null
2023-09-01	CPSP: Learning Speech Concepts From Phoneme Supervision	Chunyu Qiang, Hao Li, Yixin Tian, Ruibo Fu, Tao Wang, Longbiao Wang, Jianwu Dang et.al.	2309.00424v1	null
2023-09-01	Selective Scene Text Removal	Hayato Mitani, Akisato Kimura, Seiichi Uchida et.al.	2309.00410v1	null
2023-09-01	Fine-grained Recognition with Learnable Semantic Data Augmentation	Yifan Pu, Yizeng Han, Yulin Wang, Junlan Feng, Chao Deng, Gao Huang et.al.	2309.00399v1	null
2023-09-01	VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation	Xin Li, Wenqing Chu, Ye Wu, Weihang Yuan, Fanglong Liu, Qi Zhang, Fu Li, Haocheng Feng, Errui Ding, Jingdong Wang et.al.	2309.00398v1	null
2023-09-01	Dense Voxel 3D Reconstruction Using a Monocular Event Camera	Haodong Chen, Vera Chung, Li Tan, Xiaoming Chen et.al.	2309.00385v1	null
2023-09-01	Long-Term Memorability On Advertisements	Harini S I, Somesh Singh, Yaman K Singla, Aanisha Bhattacharyya, Veeky Baths, Changyou Chen, Rajiv Ratn Shah, Balaji Krishnamurthy et.al.	2309.00378v1	null
2023-09-01	On the Localization of Ultrasound Image Slices within Point Distribution Models	Lennart Bastian, Vincent Bürgin, Ha Young Kim, Alexander Baumann, Benjamin Busam, Mahdi Saleh, Nassir Navab et.al.	2309.00372v1	null
2023-09-01	Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior	Ashmit Khandelwal, Aditya Agrawal, Aanisha Bhattacharyya, Yaman K Singla, Somesh Singh, Uttaran Bhattacharya, Ishita Dasgupta, Stefano Petrangeli, Rajiv Ratn Shah, Changyou Chen, Balaji Krishnamurthy et.al.	2309.00359v1	null
2023-09-01	How You Split Matters: Data Leakage and Subject Characteristics Studies in Longitudinal Brain MRI Analysis	Dewinda Julianensi Rumala et.al.	2309.00350v1	null
2023-09-01	MuraNet: Multi-task Floor Plan Recognition with Relation Attention	Lingxiao Huang, Jung-Hsuan Wu, Chiching Wei, Wilson Li et.al.	2309.00348v1	null
2023-09-01	Towards Contrastive Learning in Music Video Domain	Karel Veldkamp, Mariya Hendriksen, Zoltán Szlávik, Alexander Keijser et.al.	2309.00347v1	null
2023-09-01	Robust Point Cloud Processing through Positional Embedding	Jianqiao Zheng, Xueqian Li, Sameera Ramasinghe, Simon Lucey et.al.	2309.00339v1	null
2023-09-01	Human trajectory prediction using LSTM with Attention mechanism	Amin Manafi Soltan Ahmadi, Samaneh Hoseini Semnani et.al.	2309.00331v1	null
2023-09-01	Mi-Go: Test Framework which uses YouTube as Data Source for Evaluating Speech Recognition Models like OpenAI's Whisper	Tomasz Wojnar, Jaroslaw Hryszko, Adam Roman et.al.	2309.00329v1	null
2023-09-01	ARFA: An Asymmetric Receptive Field Autoencoder Model for Spatiotemporal Prediction	Wenxuan Zhang, Xuechao Zou, Li Wu, Jianqiang Huang, Xiaoying Wang et.al.	2309.00314v1	null
2023-09-01	Fusing Monocular Images and Sparse IMU Signals for Real-time Human Motion Capture	Shaohua Pan, Qi Ma, Xinyu Yi, Weifeng Hu, Xiong Wang, Xingkang Zhou, Jijunnan Li, Feng Xu et.al.	2309.00310v1	link
2023-09-01	Efficient Surrogate Models for Materials Science Simulations: Machine Learning-based Prediction of Microstructure Properties	Binh Duong Nguyen, Pavlo Potapenko, Aytekin Dermici, Kishan Govinda, Stefan Sandfeld et.al.	2309.00305v1	null
2023-09-01	Fine-Grained Spatiotemporal Motion Alignment for Contrastive Video Representation Learning	Minghao Zhu, Xiao Lin, Ronghao Dang, Chengju Liu, Qijun Chen et.al.	2309.00297v1	null
2023-09-01	Fast Diffusion EM: a diffusion model for blind inverse problems with application to deconvolution	Charles Laroche, Andrés Almansa, Eva Coupete et.al.	2309.00287v1	null
2023-09-01	SparseSat-NeRF: Dense Depth Supervised Neural Radiance Fields for Sparse Satellite Images	Lulin Zhang, Ewelina Rupnik et.al.	2309.00277v1	link
2023-09-01	Application of Machine Learning in Melanoma Detection and the Identification of 'Ugly Duckling' and Suspicious Naevi: A Review	Fatima Al Zegair, Nathasha Naranpanawa, Brigid Betz-Stablein, Monika Janda, H. Peter Soyer, Shekhar S. Chandra et.al.	2309.00265v1	null
2023-09-01	Interpretable Medical Imagery Diagnosis with Self-Attentive Transformers: A Review of Explainable AI for Health Care	Tin Lai et.al.	2309.00252v1	null
2023-09-01	MIMOCrypt: Multi-User Privacy-Preserving Wi-Fi Sensing via MIMO Encryption	Jun Luo, Hangcheng Cao, Hongbo Jiang, Yanbing Yang, Zhe Chen et.al.	2309.00250v1	null
2023-09-01	DiffuGen: Adaptable Approach for Generating Labeled Image Datasets using Stable Diffusion Models	Michael Shenoda, Edward Kim et.al.	2309.00248v1	link
2023-09-01	Object-Centric Multiple Object Tracking	Zixu Zhao, Jiaze Wang, Max Horn, Yizhuo Ding, Tong He, Zechen Bai, Dominik Zietlow, Carl-Johann Simon-Gabriel, Bing Shuai, Zhuowen Tu, Thomas Brox, Bernt Schiele, Yanwei Fu, Francesco Locatello, Zheng Zhang, Tianjun Xiao et.al.	2309.00233v1	null
2023-09-01	What Makes Good Open-Vocabulary Detector: A Disassembling Perspective	Jincheng Li, Chunyu Xie, Xiaoyu Wu, Bin Wang, Dawei Leng et.al.	2309.00227v1	null
2023-09-01	Human-Inspired Facial Sketch Synthesis with Dynamic Adaptation	Fei Gao, Yifan Zhu, Chang Jiang, Nannan Wang et.al.	2309.00216v1	link
2023-09-01	Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding	Joshua Feinglass, Yezhou Yang et.al.	2309.00215v1	null
2023-09-01	Gap and Overlap Detection in Automated Fiber Placement	Assef Ghamisi, Homayoun Najjaran et.al.	2309.00206v1	null
2023-09-01	Diffusion Model with Clustering-based Conditioning for Food Image Generation	Yue Han, Jiangpeng He, Mridul Gupta, Edward J. Delp, Fengqing Zhu et.al.	2309.00199v1	null
2023-09-01	DARC: Distribution-Aware Re-Coloring Model for Generalizable Nucleus Segmentation	Shengcong Chen, Changxing Ding, Dacheng Tao, Hao Chen et.al.	2309.00188v1	null
2023-09-01	Vision-aided nonlinear control framework for shake table tests	Zhongwei Chen, T. Y. Yang, Yifei Xiao, Xiao Pan, Wanyan Yang et.al.	2309.00187v1	null
2023-08-31	Typing on Any Surface: A Deep Learning-based Method for Real-Time Keystroke Detection in Augmented Reality	Xingyu Fu, Mingze Xi et.al.	2309.00174v1	null
2023-08-31	RepCodec: A Speech Representation Codec for Speech Tokenization	Zhichao Huang, Chutong Meng, Tom Ko et.al.	2309.00169v1	link
2023-08-31	Pose-Graph Attentional Graph Neural Network for Lidar Place Recognition	Milad Ramezani, Liang Wang, Joshua Knights, Zhibin Li, Pauline Pounds, Peyman Moghadam et.al.	2309.00168v1	null
2023-08-31	BuilDiff: 3D Building Shape Generation using Single-Image Conditional Point Cloud Diffusion Models	Yao Wei, George Vosselman, Michael Ying Yang et.al.	2309.00158v1	null
2023-08-31	Optimized Deep Feature Selection for Pneumonia Detection: A Novel RegNet and XOR-Based PSO Approach	Fatemehsadat Ghanadi Ladani, Samaneh Hosseini Semnani et.al.	2309.00147v1	null
2023-08-31	Self-supervised Semantic Segmentation: Consistency over Transformation	Sanaz Karimijafarbigloo, Reza Azad, Amirhossein Kazerouni, Yury Velichko, Ulas Bagci, Dorit Merhof et.al.	2309.00143v1	link
2023-08-31	Improving vision-inspired keyword spotting using dynamic module skipping in streaming conformer encoder	Alexandre Bittar, Paul Dixon, Mohammad Samragh, Kumari Nishu, Devang Naik et.al.	2309.00140v1	null
2023-08-31	Fuzzy Approach for Audio-Video Emotion Recognition in Computer Games for Children	Pavel Kozlov, Alisher Akram, Pakizar Shamoi et.al.	2309.00138v1	null
2023-08-31	Distraction-free Embeddings for Robust VQA	Atharvan Dogra, Deeksha Varshney, Ashwin Kalyan, Ameet Deshpande, Neeraj Kumar et.al.	2309.00133v1	null
2023-08-31	QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning	Haohan Guo, Fenglong Xie, Jiawen Kang, Yujia Xiao, Xixin Wu, Helen Meng et.al.	2309.00126v1	null
2023-08-31	Segmentação e contagem de troncos de madeira utilizando deep learning e processamento de imagens	João V. C. Mazzochin, Gustavo Tiecker, Erick O. Rodrigues et.al.	2309.00123v1	null
2023-08-31	Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation	Reza Azad, Leon Niggemeier, Michael Huttemann, Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, Yury Velichko, Ulas Bagci, Dorit Merhof et.al.	2309.00121v1	null
2023-08-31	Laplacian-Former: Overcoming the Limitations of Vision Transformers in Local Texture Detection	Reza Azad, Amirhossein Kazerouni, Babak Azad, Ehsan Khodapanah Aghdam, Yury Velichko, Ulas Bagci, Dorit Merhof et.al.	2309.00108v1	null
2023-08-31	Unsupervised evaluation of GAN sample quality: Introducing the TTJac Score	Egor Sevriugov, Ivan Oseledets et.al.	2309.00107v1	null
2023-08-31	Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation	Chaofan Ma, Yuhuan Yang, Chen Ju, Fei Zhang, Ya Zhang, Yanfeng Wang et.al.	2309.00096v1	null
2023-08-31	Few-shot Diagnosis of Chest x-rays Using an Ensemble of Random Discriminative Subspaces	Kshitiz, Garvit Garg, Angshuman Paul et.al.	2309.00081v1	link
2023-08-31	SoDaCam: Software-defined Cameras via Single-Photon Imaging	Varun Sundar, Andrei Ardelean, Tristan Swedish, Claudio Brusschini, Edoardo Charbon, Mohit Gupta et.al.	2309.00066v1	null
2023-08-31	STint: Self-supervised Temporal Interpolation for Geospatial Data	Nidhin Harilal, Bri-Mathias Hodge, Aneesh Subramanian, Claire Monteleoni et.al.	2309.00059v1	null
2023-08-31	Bellybutton: Accessible and Customizable Deep-Learning Image Segmentation	Sam Dillavou, Jesse M. Hanlan, Anthony T. Chieco, Hongyi Xiao, Sage Fulco, Kevin T. Turner, Douglas J. Durian et.al.	2309.00058v1	null
2023-08-31	FACET: Fairness in Computer Vision Evaluation Benchmark	Laura Gustafson, Chloe Rolland, Nikhila Ravi, Quentin Duval, Aaron Adcock, Cheng-Yang Fu, Melissa Hall, Candace Ross et.al.	2309.00035v1	null
2023-08-31	Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis	Linsen Song, Wayne Wu, Chaoyou Fu, Chen Change Loy, Ran He et.al.	2309.00030v1	null
2023-08-31	Vision-Based Cranberry Crop Ripening Assessment	Faith Johnson, Jack Lowry, Kristin Dana, Peter Oudemans et.al.	2309.00028v1	null
2023-08-31	A Sequential Framework for Detection and Classification of Abnormal Teeth in Panoramic X-rays	Tudor Dascalu, Shaqayeq Ramezanzade, Azam Bakhshandeh, Lars Bjorndal, Bulat Ibragimov et.al.	2309.00027v1	link
2023-08-31	PointLLM: Empowering Large Language Models to Understand Point Clouds	Runsen Xu, Xiaolong Wang, Tai Wang, Yilun Chen, Jiangmiao Pang, Dahua Lin et.al.	2308.16911v1	link
2023-08-31	StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation	Yuhan Wang, Liming Jiang, Chen Change Loy et.al.	2308.16909v1	link
2023-08-31	Fine-Grained Cross-View Geo-Localization Using a Correlation-Aware Homography Estimator	Xiaolong Wang, Runsen Xu, Zuofan Cui, Zeyu Wan, Yu Zhang et.al.	2308.16906v1	link
2023-08-31	InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion	Sirui Xu, Zhengyuan Li, Yu-Xiong Wang, Liang-Yan Gui et.al.	2308.16905v1	link
2023-08-31	PointOcc: Cylindrical Tri-Perspective View for Point-based 3D Semantic Occupancy Prediction	Sicheng Zuo, Wenzhao Zheng, Yuanhui Huang, Jie Zhou, Jiwen Lu et.al.	2308.16896v1	link
2023-08-31	EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild	Manuel Kaufmann, Jie Song, Chen Guo, Kaiyue Shen, Tianjian Jiang, Chengcheng Tang, Juan Zarate, Otmar Hilliges et.al.	2308.16894v1	link
2023-08-31	Language-Conditioned Path Planning	Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James et.al.	2308.16893v1	null
2023-09-01	GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields	Yanjie Ze, Ge Yan, Yueh-Hua Wu, Annabella Macaluso, Yuying Ge, Jianglong Ye, Nicklas Hansen, Li Erran Li, Xiaolong Wang et.al.	2308.16891v2	link
2023-08-31	TouchStone: Evaluating Vision-Language Models by Language Models	Shuai Bai, Shusheng Yang, Jinze Bai, Peng Wang, Xingxuan Zhang, Junyang Lin, Xinggang Wang, Chang Zhou, Jingren Zhou et.al.	2308.16890v1	null
2023-08-31	Text2Scene: Text-driven Indoor Scene Stylization with Part-aware Details	Inwoo Hwang, Hyeonwoo Kim, Young Min Kim et.al.	2308.16880v1	null
2023-08-31	SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation	Jiaben Chen, Huaizu Jiang et.al.	2308.16876v1	null
2023-08-31	Holistic Processing of Colour Images Using Novel Quaternion-Valued Wavelets on the Plane	Neil D. Dizon, Jeffrey A. Hogan et.al.	2308.16875v1	null
2023-08-31	Self-pruning Graph Neural Network for Predicting Inflammatory Disease Activity in Multiple Sclerosis from Brain MR Images	Chinmay Prabhakar, Hongwei Bran Li, Johannes C. Paetzold, Timo Loehr, Chen Niu, Mark Mühlau, Daniel Rueckert, Benedikt Wiestler, Bjoern Menze et.al.	2308.16863v1	link
2023-08-31	Diffusion Models for Interferometric Satellite Aperture Radar	Alexandre Tuel, Thomas Kerdreux, Claudia Hulbert, Bertrand Rouet-Leduc et.al.	2308.16847v1	null
2023-08-31	Machine learning of microscopic structure-dynamics relationships in complex molecular systems	Martina Crippa, Annalisa Cardellini, Matteo Cioni, Gábor Csányi, Giovanni M. Pavan et.al.	2308.16829v1	link
2023-08-31	Coarse-to-Fine Amodal Segmentation with Shape Prior	Jianxiong Gao, Xuelin Qian, Yikai Wang, Tianjun Xiao, Tong He, Zheng Zhang, Yanwei Fu et.al.	2308.16825v1	null
2023-08-31	BTSeg: Barlow Twins Regularization for Domain Adaptation in Semantic Segmentation	Johannes Künzel, Anna Hilsmann, Peter Eisert et.al.	2308.16819v1	null
2023-08-31	Multiscale Residual Learning of Graph Convolutional Sequence Chunks for Human Motion Prediction	Mohsen Zand, Ali Etemad, Michael Greenspan et.al.	2308.16801v1	null
2023-09-01	Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models	Minheng Ni, Yabo Zhang, Kailai Feng, Xiaoming Li, Yiwen Guo, Wangmeng Zuo et.al.	2308.16777v2	null
2023-08-31	Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images	Cuican Yu, Guansong Lu, Yihan Zeng, Jian Sun, Xiaodan Liang, Huibin Li, Zongben Xu, Songcen Xu, Wei Zhang, Hang Xu et.al.	2308.16758v1	null
2023-08-31	Unsupervised CT Metal Artifact Reduction by Plugging Diffusion Priors in Dual Domains	Xuan Liu, Yaoqin Xie, Songhui Diao, Shan Tan, Xiaokun Liang et.al.	2308.16742v1	link
2023-08-31	Socratis: Are large multimodal models emotionally aware?	Katherine Deng, Arijit Ray, Reuben Tan, Saadia Gabriel, Bryan A. Plummer, Kate Saenko et.al.	2308.16741v1	null
2023-08-31	Parsing is All You Need for Accurate Gait Recognition in the Wild	Jinkai Zheng, Xinchen Liu, Shuai Wang, Lihao Wang, Chenggang Yan, Wu Liu et.al.	2308.16739v1	link
2023-08-31	US-SFNet: A Spatial-Frequency Domain-based Multi-branch Network for Cervical Lymph Node Lesions Diagnoses in Ultrasound Images	Yubiao Yue, Jun Xue, Haihua Liang, Bingchun Luo, Zhenzhang Li et.al.	2308.16738v1	null
2023-08-31	Post-Deployment Adaptation with Access to Source Data via Federated Learning and Source-Target Remote Gradient Alignment	Felix Wagner, Zeju Li, Pramit Saha, Konstantinos Kamnitsas et.al.	2308.16735v1	link
2023-08-30	ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers	Yi Liu, Yuekang Li, Gelei Deng, Felix Juefei-Xu, Yao Du, Cen Zhang, Chengwei Liu, Yeting Li, Lei Ma, Yang Liu et.al.	2308.15742v1	null
2023-08-28	Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition	Zhisheng Zheng, Ziyang Ma, Yu Wang, Xie Chen et.al.	2308.14814v1	null
2023-08-23	KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods	Antoine Nzeyimana et.al.	2308.11863v1	null
2023-09-05	Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model	Yuezhou Zhang, Amos A Folarin, Judith Dineley, Pauline Conde, Valeria de Angel, Shaoxiong Sun, Yatharth Ranjan, Zulqarnain Rashid, Callum Stewart, Petroula Laiou, Heet Sankesara, Linglong Qian, Faith Matcham, Katie M White, Carolin Oetzmann, Femke Lamers, Sara Siddi, Sara Simblett, Björn W. Schuller, Srinivasan Vairavan, Til Wykes, Josep Maria Haro, Brenda WJH Penninx, Vaibhav A Narayan, Matthew Hotopf, Richard JB Dobson, Nicholas Cummins, RADAR-CNS consortium et.al.	2308.11773v2	null
2023-08-20	Indonesian Automatic Speech Recognition with XLSR-53	Panji Arisaputra, Amalia Zahra et.al.	2308.11589v1	null
2023-08-22	Convoifilter: A case study of doing cocktail party speech recognition	Thai-Binh Nguyen, Alexander Waibel et.al.	2308.11380v1	null
2023-08-14	Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder	Yusheng Dai, Hang Chen, Jun Du, Xiaofei Ding, Ning Ding, Feijun Jiang, Chin-Hui Lee et.al.	2308.08488v1	link
2023-08-16	Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals	Running Zhao, Jiangtao Yu, Hang Zhao, Edith C. H. Ngai et.al.	2308.08125v1	null
2023-08-15	AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model	Jeong Hun Yeo, Minsu Kim, Jeongsoo Choi, Dae Hoe Kim, Yong Man Ro et.al.	2308.07593v1	null
2023-08-14	Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations	Wen Wu, Chao Zhang, Philip C. Woodland et.al.	2308.07145v1	link
2023-08-12	Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition	Han Zhu, Dongji Gao, Gaofeng Cheng, Daniel Povey, Pengyuan Zhang, Yonghong Yan et.al.	2308.06547v1	null
2023-08-11	Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping	Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Haithem Boussaid, Ebtessam Almazrouei, Merouane Debbah et.al.	2308.06112v1	null
2023-08-10	A Novel Self-training Approach for Low-resource Speech Recognition	Satwinder Singh, Feng Hou, Ruili Wang et.al.	2308.05269v1	null
2023-08-09	Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio	Yang Zhang, Krishna C. Puvvada, Vitaly Lavrukhin, Boris Ginsburg et.al.	2308.05218v1	link
2023-08-07	Cuing Without Sharing: A Federated Cued Speech Recognition Framework via Mutual Knowledge Distillation	Yuxuan Zhang, Lei Liu, Li Liu et.al.	2308.03432v1	link
2023-08-07	Federated Representation Learning for Automatic Speech Recognition	Guruprasad V Ramesh, Gopinath Chennupati, Milind Rao, Anit Kumar Sahu, Ariya Rastrow, Jasha Droppo et.al.	2308.02013v2	null
2023-08-02	Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification	Laurin Wagner, Mario Zusag, Theresa Bloder et.al.	2308.01327v1	null
2023-07-28	The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systems	Andreas Liesenfeld, Alianda Lopez, Mark Dingemanse et.al.	2307.15493v1	null
2023-07-27	Say Goodbye to RNN-T Loss: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition	Tian-Hao Zhang, Dinghao Zhou, Guiping Zhong, Baoxiang Li et.al.	2307.14132v2	null
2023-07-24	Adaptation of Whisper models to child speech recognition	Rishabh Jain, Andrei Barcovschi, Mariam Yiwere, Peter Corcoran, Horia Cucu et.al.	2307.13008v1	link
2023-07-24	Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition	Emiru Tsunoo, Hayato Futami, Yosuke Kashiwagi, Siddhant Arora, Shinji Watanabe et.al.	2307.12767v1	null
2023-07-24	Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training	Gege Qi, Yuefeng Chen, Xiaofeng Mao, Xiaojun Jia, Ranjie Duan, Rong Zhang, Hui Xue et.al.	2307.12498v1	null
2023-07-23	A meta learning scheme for fast accent domain expansion in Mandarin speech recognition	Ziwei Zhu, Changhao Shan, Bihong Zhang, Jian Yu et.al.	2307.12262v1	null
2023-07-21	Prompting Large Language Models with Speech Recognition Abilities	Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer et.al.	2307.11795v1	null
2023-07-20	Transsion TSUP's speech recognition system for ASRU 2023 MADASR Challenge	Xiaoxiao Li, Gaosheng Zhang, An Zhu, Weiyong Li, Shuming Fang, Xiaoyue Yang, Jianchao Zhu et.al.	2307.11778v1	null
2023-07-20	Globally Normalising the Transducer for Streaming Speech Recognition	Rogier van Dalen et.al.	2307.10975v1	null
2023-10-06	Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning	Feng-Ting Liao, Yung-Chieh Chan, Yi-Chang Chen, Chan-Jan Hsu, Da-shan Shiu et.al.	2307.10274v2	link
2023-07-17	TST: Time-Sparse Transducer for Automatic Speech Recognition	Xiaohui Zhang, Mangui Liang, Zhengkun Tian, Jiangyan Yi, Jianhua Tao et.al.	2307.08323v1	null
2023-08-03	Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition	Shaoshi Ling, Yuxuan Hu, Shuangbei Qian, Guoli Ye, Yao Qian, Yifan Gong, Ed Lin, Michael Zeng et.al.	2307.08234v2	link
2023-07-17	Towards Stealthy Backdoor Attacks against Speech Recognition via Elements of Sound	Hanbo Cai, Pengcheng Zhang, Hai Dong, Yan Xiao, Stefanos Koffas, Yiming Li et.al.	2307.08208v1	link
2023-07-12	Sumformer: A Linear-Complexity Alternative to Self-Attention for Speech Recognition	Titouan Parcollet, Rogier van Dalen, Shucong Zhang, Sourav Bhattacharya et.al.	2307.07421v1	null
2023-10-18	Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition	Theresa Pekarek Rosin, Stefan Wermter et.al.	2307.07280v2	null
2023-07-13	Personalization for BERT-based Discriminative Speech Recognition Rescoring	Jari Kolehmainen, Yile Gu, Aditya Gourav, Prashanth Gurunath Shivakumar, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko et.al.	2307.06832v1	null
2023-07-13	Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study	Zeping Min, Jinbo Wang et.al.	2307.06530v1	null
2023-07-14	Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition	Wenxuan Wang, Guodong Ma, Yuke Li, Binbin Du et.al.	2307.05956v2	null
2023-07-10	SparseVSR: Lightweight and Noise Robust Visual Speech Recognition	Adriana Fernandez-Lopez, Honglie Chen, Pingchuan Ma, Alexandros Haliassos, Stavros Petridis, Maja Pantic et.al.	2307.04552v1	null
2023-07-06	Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment	Aref Farhadipour, Hadi Veisi et.al.	2307.03296v1	link
2023-07-05	Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture	Haoran Miao, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan et.al.	2307.02351v1	null
2023-07-05	Using Data Augmentations and VTLN to Reduce Bias in Dutch End-to-End Speech Recognition Systems	Tanvina Patel, Odette Scharenborg et.al.	2307.02009v1	null
2023-07-04	Boosting Norwegian Automatic Speech Recognition	Javier de la Rosa, Rolv-Arild Braaten, Per Egil Kummervold, Freddy Wetjen, Svein Arne Brygfjeld et.al.	2307.01672v1	null
2023-06-29	Automatic Speech Recognition of Non-Native Child Speech for Language Learning Applications	Simone Wills, Yu Bai, Cristian Tejedor-Garcia, Catia Cucchiarini, Helmer Strik et.al.	2306.16710v1	null
2023-06-28	Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition	Yuang Li, Yu Wu, Jinyu Li, Shujie Liu et.al.	2306.16007v1	null
2023-06-27	Confidence-based Ensembles of End-to-End Speech Recognition Models	Igor Gitman, Vitaly Lavrukhin, Aleksandr Laptev, Boris Ginsburg et.al.	2306.15824v1	null
2023-06-27	Scaling Laws for Discriminative Speech Recognition Rescoring Models	Yile Gu, Prashanth Gurunath Shivakumar, Jari Kolehmainen, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko et.al.	2306.15815v1	null
2023-06-27	Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition	Tianzi Wang, Shoukang Hu, Jiajun Deng, Zengrui Jin, Mengzhe Geng, Yi Wang, Helen Meng, Xunying Liu et.al.	2306.15265v1	null
2023-06-26	Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems	Jiajun Deng, Guinan Li, Xurong Xie, Zengrui Jin, Mingyu Cui, Tianzi Wang, Shujie Hu, Mengzhe Geng, Xunying Liu et.al.	2306.14608v1	null
2023-06-24	An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-Hearing	Lester Phillip Violeta, Tomoki Toda et.al.	2306.13953v1	null
2023-06-26	Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems	Mingyu Cui, Jiawen Kang, Jiajun Deng, Xi Yin, Yutao Xie, Xie Chen, Xunying Liu et.al.	2306.13307v2	null
2023-06-21	A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-Supervision	Kamer Ali Yuksel, Thiago Ferreira, Ahmet Gunduz, Mohamed Al-Badrashiny, Golara Javadi et.al.	2306.13114v1	link
2023-06-21	NoRefER: a Referenceless Quality Metric for Automatic Speech Recognition via Semi-Supervised Language Model Fine-Tuning with Contrastive Learning	Kamer Ali Yuksel, Thiago Ferreira, Golara Javadi, Mohamed El-Badrashiny, Ahmet Gunduz et.al.	2306.12577v1	link
2023-06-21	Federated Self-Learning with Weak Supervision for Speech Recognition	Milind Rao, Gopinath Chennupati, Gautam Tiwari, Anit Kumar Sahu, Anirudh Raju, Ariya Rastrow, Jasha Droppo et.al.	2306.12015v1	null
2023-06-20	Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition	Xuefei Wang, Yanhua Long, Yijie Li, Haoran Wei et.al.	2306.11309v1	null
2023-06-19	Rehearsal-Free Online Continual Learning for Automatic Speech Recognition	Steven Vander Eeckt, Hugo Van hamme et.al.	2306.10860v1	link
2023-06-18	MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition	Yuchen Hu, Chen Chen, Ruizhe Li, Heqing Zou, Eng Siong Chng et.al.	2306.10567v1	link
2023-06-18	Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition	Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiushi Zhu, Eng Siong Chng et.al.	2306.10563v1	link
2023-09-19	SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition	Desh Raj, Daniel Povey, Sanjeev Khudanpur et.al.	2306.10559v2	link
2023-06-15	Distillation Strategies for Discriminative Speech Recognition Rescoring	Prashanth Gurunath Shivakumar, Jari Kolehmainen, Yile Gu, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko et.al.	2306.09452v1	null
2023-06-15	MobileASR: A resource-aware on-device personalisation framework for automatic speech recognition in mobile phones	Zitha Sasindran, Harsha Yelchuri, Pooja Rao, T. V. Prabhakar et.al.	2306.09384v1	null
2023-09-16	Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer	Kunal Dhawan, Dima Rekesh, Boris Ginsburg et.al.	2306.08753v3	link
2023-06-14	Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition	Muhammad Umar Farooq, Thomas Hain et.al.	2306.08577v1	null
2023-06-14	Research on an improved Conformer end-to-end Speech Recognition Model with R-Drop Structure	Weidong Ji, Shijie Zan, Guohui Zhou, Xu Wang et.al.	2306.08329v1	null
2023-06-14	Automated Speaker Independent Visual Speech Recognition: A Comprehensive Survey	Praneeth Nemani, G. Sai Krishna, Supriya Kundrapu et.al.	2306.08314v1	null
2023-06-09	Improving Frame-level Classifier for Word Timings with Non-peaky CTC in End-to-End Automatic Speech Recognition	Xianzhao Chen, Yist Y. Lin, Kang Wang, Yi He, Zejun Ma et.al.	2306.07949v1	null
2023-06-09	A Theory of Unsupervised Speech Recognition	Liming Wang, Mark Hasegawa-Johnson, Chang D. Yoo et.al.	2306.07926v1	link
2023-06-13	Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition	Ui-Hyeop Shin, Hyung-Min Park et.al.	2306.07562v1	null
2023-06-12	Parameter-efficient Dysarthric Speech Recognition Using Adapter Fusion and Householder Transformation	Jinzi Qi, Hugo Van hamme et.al.	2306.07090v1	null
2023-06-12	Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition	Belen Alastruey, Lukas Drude, Jahn Heymann, Simon Wiesler et.al.	2306.06954v1	null
2023-06-10	OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment	Xize Cheng, Tao Jin, Linjun Li, Wang Lin, Xinyu Duan, Zhou Zhao et.al.	2306.06410v1	link
2023-06-06	Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering	Irina-Elena Veliche, Pascale Fung et.al.	2306.06083v1	null
2023-06-08	Language-specific Acoustic Boundary Learning for Mandarin-English Code-switching Speech Recognition	Zhiyun Fan, Linhao Dong, Chen Shen, Zhenlin Liang, Jun Zhang, Lu Lu, Zejun Ma et.al.	2306.05279v1	null
2023-06-07	Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency	Shigeki Karita, Richard Sproat, Haruko Ishikawa et.al.	2306.04530v1	null
2023-06-07	Transfer Learning of Transformer-based Speech Recognition Models from Czech to Slovak	Jan Lehečka, Josef V. Psutka, Josef Psutka et.al.	2306.04399v1	null
2023-06-07	Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation	Massa Baali, Ibrahim Almakky, Shady Shehata, Fakhri Karray et.al.	2306.04368v1	link
2023-09-12	RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain	Sangeet Sagar, Mirco Ravanelli, Bernd Kiefer, Ivana Kruijff Korbayova, Josef van Genabith et.al.	2306.04054v2	null
2023-06-02	Streaming Speech-to-Confusion Network Speech Recognition	Denis Filimonov, Prabhat Pandey, Ariya Rastrow, Ankur Gandhe, Andreas Stolcke et.al.	2306.03778v1	null
2023-06-01	Some voices are too common: Building fair speech recognition systems using the Common Voice dataset	Lucas Maison, Yannick Estève et.al.	2306.03773v1	null
2023-06-05	N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition	Bashar Talafha, Abdul Waheed, Muhammad Abdul-Mageed et.al.	2306.02902v1	null
2023-06-05	OTF: Optimal Transport based Fusion of Supervised and Self-Supervised Learning Models for Automatic Speech Recognition	Li Fu, Siqi Li, Qingtao Li, Fangzhu Li, Liping Deng, Lu Fan, Meng Chen, Youzheng Wu, Xiaodong He et.al.	2306.02541v1	null
2023-06-05	Incorporating L2 Phonemes Using Articulatory Features for Robust Speech Recognition	Jisung Wang, Haram Lee, Myungwoo Oh et.al.	2306.02534v1	null
2023-06-21	SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization	Changhun Kim, Joonhyung Park, Hajin Shim, Eunho Yang et.al.	2306.01981v4	link
2023-06-02	Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation	Hanbyul Kim, Seunghyun Seo, Lukas Lee, Seolki Baek et.al.	2306.01296v1	null
2023-06-01	Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts	Dongji Gao, Matthew Wiesner, Hainan Xu, Leibny Paola Garcia, Daniel Povey, Sanjeev Khudanpur et.al.	2306.01031v1	null
2023-08-15	Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition	Tianyi Xu, Zhanheng Yang, Kaixun Huang, Pengcheng Guo, Ao Zhang, Biao Li, Changru Chen, Chao Li, Lei Xie et.al.	2306.00804v3	null
2023-06-01	SlothSpeech: Denial-of-service Attack Against Speech Recognition Models	Mirazul Haque, Rutvij Shah, Simin Chen, Berrak Şişman, Cong Liu, Wei Yang et.al.	2306.00794v1	link
2023-06-01	Adaptation and Optimization of Automatic Speech Recognition (ASR) for the Maritime Domain in the Field of VHF Communication	Emin Cagatay Nakilcioglu, Maximilian Reimann, Ole John et.al.	2306.00614v1	null
2023-05-31	ViLaS: Integrating Vision and Language into Automatic Speech Recognition	Minglun Han, Feilong Chen, Ziyi Ni, Linghui Meng, Jing Shi, Shuang Xu, Bo Xu et.al.	2305.19972v1	null
2023-05-31	Accurate and Structured Pruning for Efficient Automatic Speech Recognition	Huiqiang Jiang, Li Lyna Zhang, Yuang Li, Yu Wu, Shijie Cao, Ting Cao, Yuqing Yang, Jinyu Li, Mao Yang, Lili Qiu et.al.	2305.19549v1	null
2023-05-29	HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition	Florian Mai, Juan Zuluaga-Gomez, Titouan Parcollet, Petr Motlicek et.al.	2305.18281v1	link
2023-05-30	speech and noise dual-stream spectrogram refine network with speech distortion loss for robust speech recognition	Haoyu Lu, Nan Li, Tongtong Song, Longbiao Wang, Jianwu Dang, Xiaobao Wang, Shiliang Zhang et.al.	2305.17860v2	link
2023-05-28	RASR2: The RWTH ASR Toolkit for Generic Sequence-to-sequence Speech Recognition	Wei Zhou, Eugen Beck, Simon Berger, Ralf Schlüter, Hermann Ney et.al.	2305.17782v1	null
2023-07-19	Synthesizing Speech Test Cases with Text-to-Speech? An Empirical Study on the False Alarms in Automated Speech Recognition Testing	Julia Kaiwen Lau, Kelvin Kai Wen Kong, Julian Hao Yong, Per Hoong Tan, Zhou Yang, Zi Qian Yong, Joshua Chern Wey Low, Chun Yong Chong, Mei Kuan Lim, David Lo et.al.	2305.17445v3	link
2023-05-26	2-bit Conformer quantization for automatic speech recognition	Oleg Rybakov, Phoenix Meadowlark, Shaojin Ding, David Qiu, Jian Li, David Rim, Yanzhang He et.al.	2305.16619v1	null
2023-05-25	INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition	Eunseop Yoon, Hee Suk Yoon, John Harvill, Mark Hasegawa-Johnson, Chang D. Yoo et.al.	2305.16371v1	null
2023-05-29	InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition	Zhi-Hao Lai, Tian-Hao Zhang, Qi Liu, Xinyuan Qian, Li-Fang Wei, Song-Lu Chen, Feng Chen, Xu-Cheng Yin et.al.	2305.16342v2	null
2023-06-29	Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition	Wangyou Zhang, Yanmin Qian et.al.	2305.16286v2	null
2023-05-25	Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator	Lingwei Meng, Jiawen Kang, Mingyu Cui, Haibin Wu, Xixin Wu, Helen Meng et.al.	2305.16263v1	null
2023-05-25	VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation	Tianrui Wang, Long Zhou, Ziqiang Zhang, Yu Wu, Shujie Liu, Yashesh Gaur, Zhuo Chen, Jinyu Li, Furu Wei et.al.	2305.16107v1	null
2023-05-24	Iteratively Improving Speech Recognition and Voice Conversion	Mayank Kumar Singh, Naoya Takahashi, Onoe Naoyuki et.al.	2305.15055v1	null
2023-05-23	Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning	Sara Kashiwagi, Keitaro Tanaka, Qi Feng, Shigeo Morishima et.al.	2305.14203v1	null

(back to top)

Audio Forenisc

Publish Date	Title	Authors	PDF	Code
2022-11-29	Synthetic Voice Detection and Audio Splicing Detection using SE-Res2Net-Conformer Architecture	Lei Wang, Benedict Yeoh, Jun Wah Ng et.al.	2210.03581v2	null
2024-05-03	Towards Unconstrained Audio Splicing Detection and Localization with Neural Networks	Denise Moussa, Germans Hirsch, Christian Riess et.al.	2207.14682v4	null
2014-11-26	Audio Splicing Detection and Localization Using Environmental Signature	Hong Zhao, Yifan Chen, Rui Wang, Hafiz Malik et.al.	1411.7084v1	null

(back to top)

Updated on 2024.08.19

Speech Translation

Legal

Speech Recognition

Audio Forenisc

About

Languages