youngjetduan/cv-arxiv-daily

Updated on 2024.09.11

Usage instructions: here

Table of Contents

LLM
Early Stopping

LLM

Publish Date	Title	Authors	PDF	Code
2024-09-09	DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid Interconnects	Xu Zhang et.al.	2409.05404	null
2024-09-08	InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference	Xiurui Pan et.al.	2409.04992	null
2024-09-04	Accelerating Large Language Model Training with Hybrid GPU-based Compression	Lang Xu et.al.	2409.02423	null
2024-09-03	Contemporary Model Compression on Large Language Models Inference	Dong Liu et.al.	2409.01990	null
2024-09-03	On-chain Validation of Tracking Data Messages (TDM) Using Distributed Deep Learning on a Proof of Stake (PoS) Blockchain	Yasir Latif et.al.	2409.01614	null
2024-09-02	LuWu: An End-to-End In-Network Out-of-Core Optimizer for 100B-Scale Model-in-Network Data-Parallel Training on Distributed GPUs	Mo Sun et.al.	2409.00918	null
2024-08-26	Model Parallel Training and Transfer Learning for Convolutional Neural Networks by Domain Decomposition	Axel Klawonn et.al.	2408.14442	null
2024-08-23	Network-Offloaded Bandwidth-Optimal Broadcast and Allgather for Distributed AI	Mikhail Khalilov et.al.	2408.13356	null
2024-08-22	LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation	Shihao Chen et.al.	2408.12354	null
2024-08-23	MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding	Jian Chen et.al.	2408.11049	link
2024-08-20	Security Assessment of Hierarchical Federated Deep Learning	D Alqattan et.al.	2408.10752	link
2024-08-20	Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning	Bei Ouyang et.al.	2408.10746	null
2024-08-21	LongVILA: Scaling Long-Context Visual Language Models for Long Videos	Fuzhao Xue et.al.	2408.10188	link
2024-08-17	RepControlNet: ControlNet Reparameterization	Zhaoli Deng et.al.	2408.09240	null
2024-08-17	Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version)	Mingkuan Xu et.al.	2408.09055	null
2024-08-23	ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models	Chao Zeng et.al.	2408.08554	link
2024-08-16	Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models	Jerry Huang et.al.	2408.08470	null
2024-08-15	Asteroid: Resource-Efficient Hybrid Pipeline Parallelism for Collaborative DNN Training on Heterogeneous Edge Devices	Shengyuan Ye et.al.	2408.08015	null
2024-08-17	Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference	Rohan Baskar Prabhakar et.al.	2408.07802	null
2024-08-18	Post-Training Sparse Attention with Double Sparsity	Shuo Yang et.al.	2408.07092	link
2024-08-12	LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration	Zhiwen Mo et.al.	2408.06003	null
2024-08-10	Eigen Attention: Attention in Low-Rank Space for KV Cache Compression	Utkarsh Saxena et.al.	2408.05646	null
2024-08-05	SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving	Andreas Kosmas Kakolyris et.al.	2408.05235	null
2024-08-08	Partial Experts Checkpoint: Efficient Fault Tolerance for Sparse Mixture-of-Experts Model Training	Weilin Cai et.al.	2408.04307	null
2024-08-07	Zero-Delay QKV Compression for Mitigating KV Cache and Network Bottlenecks in LLM Inference	Zeyu Zhang et.al.	2408.04107	null
2024-08-08	NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time	Yilong Chen et.al.	2408.03675	link
2024-08-04	Cross-layer Attention Sharing for Large Language Models	Yongyu Mu et.al.	2408.01890	null
2024-08-01	Intermittent Semi-working Mask: A New Masking Paradigm for LLMs	Mingcong Lu et.al.	2408.00539	null
2024-08-13	Finch: Prompt-guided Key-Value Cache Compression	Giulio Corallo et.al.	2408.00167	null
2024-07-31	EdgeLLM: A Highly Efficient CPU-FPGA Heterogeneous Edge Accelerator for Large Language Models	Mingqiang Huang et.al.	2407.21325	null
2024-07-30	Palu: Compressing KV-Cache with Low-Rank Projection	Chi-Chih Chang et.al.	2407.21118	null
2024-07-30	ThinK: Thinner Key Cache by Query-Driven Pruning	Yuhui Xu et.al.	2407.21018	null
2024-07-31	A2SF: Accumulative Attention Scoring with Forgetting Factor for Token Pruning in Transformer Decoder	Hyun-rae Jo et.al.	2407.20485	null
2024-07-25	An Efficient Inference Framework for Early-exit Large Language Models	Ruijie Miao et.al.	2407.20272	null
2024-07-29	When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention	Lianghong Guo et.al.	2407.20042	link
2024-07-29	Inference acceleration for large language models using "stairs" assisted greedy generation	Domas Grigaliūnas et.al.	2407.19947	null
2024-07-29	Rina: Enhancing Ring-AllReduce with In-network Aggregation in Distributed Model Training	Zixuan Chen et.al.	2407.19721	null
2024-07-25	Efficient Inference of Vision Instruction-Following Models with Elastic Cache	Zuyan Liu et.al.	2407.18121	link
2024-07-28	Keep the Cost Down: A Review on Methods to Optimize LLM' s KV-Cache Consumption	Luohe Shi et.al.	2407.18003	null
2024-07-25	Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads	Xihui Lin et.al.	2407.17678	null
2024-07-23	A deeper look at depth pruning of LLMs	Shoaib Ahmed Siddiqui et.al.	2407.16286	link
2024-07-22	RazorAttention: Efficient KV Cache Compression Through Retrieval Heads	Hanlin Tang et.al.	2407.15891	null
2024-07-22	AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description	Junyu Xie et.al.	2407.15850	link
2024-07-22	LLMmap: Fingerprinting For Large Language Models	Dario Pasquini et.al.	2407.15847	null
2024-07-22	CarFormer: Self-Driving with Learned Object-Centric Representations	Shadi Hamdan et.al.	2407.15843	null
2024-07-22	SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models	Mingze Xu et.al.	2407.15841	null
2024-07-22	MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity	Yangzhou Liu et.al.	2407.15838	link
2024-07-22	dMel: Speech Tokenization made Simple	He Bai et.al.	2407.15835	null
2024-07-22	Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight	Ziyuan Huang et.al.	2407.15819	null
2024-07-23	A simple and fast C++ thread pool implementation capable of running task graphs	Dmytro Puyda et.al.	2407.15805	link
2024-07-22	Robust Facial Reactions Generation: An Emotion-Aware Framework with Modality Compensation	Guanyu Hu et.al.	2407.15798	null
2024-07-22	Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach	Rian Dolphin et.al.	2407.15788	null
2024-07-22	Parallel Split Learning with Global Sampling	Mohammad Kohankhaki et.al.	2407.15738	null
2024-07-22	vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving	Jiale Xu et.al.	2407.15309	link
2024-07-19	Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference	Joyjit Kundu et.al.	2407.14645	null
2024-07-19	Internal Consistency and Self-Feedback in Large Language Models: A Survey	Xun Liang et.al.	2407.14507	link
2024-07-19	On Pre-training of Multimodal Language Models Customized for Chart Understanding	Wan-Cyuan Fan et.al.	2407.14506	null
2024-07-19	PD-TPE: Parallel Decoder with Text-guided Position Encoding for 3D Visual Grounding	Chenshu Hou et.al.	2407.14491	null
2024-07-19	Evaluating the Reliability of Self-Explanations in Large Language Models	Korbinian Randl et.al.	2407.14487	link
2024-07-19	Contrastive Learning with Counterfactual Explanations for Radiology Report Generation	Mingjie Li et.al.	2407.14474	null
2024-07-19	Check-Eval: A Checklist-based Approach for Evaluating Text Quality	Jayr Pereira et.al.	2407.14467	null
2024-07-19	AttentNet: Fully Convolutional 3D Attention for Lung Nodule Detection	Majedaldein Almahasneh et.al.	2407.14464	null
2024-07-19	PolyFormer: Scalable Node-wise Filters via Polynomial Graph Transformer	Jiahong Ma et.al.	2407.14459	link
2024-07-19	Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier	Zachary Wojtowicz et.al.	2407.14452	null
2024-07-19	From Instruction to Insight: Exploring the Functional and Semantic Roles of Text in Interactive Dashboards	Nicole Sultanum et.al.	2407.14451	null
2024-07-19	LoAS: Fully Temporal-Parallel Datatflow for Dual-Sparse Spiking Neural Networks	Ruokai Yin et.al.	2407.14073	link
2024-07-19	LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference	Qichen Fu et.al.	2407.14057	null
2024-07-18	SegPoint: Segment Any Point Cloud via Large Language Model	Shuting He et.al.	2407.13761	null
2024-07-18	Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models	Zhuo Chen et.al.	2407.13757	null
2024-07-18	CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications	Mirza Masfiqur Rahman et.al.	2407.13742	null
2024-07-18	Baba Is AI: Break the Rules to Beat the Benchmark	Nathan Cloos et.al.	2407.13729	null
2024-07-18	Compressing Structured Tensor Algebra	Mahdi Ghorbani et.al.	2407.13726	null
2024-07-18	CoDefeater: Using LLMs To Find Defeaters in Assurance Cases	Usman Gohar et.al.	2407.13717	link
2024-07-18	Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning	Ans Munir et.al.	2407.13715	link
2024-07-18	Understanding Reference Policies in Direct Preference Optimization	Yixin Liu et.al.	2407.13709	link
2024-07-18	ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection	Janek Herrlein et.al.	2407.13702	link
2024-07-18	Cross-Task Attack: A Self-Supervision Generative Framework Based on Attention Shift	Qingyuan Zeng et.al.	2407.13700	null
2024-07-17	Analysis of Crab X-ray Polarization using Deeper IXPE Observations	Josephine Wong et.al.	2407.12779	null
2024-07-17	The BRST quantisation of chiral BMS-like field theories	José Figueroa-O'Farrill et.al.	2407.12778	null
2024-07-17	Jigsaw Game: Federated Clustering	Jinxuan Xu et.al.	2407.12764	null
2024-07-17	LookupViT: Compressing visual information to a limited number of tokens	Rajat Koner et.al.	2407.12753	null
2024-07-17	CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inference	Mohammad Erfan Sadeghi et.al.	2407.12736	null
2024-07-17	EchoSight: Advancing Visual-Language Models with Wiki Knowledge	Yibin Yan et.al.	2407.12735	null
2024-07-17	FlexFL: Heterogeneous Federated Learning via APoZ-Guided Flexible Pruning in Uncertain Scenarios	Zekai Chen et.al.	2407.12729	null
2024-07-17	Exploring the interplay of individual traits and interaction dynamics in preschool social networks	Gülşah Akçakır et.al.	2407.12728	null
2024-07-17	NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model	Zhongqun Zhang et.al.	2407.12727	null
2024-07-17	Is Sarcasm Detection A Step-by-Step Reasoning Process in Large Language Models?	Ben Yao et.al.	2407.12725	null
2024-07-16	GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression	Daniel Goldstein et.al.	2407.12077	link
2024-07-16	Hydra: Brokering Cloud and HPC Resources to Support the Execution of Heterogeneous Workloads at Scale	Aymen Alsaadi et.al.	2407.11967	null
2024-07-16	UrbanWorld: An Urban World Model for 3D City Generation	Yu Shang et.al.	2407.11965	null
2024-07-16	NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?	Mo Li et.al.	2407.11963	link
2024-07-17	Hierarchical Separable Video Transformer for Snapshot Compressive Imaging	Ping Wang et.al.	2407.11946	link
2024-07-16	Min-max theory and existence of H-spheres with arbitrary codimensions	Rui Gao et.al.	2407.11945	null
2024-07-16	Beyond Spatial Explanations: Explainable Face Recognition in the Frequency Domain	Marco Huber et.al.	2407.11941	null
2024-07-16	Generalized Difference-in-Differences	Yiqing Xu et.al.	2407.11937	null
2024-07-16	Learning Multi-view Anomaly Detection	Haoyang He et.al.	2407.11935	null
2024-07-16	Code Documentation and Analysis to Secure Software Development	Paul Attie et.al.	2407.11934	null
2024-07-16	What's Wrong? Refining Meeting Summaries with LLM Feedback	Frederic Kirstein et.al.	2407.11919	null
2024-07-16	PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation	Branden Butler et.al.	2407.11798	null
2024-07-21	Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference	Yuan Feng et.al.	2407.11550	link
2024-07-15	VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation	Bocheng Zou et.al.	2407.10972	link
2024-07-15	Q-Sparse: All Large Language Models can be Fully Sparsely-Activated	Hongyu Wang et.al.	2407.10969	null
2024-07-15	Induction of non-Fermi liquids by critical cavity photons at the onset of superradiance	Ipsita Mandal et.al.	2407.10963	null
2024-07-15	Fast Matrix Multiplications for Lookup Table-Quantized LLMs	Han Guo et.al.	2407.10960	link
2024-07-15	InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models	Nirat Saini et.al.	2407.10958	null
2024-07-15	MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models	Chengguang Gan et.al.	2407.10953	null
2024-07-15	The infamous 95 GeV $\rm b \bar b$ excess at LEP: Two b or not two b?	Patrick Janot et.al.	2407.10948	null
2024-07-15	Can Textual Semantics Mitigate Sounding Object Segmentation Preference?	Yaoting Wang et.al.	2407.10947	link
2024-07-15	GRUtopia: Dream General Robots in a City at Scale	Hanqing Wang et.al.	2407.10943	link
2024-07-15	IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation	Yuanhao Zhai et.al.	2407.10937	link
2024-07-12	FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3	Georgios Makridis et.al.	2407.09467	null
2024-07-12	Human-like Episodic Memory for Infinite Context LLMs	Zafeirios Fountas et.al.	2407.09450	null
2024-07-12	ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts	Amelia F. Hardy et.al.	2407.09447	link
2024-07-12	MUSCLE: A Model Update Strategy for Compatible LLM Evolution	Jessica Echterhoff et.al.	2407.09435	null
2024-07-12	Open (Clinical) LLMs are Sensitive to Instruction Phrasings	Alberto Mario Ceballos Arroyo et.al.	2407.09429	link
2024-07-12	TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models	Hang Zou et.al.	2407.09424	null
2024-07-12	Mitigating Entity-Level Hallucination in Large Language Models	Weihang Su et.al.	2407.09417	link
2024-07-12	SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers	Shraman Pramanick et.al.	2407.09413	link
2024-07-12	Thunderbolt: Causal Concurrent Consensus and Execution	Junchao Chen et.al.	2407.09409	null
2024-07-12	PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents	Saber Zerhoudi et.al.	2407.09394	link
2024-07-11	MAVIS: Mathematical Visual Instruction Tuning	Renrui Zhang et.al.	2407.08739	link
2024-07-11	Real-Time Anomaly Detection and Reactive Planning with Large Language Models	Rohan Sinha et.al.	2407.08735	null
2024-07-11	Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist	Zihao Zhou et.al.	2407.08733	null
2024-07-11	Planar decomposition of the HOMFLY polynomial for bipartite knots and links	A. Anokhina et.al.	2407.08724	null
2024-07-11	A Taxonomy for Data Contamination in Large Language Models	Medha Palavalli et.al.	2407.08716	null
2024-07-11	GTA: A Benchmark for General Tool Agents	Jize Wang et.al.	2407.08713	link
2024-07-11	Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models	Zhening Xing et.al.	2407.08701	null
2024-07-11	Flex-TPU: A Flexible TPU with Runtime Reconfigurable Dataflow Architecture	Mohammed Elbtity et.al.	2407.08700	null
2024-07-11	Mitigating Catastrophic Forgetting in Language Transfer via Model Merging	Anton Alexandrov et.al.	2407.08699	null
2024-07-11	Patterns of link reciprocity in directed, signed networks	Anna Gallo et.al.	2407.08697	null
2024-07-10	Training on the Test Task Confounds Evaluation and Emergence	Ricardo Dominguez-Olmedo et.al.	2407.07890	link
2024-07-10	Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization	Junkang Wu et.al.	2407.07880	link
2024-07-10	Bound States in Continuum via Singular Transfer Matrices	Ovidiu-Zeno Lipan et.al.	2407.07879	null
2024-07-10	FACTS About Building Retrieval Augmented Generation-based Chatbots	Rama Akkiraju et.al.	2407.07858	null
2024-07-10	OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training	Sami Jaghouar et.al.	2407.07852	link
2024-07-10	Harnessing Integrated CPU-GPU System Memory for HPC: a first look into Grace Hopper	Gabin Schieffer et.al.	2407.07850	null
2024-07-10	Natural Language Mechanisms via Self-Resolution with Foundation Models	Nicolas Della Penna et.al.	2407.07845	null
2024-07-10	Study on Aspect Ratio Variability toward Robustness of Vision Transformer-based Vehicle Re-identification	Mei Qiu et.al.	2407.07842	null
2024-07-10	Transformer Alignment in Large Language Models	Murdock Aubry et.al.	2407.07810	null
2024-07-10	Attribute or Abstain: Large Language Models as Long Document Assistants	Jan Buchmann et.al.	2407.07799	link
2024-07-09	AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning	Jiaxi Cui et.al.	2407.07094	link
2024-07-09	FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation	Liqun Ma et.al.	2407.07093	link
2024-07-09	Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic	Ruochen Jin et.al.	2407.07089	link
2024-07-09	Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models	Logan Cross et.al.	2407.07086	link
2024-07-09	Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities	Shaltiel Shmidman et.al.	2407.07080	null
2024-07-09	ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction	Shaozhe Hao et.al.	2407.07077	link
2024-07-09	Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps	Yung-Sung Chuang et.al.	2407.07071	link
2024-07-09	Prompting Techniques for Secure Code Generation: A Systematic Investigation	Catherine Tony et.al.	2407.07064	null
2024-07-09	Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence	Weize Chen et.al.	2407.07061	link
2024-07-09	CAPformer: Compression-Aware Pre-trained Transformer for Low-Light Image Enhancement	Wang Wei et.al.	2407.07056	null
2024-07-08	Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision	Orr Zohar et.al.	2407.06189	link
2024-07-08	CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation	Xinying Guo et.al.	2407.06188	null
2024-07-08	Left-Linear Rewriting in Adhesive Categories	Paolo Baldan et.al.	2407.06181	null
2024-07-08	The Tug-of-War Between Deepfake Generation and Detection	Hannah Lee et.al.	2407.06174	null
2024-07-08	On Speeding Up Language Model Evaluation	Jin Peng Zhou et.al.	2407.06172	null
2024-07-08	Inevitable Endgame of Comet Tsuchinshan-ATLAS (C/2023 A3)	Zdenek Sekanina et.al.	2407.06166	null
2024-07-08	What's Wrong with Your Code Generated by Large Language Models? An Extensive Study	Shihan Dou et.al.	2407.06153	null
2024-07-08	WIBACong: An Argument-centric Framework for Understanding US Congressional Hearings	Arman Irani et.al.	2407.06149	null
2024-07-08	Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks	Lukas Netz et.al.	2407.06146	null
2024-07-08	ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation	Ethan Chern et.al.	2407.06135	link
2024-07-05	LaRa: Efficient Large-Baseline Radiance Fields	Anpei Chen et.al.	2407.04699	null
2024-07-05	Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs	Rudolf Laine et.al.	2407.04694	link
2024-07-05	ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models	Yuzhe Gu et.al.	2407.04693	link
2024-07-05	Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge	Yuanze Lin et.al.	2407.04681	null
2024-07-05	Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition	Ye Bai et.al.	2407.04675	null
2024-07-05	Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement	Yongji Wu et.al.	2407.04656	null
2024-07-05	Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework	Reza Averly et.al.	2407.04629	null
2024-07-05	On scalable oversight with weak LLMs judging strong LLMs	Zachary Kenton et.al.	2407.04622	null
2024-07-08	OneRestore: A Universal Restoration Framework for Composite Degradation	Yu Guo et.al.	2407.04621	link
2024-07-05	Learning to (Learn at Test Time): RNNs with Expressive Hidden States	Yu Sun et.al.	2407.04620	link
2024-07-03	Universal Length Generalization with Turing Programs	Kaiying Hou et.al.	2407.03310	null
2024-07-03	Eyes on the Game: Deciphering Implicit Human Signals to Infer Human Proficiency, Trust, and Intent	Nikhil Hulle et.al.	2407.03298	null
2024-07-03	Large Language Models for JSON Schema Discovery	Michael J. Mior et.al.	2407.03286	null
2024-07-03	LLM Internal States Reveal Hallucination Risk Faced With a Query	Ziwei Ji et.al.	2407.03282	null
2024-07-03	Cooperative Multi-Agent Deep Reinforcement Learning Methods for UAV-aided Mobile Edge Computing Networks	Mintae Kim et.al.	2407.03280	null
2024-07-03	Nesterov's Accelerated Jacobi-Type Methods for Large-scale Symmetric Positive Semidefinite Linear Systems	Ling Liang et.al.	2407.03272	null
2024-07-03	STF: Sentence Transformer Fine-Tuning For Topic Categorization With Limited Data	Kheir Eddine Daouadi et.al.	2407.03253	null
2024-07-03	ACTRESS: Active Retraining for Semi-supervised Visual Grounding	Weitai Kang et.al.	2407.03251	null
2024-07-04	When big data actually are low-rank, or entrywise approximation of certain function-generated matrices	Stanislav Budzinskiy et.al.	2407.03250	link
2024-07-03	Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity Learning	Jiaqi Wang et.al.	2407.03247	link
2024-07-02	MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention	Huiqiang Jiang et.al.	2407.02490	link
2024-07-02	Neurocache: Efficient Vector Retrieval for Long-range Language Modeling	Ali Safaya et.al.	2407.02486	link
2024-07-02	RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs	Yue Yu et.al.	2407.02485	null
2024-07-02	Characterizing the Interpretability of Attention Maps in Digital Pathology	Tomé Albuquerque et.al.	2407.02484	null
2024-07-02	MMedAgent: Learning to Use Medical Tools with Multi-modal Agent	Binxu Li et.al.	2407.02483	null
2024-07-02	Understanding Alignment in Multimodal LLMs: A Comprehensive Study	Elmira Amirloo et.al.	2407.02477	null
2024-07-02	Open Scene Graphs for Open World Object-Goal Navigation	Joel Loo et.al.	2407.02473	null
2024-07-02	Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I	Harrie Oosterhuis et.al.	2407.02464	null
2024-07-02	Decentralized Intelligence Network (DIN)	Abraham Nash et.al.	2407.02461	null
2024-07-02	Revisión de Métodos de Planificación de Camino de Cobertura para Entornos Agrícolas	Ismael Ait et.al.	2407.02449	null

(back to top)

Early Stopping

Publish Date	Title	Authors	PDF	Code
2024-09-09	Early-exit Convolutional Neural Networks	Edanur Demir et.al.	2409.05336	null
2024-09-08	Attention-Based Efficient Breath Sound Removal in Studio Audio Recordings	Nidula Elgiriyewithana et.al.	2409.04949	null
2024-09-01	RTop-K: Ultra-Fast Row-Wise Top-K Algorithm and GPU Implementation for Neural Networks	Xi Xie et.al.	2409.00822	null
2024-08-30	Dynamic Self-Consistency: Leveraging Reasoning Paths for Efficient LLM Sampling	Guangya Wan et.al.	2408.17017	null
2024-08-24	Inferring the shape of a solid inside a draining tank from its liquid level dynamics	Gbenga Fabusola et.al.	2408.14503	null
2024-08-26	Re-Mix: Optimizing Data Mixtures for Large Scale Imitation Learning	Joey Hejna et.al.	2408.14037	link
2024-08-24	Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning	Xinglin Wang et.al.	2408.13457	null
2024-08-24	Face Clustering via Early Stopping and Edge Recall	Junjie Liu et.al.	2408.13431	link
2024-08-21	Critique-out-Loud Reward Models	Zachary Ankner et.al.	2408.11791	link
2024-08-21	EEG-Defender: Defending against Jailbreak through Early Exit Generation of Large Language Models	Chongwen Zhao et.al.	2408.11308	null
2024-08-20	Inferring Underwater Topography with FINN	Coşku Can Horuz et.al.	2408.10649	null
2024-08-15	An Efficient Continuous Control Perspective for Reinforcement-Learning-based Sequential Recommendation	Jun Wang et.al.	2408.08047	null
2024-08-14	Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks	Liting Jiang et.al.	2408.07613	null
2024-08-12	HeLiMOS: A Dataset for Moving Object Segmentation in 3D Point Clouds From Heterogeneous LiDAR Sensors	Hyungtae Lim et.al.	2408.06328	null
2024-08-12	Transfer learning of state-based potential games for process optimization in decentralized manufacturing systems	Steve Yuwono et.al.	2408.05992	null
2024-08-12	A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models	Taehong Moon et.al.	2408.05927	link
2024-08-08	Early-Exit meets Model-Distributed Inference at Edge Networks	Marco Colocrese et.al.	2408.05247	null
2024-08-09	PriPHiT: Privacy-Preserving Hierarchical Training of Deep Neural Networks	Yamin Sepehri et.al.	2408.05092	null
2024-08-09	Early Exit Strategies for Approximate k-NN Search in Dense Retrieval	Francesco Busolin et.al.	2408.04981	null
2024-08-07	Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling	Zilyu Ye et.al.	2408.03695	null
2024-08-03	Advancing Green AI: Efficient and Accurate Lightweight CNNs for Rice Leaf Disease Identification	Khairun Saddami et.al.	2408.01752	null
2024-08-01	Early Stopping Based on Repeated Significance	Eric Bax et.al.	2408.00908	null
2024-07-31	Automated Sperm Morphology Analysis Based on Instance-Aware Part Segmentation	Wenyuan Chen et.al.	2408.00112	null
2024-07-30	Accelerating Large Language Model Inference with Self-Supervised Early Exits	Florian Valade et.al.	2407.21082	null
2024-07-25	An Efficient Inference Framework for Early-exit Large Language Models	Ruijie Miao et.al.	2407.20272	null
2024-07-26	Topology Optimization of Random Memristors for Input-Aware Dynamic SNN	Bo Wang et.al.	2407.18625	null
2024-07-25	Superior Scoring Rules for Probabilistic Evaluation of Single-Label Multi-Class Classification Tasks	Rouhollah Ahmadian et.al.	2407.17697	null
2024-07-23	Can Large Language Models Automatically Jailbreak GPT-4V?	Yuanwei Wu et.al.	2407.16686	null
2024-07-22	WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding	Quan Kong et.al.	2407.15350	null
2024-07-19	Joint or Disjoint: Mixing Training Regimes for Early-Exit Models	Bartłomiej Krzepkowski et.al.	2407.14320	link
2024-07-19	BERTer: The Efficient One	Pradyumna Saligram et.al.	2407.14039	null
2024-07-18	On the consistency of rotation curves and spatially integrated HI flux profiles	Tariq Yasin et.al.	2407.13754	null
2024-07-19	Revisiting Adaptive Cellular Recognition Under Domain Shifts: A Contextual Correspondence View	Jianan Fan et.al.	2407.12870	link
2024-07-17	Hallucination Index: An Image Quality Metric for Generative Reconstruction Models	Matthew Tivnan et.al.	2407.12780	null
2024-07-16	Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning	Yanting Miao et.al.	2407.12164	null
2024-07-16	Enhancing Split Computing and Early Exit Applications through Predefined Sparsity	Luigi Capogrosso et.al.	2407.11763	link
2024-07-16	Preconditioned Gradient Descent Finds Over-Parameterized Neural Networks with Sharp Generalization for Nonparametric Regression	Yingzhen Yang et.al.	2407.11353	null
2024-07-10	Exploring the Boundaries of On-Device Inference: When Tiny Falls Short, Go Hierarchical	Adarsh Prasad Behera et.al.	2407.11061	null
2024-07-15	Multilingual Contrastive Decoding via Language-Agnostic Layers Skipping	Wenhao Zhu et.al.	2407.10795	link
2024-07-13	Towards understanding epoch-wise double descent in two-layer linear neural networks	Amanda Olmin et.al.	2407.09845	null
2024-07-11	Sensor-Aware Classifiers for Energy-Efficient Time Series Applications on IoT Devices	Dina Hussein et.al.	2407.08715	null
2024-07-07	Learning Motion Blur Robust Vision Transformers with Dynamic Early Exit for Real-Time UAV Tracking	You Wu et.al.	2407.05383	null
2024-07-04	Unsupervised speech enhancement with spectral kurtosis and double deep priors	Hien Ohnaka et.al.	2407.03887	null
2024-07-02	Advancing Compressed Video Action Recognition through Progressive Knowledge Distillation	Efstathia Soufleri et.al.	2407.02713	link
2024-07-02	Zero-shot Video Restoration and Enhancement Using Pre-Trained Image Diffusion Model	Cong Cao et.al.	2407.01960	null
2024-07-01	Exact statistical analysis for response-adaptive clinical trials: a general and computationally tractable approach	Stef Baas et.al.	2407.01055	null
2024-07-01	SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection	Dingkang Liang et.al.	2407.01016	null
2024-06-27	Adaptive Stochastic Weight Averaging	Caglar Demir et.al.	2406.19092	link
2024-06-26	An Order Theory Framework of Recurrence Equations for Static Cost Analysis $-$ Dynamic Inference of Non-Linear Inequality Invariants	Louis Rustenholz et.al.	2406.18260	null
2024-06-24	SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments	Neng Wang et.al.	2406.16279	link
2024-06-21	Micro-power spoken keyword spotting on Xylo Audio 2	Hannah Bos et.al.	2406.15112	null
2024-06-21	Early stopping for conjugate gradients in statistical inverse problems	Laura Hucker et.al.	2406.15001	null
2024-06-21	Cost-Effective RF Fingerprinting Based on Hybrid CVNN-RF Classifier with Automated Multi-Dimensional Early-Exit Strategy	Jiayan Gan et.al.	2406.14869	null
2024-06-20	On Layer-wise Representation Similarity: Application for Multi-Exit Models with a Single Classifier	Jiachen Jiang et.al.	2406.14479	null

(back to top)

youngjetduan / cv-arxiv-daily

Updated on 2024.09.11

LLM

Early Stopping

About

Languages