youngjetduan / cv-arxiv-daily

🎓Automatically Update Interested Papers Daily using Github Actions (Update Every 12th hours)

Home Page:https://youngjetduan.github.io/cv-arxiv-daily/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Contributors Forks Stargazers Issues

Updated on 2024.09.11

Usage instructions: here

Table of Contents
  1. LLM
  2. Early Stopping

LLM

Publish Date Title Authors PDF Code
2024-09-09 DFabric: Scaling Out Data Parallel Applications with CXL-Ethernet Hybrid Interconnects Xu Zhang et.al. 2409.05404 null
2024-09-08 InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference Xiurui Pan et.al. 2409.04992 null
2024-09-04 Accelerating Large Language Model Training with Hybrid GPU-based Compression Lang Xu et.al. 2409.02423 null
2024-09-03 Contemporary Model Compression on Large Language Models Inference Dong Liu et.al. 2409.01990 null
2024-09-03 On-chain Validation of Tracking Data Messages (TDM) Using Distributed Deep Learning on a Proof of Stake (PoS) Blockchain Yasir Latif et.al. 2409.01614 null
2024-09-02 LuWu: An End-to-End In-Network Out-of-Core Optimizer for 100B-Scale Model-in-Network Data-Parallel Training on Distributed GPUs Mo Sun et.al. 2409.00918 null
2024-08-26 Model Parallel Training and Transfer Learning for Convolutional Neural Networks by Domain Decomposition Axel Klawonn et.al. 2408.14442 null
2024-08-23 Network-Offloaded Bandwidth-Optimal Broadcast and Allgather for Distributed AI Mikhail Khalilov et.al. 2408.13356 null
2024-08-22 LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation Shihao Chen et.al. 2408.12354 null
2024-08-23 MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding Jian Chen et.al. 2408.11049 link
2024-08-20 Security Assessment of Hierarchical Federated Deep Learning D Alqattan et.al. 2408.10752 link
2024-08-20 Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning Bei Ouyang et.al. 2408.10746 null
2024-08-21 LongVILA: Scaling Long-Context Visual Language Models for Long Videos Fuzhao Xue et.al. 2408.10188 link
2024-08-17 RepControlNet: ControlNet Reparameterization Zhaoli Deng et.al. 2408.09240 null
2024-08-17 Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version) Mingkuan Xu et.al. 2408.09055 null
2024-08-23 ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models Chao Zeng et.al. 2408.08554 link
2024-08-16 Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models Jerry Huang et.al. 2408.08470 null
2024-08-15 Asteroid: Resource-Efficient Hybrid Pipeline Parallelism for Collaborative DNN Training on Heterogeneous Edge Devices Shengyuan Ye et.al. 2408.08015 null
2024-08-17 Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference Rohan Baskar Prabhakar et.al. 2408.07802 null
2024-08-18 Post-Training Sparse Attention with Double Sparsity Shuo Yang et.al. 2408.07092 link
2024-08-12 LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration Zhiwen Mo et.al. 2408.06003 null
2024-08-10 Eigen Attention: Attention in Low-Rank Space for KV Cache Compression Utkarsh Saxena et.al. 2408.05646 null
2024-08-05 SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving Andreas Kosmas Kakolyris et.al. 2408.05235 null
2024-08-08 Partial Experts Checkpoint: Efficient Fault Tolerance for Sparse Mixture-of-Experts Model Training Weilin Cai et.al. 2408.04307 null
2024-08-07 Zero-Delay QKV Compression for Mitigating KV Cache and Network Bottlenecks in LLM Inference Zeyu Zhang et.al. 2408.04107 null
2024-08-08 NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time Yilong Chen et.al. 2408.03675 link
2024-08-04 Cross-layer Attention Sharing for Large Language Models Yongyu Mu et.al. 2408.01890 null
2024-08-01 Intermittent Semi-working Mask: A New Masking Paradigm for LLMs Mingcong Lu et.al. 2408.00539 null
2024-08-13 Finch: Prompt-guided Key-Value Cache Compression Giulio Corallo et.al. 2408.00167 null
2024-07-31 EdgeLLM: A Highly Efficient CPU-FPGA Heterogeneous Edge Accelerator for Large Language Models Mingqiang Huang et.al. 2407.21325 null
2024-07-30 Palu: Compressing KV-Cache with Low-Rank Projection Chi-Chih Chang et.al. 2407.21118 null
2024-07-30 ThinK: Thinner Key Cache by Query-Driven Pruning Yuhui Xu et.al. 2407.21018 null
2024-07-31 A2SF: Accumulative Attention Scoring with Forgetting Factor for Token Pruning in Transformer Decoder Hyun-rae Jo et.al. 2407.20485 null
2024-07-25 An Efficient Inference Framework for Early-exit Large Language Models Ruijie Miao et.al. 2407.20272 null
2024-07-29 When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention Lianghong Guo et.al. 2407.20042 link
2024-07-29 Inference acceleration for large language models using "stairs" assisted greedy generation Domas Grigaliūnas et.al. 2407.19947 null
2024-07-29 Rina: Enhancing Ring-AllReduce with In-network Aggregation in Distributed Model Training Zixuan Chen et.al. 2407.19721 null
2024-07-25 Efficient Inference of Vision Instruction-Following Models with Elastic Cache Zuyan Liu et.al. 2407.18121 link
2024-07-28 Keep the Cost Down: A Review on Methods to Optimize LLM' s KV-Cache Consumption Luohe Shi et.al. 2407.18003 null
2024-07-25 Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads Xihui Lin et.al. 2407.17678 null
2024-07-23 A deeper look at depth pruning of LLMs Shoaib Ahmed Siddiqui et.al. 2407.16286 link
2024-07-22 RazorAttention: Efficient KV Cache Compression Through Retrieval Heads Hanlin Tang et.al. 2407.15891 null
2024-07-22 AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description Junyu Xie et.al. 2407.15850 link
2024-07-22 LLMmap: Fingerprinting For Large Language Models Dario Pasquini et.al. 2407.15847 null
2024-07-22 CarFormer: Self-Driving with Learned Object-Centric Representations Shadi Hamdan et.al. 2407.15843 null
2024-07-22 SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models Mingze Xu et.al. 2407.15841 null
2024-07-22 MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity Yangzhou Liu et.al. 2407.15838 link
2024-07-22 dMel: Speech Tokenization made Simple He Bai et.al. 2407.15835 null
2024-07-22 Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight Ziyuan Huang et.al. 2407.15819 null
2024-07-23 A simple and fast C++ thread pool implementation capable of running task graphs Dmytro Puyda et.al. 2407.15805 link
2024-07-22 Robust Facial Reactions Generation: An Emotion-Aware Framework with Modality Compensation Guanyu Hu et.al. 2407.15798 null
2024-07-22 Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach Rian Dolphin et.al. 2407.15788 null
2024-07-22 Parallel Split Learning with Global Sampling Mohammad Kohankhaki et.al. 2407.15738 null
2024-07-22 vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving Jiale Xu et.al. 2407.15309 link
2024-07-19 Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference Joyjit Kundu et.al. 2407.14645 null
2024-07-19 Internal Consistency and Self-Feedback in Large Language Models: A Survey Xun Liang et.al. 2407.14507 link
2024-07-19 On Pre-training of Multimodal Language Models Customized for Chart Understanding Wan-Cyuan Fan et.al. 2407.14506 null
2024-07-19 PD-TPE: Parallel Decoder with Text-guided Position Encoding for 3D Visual Grounding Chenshu Hou et.al. 2407.14491 null
2024-07-19 Evaluating the Reliability of Self-Explanations in Large Language Models Korbinian Randl et.al. 2407.14487 link
2024-07-19 Contrastive Learning with Counterfactual Explanations for Radiology Report Generation Mingjie Li et.al. 2407.14474 null
2024-07-19 Check-Eval: A Checklist-based Approach for Evaluating Text Quality Jayr Pereira et.al. 2407.14467 null
2024-07-19 AttentNet: Fully Convolutional 3D Attention for Lung Nodule Detection Majedaldein Almahasneh et.al. 2407.14464 null
2024-07-19 PolyFormer: Scalable Node-wise Filters via Polynomial Graph Transformer Jiahong Ma et.al. 2407.14459 link
2024-07-19 Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier Zachary Wojtowicz et.al. 2407.14452 null
2024-07-19 From Instruction to Insight: Exploring the Functional and Semantic Roles of Text in Interactive Dashboards Nicole Sultanum et.al. 2407.14451 null
2024-07-19 LoAS: Fully Temporal-Parallel Datatflow for Dual-Sparse Spiking Neural Networks Ruokai Yin et.al. 2407.14073 link
2024-07-19 LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Qichen Fu et.al. 2407.14057 null
2024-07-18 SegPoint: Segment Any Point Cloud via Large Language Model Shuting He et.al. 2407.13761 null
2024-07-18 Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models Zhuo Chen et.al. 2407.13757 null
2024-07-18 CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications Mirza Masfiqur Rahman et.al. 2407.13742 null
2024-07-18 Baba Is AI: Break the Rules to Beat the Benchmark Nathan Cloos et.al. 2407.13729 null
2024-07-18 Compressing Structured Tensor Algebra Mahdi Ghorbani et.al. 2407.13726 null
2024-07-18 CoDefeater: Using LLMs To Find Defeaters in Assurance Cases Usman Gohar et.al. 2407.13717 link
2024-07-18 Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning Ans Munir et.al. 2407.13715 link
2024-07-18 Understanding Reference Policies in Direct Preference Optimization Yixin Liu et.al. 2407.13709 link
2024-07-18 ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection Janek Herrlein et.al. 2407.13702 link
2024-07-18 Cross-Task Attack: A Self-Supervision Generative Framework Based on Attention Shift Qingyuan Zeng et.al. 2407.13700 null
2024-07-17 Analysis of Crab X-ray Polarization using Deeper IXPE Observations Josephine Wong et.al. 2407.12779 null
2024-07-17 The BRST quantisation of chiral BMS-like field theories José Figueroa-O'Farrill et.al. 2407.12778 null
2024-07-17 Jigsaw Game: Federated Clustering Jinxuan Xu et.al. 2407.12764 null
2024-07-17 LookupViT: Compressing visual information to a limited number of tokens Rajat Koner et.al. 2407.12753 null
2024-07-17 CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inference Mohammad Erfan Sadeghi et.al. 2407.12736 null
2024-07-17 EchoSight: Advancing Visual-Language Models with Wiki Knowledge Yibin Yan et.al. 2407.12735 null
2024-07-17 FlexFL: Heterogeneous Federated Learning via APoZ-Guided Flexible Pruning in Uncertain Scenarios Zekai Chen et.al. 2407.12729 null
2024-07-17 Exploring the interplay of individual traits and interaction dynamics in preschool social networks Gülşah Akçakır et.al. 2407.12728 null
2024-07-17 NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model Zhongqun Zhang et.al. 2407.12727 null
2024-07-17 Is Sarcasm Detection A Step-by-Step Reasoning Process in Large Language Models? Ben Yao et.al. 2407.12725 null
2024-07-16 GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression Daniel Goldstein et.al. 2407.12077 link
2024-07-16 Hydra: Brokering Cloud and HPC Resources to Support the Execution of Heterogeneous Workloads at Scale Aymen Alsaadi et.al. 2407.11967 null
2024-07-16 UrbanWorld: An Urban World Model for 3D City Generation Yu Shang et.al. 2407.11965 null
2024-07-16 NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window? Mo Li et.al. 2407.11963 link
2024-07-17 Hierarchical Separable Video Transformer for Snapshot Compressive Imaging Ping Wang et.al. 2407.11946 link
2024-07-16 Min-max theory and existence of H-spheres with arbitrary codimensions Rui Gao et.al. 2407.11945 null
2024-07-16 Beyond Spatial Explanations: Explainable Face Recognition in the Frequency Domain Marco Huber et.al. 2407.11941 null
2024-07-16 Generalized Difference-in-Differences Yiqing Xu et.al. 2407.11937 null
2024-07-16 Learning Multi-view Anomaly Detection Haoyang He et.al. 2407.11935 null
2024-07-16 Code Documentation and Analysis to Secure Software Development Paul Attie et.al. 2407.11934 null
2024-07-16 What's Wrong? Refining Meeting Summaries with LLM Feedback Frederic Kirstein et.al. 2407.11919 null
2024-07-16 PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation Branden Butler et.al. 2407.11798 null
2024-07-21 Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference Yuan Feng et.al. 2407.11550 link
2024-07-15 VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation Bocheng Zou et.al. 2407.10972 link
2024-07-15 Q-Sparse: All Large Language Models can be Fully Sparsely-Activated Hongyu Wang et.al. 2407.10969 null
2024-07-15 Induction of non-Fermi liquids by critical cavity photons at the onset of superradiance Ipsita Mandal et.al. 2407.10963 null
2024-07-15 Fast Matrix Multiplications for Lookup Table-Quantized LLMs Han Guo et.al. 2407.10960 link
2024-07-15 InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models Nirat Saini et.al. 2407.10958 null
2024-07-15 MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models Chengguang Gan et.al. 2407.10953 null
2024-07-15 The infamous 95 GeV $\rm b \bar b$ excess at LEP: Two b or not two b? Patrick Janot et.al. 2407.10948 null
2024-07-15 Can Textual Semantics Mitigate Sounding Object Segmentation Preference? Yaoting Wang et.al. 2407.10947 link
2024-07-15 GRUtopia: Dream General Robots in a City at Scale Hanqing Wang et.al. 2407.10943 link
2024-07-15 IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation Yuanhao Zhai et.al. 2407.10937 link
2024-07-12 FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3 Georgios Makridis et.al. 2407.09467 null
2024-07-12 Human-like Episodic Memory for Infinite Context LLMs Zafeirios Fountas et.al. 2407.09450 null
2024-07-12 ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts Amelia F. Hardy et.al. 2407.09447 link
2024-07-12 MUSCLE: A Model Update Strategy for Compatible LLM Evolution Jessica Echterhoff et.al. 2407.09435 null
2024-07-12 Open (Clinical) LLMs are Sensitive to Instruction Phrasings Alberto Mario Ceballos Arroyo et.al. 2407.09429 link
2024-07-12 TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models Hang Zou et.al. 2407.09424 null
2024-07-12 Mitigating Entity-Level Hallucination in Large Language Models Weihang Su et.al. 2407.09417 link
2024-07-12 SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers Shraman Pramanick et.al. 2407.09413 link
2024-07-12 Thunderbolt: Causal Concurrent Consensus and Execution Junchao Chen et.al. 2407.09409 null
2024-07-12 PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents Saber Zerhoudi et.al. 2407.09394 link
2024-07-11 MAVIS: Mathematical Visual Instruction Tuning Renrui Zhang et.al. 2407.08739 link
2024-07-11 Real-Time Anomaly Detection and Reactive Planning with Large Language Models Rohan Sinha et.al. 2407.08735 null
2024-07-11 Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist Zihao Zhou et.al. 2407.08733 null
2024-07-11 Planar decomposition of the HOMFLY polynomial for bipartite knots and links A. Anokhina et.al. 2407.08724 null
2024-07-11 A Taxonomy for Data Contamination in Large Language Models Medha Palavalli et.al. 2407.08716 null
2024-07-11 GTA: A Benchmark for General Tool Agents Jize Wang et.al. 2407.08713 link
2024-07-11 Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models Zhening Xing et.al. 2407.08701 null
2024-07-11 Flex-TPU: A Flexible TPU with Runtime Reconfigurable Dataflow Architecture Mohammed Elbtity et.al. 2407.08700 null
2024-07-11 Mitigating Catastrophic Forgetting in Language Transfer via Model Merging Anton Alexandrov et.al. 2407.08699 null
2024-07-11 Patterns of link reciprocity in directed, signed networks Anna Gallo et.al. 2407.08697 null
2024-07-10 Training on the Test Task Confounds Evaluation and Emergence Ricardo Dominguez-Olmedo et.al. 2407.07890 link
2024-07-10 Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization Junkang Wu et.al. 2407.07880 link
2024-07-10 Bound States in Continuum via Singular Transfer Matrices Ovidiu-Zeno Lipan et.al. 2407.07879 null
2024-07-10 FACTS About Building Retrieval Augmented Generation-based Chatbots Rama Akkiraju et.al. 2407.07858 null
2024-07-10 OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training Sami Jaghouar et.al. 2407.07852 link
2024-07-10 Harnessing Integrated CPU-GPU System Memory for HPC: a first look into Grace Hopper Gabin Schieffer et.al. 2407.07850 null
2024-07-10 Natural Language Mechanisms via Self-Resolution with Foundation Models Nicolas Della Penna et.al. 2407.07845 null
2024-07-10 Study on Aspect Ratio Variability toward Robustness of Vision Transformer-based Vehicle Re-identification Mei Qiu et.al. 2407.07842 null
2024-07-10 Transformer Alignment in Large Language Models Murdock Aubry et.al. 2407.07810 null
2024-07-10 Attribute or Abstain: Large Language Models as Long Document Assistants Jan Buchmann et.al. 2407.07799 link
2024-07-09 AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning Jiaxi Cui et.al. 2407.07094 link
2024-07-09 FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation Liqun Ma et.al. 2407.07093 link
2024-07-09 Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic Ruochen Jin et.al. 2407.07089 link
2024-07-09 Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models Logan Cross et.al. 2407.07086 link
2024-07-09 Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities Shaltiel Shmidman et.al. 2407.07080 null
2024-07-09 ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction Shaozhe Hao et.al. 2407.07077 link
2024-07-09 Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps Yung-Sung Chuang et.al. 2407.07071 link
2024-07-09 Prompting Techniques for Secure Code Generation: A Systematic Investigation Catherine Tony et.al. 2407.07064 null
2024-07-09 Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence Weize Chen et.al. 2407.07061 link
2024-07-09 CAPformer: Compression-Aware Pre-trained Transformer for Low-Light Image Enhancement Wang Wei et.al. 2407.07056 null
2024-07-08 Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision Orr Zohar et.al. 2407.06189 link
2024-07-08 CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation Xinying Guo et.al. 2407.06188 null
2024-07-08 Left-Linear Rewriting in Adhesive Categories Paolo Baldan et.al. 2407.06181 null
2024-07-08 The Tug-of-War Between Deepfake Generation and Detection Hannah Lee et.al. 2407.06174 null
2024-07-08 On Speeding Up Language Model Evaluation Jin Peng Zhou et.al. 2407.06172 null
2024-07-08 Inevitable Endgame of Comet Tsuchinshan-ATLAS (C/2023 A3) Zdenek Sekanina et.al. 2407.06166 null
2024-07-08 What's Wrong with Your Code Generated by Large Language Models? An Extensive Study Shihan Dou et.al. 2407.06153 null
2024-07-08 WIBACong: An Argument-centric Framework for Understanding US Congressional Hearings Arman Irani et.al. 2407.06149 null
2024-07-08 Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks Lukas Netz et.al. 2407.06146 null
2024-07-08 ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation Ethan Chern et.al. 2407.06135 link
2024-07-05 LaRa: Efficient Large-Baseline Radiance Fields Anpei Chen et.al. 2407.04699 null
2024-07-05 Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs Rudolf Laine et.al. 2407.04694 link
2024-07-05 ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models Yuzhe Gu et.al. 2407.04693 link
2024-07-05 Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge Yuanze Lin et.al. 2407.04681 null
2024-07-05 Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition Ye Bai et.al. 2407.04675 null
2024-07-05 Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement Yongji Wu et.al. 2407.04656 null
2024-07-05 Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework Reza Averly et.al. 2407.04629 null
2024-07-05 On scalable oversight with weak LLMs judging strong LLMs Zachary Kenton et.al. 2407.04622 null
2024-07-08 OneRestore: A Universal Restoration Framework for Composite Degradation Yu Guo et.al. 2407.04621 link
2024-07-05 Learning to (Learn at Test Time): RNNs with Expressive Hidden States Yu Sun et.al. 2407.04620 link
2024-07-03 Universal Length Generalization with Turing Programs Kaiying Hou et.al. 2407.03310 null
2024-07-03 Eyes on the Game: Deciphering Implicit Human Signals to Infer Human Proficiency, Trust, and Intent Nikhil Hulle et.al. 2407.03298 null
2024-07-03 Large Language Models for JSON Schema Discovery Michael J. Mior et.al. 2407.03286 null
2024-07-03 LLM Internal States Reveal Hallucination Risk Faced With a Query Ziwei Ji et.al. 2407.03282 null
2024-07-03 Cooperative Multi-Agent Deep Reinforcement Learning Methods for UAV-aided Mobile Edge Computing Networks Mintae Kim et.al. 2407.03280 null
2024-07-03 Nesterov's Accelerated Jacobi-Type Methods for Large-scale Symmetric Positive Semidefinite Linear Systems Ling Liang et.al. 2407.03272 null
2024-07-03 STF: Sentence Transformer Fine-Tuning For Topic Categorization With Limited Data Kheir Eddine Daouadi et.al. 2407.03253 null
2024-07-03 ACTRESS: Active Retraining for Semi-supervised Visual Grounding Weitai Kang et.al. 2407.03251 null
2024-07-04 When big data actually are low-rank, or entrywise approximation of certain function-generated matrices Stanislav Budzinskiy et.al. 2407.03250 link
2024-07-03 Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity Learning Jiaqi Wang et.al. 2407.03247 link
2024-07-02 MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention Huiqiang Jiang et.al. 2407.02490 link
2024-07-02 Neurocache: Efficient Vector Retrieval for Long-range Language Modeling Ali Safaya et.al. 2407.02486 link
2024-07-02 RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs Yue Yu et.al. 2407.02485 null
2024-07-02 Characterizing the Interpretability of Attention Maps in Digital Pathology Tomé Albuquerque et.al. 2407.02484 null
2024-07-02 MMedAgent: Learning to Use Medical Tools with Multi-modal Agent Binxu Li et.al. 2407.02483 null
2024-07-02 Understanding Alignment in Multimodal LLMs: A Comprehensive Study Elmira Amirloo et.al. 2407.02477 null
2024-07-02 Open Scene Graphs for Open World Object-Goal Navigation Joel Loo et.al. 2407.02473 null
2024-07-02 Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I Harrie Oosterhuis et.al. 2407.02464 null
2024-07-02 Decentralized Intelligence Network (DIN) Abraham Nash et.al. 2407.02461 null
2024-07-02 Revisión de Métodos de Planificación de Camino de Cobertura para Entornos Agrícolas Ismael Ait et.al. 2407.02449 null

(back to top)

Early Stopping

Publish Date Title Authors PDF Code
2024-09-09 Early-exit Convolutional Neural Networks Edanur Demir et.al. 2409.05336 null
2024-09-08 Attention-Based Efficient Breath Sound Removal in Studio Audio Recordings Nidula Elgiriyewithana et.al. 2409.04949 null
2024-09-01 RTop-K: Ultra-Fast Row-Wise Top-K Algorithm and GPU Implementation for Neural Networks Xi Xie et.al. 2409.00822 null
2024-08-30 Dynamic Self-Consistency: Leveraging Reasoning Paths for Efficient LLM Sampling Guangya Wan et.al. 2408.17017 null
2024-08-24 Inferring the shape of a solid inside a draining tank from its liquid level dynamics Gbenga Fabusola et.al. 2408.14503 null
2024-08-26 Re-Mix: Optimizing Data Mixtures for Large Scale Imitation Learning Joey Hejna et.al. 2408.14037 link
2024-08-24 Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning Xinglin Wang et.al. 2408.13457 null
2024-08-24 Face Clustering via Early Stopping and Edge Recall Junjie Liu et.al. 2408.13431 link
2024-08-21 Critique-out-Loud Reward Models Zachary Ankner et.al. 2408.11791 link
2024-08-21 EEG-Defender: Defending against Jailbreak through Early Exit Generation of Large Language Models Chongwen Zhao et.al. 2408.11308 null
2024-08-20 Inferring Underwater Topography with FINN Coşku Can Horuz et.al. 2408.10649 null
2024-08-15 An Efficient Continuous Control Perspective for Reinforcement-Learning-based Sequential Recommendation Jun Wang et.al. 2408.08047 null
2024-08-14 Rethinking the Key Factors for the Generalization of Remote Sensing Stereo Matching Networks Liting Jiang et.al. 2408.07613 null
2024-08-12 HeLiMOS: A Dataset for Moving Object Segmentation in 3D Point Clouds From Heterogeneous LiDAR Sensors Hyungtae Lim et.al. 2408.06328 null
2024-08-12 Transfer learning of state-based potential games for process optimization in decentralized manufacturing systems Steve Yuwono et.al. 2408.05992 null
2024-08-12 A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models Taehong Moon et.al. 2408.05927 link
2024-08-08 Early-Exit meets Model-Distributed Inference at Edge Networks Marco Colocrese et.al. 2408.05247 null
2024-08-09 PriPHiT: Privacy-Preserving Hierarchical Training of Deep Neural Networks Yamin Sepehri et.al. 2408.05092 null
2024-08-09 Early Exit Strategies for Approximate k-NN Search in Dense Retrieval Francesco Busolin et.al. 2408.04981 null
2024-08-07 Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling Zilyu Ye et.al. 2408.03695 null
2024-08-03 Advancing Green AI: Efficient and Accurate Lightweight CNNs for Rice Leaf Disease Identification Khairun Saddami et.al. 2408.01752 null
2024-08-01 Early Stopping Based on Repeated Significance Eric Bax et.al. 2408.00908 null
2024-07-31 Automated Sperm Morphology Analysis Based on Instance-Aware Part Segmentation Wenyuan Chen et.al. 2408.00112 null
2024-07-30 Accelerating Large Language Model Inference with Self-Supervised Early Exits Florian Valade et.al. 2407.21082 null
2024-07-25 An Efficient Inference Framework for Early-exit Large Language Models Ruijie Miao et.al. 2407.20272 null
2024-07-26 Topology Optimization of Random Memristors for Input-Aware Dynamic SNN Bo Wang et.al. 2407.18625 null
2024-07-25 Superior Scoring Rules for Probabilistic Evaluation of Single-Label Multi-Class Classification Tasks Rouhollah Ahmadian et.al. 2407.17697 null
2024-07-23 Can Large Language Models Automatically Jailbreak GPT-4V? Yuanwei Wu et.al. 2407.16686 null
2024-07-22 WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding Quan Kong et.al. 2407.15350 null
2024-07-19 Joint or Disjoint: Mixing Training Regimes for Early-Exit Models Bartłomiej Krzepkowski et.al. 2407.14320 link
2024-07-19 BERTer: The Efficient One Pradyumna Saligram et.al. 2407.14039 null
2024-07-18 On the consistency of rotation curves and spatially integrated HI flux profiles Tariq Yasin et.al. 2407.13754 null
2024-07-19 Revisiting Adaptive Cellular Recognition Under Domain Shifts: A Contextual Correspondence View Jianan Fan et.al. 2407.12870 link
2024-07-17 Hallucination Index: An Image Quality Metric for Generative Reconstruction Models Matthew Tivnan et.al. 2407.12780 null
2024-07-16 Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning Yanting Miao et.al. 2407.12164 null
2024-07-16 Enhancing Split Computing and Early Exit Applications through Predefined Sparsity Luigi Capogrosso et.al. 2407.11763 link
2024-07-16 Preconditioned Gradient Descent Finds Over-Parameterized Neural Networks with Sharp Generalization for Nonparametric Regression Yingzhen Yang et.al. 2407.11353 null
2024-07-10 Exploring the Boundaries of On-Device Inference: When Tiny Falls Short, Go Hierarchical Adarsh Prasad Behera et.al. 2407.11061 null
2024-07-15 Multilingual Contrastive Decoding via Language-Agnostic Layers Skipping Wenhao Zhu et.al. 2407.10795 link
2024-07-13 Towards understanding epoch-wise double descent in two-layer linear neural networks Amanda Olmin et.al. 2407.09845 null
2024-07-11 Sensor-Aware Classifiers for Energy-Efficient Time Series Applications on IoT Devices Dina Hussein et.al. 2407.08715 null
2024-07-07 Learning Motion Blur Robust Vision Transformers with Dynamic Early Exit for Real-Time UAV Tracking You Wu et.al. 2407.05383 null
2024-07-04 Unsupervised speech enhancement with spectral kurtosis and double deep priors Hien Ohnaka et.al. 2407.03887 null
2024-07-02 Advancing Compressed Video Action Recognition through Progressive Knowledge Distillation Efstathia Soufleri et.al. 2407.02713 link
2024-07-02 Zero-shot Video Restoration and Enhancement Using Pre-Trained Image Diffusion Model Cong Cao et.al. 2407.01960 null
2024-07-01 Exact statistical analysis for response-adaptive clinical trials: a general and computationally tractable approach Stef Baas et.al. 2407.01055 null
2024-07-01 SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection Dingkang Liang et.al. 2407.01016 null
2024-06-27 Adaptive Stochastic Weight Averaging Caglar Demir et.al. 2406.19092 link
2024-06-26 An Order Theory Framework of Recurrence Equations for Static Cost Analysis $-$ Dynamic Inference of Non-Linear Inequality Invariants Louis Rustenholz et.al. 2406.18260 null
2024-06-24 SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments Neng Wang et.al. 2406.16279 link
2024-06-21 Micro-power spoken keyword spotting on Xylo Audio 2 Hannah Bos et.al. 2406.15112 null
2024-06-21 Early stopping for conjugate gradients in statistical inverse problems Laura Hucker et.al. 2406.15001 null
2024-06-21 Cost-Effective RF Fingerprinting Based on Hybrid CVNN-RF Classifier with Automated Multi-Dimensional Early-Exit Strategy Jiayan Gan et.al. 2406.14869 null
2024-06-20 On Layer-wise Representation Similarity: Application for Multi-Exit Models with a Single Classifier Jiachen Jiang et.al. 2406.14479 null

(back to top)

About

🎓Automatically Update Interested Papers Daily using Github Actions (Update Every 12th hours)

https://youngjetduan.github.io/cv-arxiv-daily/

License:Apache License 2.0


Languages

Language:Python 100.0%