[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]
Usage instructions: here
Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2023-03-15 | RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters | Shuja Khalid et.al. | 2303.08695 | null |
2023-03-09 | Revisiting Rotation Averaging: Uncertainties and Robust Losses | Ganlin Zhang et.al. | 2303.05195 | link |
2023-02-28 | Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images | Zhongli Fan et.al. | 2302.14239 | link |
2023-02-27 | BaLi-RF: Bandlimited Radiance Fields for Dynamic Scene Modeling | Sameera Ramasinghe et.al. | 2302.13543 | null |
2023-02-21 | EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images | Zhichao Ye et.al. | 2302.10544 | link |
2023-02-18 | Bridge Damage Cause Estimation Using Multiple Images Based on Visual Question Answering | Tatsuro Yamane et.al. | 2302.09208 | null |
2023-02-12 | Uncertainty-Driven Dense Two-View Structure from Motion | Weirong Chen et.al. | 2302.00523 | null |
2023-01-28 | AdaSfM: From Coarse Global to Fine Incremental Adaptive Structure from Motion | Yu Chen et.al. | 2301.12135 | null |
2023-01-20 | A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles | Zhefan Xu et.al. | 2301.08422 | null |
2023-01-05 | Robust Dynamic Radiance Fields | Yu-Lun Liu et.al. | 2301.02239 | null |
2022-12-24 | Polarimetric Multi-View Inverse Rendering | Jinyu Zhao et.al. | 2212.12721 | null |
2022-12-13 | Accidental Turntables: Learning 3D Pose by Watching Objects Turn | Zezhou Cheng et.al. | 2212.06300 | null |
2022-12-04 | 3D Object Aided Self-Supervised Monocular Depth Estimation | Songlin Wei et.al. | 2212.01768 | null |
2023-03-15 | High-Res Facial Appearance Capture from Polarized Smartphone Images | Dejan Azinović et.al. | 2212.01160 | null |
2022-11-28 | FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network | Xinjiang Wang et.al. | 2211.15069 | null |
2022-11-24 | JigsawPlan: Room Layout Jigsaw Puzzle Extreme Structure from Motion using Diffusion Models | Sepidehsadat Hosseini et.al. | 2211.13785 | null |
2022-11-24 | SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks | Sergio Izquierdo et.al. | 2211.13551 | null |
2022-11-22 | Level-S |
Yuxi Xiao et.al. | 2211.12018 | null |
2022-11-21 | Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques | David Ramirez et.al. | 2211.11836 | null |
2022-11-14 | Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion | René Haas et.al. | 2211.07195 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-05-21 | Spatial-aware Attention Generative Adversarial Network for Semi-supervised Anomaly Detection in Medical Image | Zerui Zhang et.al. | 2405.12872 | null |
2024-05-21 | A Dataset and Baselines for Measuring and Predicting the Music Piece Memorability | Li-Yang Tseng et.al. | 2405.12847 | null |
2024-05-20 | Paired Conditional Generative Adversarial Network for Highly Accelerated Liver 4D MRI | Di Xu et.al. | 2405.12357 | null |
2024-05-20 | EGAN: Evolutional GAN for Ransomware Evasion | Daniel Commey et.al. | 2405.12266 | null |
2024-05-19 | Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation | Sangyeop Yeo et.al. | 2405.11614 | null |
2024-05-21 | A GAN-Based Data Poisoning Attack Against Federated Learning Systems and Its Countermeasure | Wei Sun et.al. | 2405.11440 | null |
2024-05-18 | Few-Shot API Attack Detection: Overcoming Data Scarcity with GAN-Inspired Learning | Udi Aharon et.al. | 2405.11258 | null |
2024-05-16 | An Autoencoder and Generative Adversarial Networks Approach for Multi-Omics Data Imbalanced Class Handling and Classification | Ibrahim Al-Hurani et.al. | 2405.09756 | null |
2024-05-15 | Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer | Weifei Jin et.al. | 2405.09470 | null |
2024-05-15 | Deep Learning in Earthquake Engineering: A Comprehensive Review | Yazhou Xie et.al. | 2405.09021 | null |
2024-05-13 | RATLIP: Generative Adversarial CLIP Text-to-Image Synthesis Based on Recurrent Affine Transformations | Chengde Lin et.al. | 2405.08114 | link |
2024-05-13 | SAR Image Synthesis with Diffusion Models | Denisa Qosja et.al. | 2405.07776 | null |
2024-05-12 | Semantic Loss Functions for Neuro-Symbolic Structured Prediction | Kareem Ahmed et.al. | 2405.07387 | null |
2024-05-12 | PotatoGANs: Utilizing Generative Adversarial Networks, Instance Segmentation, and Explainable AI for Enhanced Potato Disease Identification and Classification | Mohammad Shafiul Alam et.al. | 2405.07332 | link |
2024-05-10 | Deep MMD Gradient Flow without adversarial training | Alexandre Galashov et.al. | 2405.06780 | null |
2024-05-09 | Photonic quantum generative adversarial networks for classical data | Tigran Sedrakyan et.al. | 2405.06023 | null |
2024-05-13 | Characteristic Learning for Provable One Step Generation | Zhao Ding et.al. | 2405.05512 | link |
2024-05-08 | Cross-Modality Translation with Generative Adversarial Networks to Unveil Alzheimer's Disease Biomarkers | Reihaneh Hassanzadeh et.al. | 2405.05462 | null |
2024-05-08 | StyleMamba : State Space Model for Efficient Text-driven Image Style Transfer | Zijia Wang et.al. | 2405.05027 | null |
2024-05-08 | Improving Long Text Understanding with Knowledge Distilled from Summarization Model | Yan Liu et.al. | 2405.04955 | null |
2024-05-08 | HAGAN: Hybrid Augmented Generative Adversarial Network for Medical Image Synthesis | Zhihan Ju et.al. | 2405.04902 | null |
2024-05-07 | SingIt! Singer Voice Transformation | Amit Eliav et.al. | 2405.04627 | null |
2024-05-07 | Data augmentation experiments with style-based quantum generative adversarial networks on trapped-ion and superconducting-qubit technologies | Julien Baglio et.al. | 2405.04401 | null |
2024-05-07 | Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation | Jihyun Kim et.al. | 2405.04356 | null |
2024-05-07 | Improving Offline Reinforcement Learning with Inaccurate Simulators | Yiwen Hou et.al. | 2405.04307 | null |
2024-05-07 | Bidirectional Adversarial Autoencoders for the design of Plasmonic Metasurfaces | Yuansan Liu et.al. | 2405.04056 | link |
2024-05-06 | Generative adversarial learning with optimal input dimension and its adaptive generator architecture | Zhiyao Tan et.al. | 2405.03723 | null |
2024-05-06 | CCDM: Continuous Conditional Diffusion Models for Image Generation | Xin Ding et.al. | 2405.03546 | link |
2024-05-06 | GLIP: Electromagnetic Field Exposure Map Completion by Deep Generative Networks | Mohammed Mallik et.al. | 2405.03384 | null |
2024-05-05 | AnoGAN for Tabular Data: A Novel Approach to Anomaly Detection | Aditya Singh et.al. | 2405.03075 | null |
2024-05-12 | Boundary-aware Decoupled Flow Networks for Realistic Extreme Rescaling | Jinmin Li et.al. | 2405.02941 | null |
2024-05-05 | SMCD: High Realism Motion Style Transfer via Mamba-based Diffusion | Ziyun Qian et.al. | 2405.02844 | null |
2024-05-03 | Reconstructing the mid-infrared spectra of galaxies using ultraviolet to submillimeter photometry and Deep Generative Networks | Agapi Rissaki et.al. | 2405.02153 | null |
2024-05-03 | Three-Dimensional Amyloid-Beta PET Synthesis from Structural MRI with Conditional Generative Adversarial Networks | Fernando Vega et.al. | 2405.02109 | null |
2024-05-03 | Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics | Rucha Deshpande et.al. | 2405.01822 | null |
2024-05-02 | Investigation on optimal microstructure of dual-phase steel with high strength and ductility by machine learning | Misato Suzuki et.al. | 2405.01689 | null |
2024-05-07 | Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration | Praveen Kumar Chandaliya et.al. | 2405.01273 | null |
2024-05-01 | UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement | Ruiquan Ge et.al. | 2405.00542 | link |
2024-05-01 | Beamforming Inferring by Conditional WGAN-GP for Holographic Antenna Arrays | Fenghao Zhu et.al. | 2405.00391 | null |
2024-04-30 | IgCONDA-PET: Implicitly-Guided Counterfactual Diffusion for Detecting Anomalies in PET Images | Shadab Ahamed et.al. | 2405.00239 | link |
2024-04-30 | SwipeGANSpace: Swipe-to-Compare Image Generation via Efficient Latent Space Exploration | Yuto Nakashima et.al. | 2404.19693 | null |
2024-04-30 | Seeing Through the Clouds: Cloud Gap Imputation with Prithvi Foundation Model | Denys Godwin et.al. | 2404.19609 | null |
2024-05-01 | Mapping New Realities: Ground Truth Image Creation with Pix2Pix Image-to-Image Translation | Zhenglin Li et.al. | 2404.19265 | null |
2024-04-29 | Socially Adaptive Path Planning Based on Generative Adversarial Network | Yao Wang et.al. | 2404.18687 | null |
2024-04-26 | Generative Dataset Distillation: Balancing Global Structure and Local Details | Longzhen Li et.al. | 2404.17732 | null |
2024-05-01 | Federated Transfer Component Analysis Towards Effective VNF Profiling | Xunzheng Zhang et.al. | 2404.17553 | null |
2024-04-26 | DPGAN: A Dual-Path Generative Adversarial Network for Missing Data Imputation in Graphs | Xindi Zheng et.al. | 2404.17164 | null |
2024-04-26 | An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder | Yicheng Gu et.al. | 2404.17161 | null |
2024-04-26 | Synthesizing Iris Images using Generative Adversarial Networks: Survey and Comparative Analysis | Shivangi Yadav et.al. | 2404.17105 | null |
2024-04-25 | Channel Modeling for FR3 Upper Mid-band via Generative Adversarial Networks | Yaqi Hu et.al. | 2404.17069 | null |
2024-04-25 | DE-CGAN: Boosting rTMS Treatment Prediction with Diversity Enhancing Conditional Generative Adversarial Networks | Matthew Squires et.al. | 2404.16913 | null |
2024-04-26 | Guardians of the Quantum GAN | Archisman Ghosh et.al. | 2404.16156 | null |
2024-04-24 | Quantitative Characterization of Retinal Features in Translated OCTA | Rashadul Hasan Badhon et.al. | 2404.16133 | null |
2024-04-24 | HDDGAN: A Heterogeneous Dual-Discriminator Generative Adversarial Network for Infrared and Visible Image Fusion | Guosheng Lu et.al. | 2404.15992 | null |
2024-04-24 | Toward Physics-Aware Deep Learning Architectures for LiDAR Intensity Simulation | Vivek Anand et.al. | 2404.15774 | null |
2024-04-24 | SRAGAN: Saliency Regularized and Attended Generative Adversarial Network for Chinese Ink-wash Painting Generation | Xiang Gao et.al. | 2404.15743 | null |
2024-04-24 | Site-Specific Ground Motion Generative Model for Crustal Earthquakes in Japan Based on Generative Adversarial Networks | Yuma Matsumoto et.al. | 2404.15640 | link |
2024-04-24 | Security Analysis of WiFi-based Sensing Systems: Threats from Perturbation Attacks | Hangcheng Cao et.al. | 2404.15587 | null |
2024-04-23 | CoARF: Controllable 3D Artistic Style Transfer for Radiance Fields | Deheng Zhang et.al. | 2404.14967 | null |
2024-04-23 | Music Style Transfer With Diffusion Model | Hong Huang et.al. | 2404.14771 | null |
2024-04-23 | Skip the Benchmark: Generating System-Level High-Level Synthesis Data using Generative Machine Learning | Yuchao Liao et.al. | 2404.14754 | link |
2024-04-20 | Generative Subspace Adversarial Active Learning for Outlier Detection in Multiple Views of High-dimensional Data | Jose Cribeiro-Ramallo et.al. | 2404.14451 | null |
2024-04-24 | Regional Style and Color Transfer | Zhicheng Ding et.al. | 2404.13880 | null |
2024-04-22 | Distributional Black-Box Model Inversion Attack with Multi-Agent Reinforcement Learning | Huan Bao et.al. | 2404.13860 | null |
2024-04-28 | A Comparative Study on Enhancing Prediction in Social Network Advertisement through Data Augmentation | Qikai Yang et.al. | 2404.13812 | null |
2024-04-21 | Counterfactual Reasoning Using Predicted Latent Personality Dimensions for Optimizing Persuasion Outcome | Donghuo Zeng et.al. | 2404.13792 | null |
2024-04-21 | Towards General Conceptual Model Editing via Adversarial Representation Engineering | Yihao Zhang et.al. | 2404.13752 | link |
2024-04-23 | A Dataset and Model for Realistic License Plate Deblurring | Haoyan Gong et.al. | 2404.13677 | link |
2024-04-26 | Bt-GAN: Generating Fair Synthetic Healthdata via Bias-transforming Generative Adversarial Networks | Resmi Ramachandranpillai et.al. | 2404.13634 | null |
2024-04-21 | Rethink Arbitrary Style Transfer with Transformer and Contrastive Learning | Zhanjie Zhang et.al. | 2404.13584 | null |
2024-04-21 | Exploring Diverse Methods in Visual Question Answering | Panfeng Li et.al. | 2404.13565 | null |
2024-04-21 | Generalized Regression with Conditional GANs | Deddy Jobson et.al. | 2404.13500 | link |
2024-04-19 | DensePANet: An improved generative adversarial network for photoacoustic tomography image reconstruction from sparse data | Hesam Hakimnejad et.al. | 2404.13101 | null |
2024-04-19 | RadRotator: 3D Rotation of Radiographs with Diffusion Models | Pouria Rouzrokh et.al. | 2404.13000 | null |
2024-04-19 | Explainable Deepfake Video Detection using Convolutional Neural Network and CapsuleNet | Gazi Hasin Ishrak et.al. | 2404.12841 | null |
2024-04-19 | PATE-TripleGAN: Privacy-Preserving Image Synthesis with Gaussian Differential Privacy | Zepeng Jiang et.al. | 2404.12730 | null |
2024-04-19 | MLSD-GAN -- Generating Strong High Quality Face Morphing Attacks using Latent Semantic Disentanglement | Aravinda Reddy PN et.al. | 2404.12679 | null |
2024-04-19 | F2FLDM: Latent Diffusion Models with Histopathology Pre-Trained Embeddings for Unpaired Frozen Section to FFPE Translation | Man M. Ho et.al. | 2404.12650 | null |
2024-04-18 | Alleviating Catastrophic Forgetting in Facial Expression Recognition with Emotion-Centered Models | Israel A. Laurensi et.al. | 2404.12260 | null |
2024-04-18 | Generating synthetic electroretinogram waveforms using Artificial Intelligence to improve classification of retinal conditions in under-represented populations | Mikhail Kulyabin et.al. | 2404.11842 | null |
2024-04-18 | Tailoring Generative Adversarial Networks for Smooth Airfoil Design | Joyjit Chattoraj et.al. | 2404.11816 | null |
2024-04-17 | Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt | Zhanjie Zhang et.al. | 2404.11474 | link |
2024-04-17 | What-if Analysis Framework for Digital Twins in 6G Wireless Network Management | Elif Ak et.al. | 2404.11394 | null |
2024-04-19 | KI-GAN: Knowledge-Informed Generative Adversarial Networks for Enhanced Multi-Vehicle Trajectory Forecasting at Signalized Intersections | Chuheng Wei et.al. | 2404.11181 | link |
2024-04-16 | AV-GAN: Attention-Based Varifocal Generative Adversarial Network for Uneven Medical Image Translation | Zexin Li et.al. | 2404.10714 | null |
2024-04-16 | Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks | Florian Barthel et.al. | 2404.10625 | null |
2024-04-15 | Multi-objective evolutionary GAN for tabular data synthesis | Nian Ran et.al. | 2404.10176 | link |
2024-04-15 | AIGeN: An Adversarial Approach for Instruction Generation in VLN | Niyati Rawal et.al. | 2404.10054 | null |
2024-04-15 | VFLGAN: Vertical Federated Learning-based Generative Adversarial Network for Vertically Partitioned Data Publication | Xun Yuan et.al. | 2404.09722 | null |
2024-04-15 | Text-Driven Diverse Facial Texture Generation via Progressive Latent-Space Refinement | Chi Wang et.al. | 2404.09540 | null |
2024-04-15 | Improved Object-Based Style Transfer with Single Deep Network | Harshmohan Kulkarni et.al. | 2404.09461 | null |
2024-04-14 | Counteracting Concept Drift by Learning with Future Malware Predictions | Branislav Bosansky et.al. | 2404.09352 | null |
2024-04-12 | Single-image driven 3d viewpoint training data augmentation for effective wine label recognition | Yueh-Cheng Huang et.al. | 2404.08820 | null |
2024-04-12 | Multi-Branch Generative Models for Multichannel Imaging with an Application to PET/CT Joint Reconstruction | Noel Jeffrey Pinton et.al. | 2404.08748 | null |
2024-04-12 | Synthesis of Through-Wall Micro-Doppler Signatures of Human Motions Using Generative Adversarial Networks | Kainat Yasmeen Shobha Sundar Ram et.al. | 2404.08739 | null |
2024-04-11 | Synthetic Brain Images: Bridging the Gap in Brain Mapping With Generative Adversarial Model | Drici Mourad et.al. | 2404.08703 | null |
2024-04-12 | An improved tabular data generator with VAE-GMM integration | Patricia A. Apellániz et.al. | 2404.08434 | null |
2024-04-11 | GAN-based iterative motion estimation in HASTE MRI | Mathias S. Feinler et.al. | 2404.07576 | null |
2024-04-11 | ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation | Stanislav Frolov et.al. | 2404.07564 | null |
2024-04-11 | Enhancing Network Intrusion Detection Performance using Generative Adversarial Networks | Xinxing Zhao et.al. | 2404.07464 | null |
2024-04-11 | Privacy preserving layer partitioning for Deep Neural Network models | Kishore Rajasekar et.al. | 2404.07437 | null |
2024-04-10 | Improving Multi-Center Generalizability of GAN-Based Fat Suppression using Federated Learning | Pranav Kulkarni et.al. | 2404.07374 | null |
2024-04-10 | Differentially Private GANs for Generating Synthetic Indoor Location Data | Vahideh Moghtadaiee et.al. | 2404.07366 | null |
2024-04-10 | GANsemble for Small and Imbalanced Data Sets: A Baseline for Synthetic Microplastics Data | Daniel Platnick et.al. | 2404.07356 | link |
2024-04-10 | A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks | Neel Mishra et.al. | 2404.07172 | link |
2024-04-10 | Implicit Multi-Spectral Transformer: An Lightweight and Effective Visible to Infrared Image Translation Model | Yijia Chen et.al. | 2404.07072 | link |
2024-04-10 | Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer | Yanqi Ge et.al. | 2404.06835 | null |
2024-04-10 | CryinGAN: Design and evaluation of point-cloud-based generative adversarial networks using disordered materials |
Adrian Xiao Bin Yong et.al. | 2404.06734 | link |
2024-04-09 | Onboard Processing of Hyperspectral Imagery: Deep Learning Advancements, Methodologies, Challenges, and Emerging Trends | Nafiseh Ghasemi et.al. | 2404.06526 | null |
2024-04-09 | Fortifying Fully Convolutional Generative Adversarial Networks for Image Super-Resolution Using Divergence Measures | Arkaprabha Basu et.al. | 2404.06294 | null |
2024-04-09 | Greedy-DiM: Greedy Algorithms for Unreasonably Effective Face Morphs | Zander W. Blasingame et.al. | 2404.06025 | null |
2024-04-11 | Boosting Digital Safeguards: Blending Cryptography and Steganography | Anamitra Maiti et.al. | 2404.05985 | null |
2024-04-09 | Quantum Generative Adversarial Networks in a Silicon Photonic Chip with Maximum Expressibility | Haoran Ma et.al. | 2404.05921 | null |
2024-04-08 | Learning 3D-Aware GANs from Unposed Images with Template Feature Field | Xinya Chen et.al. | 2404.05705 | null |
2024-04-08 | SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation | Heyuan Li et.al. | 2404.05680 | null |
2024-04-08 | Stylizing Sparse-View 3D Scenes with Hierarchical Neural Representation | Y. Wang et.al. | 2404.05236 | null |
2024-04-08 | StylizedGS: Controllable Stylization for 3D Gaussian Splatting | Dingxi Zhang et.al. | 2404.05220 | null |
2024-04-08 | A secure and private ensemble matcher using multi-vault obfuscated templates | Babak Poorebrahim Gilkalaye et.al. | 2404.05205 | null |
2024-04-07 | Reconstructing Retinal Visual Images from 3T fMRI Data Enhanced by Unsupervised Learning | Yujian Xiong et.al. | 2404.05107 | null |
2024-04-07 | Data Conditioning for Subsurface Models with Single-Image Generative Adversarial Network (SinGAN) | Lei Liu et.al. | 2404.05068 | null |
2024-04-06 | Power-Efficient Image Storage: Leveraging Super Resolution Generative Adversarial Network for Sustainable Compression and Reduced Carbon Footprint | Ashok Mondal et.al. | 2404.04642 | null |
2024-04-06 | Frequency Decomposition-Driven Unsupervised Domain Adaptation for Remote Sensing Image Semantic Segmentation | Xianping Ma et.al. | 2404.04531 | link |
2024-04-04 | Mitigating analytical variability in fMRI results with style transfer | Elodie Germani et.al. | 2404.03703 | null |
2024-04-07 | RaFE: Generative Radiance Fields Restoration | Zhongkai Wu et.al. | 2404.03654 | null |
2024-04-04 | Reference-Based 3D-Aware Image Editing with Triplane | Bahri Batuhan Bilecen et.al. | 2404.03632 | null |
2024-04-04 | Integrating Generative AI into Financial Market Prediction for Improved Decision Making | Chang Che et.al. | 2404.03523 | null |
2024-04-04 | Knowledge Distillation-Based Model Extraction Attack using Private Counterfactual Explanations | Fatima Ezzeddine et.al. | 2404.03348 | null |
2024-04-03 | MeshBrush: Painting the Anatomical Mesh with Neural Stylization for Endoscopy | John J. Han et.al. | 2404.02999 | null |
2024-04-03 | Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections | Gabriel Loaiza-Ganem et.al. | 2404.02954 | null |
2024-03-31 | An Unsupervised Adversarial Autoencoder for Cyber Attack Detection in Power Distribution Grids | Mehdi Jabbari Zideh et.al. | 2404.02923 | null |
2024-04-03 | Deep Privacy Funnel Model: From a Discriminative to a Generative Approach with an Application to Face Recognition | Behrooz Razeghi et.al. | 2404.02696 | null |
2024-04-03 | Designing a Photonic Physically Unclonable Function Having Resilience to Machine Learning Attacks | Elena R. Henderson et.al. | 2404.02440 | null |
2024-04-02 | A Generative Deep Learning Approach for Crash Severity Modeling with Imbalanced Data | Junlan Chen et.al. | 2404.02187 | null |
2024-04-01 | Exploring Quantum-Enhanced Machine Learning for Computer Vision: Applications and Insights on Noisy Intermediate-Scale Quantum Devices | Purnachandra Mandadapu et.al. | 2404.02177 | null |
2024-04-02 | Red-Teaming Segment Anything Model | Krzysztof Jankowski et.al. | 2404.02067 | link |
2024-04-02 | MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages | Daryna Dementieva et.al. | 2404.02037 | null |
2024-04-04 | Enhancing Portfolio Optimization with Transformer-GAN Integration: A Novel Approach in the Black-Litterman Framework | Enmin Zhu et.al. | 2404.02029 | null |
2024-04-07 | Bi-LORA: A Vision-Language Approach for Synthetic Image Detection | Mamadou Keita et.al. | 2404.01959 | null |
2024-03-31 | Privacy Re-identification Attacks on Tabular GANs | Abdallah Alshantti et.al. | 2404.00696 | null |
2024-03-31 | GAN with Skip Patch Discriminator for Biological Electron Microscopy Image Generation | Nishith Ranjon Roy et.al. | 2404.00558 | null |
2024-03-31 | Creating synthetic energy meter data using conditional diffusion and building metadata | Chun Fu et.al. | 2404.00525 | link |
2024-04-07 | CHAIN: Enhancing Generalization in Data-Efficient GANs via lipsCHitz continuity constrAIned Normalization | Yao Ni et.al. | 2404.00521 | null |
2024-03-29 | Deepfake Sentry: Harnessing Ensemble Intelligence for Resilient Detection and Generalisation | Liviu-Daniel Ştefan et.al. | 2404.00114 | null |
2024-03-29 | Molecular Generative Adversarial Network with Multi-Property Optimization | Huidong Tang et.al. | 2404.00081 | null |
2024-03-28 | GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models | Yusuf Dalva et.al. | 2403.19645 | null |
2024-03-28 | Lane-Change in Dense Traffic with Model Predictive Control and Neural Networks | Sangjae Bae et.al. | 2403.19633 | link |
2024-03-28 | Collaborative Interactive Evolution of Art in the Latent Space of Deep Generative Models | Ole Hall et.al. | 2403.19620 | null |
2024-03-28 | Synthetic Medical Imaging Generation with Generative Adversarial Networks For Plain Radiographs | John R. McNulty et.al. | 2403.19107 | null |
2024-03-27 | Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D | Mukund Varma T et.al. | 2403.18922 | null |
2024-03-27 | DiffStyler: Diffusion-based Localized Image Style Transfer | Shaoxu Li et.al. | 2403.18461 | null |
2024-03-27 | Colour and Brush Stroke Pattern Recognition in Abstract Art using Modified Deep Convolutional Generative Adversarial Networks | Srinitish Srinivasan et.al. | 2403.18397 | link |
2024-03-27 | DSF-GAN: DownStream Feedback Generative Adversarial Network | Oriel Perets et.al. | 2403.18267 | link |
2024-03-26 | Cross-system biological image quality enhancement based on the generative adversarial network as a foundation for establishing a multi-institute microscopy cooperative network | Dominik Panek et.al. | 2403.18026 | null |
2024-03-26 | FaultGuard: A Generative Approach to Resilient Fault Prediction in Smart Electrical Grids | Emad Efatinasab et.al. | 2403.17494 | null |
2024-03-25 | FLIGAN: Enhancing Federated Learning with Incomplete Data using GAN | Paul Joe Maliakel et.al. | 2403.16930 | null |
2024-03-25 | Multi-Scale Texture Loss for CT denoising with GANs | Francesco Di Feola et.al. | 2403.16640 | link |
2024-03-25 | Enhancing Cross-Dataset EEG Emotion Recognition: A Novel Approach with Emotional EEG Style Transfer Network | Yijin Zhou et.al. | 2403.16540 | null |
2024-03-25 | Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator | Takuhiro Kaneko et.al. | 2403.16464 | null |
2024-03-25 | Illuminating Systematic Trends in Nuclear Data with Generative Machine Learning Models | Jordan M. R. Fox et.al. | 2403.16389 | null |
2024-03-21 | Rolling bearing fault diagnosis method based on generative adversarial enhanced multi-scale convolutional neural network model | Maoxuan Zhou et.al. | 2403.15483 | null |
2024-03-22 | A Wasserstein perspective of Vanilla GANs | Lea Kunkel et.al. | 2403.15312 | null |
2024-03-22 | Robust Utility Optimization via a GAN Approach | Florian Krach et.al. | 2403.15243 | link |
2024-03-25 | Geometric Generative Models based on Morphological Equivariant PDEs and GANs | El Hadji S. Diop et.al. | 2403.14897 | null |
2024-03-21 | Diffusion Attack: Leveraging Stable Diffusion for Naturalistic Image Attacking | Qianyu Guo et.al. | 2403.14778 | null |
2024-03-21 | A task of anomaly detection for a smart satellite Internet of things system | Zilong Shao et.al. | 2403.14738 | null |
2024-03-21 | Implicit Style-Content Separation using B-LoRA | Yarden Frenkel et.al. | 2403.14572 | null |
2024-03-22 | AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks | Max Ku et.al. | 2403.14468 | null |
2024-03-20 | Enhancing Fingerprint Image Synthesis with GANs, Diffusion Models, and Style Transfer Techniques | W. Tang et.al. | 2403.13916 | null |
2024-03-20 | The Bid Picture: Auction-Inspired Multi-player Generative Adversarial Networks Training | Joo Yong Shim et.al. | 2403.13866 | null |
2024-03-20 | IIDM: Image-to-Image Diffusion Model for Semantic Image Synthesis | Feng Liu et.al. | 2403.13378 | link |
2024-03-19 | NSGAN: A Non-Dominant Sorting Optimisation-Based Generative Adversarial Design Framework for Alloy Discovery | Zhipeng Li et.al. | 2403.12495 | null |
2024-03-18 | E2F-Net: Eyes-to-Face Inpainting via StyleGAN Latent Space | Ahmad Hassanpour et.al. | 2403.12197 | link |
2024-03-19 | Leveraging Spatial and Semantic Feature Extraction for Skin Cancer Diagnosis with Capsule Networks and Graph Neural Networks | K. P. Santoso et.al. | 2403.12009 | null |
2024-03-18 | LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model | Runhui Huang et.al. | 2403.11929 | null |
2024-03-18 | LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model | Yuxin Cao et.al. | 2403.11656 | null |
2024-03-18 | VmambaIR: Visual State Space Model for Image Restoration | Yuan Shi et.al. | 2403.11423 | link |
2024-03-17 | Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation | Silvia Corbara et.al. | 2403.11265 | null |
2024-03-16 | Exploiting Topological Prior for Boosting Point Cloud Generation | Baiyuan Chen et.al. | 2403.10962 | null |
2024-03-16 | Could We Generate Cytology Images from Histopathology Images? An Empirical Study | Soumyajyoti Dey et.al. | 2403.10885 | null |
2024-03-16 | Efficient Domain Adaptation for Endoscopic Visual Odometry | Junyang Wu et.al. | 2403.10860 | null |
2024-03-15 | A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models | Xijun Wang et.al. | 2403.10589 | null |
2024-03-20 | MusicHiFi: Fast High-Fidelity Stereo Vocoding | Ge Zhu et.al. | 2403.10493 | null |
2024-03-15 | Synthesizing impurity clustering in the edge plasma of tokamaks using neural networks | Zetao Lin et.al. | 2403.10219 | null |
2024-03-18 | A survey of synthetic data augmentation methods in computer vision | Alhassan Mumuni et.al. | 2403.10075 | null |
2024-03-14 | StainFuser: Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images | Robert Jewsbury et.al. | 2403.09302 | link |
2024-03-13 | LMStyle Benchmark: Evaluating Text Style Transfer for Chatbots | Jianlin Chen et.al. | 2403.08943 | null |
2024-03-13 | Gaussian Splatting in Style | Abhishek Saroha et.al. | 2403.08498 | null |
2024-03-13 | StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields | Hongbin Xu et.al. | 2403.08310 | null |
2024-03-13 | Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation | Tianyi Chu et.al. | 2403.08294 | null |
2024-03-13 | CoroNetGAN: Controlled Pruning of GANs via Hypernetworks | Aman Kumar et.al. | 2403.08261 | null |
2024-03-13 | Point Cloud Compression via Constrained Optimal Transport | Zezeng Li et.al. | 2403.08236 | link |
2024-03-13 | ShadowRemovalNet: Efficient Real-Time Shadow Removal | Alzayat Saleh et.al. | 2403.08142 | null |
2024-03-12 | Authorship Style Transfer with Policy Optimization | Shuai Liu et.al. | 2403.08043 | link |
2024-03-12 | Quantifying and Mitigating Privacy Risks for Tabular Generative Models | Chaoyi Zhu et.al. | 2403.07842 | null |
2024-03-12 | StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting | Kunhao Liu et.al. | 2403.07807 | null |
2024-03-13 | Towards Model Extraction Attacks in GAN-Based Image Translation via Domain Shift Mitigation | Di Mi et.al. | 2403.07673 | null |
2024-03-15 | Gender-ambiguous voice generation through feminine speaking style transfer in male voices | Maria Koutsogiannaki et.al. | 2403.07661 | null |
2024-03-12 | Auxiliary CycleGAN-guidance for Task-Aware Domain Translation from Duplex to Monoplex IHC Images | Nicolas Brieu et.al. | 2403.07389 | null |
2024-03-11 | Data-Independent Operator: A Training-Free Artifact Representation Extractor for Generalizable Deepfake Detection | Chuangchuang Tan et.al. | 2403.06803 | link |
2024-03-11 | Galaxy Morphologies Revealed with Subaru HSC and Super-Resolution Techniques II: Environmental Dependence of Galaxy Mergers at z~2-5 | Takatoshi Shibuya et.al. | 2403.06729 | null |
2024-03-11 | 3D-aware Image Generation and Editing with Multi-modal Conditions | Bo Li et.al. | 2403.06470 | null |
2024-03-11 | A Zero Trust Framework for Realization and Defense Against Generative AI Attacks in Power Grid | Md. Shirajum Munir et.al. | 2403.06388 | null |
2024-03-10 | Fast-Track of F-18 Positron paths simulations | Youness Mellak et.al. | 2403.06307 | link |
2024-03-10 | MoST: Motion Style Transformer between Diverse Action Contents | Boeun Kim et.al. | 2403.06225 | link |
2024-03-13 | S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes | Xingyi Li et.al. | 2403.06205 | null |
2024-03-10 | On depth prediction for autonomous driving using self-supervised learning | Houssem Boulahbal et.al. | 2403.06194 | null |
2024-03-09 | Large Generative Model Assisted 3D Semantic Communication | Feibo Jiang et.al. | 2403.05783 | null |
2024-03-08 | A Data Augmentation Pipeline to Generate Synthetic Labeled Datasets of 3D Echocardiography Images using a GAN | Cristiana Tiago et.al. | 2403.05384 | null |
2024-03-08 | Federated Learning Method for Preserving Privacy in Face Recognition System | Enoch Solomon et.al. | 2403.05344 | null |
2024-03-08 | GAN-based Massive MIMO Channel Model Trained on Measured Data | Florian Euchner et.al. | 2403.05321 | link |
2024-03-08 | An Efficient Quasi-Random Sampling for Copulas | Sumin Wang et.al. | 2403.05281 | null |
2024-03-08 | Robust Semantic Communications for Speech-to-Text Translation | Zhenzi Weng et.al. | 2403.05187 | link |
2024-03-08 | Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile | Seokjun Lee et.al. | 2403.05093 | link |
2024-03-08 | Quantifying Manifolds: Do the manifolds learned by Generative Adversarial Networks converge to the real data manifold | Anupam Chaudhuri et.al. | 2403.05033 | null |
2024-03-07 | A spatiotemporal style transfer algorithm for dynamic visual stimulus generation | Antonino Greco et.al. | 2403.04940 | null |
2024-03-05 | (Un)paired signal-to-signal translation with 1D conditional GANs | Eric Easthope et.al. | 2403.04800 | null |
2024-03-07 | A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images | Cristiana Tiago et.al. | 2403.04612 | null |
2024-03-07 | DLP-GAN: learning to draw modern Chinese landscape photos with generative adversarial network | Xiangquan Gui et.al. | 2403.03456 | null |
2024-03-05 | Doubly Abductive Counterfactual Inference for Text-based Image Editing | Xue Song et.al. | 2403.02981 | link |
2024-03-05 | Time Weaver: A Conditional Time Series Generation Model | Sai Shankar Narasimhan et.al. | 2403.02682 | null |
2024-03-04 | AFBT GAN: enhanced explainability and diagnostic performance for cognitive decline by counterfactual generative adversarial network | Xiongri Shen et.al. | 2403.01758 | link |
2024-03-02 | A Hybrid Model for Traffic Incident Detection based on Generative Adversarial Networks and Transformer Model | Xinying Lu et.al. | 2403.01147 | null |
2024-03-02 | Distilling Text Style Transfer With Self-Explanation From LLMs | Chiyu Zhang et.al. | 2403.01106 | null |
2024-03-01 | BasedAI: A decentralized P2P network for Zero Knowledge Large Language Models (ZK-LLMs) | Sean Wellington et.al. | 2403.01008 | null |
2024-03-05 | Improving Android Malware Detection Through Data Augmentation Using Wasserstein Generative Adversarial Networks | Kawana Stalin et.al. | 2403.00890 | null |
2024-02-29 | Learning to Find Missing Video Frames with Synthetic Data Augmentation: A General Framework and Application in Generating Thermal Images Using RGB Cameras | Mathias Viborg Andersen et.al. | 2403.00196 | null |
2024-02-29 | SeD: Semantic-Aware Discriminator for Image Super-Resolution | Bingchen Li et.al. | 2402.19387 | null |
2024-02-29 | Memory-Augmented Generative Adversarial Transformers | Stephan Raaijmakers et.al. | 2402.19218 | null |
2024-02-29 | Generative models struggle with kirigami metamaterials | Gerrit Felsch et.al. | 2402.19196 | null |
2024-02-29 | Lotka-Volterra Model with Mutations and Generative Adversarial Networks | S. V. Kozyrev et.al. | 2402.19035 | null |
2024-02-29 | Generating, Reconstructing, and Representing Discrete and Continuous Data: Generalized Diffusion with Learnable Encoding-Decoding | Guangyi Liu et.al. | 2402.19009 | null |
2024-02-29 | BlockEcho: Retaining Long-Range Dependencies for Imputing Block-Wise Missing Data | Qiao Han et.al. | 2402.18800 | null |
2024-02-28 | MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation | Jiahao Huang et.al. | 2402.18451 | null |
2024-02-28 | Misalignment-Robust Frequency Distribution Loss for Image Transformation | Zhangkai Ni et.al. | 2402.18192 | link |
2024-02-28 | Breaking the Black-Box: Confidence-Guided Model Inversion Attack for Distribution Shift | Xinhao Liu et.al. | 2402.18027 | null |
2024-02-27 | How we won BraTS 2023 Adult Glioma challenge? Just faking it! Enhanced Synthetic Data Augmentation and Model Ensemble for brain tumour segmentation | André Ferreira et.al. | 2402.17317 | null |
2024-02-27 | GAN Based Near-Field Channel Estimation for Extremely Large-Scale MIMO Systems | Ming Ye et.al. | 2402.17281 | null |
2024-02-27 | Deep Umbra: A Generative Approach for Sunlight Access Computation in Urban Spaces | Kazi Shahrukh Omar et.al. | 2402.17169 | null |
2024-02-26 | Taming the Tail in Class-Conditional GANs: Knowledge Sharing via Unconditional Training at Lower Resolutions | Saeed Khorram et.al. | 2402.17065 | link |
2024-02-26 | Penalized Generative Variable Selection | Tong Wang et.al. | 2402.16661 | null |
2024-02-26 | Training Implicit Generative Models via an Invariant Statistical Loss | José Manuel de Frutos et.al. | 2402.16435 | link |
2024-02-27 | Attention-GAN for Anomaly Detection: A Cutting-Edge Approach to Cybersecurity Threat Management | Mohammed Abo Sen et.al. | 2402.15945 | null |
2024-02-24 | Sandwich GAN: Image Reconstruction from Phase Mask based Anti-dazzle Imaging | Xiaopeng Peng et.al. | 2402.15919 | null |
2024-02-24 | Enhanced Droplet Analysis Using Generative Adversarial Networks | Tan-Hanh Pham et.al. | 2402.15909 | null |
2024-02-24 | A Generative Machine Learning Model for Material Microstructure 3D Reconstruction and Performance Evaluation | Yilin Zheng et.al. | 2402.15815 | null |
2024-02-28 | IRConStyle: Image Restoration Framework Using Contrastive Learning and Style Transfer | Dongqi Fan et.al. | 2402.15784 | null |
2024-02-24 | Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT | Sixiao Zheng et.al. | 2402.15746 | null |
2024-02-28 | Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks | Duo Ma et.al. | 2402.15725 | null |
2024-02-23 | Counterfactual Generation with Identifiability Guarantees | Hanqi Yan et.al. | 2402.15309 | link |
2024-02-23 | A Survey of Music Generation in the Context of Interaction | Ismael Agchar et.al. | 2402.15294 | null |
2024-02-23 | Modified CycleGAN for the synthesization of samples for wheat head segmentation | Jaden Myers et.al. | 2402.15135 | null |
2024-02-22 | Deep Generative Model-based Synthesis of Four-bar Linkage Mechanisms with Target Conditions | Sumin Lee et.al. | 2402.14882 | null |
2024-02-22 | Generative Adversarial Network with Soft-Dynamic Time Warping and Parallel Reconstruction for Energy Time Series Anomaly Detection | Hardik Prabhu et.al. | 2402.14384 | link |
2024-02-21 | Generative Adversarial Models for Extreme Downscaling of Climate Datasets | Guiye Li et.al. | 2402.14049 | null |
2024-02-21 | Protect and Extend -- Using GANs for Synthetic Data Generation of Time-Series Medical Records | Navid Ashrafi et.al. | 2402.14042 | null |
2024-02-26 | Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions | Jiayu Chen et.al. | 2402.13777 | null |
2024-02-21 | Music Style Transfer with Time-Varying Inversion of Diffusion Models | Sifei Li et.al. | 2402.13763 | null |
2024-02-21 | SRNDiff: Short-term Rainfall Nowcasting with Condition Diffusion Model | Xudong Ling et.al. | 2402.13737 | null |
2024-02-21 | Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions | Lei Pan et.al. | 2402.13647 | null |
2024-02-21 | Generative AI for Secure Physical Layer Communications: A Survey | Changyuan Zhao et.al. | 2402.13553 | null |
2024-02-20 | CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection | Sohail Ahmed Khan et.al. | 2402.12927 | null |
2024-02-22 | Improving Deep Generative Models on Many-To-One Image-to-Image Translation | Sagar Saxena et.al. | 2402.12531 | null |
2024-02-16 | Toward using GANs in astrophysical Monte-Carlo simulations | Ahab Isaac et.al. | 2402.12396 | null |
2024-02-19 | UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models | Yihua Zhang et.al. | 2402.11846 | link |
2024-02-18 | MORL-Prompt: An Empirical Analysis of Multi-Objective Reinforcement Learning for Discrete Prompt Optimization | Yasaman Jafari et.al. | 2402.11711 | null |
2024-02-16 | Cosmological multifield emulator | Sambatra Andrianomena et.al. | 2402.10997 | null |
2024-02-16 | GAN-driven Electromagnetic Imaging of 2-D Dielectric Scatterers | Ehtasham Naseer et.al. | 2402.10831 | null |
2024-02-16 | RAGIC: Risk-Aware Generative Adversarial Model for Stock Interval Construction | Jingyi Gu et.al. | 2402.10760 | null |
2024-02-16 | APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding | Yang Ai et.al. | 2402.10533 | null |
2024-02-16 | Generative Modeling for Tabular Data via Penalized Optimal Transport Network | Wenhui Sophia Lu et.al. | 2402.10456 | null |
2024-02-16 | Recurrent Neural Networks for Multivariate Loss Reserving and Risk Capital Analysis | Pengfei Cai et.al. | 2402.10421 | null |
2024-02-15 | Interpretable Generative Adversarial Imitation Learning | Wenliang Liu et.al. | 2402.10310 | null |
2024-02-15 | Utilizing GANs for Fraud Detection: Model Training with Synthetic Transaction Data | Mengran Zhu et.al. | 2402.09830 | null |
2024-02-16 | Examining Pathological Bias in a Generative Adversarial Network Discriminator: A Case Study on a StyleGAN3 Model | Alvin Grissom II et.al. | 2402.09786 | null |
2024-02-14 | TAI-GAN: A Temporally and Anatomically Informed Generative Adversarial Network for early-to-late frame conversion in dynamic cardiac PET inter-frame motion correction | Xueqi Guo et.al. | 2402.09567 | null |
2024-02-14 | Towards Realistic Landmark-Guided Facial Video Inpainting Based on GANs | Fatemeh Ghorbani Lohesara et.al. | 2402.09100 | null |
2024-02-14 | Review-Incorporated Model-Agnostic Profile Injection Attacks on Recommender Systems | Shiyi Yang et.al. | 2402.09023 | null |
2024-02-13 | Towards the Detection of AI-Synthesized Human Face Images | Yuhang Lu et.al. | 2402.08750 | null |
2024-02-13 | Generative VS non-Generative Models in Engineering Shape Optimization | Muhammad Usama et.al. | 2402.08540 | null |
2024-02-13 | Unrestricted Global Phase Bias-Aware Single-channel Speech Enhancement with Conformer-based Metric GAN | Shiqi Zhang et.al. | 2402.08252 | null |
2024-02-12 | Text Detoxification as Style Transfer in English and Hindi | Sourabrata Mukherjee et.al. | 2402.07767 | null |
2024-02-15 | Re-DiffiNet: Modeling discrepancies loss in tumor segmentation using diffusion models | Tianyi Ren et.al. | 2402.07354 | null |
2024-02-10 | Near-perfect Coverage Manifold Estimation in Cellular Networks via conditional GAN | Washim Uddin Mondal et.al. | 2402.06901 | null |
2024-02-09 | Generative Nowcasting of Marine Fog Visibility in the Grand Banks area and Sable Island in Canada | Eren Gultepe et.al. | 2402.06800 | null |
2024-02-06 | Explainable Adversarial Learning Framework on Physical Layer Secret Keys Combating Malicious Reconfigurable Intelligent Surface | Zhuangkun Wei et.al. | 2402.06663 | null |
2024-02-09 | TimEHR: Image-based Time Series Generation for Electronic Health Records | Hojjat Karami et.al. | 2402.06318 | null |
2024-02-09 | Multisource Semisupervised Adversarial Domain Generalization Network for Cross-Scene Sea\textendash Land Clutter Classification | Xiaoxuan Zhang et.al. | 2402.06315 | null |
2024-02-08 | AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning | Wamiq Reyaz Para et.al. | 2402.05803 | null |
2024-02-06 | CEHR-GPT: Generating Electronic Health Records with Chronological Patient Timelines | Chao Pang et.al. | 2402.04400 | null |
2024-02-07 | DeMarking: A Defense for Network Flow Watermarking in Real-Time | Yali Yuan et.al. | 2402.03760 | null |
2024-02-06 | Reviewing FID and SID Metrics on Generative Adversarial Networks | Ricardo de Deijn et.al. | 2402.03654 | null |
2024-02-08 | IGUANe: a 3D generalizable CycleGAN for multicenter harmonization of brain MR images | Vincent Roca et.al. | 2402.03227 | link |
2024-02-05 | ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer | Bumsoo Kim et.al. | 2402.02733 | null |
2024-02-05 | Fast and Accurate Cooperative Radio Map Estimation Enabled by GAN | Zezhong Zhang et.al. | 2402.02729 | null |
2024-02-03 | Revisiting Generative Adversarial Networks for Binary Semantic Segmentation on Imbalanced Datasets | Lei Xu et.al. | 2402.02245 | null |
2024-02-03 | Enhancing crop classification accuracy by synthetic SAR-Optical data generation using deep learning | Ali Mirzaei et.al. | 2402.02121 | null |
2024-02-02 | ConRF: Zero-shot Stylization of 3D Scenes with Conditioned Radiation Fields | Xingyu Miao et.al. | 2402.01950 | link |
2024-02-02 | KS-Net: Multi-band joint speech restoration and enhancement network for 2024 ICASSP SSI Challenge | Guochen Yu et.al. | 2402.01808 | null |
2024-02-02 | Variational Quantum Circuits Enhanced Generative Adversarial Network | Runqiu Shu et.al. | 2402.01791 | null |
2024-02-05 | Phrase Grounding-based Style Transfer for Single-Domain Generalized Object Detection | Hao Li et.al. | 2402.01304 | null |
2024-02-02 | Ambient-Pix2PixGAN for Translating Medical Images from Noisy Data | Wentao Chen et.al. | 2402.01186 | null |
2024-02-02 | AmbientCycleGAN for Establishing Interpretable Stochastic Object Models Based on Mathematical Phantoms and Medical Imaging Measurements | Xichen Xu et.al. | 2402.01171 | null |
2024-02-01 | mmID: High-Resolution mmWave Imaging for Human Identification | Sakila S. Jayaweera et.al. | 2402.00996 | null |
2024-02-01 | A Cost-Efficient Approach for Creating Virtual Fitting Room using Generative Adversarial Networks (GANs) | Kirolos Attallah et.al. | 2402.00994 | null |
2024-01-31 | EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks | Shijia Liao et.al. | 2402.00892 | null |
2024-02-02 | Geometry Transfer for Stylizing Radiance Fields | Hyunyoung Jung et.al. | 2402.00863 | null |
2024-02-01 | Profiling and Modeling of Power Characteristics of Leadership-Scale HPC System Workloads | Ahmad Maroof Karimi et.al. | 2402.00729 | null |
2024-02-01 | Neural Style Transfer with Twin-Delayed DDPG for Shared Control of Robotic Manipulators | Raul Fernandez-Fernandez et.al. | 2402.00722 | null |
2024-02-01 | Neural Policy Style Transfer | Raul Fernandez-Fernandez et.al. | 2402.00677 | null |
2024-02-01 | Transferring human emotions to robot motions using Neural Policy Style Transfer | Raul Fernandez-Fernandez et.al. | 2402.00663 | null |
2024-02-01 | Disentangled Multimodal Brain MR Image Translation via Transformer-based Modality Infuser | Jihoon Cho et.al. | 2402.00375 | null |
2024-02-02 | DARCS: Memory-Efficient Deep Compressed Sensing Reconstruction for Acceleration of 3D Whole-Heart Coronary MR Angiography | Zhihao Xue et.al. | 2402.00320 | null |
2024-01-31 | ViTacTip: Design and Verification of a Novel Biomimetic Physical Vision-Tactile Fusion Sensor | Wen Fan et.al. | 2402.00199 | null |
2024-01-31 | Fully Data-Driven Model for Increasing Sampling Rate Frequency of Seismic Data using Super-Resolution Generative Adversarial Networks | Navid Gholizadeh et.al. | 2402.00153 | null |
2024-01-30 | Anything in Any Scene: Photorealistic Video Object Insertion | Chen Bai et.al. | 2401.17509 | null |
2024-01-30 | Evaluation in Neural Style Transfer: A Review | Eleftherios Ioannou et.al. | 2401.17109 | null |
2024-01-30 | Active Generation Network of Human Skeleton for Action Recognition | Long Liu et.al. | 2401.17086 | null |
2024-01-30 | WGAN-AFL: Seed Generation Augmented Fuzzer with Wasserstein-GAN | Liqun Yang et.al. | 2401.16947 | null |
2024-01-30 | LATENTPATCH: A Non-Parametric Approach for Face Generation and Editing | Benjamin Samuth et.al. | 2401.16830 | null |
2024-01-30 | A Literature Review on Fetus Brain Motion Correction in MRI | Haoran Zhang et.al. | 2401.16782 | null |
2024-01-30 | cDVGAN: One Flexible Model for Multi-class Gravitational Wave Signal and Glitch Generation | Tom Dooney et.al. | 2401.16356 | null |
2024-01-29 | Domain adaptation strategies for 3D reconstruction of the lumbar spine using real fluoroscopy data | Sascha Jecklin et.al. | 2401.16027 | null |
2024-01-28 | On the Statistical Properties of Generative Adversarial Models for Low Intrinsic Data Dimension | Saptarshi Chakraborty et.al. | 2401.15801 | null |
2024-01-28 | UP-CrackNet: Unsupervised Pixel-Wise Road Crack Detection via Adversarial Image Restoration | Nachuan Ma et.al. | 2401.15647 | null |
2024-01-28 | FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models | Feihong He et.al. | 2401.15636 | null |
2024-01-27 | An Implicit Physical Face Model Driven by Expression and Style | Lingchen Yang et.al. | 2401.15414 | null |
2024-01-27 | Face to Cartoon Incremental Super-Resolution using Knowledge Distillation | Trinetra Devkatte et.al. | 2401.15366 | null |
2024-01-26 | Annotated Hands for Generative Models | Yue Yang et.al. | 2401.15075 | link |
2024-01-26 | Additional Look into GAN-based Augmentation for Deep Learning COVID-19 Image Classification | Oleksandr Fedoruk et.al. | 2401.14705 | null |
2024-01-26 | UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization | Yuejiao Wang et.al. | 2401.14664 | null |
2024-01-26 | Diffusion Stochastic Optimization for Min-Max Problems | Haoyuan Cai et.al. | 2401.14585 | link |
2024-01-25 | Expression-aware video inpainting for HMD removal in XR applications | Fatemeh Ghorbani Lohesara et.al. | 2401.14136 | null |
2024-01-30 | CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion | Nisha Huang et.al. | 2401.14066 | null |
2024-01-24 | Inference Attacks Against Face Recognition Model without Classification Layers | Yuanqing Huang et.al. | 2401.13719 | null |
2024-01-24 | Generating Synthetic Health Sensor Data for Privacy-Preserving Wearable Stress Detection | Lucas Lange et.al. | 2401.13327 | link |
2024-01-23 | CCA: Collaborative Competitive Agents for Image Editing | Tiankai Hang et.al. | 2401.13011 | link |
2024-01-23 | Two-View Topogram-Based Anatomy-Guided CT Reconstruction for Prospective Risk Minimization | Chang Liu et.al. | 2401.12725 | null |
2024-01-22 | ScoreDec: A Phase-preserving High-Fidelity Audio Codec with A Generalized Score-based Diffusion Post-filter | Yi-Chiao Wu et.al. | 2401.12160 | null |
2024-01-22 | Simulating Nighttime Visible Satellite Imagery of Tropical Cyclones Using Conditional Generative Adversarial Networks | Jinghuai Yao et.al. | 2401.11679 | null |
2024-01-19 | Fast Registration of Photorealistic Avatars for VR Facial Animation | Chaitanya Patel et.al. | 2401.11002 | null |
2024-01-12 | GANs for EVT Based Model Parameter Estimation in Real-time Ultra-Reliable Communication | Parmida Valiahdi et.al. | 2401.10280 | null |
2024-01-18 | Image Translation as Diffusion Visual Programmers | Cheng Han et.al. | 2401.09742 | null |
2024-01-18 | Artwork Protection Against Neural Style Transfer Using Locally Adaptive Adversarial Color Attack | Zhongliang Guo et.al. | 2401.09673 | null |
2024-01-17 | MITS-GAN: Safeguarding Medical Imaging from Tampering with Generative Adversarial Networks | Giovanni Pasqualino et.al. | 2401.09624 | link |
2024-01-17 | Efficient generative adversarial networks using linear additive-attention Transformers | Emilio Morales-Juarez et.al. | 2401.09596 | link |
2024-01-23 | Uncertainty-Aware Hardware Trojan Detection Using Multimodal Deep Learning | Rahul Vishwakarma et.al. | 2401.09479 | link |
2024-01-18 | Unsupervised Multiple Domain Translation through Controlled Disentanglement in Variational Autoencoder | Antonio Almudévar et.al. | 2401.09180 | link |
2024-01-17 | ACT-GAN: Radio map construction based on generative adversarial networks with ACT blocks | Chen Qi et.al. | 2401.08976 | null |
2024-01-12 | A Physics-informed machine learning model for time-dependent wave runup prediction | Saeed Saviz Naeini et.al. | 2401.08684 | null |
2024-01-16 | Inpainting Normal Maps for Lightstage data | Hancheng Zuo et.al. | 2401.08099 | null |
2024-01-16 | Adversarial Masking Contrastive Learning for vein recognition | Huafeng Qin et.al. | 2401.08079 | null |
2024-01-15 | Graph Transformer GANs with Graph Masked Modeling for Architectural Layout Generation | Hao Tang et.al. | 2401.07721 | null |
2024-01-15 | Multimodal Crowd Counting with Pix2Pix GANs | Muhammad Asif Khan et.al. | 2401.07591 | null |
2024-01-15 | Cross Domain Early Crop Mapping using CropGAN and CNN Classifier | Yiqun Wang et.al. | 2401.07398 | null |
2024-01-14 | Generation of Synthetic Images for Pedestrian Detection Using a Sequence of GANs | Viktor Seib et.al. | 2401.07370 | null |
2024-01-14 | A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models | Namjoon Suh et.al. | 2401.07187 | null |
2024-01-13 | Quantum Generative Diffusion Model | Chuangtao Chen et.al. | 2401.07039 | null |
2024-01-12 | Causally Aware Generative Adversarial Networks for Light Pollution Control | Yuyao Zhang et.al. | 2401.06453 | link |
2024-01-12 | Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction | Ye-Xin Lu et.al. | 2401.06387 | null |
2024-01-11 | E |
Yifan Gong et.al. | 2401.06127 | null |
2024-01-11 | RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane Networks | Partha Ghosh et.al. | 2401.06035 | null |
2024-01-11 | GE-AdvGAN: Improving the transferability of adversarial samples by gradient editing-based adversarial generative model | Zhiyu Zhu et.al. | 2401.06031 | link |
2024-01-11 | HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models | Hanzhang Wang et.al. | 2401.05870 | null |
2024-01-11 | Evaluating Data Augmentation Techniques for Coffee Leaf Disease Classification | Adrian Gheorghiu et.al. | 2401.05768 | null |
2024-01-11 | CAT-LLM: Prompting Large Language Models with Text Style Definition for Chinese Article-style Transfer | Zhen Tao et.al. | 2401.05707 | link |
2024-01-11 | Nucleus subtype classification using inter-modality learning | Lucas W. Remedios et.al. | 2401.05602 | null |
2024-01-10 | An Augmented Surprise-guided Sequential Learning Framework for Predicting the Melt Pool Geometry | Ahmed Shoyeb Raihan et.al. | 2401.05579 | null |
2024-01-10 | FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields | GeonU Kim et.al. | 2401.05516 | null |
2024-01-10 | Synthesis of pulses from particle detectors with a Generative Adversarial Network (GAN) | Alberto Regadío et.al. | 2401.05295 | null |
2024-01-10 | Application of Deep Learning in Blind Motion Deblurring: Current Status and Future Prospects | Yawen Xiang et.al. | 2401.05055 | link |
2024-01-10 | Latency-aware Road Anomaly Segmentation in Videos: A Photorealistic Dataset and New Metrics | Beiwen Tian et.al. | 2401.04942 | null |
2024-01-09 | Advancing Ante-Hoc Explainable Models through Generative Adversarial Networks | Tanmay Garg et.al. | 2401.04647 | null |
2024-01-09 | Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding | Yatong Bai et.al. | 2401.04575 | null |
2024-01-09 | Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement | Soumya Dutta et.al. | 2401.04511 | link |
2024-01-08 | Generative adversarial wavelet neural operator: Application to fault detection and isolation of multivariate time series data | Jyoti Rani et.al. | 2401.04004 | null |
2024-01-07 | Towards a Unified Method for Network Dynamic via Adversarial Weighted Link Prediction | Meng Qin et.al. | 2401.03444 | null |
2024-01-07 | Advancing Noise-Resilient Twist Angle Characterization in Bilayer Graphene through Raman Spectroscopy via GAN-CNN Modeling | Dan Hu et.al. | 2401.03371 | null |
2024-01-04 | An AI-enabled Bias-Free Respiratory Disease Diagnosis Model using Cough Audio: A Case Study for COVID-19 | Tabish Saeed et.al. | 2401.02996 | null |
2024-01-05 | Characteristics and prevalence of fake social media profiles with AI-generated faces | Kai-Cheng Yang et.al. | 2401.02627 | link |
2024-01-04 | What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs | Alex Trevithick et.al. | 2401.02411 | null |
2024-01-04 | CALPAGAN: Calorimetry for Particles using GANs | Anil Dogru et.al. | 2401.02248 | null |
2024-01-03 | Representation Learning of Multivariate Time Series using Attention and Adversarial Training | Leon Scharwächter et.al. | 2401.01987 | null |
2024-01-03 | Can We Generate Realistic Hands Only Using Convolution? | Mehran Hosseini et.al. | 2401.01951 | null |
2024-01-03 | Adversarial Machine Learning-Enabled Anonymization of OpenWiFi Data | Samhita Kuili et.al. | 2401.01542 | null |
2024-01-03 | Automated Segmentation of Large Image Datasets using Artificial Intelligence for Microstructure Characterisation, Damage Analysis and High-Throughput Modelling Input | Setareh Medghalchi et.al. | 2401.01147 | null |
2024-01-02 | Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation | Jinlong Xue et.al. | 2401.01044 | link |
2024-01-01 | An attempt to generate new bridge types from latent space of generative adversarial network | Hongjun Zhang et.al. | 2401.00700 | link |
2023-12-31 | RainSD: Rain Style Diversification Module for Image Synthesis Enhancement using Feature-Level Style Distribution | Hyeonjae Jeon et.al. | 2401.00460 | null |
2024-01-04 | TSGAN: An Optical-to-SAR Dual Conditional GAN for Optical based SAR Temporal Shifting | Moien Rangzan et.al. | 2401.00440 | link |
2023-12-29 | Distance Guided Generative Adversarial Network for Explainable Binary Classifications | Xiangyu Xiong et.al. | 2312.17538 | link |
2023-12-28 | Learning to Generate Text in Arbitrary Writing Styles | Aleem Khan et.al. | 2312.17242 | null |
2023-12-28 | A GAN-based Semantic Communication for Text without CSI | Jin Mao et.al. | 2312.16909 | null |
2023-12-30 | A Survey on Super Resolution for video Enhancement Using GAN | Ankush Maity et.al. | 2312.16471 | null |
2023-12-27 | Active Third-Person Imitation Learning | Timo Klein et.al. | 2312.16365 | null |
2023-12-25 | MetaScript: Few-Shot Handwritten Chinese Content Generation via Generative Adversarial Networks | Xiangyuan Xue et.al. | 2312.16251 | link |
2023-12-25 | MuLA-GAN: Multi-Level Attention GAN for Enhanced Underwater Visibility | Ahsan Baidar Bakht et.al. | 2312.15633 | null |
2023-12-25 | GanFinger: GAN-Based Fingerprint Generation for Deep Neural Network Ownership Verification | Huali Ren et.al. | 2312.15617 | null |
2023-12-23 | AdamL: A fast adaptive gradient method incorporating loss function | Lu Xia et.al. | 2312.15295 | null |
2023-12-23 | IRG: Generating Synthetic Relational Databases using GANs | Jiayu Li et.al. | 2312.15187 | null |
2023-12-23 | Multilingual Bias Detection and Mitigation for Indian Languages | Ankita Maity et.al. | 2312.15181 | null |
2023-12-22 | EGAIN: Extended GAn INversion | Wassim Kabbani et.al. | 2312.15116 | null |
2023-12-22 | Neural network models for preferential concentration of particles in two-dimensional turbulence | Thibault Maurel-Oujia et.al. | 2312.14829 | null |
2023-12-22 | The Effects of Signal-to-Noise Ratio on Generative Adversarial Networks Applied to Marine Bioacoustic Data | Georgia Atkinson et.al. | 2312.14806 | null |
2023-12-22 | The Rate-Distortion-Perception-Classification Tradeoff: Joint Source Coding and Modulation via Inverse-Domain GANs | Junli Fang et.al. | 2312.14792 | null |
2023-12-22 | Compressing Image-to-Image Translation GANs Using Local Density Structures on Their Learned Manifold | Alireza Ganjdanesh et.al. | 2312.14776 | null |
2023-12-22 | Balancing the Style-Content Trade-Off in Sentiment Transfer Using Polarity-Aware Denoising | Sourabrata Mukherjee et.al. | 2312.14708 | link |
2023-12-22 | Self-Supervised Generative Models for Crystal Structures | Fangze Liu et.al. | 2312.14485 | null |
2023-12-22 | AdvCloak: Customized Adversarial Cloak for Privacy Protection | Xuannan Liu et.al. | 2312.14407 | null |
2023-12-21 | Open-Set: ID Card Presentation Attack Detection using Neural Transfer Style | Reuben Markham et.al. | 2312.13993 | null |
2023-12-21 | Adapt & Align: Continual Learning with Generative Models Latent Space Alignment | Kamil Deja et.al. | 2312.13699 | link |
2023-12-21 | Free-Editor: Zero-shot Text-driven 3D Scene Editing | Nazmul Karim et.al. | 2312.13663 | null |
2023-12-21 | A Comprehensive End-to-End Computer Vision Framework for Restoration and Recognition of Low-Quality Engineering Drawings | Lvyang Yang et.al. | 2312.13620 | link |
2023-12-21 | HyperEditor: Achieving Both Authenticity and Cross-Domain Capability in Image Editing via Hypernetworks | Hai Zhang et.al. | 2312.13537 | link |
2023-12-21 | SPDGAN: A Generative Adversarial Network based on SPD Manifold Learning for Automatic Image Colorization | Youssef Mourchid et.al. | 2312.13506 | null |
2023-12-20 | Texture Matching GAN for CT Image Enhancement | Madhuri Nagare et.al. | 2312.13422 | null |
2023-12-20 | A 3D super-resolution of wind fields via physics-informed pixel-wise self-attention generative adversarial network | Takuya Kurihana et.al. | 2312.13212 | null |
2023-12-20 | Neural Stochastic Differential Equations with Change Points: A Generative Adversarial Approach | Zhongchang Sun et.al. | 2312.13152 | null |
2023-12-20 | Pixel-to-Abundance Translation: Conditional Generative Adversarial Networks Based on Patch Transformer for Hyperspectral Unmixing | Li Wang et.al. | 2312.13127 | null |
2023-12-20 | A self-attention-based differentially private tabular GAN with high data utility | Zijian Li et.al. | 2312.13031 | null |
2023-12-19 | Unveiling Spaces: Architecturally meaningful semantic descriptions from images of interior spaces | Demircan Tas et.al. | 2312.12481 | null |
2023-12-19 | Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion | Fan Zhang et.al. | 2312.12471 | link |
2023-12-19 | FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning | Zhenhua Yang et.al. | 2312.12142 | link |
2023-12-19 | Self-supervised Learning for Enhancing Geometrical Modeling in 3D-Aware Generative Adversarial Network | Jiarong Guo et.al. | 2312.11856 | null |
2023-12-18 | Ultrasound Image Enhancement using CycleGAN and Perceptual Loss | Shreeram Athreya et.al. | 2312.11748 | link |
2023-12-17 | COPD-FlowNet: Elevating Non-invasive COPD Diagnosis with CFD Simulations | Aryan Tyagi et.al. | 2312.11561 | null |
2023-12-18 | Multi-scale Reconstruction of Turbulent Rotating Flows with Generative Diffusion Models | Tianyi Li et.al. | 2312.11121 | link |
2023-12-17 | High-Fidelity Face Swapping with Style Blending | Xinyu Yang et.al. | 2312.10843 | null |
2023-12-17 | StyleSinger: Style Transfer for Out-Of-Domain Singing Voice Synthesis | Yu Zhang et.al. | 2312.10741 | null |
2023-12-19 | MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis | Wenhao Guan et.al. | 2312.10687 | null |
2023-12-17 | Bengali Intent Classification with Generative Adversarial BERT | Mehedi Hasan et.al. | 2312.10679 | null |
2023-12-17 | Analisis Eksploratif Dan Augmentasi Data NSL-KDD Menggunakan Deep Generative Adversarial Networks Untuk Meningkatkan Performa Algoritma Extreme Gradient Boosting Dalam Klasifikasi Jenis Serangan Siber | K. P. Santoso et.al. | 2312.10669 | null |
2023-12-16 | Lecture Notes in Probabilistic Diffusion Models | Inga Strümke et.al. | 2312.10393 | null |
2023-12-15 | NM-FlowGAN: Modeling sRGB Noise with a Hybrid Approach based on Normalizing Flows and Generative Adversarial Networks | Young Joo Han et.al. | 2312.10112 | link |
2023-12-15 | Quantum Generative Adversarial Networks: Bridging Classical and Quantum Realms | Sahil Nokhwal et.al. | 2312.09939 | null |
2023-12-15 | LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer | Yuxin Cao et.al. | 2312.09935 | link |
2023-12-15 | Style Generation in Robot Calligraphy with Deep Generative Adversarial Networks | Xiaoming Wang et.al. | 2312.09673 | null |
2023-12-15 | Image Deblurring using GAN | Zhengdong Li et.al. | 2312.09496 | null |
2023-12-11 | Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer | Jiwoo Chung et.al. | 2312.09008 | null |
2023-12-12 | Diffusion Cocktail: Fused Generation from Diffusion Models | Haoming Liu et.al. | 2312.08873 | link |
2023-12-14 | CPST: Comprehension-Preserving Style Transfer for Multi-Modal Narratives | Yi-Chun Chen et.al. | 2312.08695 | null |
2023-12-14 | CartoMark: a benchmark dataset for map pattern recognition and 1 map content retrieval with machine intelligence | Xiran Zhou et.al. | 2312.08600 | null |
2023-12-13 | Integrating Particle Flavor into Deep Learning Models for Hadronization | Jay Chan et.al. | 2312.08453 | null |
2023-12-13 | PhenDiff: Revealing Invisible Phenotypes with Conditional Diffusion Models | Anis Bourou et.al. | 2312.08290 | link |
2023-12-13 | A Compact and Semantic Latent Space for Disentangled and Controllable Image Editing | Gwilherm Lesné et.al. | 2312.08256 | null |
2023-12-13 | Towards Better Morphed Face Images without Ghosting Artifacts | Clemens Seibold et.al. | 2312.08111 | null |
2023-12-13 | ClusterDDPM: An EM clustering framework with Denoising Diffusion Probabilistic Models | Jie Yan et.al. | 2312.08029 | null |
2023-12-13 | Artificial Intelligence Studies in Cartography: A Review and Synthesis of Methods, Applications, and Ethics | Yuhao Kang et.al. | 2312.07901 | null |
2023-12-12 | Scalable Motion Style Transfer with Constrained Diffusion Generation | Wenjie Yin et.al. | 2312.07311 | null |
2023-12-12 | Experimental Investigation of Machine Learning based Soft-Failure Management using the Optical Spectrum | Lars E. Kruse et.al. | 2312.07208 | null |
2023-12-12 | Patch-MI: Enhancing Model Inversion Attacks via Patch-Based Reconstruction | Jonggyu Jang et.al. | 2312.07040 | link |
2023-12-12 | Prediction and control of two-dimensional decaying turbulence using generative adversarial networks | Jiyeon Kim et.al. | 2312.07037 | null |
2023-12-11 | Deep Learning based Modeling of Wireless Communication Channel with Fading | Lee Youngmin et.al. | 2312.06849 | null |
2023-12-10 | Class-Prototype Conditional Diffusion Model for Continual Learning with Generative Replay | Khanh Doan et.al. | 2312.06710 | link |
2023-12-10 | Neutral Editing Framework for Diffusion-based Video Editing | Sunjae Yoon et.al. | 2312.06708 | null |
2023-12-11 | A GAN Approach for Node Embedding in Heterogeneous Graphs Using Subgraph Sampling | Hung Chun Hsu et.al. | 2312.06519 | null |
2023-12-11 | Semantic Image Synthesis for Abdominal CT | Yan Zhuang et.al. | 2312.06453 | null |
2023-12-11 | Deep Imbalanced Learning for Multimodal Emotion Recognition in Conversations | Tao Meng et.al. | 2312.06337 | null |
2023-12-11 | ArtBank: Artistic Style Transfer with Pre-trained Diffusion Model and Implicit Style Prompt Bank | Zhanjie Zhang et.al. | 2312.06135 | link |
2023-12-10 | AesFA: An Aesthetic Feature-Aware Arbitrary Neural Style Transfer | Joonwoo Kwon et.al. | 2312.05928 | link |
2023-12-09 | Iterative Token Evaluation and Refinement for Real-World Super-Resolution | Chaofeng Chen et.al. | 2312.05616 | link |
2023-12-09 | BARET : Balanced Attention based Real image Editing driven by Target-text Inversion | Yuming Qiao et.al. | 2312.05482 | null |
2023-12-08 | Multi-view Inversion for 3D-aware Generative Adversarial Networks | Florian Barthel et.al. | 2312.05330 | null |
2023-12-08 | MuVieCAST: Multi-View Consistent Artistic Style Transfer | Nail Ibrahimli et.al. | 2312.05046 | null |
2023-12-08 | Synthesizing Traffic Datasets using Graph Neural Networks | Daniel Rodriguez-Criado et.al. | 2312.05031 | link |
2023-12-08 | Damage GAN: A Generative Model for Imbalanced Data | Ali Anaissi et.al. | 2312.04862 | null |
2023-12-08 | Induced Generative Adversarial Particle Transformers | Anni Li et.al. | 2312.04757 | null |
2023-12-07 | Probabilistic volumetric speckle suppression in OCT using deep learning | Bhaskara Rao Chintada et.al. | 2312.04460 | link |
2023-12-07 | Learning to sample in Cartesian MRI | Thomas Sanchez et.al. | 2312.04327 | null |
2023-12-07 | Towards 4D Human Video Stylization | Tiantian Wang et.al. | 2312.04143 | link |
2023-12-07 | Style Transfer to Calvin and Hobbes comics using Stable Diffusion | Sloke Shrestha et.al. | 2312.03993 | null |
2023-12-06 | Data-driven Crop Growth Simulation on Time-varying Generated Images using Multi-conditional Generative Adversarial Networks | Lukas Drees et.al. | 2312.03443 | link |
2023-12-05 | A study of topological quantities of lattice QCD by a modified DCGAN frame | Lin Gao et.al. | 2312.03023 | null |
2023-12-03 | Cycle-consistent Generative Adversarial Network Synthetic CT for MR-only Adaptive Radiation Therapy on MR-Linac | Gabriel L. Asher et.al. | 2312.02211 | null |
2023-12-04 | ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation | Dar-Yen Chen et.al. | 2312.02109 | null |
2023-12-04 | SRTransGAN: Image Super-Resolution using Transformer based Generative Adversarial Network | Neeraj Baghel et.al. | 2312.01999 | null |
2023-12-04 | SEFGAN: Harvesting the Power of Normalizing Flows and GANs for Efficient High-Quality Speech Enhancement | Martin Strauss et.al. | 2312.01744 | null |
2023-12-04 | Multimodality-guided Image Style Transfer using Cross-modal GAN Inversion | Hanyu Wang et.al. | 2312.01671 | null |
2023-12-04 | Learning Channel Capacity with Neural Mutual Information Estimator Based on Message Importance Measure | Zhefan Li et.al. | 2312.01546 | null |
2023-12-02 | SASSL: Enhancing Self-Supervised Learning via Neural Style Transfer | Renan A. Rojas-Gomez et.al. | 2312.01187 | null |
2023-12-02 | Generating Images of the M87 Black Hole Using GANs* | Arya Mohan et.al. | 2312.01005 | link |
2023-12-02 | Convergences for Minimax Optimization Problems over Infinite-Dimensional Spaces Towards Stability in Adversarial Training | Takashi Furuya et.al. | 2312.00991 | null |
2023-12-02 | Deep Generative Attacks and Countermeasures for Data-Driven Offline Signature Verification | An Ngo et.al. | 2312.00987 | null |
2023-12-01 | Adversarial Score Distillation: When score distillation meets GAN | Min Wei et.al. | 2312.00739 | link |
2023-11-30 | S2ST: Image-to-Image Translation in the Seed Space of Latent Diffusion | Or Greenberg et.al. | 2312.00116 | null |
2023-11-30 | Adversarial Attacks and Defenses for Wireless Signal Classifiers using CDI-aware GANs | Sujata Sinha et.al. | 2311.18820 | null |
2023-11-30 | Wasserstein GANs are Minimax Optimal Distribution Estimators | Arthur Stéphanovitch et.al. | 2311.18613 | null |
2023-11-30 | Combining deep generative models with extreme value theory for synthetic hazard simulation: a multivariate and spatially coherent approach | Alison Peard et.al. | 2311.18521 | null |
2023-11-30 | CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model | Jianhao Zeng et.al. | 2311.18405 | link |
2023-11-30 | Advances in 3D Neural Stylization: A Survey | Yingshu Chen et.al. | 2311.18328 | link |
2023-11-30 | Deep Reinforcement Learning Based Optimal Energy Management of Multi-energy Microgrids with Uncertainties | Yang Cui et.al. | 2311.18327 | null |
2023-11-30 | Beyond Entropy: Style Transfer Guided Single Image Continual Test-Time Adaptation | Younggeol Cho et.al. | 2311.18270 | null |
2023-11-30 | SMaRt: Improving GANs with Score Matching Regularity | Mengfei Xia et.al. | 2311.18208 | null |
2023-11-29 | Zooming Out on Zooming In: Advancing Super-Resolution for Remote Sensing | Piper Wolters et.al. | 2311.18082 | link |
2023-11-29 | DiffGEPCI: 3D MRI Synthesis from mGRE Signals using 2.5D Diffusion Model | Yuyang Hu et.al. | 2311.18073 | null |
2023-11-29 | GELDA: A generative language annotation framework to reveal visual biases in datasets | Krish Kabra et.al. | 2311.18064 | null |
2023-11-29 | Gaussian Shell Maps for Efficient 3D Human Generation | Rameen Abdal et.al. | 2311.17857 | null |
2023-11-29 | Leveraging Graph Diffusion Models for Network Refinement Tasks | Puja Trivedi et.al. | 2311.17856 | null |
2023-11-29 | SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis | Ziqiao Peng et.al. | 2311.17590 | link |
2023-11-29 | Microstructure reconstruction of 2D/3D random materials via diffusion-based deep generative models | Xianrui Lyu et.al. | 2311.17319 | null |
2023-11-25 | Yingying Deng et.al. | 2311.16491 | null | |
2023-11-28 | Observer study-based evaluation of TGAN architecture used to generate oncological PET images | Roberto Fedrigo et.al. | 2311.16047 | null |
2023-11-27 | A deep learning approach for marine snow synthesis and removal | Fernando Galetto et.al. | 2311.15584 | link |
2023-11-27 | Video-based Visible-Infrared Person Re-Identification with Auxiliary Samples | Yunhao Du et.al. | 2311.15571 | link |
2023-11-27 | ET3D: Efficient Text-to-3D Generation via Multi-View Distillation | Yiming Chen et.al. | 2311.15561 | null |
2023-11-25 | Multi-Scale Sub-Band Constant-Q Transform Discriminator for High-Fidelity Vocoder | Yicheng Gu et.al. | 2311.14957 | null |
2023-11-25 | FreePIH: Training-Free Painterly Image Harmonization with Diffusion Model | Ruibin Li et.al. | 2311.14926 | null |
2023-11-24 | Neural Style Transfer for Computer Games | Eleftherios Ioannou et.al. | 2311.14617 | null |
2023-11-24 | A Parameterized Generative Adversarial Network Using Cyclic Projection for Explainable Medical Image Classification | Xiangyu Xiong et.al. | 2311.14388 | null |
2023-11-23 | Video Anomaly Detection using GAN | Anikeit Sethi et.al. | 2311.14095 | null |
2023-11-23 | Human Machine Co-Creation. A Complementary Cognitive Approach to Creative Character Design Process Using GANs | Mohammad Lataifeh et.al. | 2311.13960 | null |
2023-11-23 | Exploring Methods for Cross-lingual Text Style Transfer: The Case of Text Detoxification | Daryna Dementieva et.al. | 2311.13937 | null |
2023-11-28 | Perceptual Image Compression with Cooperative Cross-Modal Side Information | Shiyu Qin et.al. | 2311.13847 | null |
2023-11-22 | Physics-driven generative adversarial networks empower single-pixel infrared hyperspectral imaging | Dong-Yin Wang et.al. | 2311.13626 | null |
2023-11-29 | Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object | Junhao Chen et.al. | 2311.13562 | link |
2023-11-22 | 3D Face Style Transfer with a Hybrid Solution of NeRF and Mesh Rasterization | Jianwei Feng et.al. | 2311.13168 | null |
2023-11-21 | Volatility and irregularity Capturing in stock price indices using time series Generative adversarial networks (TimeGAN) | Leonard Mushunje et.al. | 2311.12987 | null |
2023-11-22 | Creating Temporally Correlated High-Resolution Power Injection Profiles Using Physics-Aware GAN | Hritik Gopal Shah et.al. | 2311.12166 | null |
2023-11-20 | CrackCLF: Automatic Pavement Crack Detection based on Closed-Loop Feedback | Chong Li et.al. | 2311.11815 | null |
2023-11-20 | AdvGen: Physical Adversarial Attack on Face Presentation Attack Detection Systems | Sai Amrit Patnaik et.al. | 2311.11753 | null |
2023-11-18 | Diverse Shape Completion via Style Modulated Generative Adversarial Networks | Wesley Khademi et.al. | 2311.11184 | null |
2023-11-18 | Deep Coherence Learning: An Unsupervised Deep Beamformer for High Quality Single Plane Wave Imaging in Medical Ultrasound | Hyunwoo Cho et.al. | 2311.11169 | null |
2023-11-18 | Compact and Intuitive Airfoil Parameterization Method through Physics-aware Variational Autoencoder | Yu-Eop Kang et.al. | 2311.10921 | null |
2023-11-17 | Pre- to Post-Contrast Breast MRI Synthesis for Enhanced Tumour Segmentation | Richard Osuala et.al. | 2311.10879 | link |
2023-11-17 | A Quadratic Speedup in Finding Nash Equilibria of Quantum Zero-Sum Games | Francisca Vasconcelos et.al. | 2311.10859 | null |
2023-11-17 | Human motion trajectory prediction using the Social Force Model for real-time and low computational cost applications | Oscar Gil et.al. | 2311.10582 | null |
2023-11-17 | Semi-supervised ViT knowledge distillation network with style transfer normalization for colorectal liver metastases survival prediction | Mohamed El Amine Elforaici et.al. | 2311.10305 | null |
2023-11-21 | Advancements in Generative AI: A Comprehensive Review of GANs, GPT, Autoencoders, Diffusion Model, and Transformers | Staphord Bengesi et.al. | 2311.10242 | null |
2023-11-15 | Study of topological quantities of lattice QCD by a modified Wasserstein generative adversarial network | Lin Gao et.al. | 2311.10108 | null |
2023-11-16 | MAM-E: Mammographic synthetic image generation with diffusion models | Ricardo Montoya-del-Angel et.al. | 2311.09822 | link |
2023-11-16 | SynDiffix: More accurate synthetic structured data | Paul Francis et.al. | 2311.09628 | null |
2023-11-15 | Strategic Data Augmentation with CTGAN for Smart Manufacturing: Enhancing Machine Learning Predictions of Paper Breaks in Pulp-and-Paper Production | Hamed Khosravi et.al. | 2311.09333 | null |
2023-11-15 | NormNet: Scale Normalization for 6D Pose Estimation in Stacked Scenarios | En-Te Lin et.al. | 2311.09269 | link |
2023-11-15 | FastBlend: a Powerful Model-Free Toolkit Making Video Stylization Easier | Zhongjie Duan et.al. | 2311.09265 | link |
2023-11-17 | RBPGAN: Recurrent Back-Projection GAN for Video Super Resolution | Israa Fahmy et.al. | 2311.09178 | null |
2023-11-14 | PEMA: Plug-in External Memory Adaptation for Language Models | HyunJin Kim et.al. | 2311.08590 | null |
2023-11-14 | TSST: A Benchmark and Evaluation Models for Text Speech-Style Transfer | Huashan Sun et.al. | 2311.08389 | null |
2023-11-13 | STEER: Unified Style Transfer with Expert Reinforcement | Skyler Hallinan et.al. | 2311.07167 | null |
2023-11-13 | CycleGANAS: Differentiable Neural Architecture Search for CycleGAN | Taegun An et.al. | 2311.07162 | null |
2023-11-16 | In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering | Sheng Liu et.al. | 2311.06668 | link |
2023-11-10 | BanglaBait: Semi-Supervised Adversarial Approach for Clickbait Detection on Bangla Clickbait Dataset | Md. Motahar Mahtab et.al. | 2311.06204 | link |
2023-11-09 | 3DGAUnet: 3D generative adversarial networks with a 3D U-Net based generator to achieve the accurate and effective synthesis of clinical tumor image data for pancreatic cancer | Yu Shi et.al. | 2311.05697 | null |
2023-11-09 | L-WaveBlock: A Novel Feature Extractor Leveraging Wavelets for Generative Adversarial Networks | Mirat Shah et.al. | 2311.05548 | null |
2023-11-09 | Robust Retraining-free GAN Fingerprinting via Personalized Normalization | Jianwei Fei et.al. | 2311.05478 | null |
2023-11-09 | Designing ship hull forms using generative adversarial networks | Kazuo Yonekura et.al. | 2311.05470 | null |
2023-11-09 | ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors | Jingwen Chen et.al. | 2311.05463 | null |
2023-11-09 | Airfoil generation and feature extraction using the conditional VAE-WGAN-gp | Kazuo Yonekura et.al. | 2311.05445 | null |
2023-11-09 | Dual Pipeline Style Transfer with Input Distribution Differentiation | ShiQi Jiang et.al. | 2311.05432 | null |
2023-11-09 | Few-Shot Recognition and Classification of Jamming Signal via CGAN-Based Fusion CNN Algorithm | Xuhui Ding et.al. | 2311.05273 | null |
2023-11-10 | Let's Get the FACS Straight -- Reconstructing Obstructed Facial Features | Tim Büchner et.al. | 2311.05221 | null |
2023-11-09 | Social Media Bot Detection using Dropout-GAN | Anant Shukla et.al. | 2311.05079 | null |
2023-11-08 | Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation | Ha-Yeong Choi et.al. | 2311.04693 | null |
2023-11-08 | Deep learning as a tool for quantum error reduction in quantum image processing | Krzysztof Werner et.al. | 2311.04575 | null |
2023-11-08 | A 3D generative model of pathological multi-modal MR images and segmentations | Virginia Fernandez et.al. | 2311.04552 | link |
2023-11-07 | Generative Structural Design Integrating BIM and Diffusion Model | Zhili He et.al. | 2311.04052 | null |
2023-11-07 | 3D EAGAN: 3D edge-aware attention generative adversarial network for prostate segmentation in transrectal ultrasound images | Mengqing Liu et.al. | 2311.04049 | null |
2023-11-08 | Improving the Effectiveness of Deep Generative Data | Ruyu Wang et.al. | 2311.03959 | null |
2023-11-07 | MeVGAN: GAN-based Plugin Model for Video Generation with Applications in Colonoscopy | Łukasz Struski et.al. | 2311.03884 | null |
2023-11-07 | SCONE-GAN: Semantic Contrastive learning-based Generative Adversarial Network for an end-to-end image translation | Iman Abbasnejad et.al. | 2311.03866 | null |
2023-11-07 | Unsupervised Video Summarization | Hanqing Li et.al. | 2311.03745 | null |
2023-11-08 | DeepInspect: An AI-Powered Defect Detection for Manufacturing Industries | Arti Kumbhar et.al. | 2311.03725 | null |
2023-11-06 | Multi-Resolution Diffusion for Privacy-Sensitive Recommender Systems | Derek Lilienthal et.al. | 2311.03488 | link |
2023-11-06 | Preserving Privacy in GANs Against Membership Inference Attack | Mohammadhadi Shateri et.al. | 2311.03172 | null |
2023-11-06 | A Two-Stage Generative Model with CycleGAN and Joint Diffusion for MRI-based Brain Tumor Detection | Wenxin Wang et.al. | 2311.03074 | link |
2023-11-08 | Deep Image Semantic Communication Model for Artificial Intelligent Internet of Things | Li Ping Qian et.al. | 2311.02926 | link |
2023-11-06 | Flexible Multi-Generator Model with Fused Spatiotemporal Graph for Trajectory Prediction | Peiyuan Zhu et.al. | 2311.02835 | null |
2023-11-05 | Synthetic Tumor Manipulation: With Radiomics Features | Inye Na et.al. | 2311.02586 | null |
2023-11-04 | A Strictly Bounded Deep Network for Unpaired Cyclic Translation of Medical Images | Swati Rai et.al. | 2311.02480 | null |
2023-11-04 | MTS-DVGAN: Anomaly Detection in Cyber-Physical Systems using a Dual Variational Generative Adversarial Network | Haili Sun et.al. | 2311.02378 | null |
2023-11-03 | Optimal Image Transport on Sparse Dictionaries | Junqing Huang et.al. | 2311.01984 | null |
2023-11-03 | Latent Diffusion Model for Conditional Reservoir Facies Generation | Daesoo Lee et.al. | 2311.01968 | null |
2023-11-06 | An Efficient Detection and Control System for Underwater Docking using Machine Learning and Realistic Simulation: A Comprehensive Approach | Jalil Chavez-Galaviz et.al. | 2311.01522 | null |
2023-11-02 | Ultra-Fast Generation of Air Shower Images for Imaging Air Cherenkov Telescopes using Generative Adversarial Networks | Christian Elflein et.al. | 2311.01385 | null |
2023-11-02 | Monotone Generative Modeling via a Gromov-Monge Embedding | Wonjun Lee et.al. | 2311.01375 | null |
2023-11-02 | Novel View Synthesis from a Single RGBD Image for Indoor Scenes | Congrui Hetang et.al. | 2311.01065 | null |
2023-11-02 | A Chronological Survey of Theoretical Advancements in Generative Adversarial Networks for Computer Vision | Hrishikesh Sharma et.al. | 2311.00995 | null |
2023-11-02 | Stochastic Smoothed Gradient Descent Ascent for Federated Minimax Optimization | Wei Shen et.al. | 2311.00944 | null |
2023-11-01 | Generating HSR Bogie Vibration Signals via Pulse Voltage-Guided Conditional Diffusion Model | Xuan Liu et.al. | 2311.00496 | link |
2023-11-01 | Flooding Regularization for Stable Training of Generative Adversarial Networks | Iu Yahiro et.al. | 2311.00318 | null |
2023-10-31 | Histopathological Image Analysis with Style-Augmented Feature Domain Mixing for Improved Generalization | Vaibhav Khamankar et.al. | 2310.20638 | link |
2023-10-31 | Using Higher-Order Moments to Assess the Quality of GAN-generated Image Features | Lorenzo Luzi et.al. | 2310.20636 | null |
2023-10-31 | A physics-informed GAN Framework based on Model-free Data-Driven Computational Mechanics | Kerem Ciftci et.al. | 2310.20308 | null |
2023-10-31 | An Implementation of Multimodal Fusion System for Intelligent Digital Human Generation | Yingjie Zhou et.al. | 2310.20251 | link |
2023-10-31 | Synthesizing Diabetic Foot Ulcer Images with Diffusion Model | Reza Basiri et.al. | 2310.20140 | null |
2023-10-30 | GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models | Mianchu Wang et.al. | 2310.20025 | null |
2023-10-30 | Generated Distributions Are All You Need for Membership Inference Attacks Against Generative Models | Minxing Zhang et.al. | 2310.19410 | link |
2023-10-30 | A3SA: Advanced Data Augmentation via Adjoint Sensitivity Analysis | Chanik Kang et.al. | 2310.19291 | null |
2023-10-30 | EDiffSR: An Efficient Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution | Yi Xiao et.al. | 2310.19288 | link |
2023-10-29 | Finding Optimal Training Parameters for Quantum Generative Adversarial Networks | C. Strynar et.al. | 2310.19117 | null |
2023-10-29 | Emergence of Shape Bias in Convolutional Neural Networks through Activation Sparsity | Tianqin Li et.al. | 2310.18894 | link |
2023-10-28 | Translating away Translationese without Parallel Data | Rricha Jalota et.al. | 2310.18830 | null |
2023-10-27 | Addressing GAN Training Instabilities via Tunable Classification Losses | Monica Welfert et.al. | 2310.18291 | null |
2023-10-27 | PlantPlotGAN: A Physics-Informed Generative Adversarial Network for Plant Disease Prediction | Felipe A. Lopes et.al. | 2310.18268 | null |
2023-10-26 | MIM-GAN-based Anomaly Detection for Multivariate Time Series Data | Shan Lu et.al. | 2310.18257 | link |
2023-10-30 | Generative AI Model for Artistic Style Transfer Using Convolutional Neural Networks | Jonayet Miah et.al. | 2310.18237 | null |
2023-10-27 | Adversarial Anomaly Detection using Gaussian Priors and Nonlinear Anomaly Scores | Fiete Lüer et.al. | 2310.18091 | link |
2023-10-27 | Towards optimal multimode fiber imaging by leveraging input polarization and conditional generative adversarial networks | Jawaria Maqbool et.al. | 2310.17889 | null |
2023-10-24 | Bayesian imaging inverse problem with SA-Roundtrip prior via HMC-pCN sampler | Jiayu Qian et.al. | 2310.17817 | link |
2023-10-26 | Counterfactual Fairness for Predictions using Generative Adversarial Networks | Yuchen Ma et.al. | 2310.17687 | null |
2023-10-26 | Three-dimensional Bone Image Synthesis with Generative Adversarial Networks | Christoph Angermann et.al. | 2310.17216 | null |
2023-10-26 | Content-based Controls For Music Large Language Modeling | Liwei Lin et.al. | 2310.17162 | null |
2023-10-26 | Single channel speech enhancement by colored spectrograms | Sania Gul et.al. | 2310.17142 | null |
2023-10-26 | Neural style transfer of weak lensing mass maps | Masato Shirasaki et.al. | 2310.17141 | null |
2023-10-26 | Detecting stealthy cyberattacks on adaptive cruise control vehicles: A machine learning approach | Tianyi Li et.al. | 2310.17091 | null |
2023-10-25 | Using Diffusion Models to Generate Synthetic Labelled Data for Medical Image Segmentation | Daniel Saragih et.al. | 2310.16794 | link |
2023-10-25 | Interferometric Neural Networks | Arun Sehrawat et.al. | 2310.16742 | link |
2023-10-25 | GADY: Unsupervised Anomaly Detection on Dynamic Graphs | Shiqi Lou et.al. | 2310.16376 | null |
2023-10-25 | AccoMontage-3: Full-Band Accompaniment Arrangement via Sequential Style Transfer and Multi-Track Function Prior | Jingwei Zhao et.al. | 2310.16334 | link |
2023-10-24 | Nighttime Thermal Infrared Image Colorization with Feedback-based Object Appearance Learning | Fu-Ya Luo et.al. | 2310.15688 | link |
2023-10-24 | Region-controlled Style Transfer | Junjie Kang et.al. | 2310.15658 | null |
2023-10-24 | Mean Teacher DETR with Masked Feature Alignment: A Robust Domain Adaptive Detection Transformer Framework | Weixi Weng et.al. | 2310.15646 | null |
2023-10-24 | PET Synthesis via Self-supervised Adaptive Residual Estimation Generative Adversarial Network | Yuxin Xue et.al. | 2310.15550 | null |
2023-10-23 | Error analysis of generative adversarial network | Mahmud Hasan et.al. | 2310.15387 | null |
2023-10-23 | Prefix-Tuning Based Unsupervised Text Style Transfer | Huiyu Mai et.al. | 2310.14599 | null |
2023-10-23 | Diversify Question Generation with Retrieval-Augmented Style Transfer | Qi Gou et.al. | 2310.14503 | link |
2023-10-23 | Text Fact Transfer | Nishant Balepur et.al. | 2310.14486 | link |
2023-10-22 | Diffusion-Model-Assisted Supervised Learning of Generative Models for Density Estimation | Yanfang Liu et.al. | 2310.14458 | null |
2023-10-21 | Fast Diffusion GAN Model for Symbolic Music Generation Controlled by Emotions | Jincheng Zhang et.al. | 2310.14040 | null |
2023-10-24 | Boosting Generalization with Adaptive Style Techniques for Fingerprint Liveness Detection | Kexin Zhu et.al. | 2310.13573 | null |
2023-10-20 | Stable Nonconvex-Nonconcave Training via Linear Interpolation | Thomas Pethick et.al. | 2310.13459 | null |
2023-10-19 | A Distributed Approach to Meteorological Predictions: Addressing Data Imbalance in Precipitation Prediction Models through Federated Learning and GANs | Elaheh Jafarigol et.al. | 2310.13161 | null |
2023-10-19 | Audio Editing with Non-Rigid Text Prompts | Francesco Paissan et.al. | 2310.12858 | null |
2023-10-18 | Improving SCGAN's Similarity Constraint and Learning a Better Disentangled Representation | Iman Yazdanpanah et.al. | 2310.12262 | null |
2023-10-18 | Black-Box Training Data Identification in GANs via Detector Networks | Lukman Olagoke et.al. | 2310.12063 | null |
2023-10-18 | On the Evaluation of Generative Models in Distributed Learning Tasks | Zixiao Wang et.al. | 2310.11714 | null |
2023-10-18 | Deep learning based on Transformer architecture for power system short-term voltage stability assessment with class imbalance | Yang Li et.al. | 2310.11690 | null |
2023-10-17 | A High Fidelity and Low Complexity Neural Audio Coding | Wenzhe Liu et.al. | 2310.10992 | null |
2023-10-16 | DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing | Jia-Wei Liu et.al. | 2310.10624 | null |
2023-10-16 | Style transfer between Microscopy and Magnetic Resonance Imaging via Generative Adversarial Network in small sample size settings | Monika Pytlarz et.al. | 2310.10414 | null |
2023-10-16 | Impact of Data Synthesis Strategies for the Classification of Craniosynostosis | Matthias Schaufelberger et.al. | 2310.10199 | link |
2023-10-16 | Outlier Detection Using Generative Models with Theoretical Performance Guarantees | Jirong Yi et.al. | 2310.09999 | null |
2023-10-17 | Chinese Painting Style Transfer Using Deep Generative Models | Weijian Ma et.al. | 2310.09978 | link |
2023-10-15 | LOVECon: Text-driven Training-Free Long Video Editing with ControlNet | Zhenyi Liao et.al. | 2310.09711 | link |
2023-10-13 | Efficient Apple Maturity and Damage Assessment: A Lightweight Detection Model with GAN and Attention Mechanism | Yufei Liu et.al. | 2310.09347 | null |
2023-10-13 | Using cGANs for Anomaly Detection: Identifying Astronomical Anomalies in JWST NIRcam Imaging | Ruby Pearce-Casey et.al. | 2310.09073 | link |
2023-10-13 | Generative AI-driven Semantic Communication Framework for NextG Wireless Network | Avi Deb Raha et.al. | 2310.09021 | null |
2023-10-13 | A Framework for Few-Shot Policy Transfer through Observation Mapping and Behavior Cloning | Yash Shukla et.al. | 2310.08836 | link |
2023-10-12 | A Benchmarking Protocol for SAR Colorization: From Regression to Deep Learning Approaches | Kangqing Shen et.al. | 2310.08705 | null |
2023-10-13 | Worst-Case Morphs using Wasserstein ALI and Improved MIPGAN | Una M. Kelly et.al. | 2310.08371 | null |
2023-10-12 | CleftGAN: Adapting A Style-Based Generative Adversarial Network To Create Images Depicting Cleft Lip Deformity | Abdullah Hayajneh et.al. | 2310.07969 | null |
2023-10-18 | Revolutionising inverse design of magnesium alloys through generative adversarial networks | Marzie Ghorbani et.al. | 2310.07836 | null |
2023-10-11 | Does resistance to Style-Transfer equal Shape Bias? Evaluating Shape Bias by Distorted Shape | Ziqi Wen et.al. | 2310.07555 | null |
2023-10-14 | Synthesizing Missing MRI Sequences from Available Modalities using Generative Adversarial Networks in BraTS Dataset | Ibrahim Ethem Hamamci et.al. | 2310.07250 | null |
2023-10-12 | Vec-Tok Speech: speech vectorization and tokenization for neural speech generation | Xinfa Zhu et.al. | 2310.07246 | link |
2023-10-11 | Crowd Counting in Harsh Weather using Image Denoising with Pix2Pix GANs | Muhammad Asif Khan et.al. | 2310.07245 | null |
2023-10-10 | Stochastic Super-resolution of Cosmological Simulations with Denoising Diffusion Models | Andreas Schanz et.al. | 2310.06929 | null |
2023-10-10 | SC2GAN: Rethinking Entanglement by Self-correcting Correlated GAN Space | Zikun Chen et.al. | 2310.06667 | null |
2023-10-10 | Data-level hybrid strategy selection for disk fault prediction model based on multivariate GAN | Shuangshuang Yuan et.al. | 2310.06537 | null |
2023-10-10 | An improved CTGAN for data processing method of imbalanced disk failure | Jingbo Jia et.al. | 2310.06481 | null |
2023-10-10 | Adversarial Masked Image Inpainting for Robust Detection of Mpox and Non-Mpox | Yubiao Yue et.al. | 2310.06318 | null |
2023-10-09 | Latent Diffusion Model for DNA Sequence Generation | Zehui Li et.al. | 2310.06150 | null |
2023-10-09 | Generative ensemble deep learning severe weather prediction from a deterministic convection-allowing model | Yingkai Sha et.al. | 2310.06045 | link |
2023-10-07 | WAIT: Feature Warping for Animation to Illustration video Translation using GANs | Samet Hicsonmez et.al. | 2310.04901 | link |
2023-10-07 | Towards Dynamic and Small Objects Refinement for Unsupervised Domain Adaptative Nighttime Semantic Segmentation | Jingyi Pan et.al. | 2310.04747 | null |
2023-10-07 | A dimension-reduced variational approach for solving physics-based inverse problems using generative adversarial network priors and normalizing flows | Agnimitra Dasgupta et.al. | 2310.04690 | null |
2023-10-07 | X-Transfer: A Transfer Learning-Based Framework for Robust GAN-Generated Fake Image Detection | Lei Zhang et.al. | 2310.04639 | null |
2023-10-06 | FluxGAN: A Physics-Aware Generative Adversarial Network Model for Generating Microstructures That Maintain Target Heat Flux | Artem K. Pimachev et.al. | 2310.04622 | null |
2023-10-06 | VTON-IT: Virtual Try-On using Image Translation | Santosh Adhikari et.al. | 2310.04558 | link |
2023-10-06 | A Deeply Supervised Semantic Segmentation Method Based on GAN | Wei Zhao et.al. | 2310.04081 | null |
2023-10-06 | CineTransfer: Controlling a Robot to Imitate Cinematographic Style from a Single Example | Pablo Pueyo et.al. | 2310.03953 | null |
2023-10-05 | Automatic and Human-AI Interactive Text Generation | Yao Dou et.al. | 2310.03878 | null |
2023-10-05 | Multimarginal generative modeling with stochastic interpolants | Michael S. Albergo et.al. | 2310.03695 | null |
2023-10-04 | Boosting Dermatoscopic Lesion Segmentation via Diffusion Models with Visual and Textual Prompts | Shiyi Du et.al. | 2310.02906 | null |
2023-10-04 | Generative Modeling of Regular and Irregular Time Series Data via Koopman VAEs | Ilan Naiman et.al. | 2310.02619 | null |
2023-10-03 | Learnable Data Augmentation for One-Shot Unsupervised Domain Adaptation | Julio Ivan Davila Carrazco et.al. | 2310.02201 | null |
2023-10-03 | Improving style transfer in dynamic contrast enhanced MRI using a spatio-temporal approach | Adam G. Tattersall et.al. | 2310.01908 | null |
2023-10-03 | A Dual Attentive Generative Adversarial Network for Remote Sensing Image Change Detection | Luyi Qiu et.al. | 2310.01876 | null |
2023-10-03 | Benchmarking and Improving Generator-Validator Consistency of Language Models | Xiang Lisa Li et.al. | 2310.01846 | null |
2023-10-02 | Home Electricity Data Generator (HEDGE): An open-access tool for the generation of electric vehicle, residential demand, and PV generation profiles | Flora Charbonnier et.al. | 2310.01661 | null |
2023-10-02 | Color and Texture Dual Pipeline Lightweight Style Transfer | ShiQi Jiang et.al. | 2310.01321 | null |
2023-10-02 | Generating 3D Brain Tumor Regions in MRI using Vector-Quantization Generative Adversarial Networks | Meng Zhou et.al. | 2310.01251 | null |
2023-10-02 | Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models | Hyeonho Jeong et.al. | 2310.01107 | link |
2023-10-02 | Practical Radar Sensing Using Two Stage Neural Network for Denoising OTFS Signals | Ashok S Kumar et.al. | 2310.00897 | null |
2023-10-02 | Subsurface Characterization using Ensemble-based Approaches with Deep Generative Models | Jichao Bao et.al. | 2310.00839 | link |
2023-10-01 | Counterfactual Image Generation for adversarially robust and interpretable Classifiers | Rafael Bischof et.al. | 2310.00761 | null |
2023-10-01 | Quantum generative adversarial learning in photonics | Yizhi Wang et.al. | 2310.00585 | null |
2023-09-30 | The objective function equality property of infoGAN for two-layer network | Mahmud Hasan et.al. | 2310.00443 | null |
2023-09-30 | Controlling Neural Style Transfer with Deep Reinforcement Learning | Chengming Feng et.al. | 2310.00405 | null |
2023-10-04 | Structural Adversarial Objectives for Self-Supervised Representation Learning | Xiao Zhang et.al. | 2310.00357 | null |
2023-09-30 | Anomaly Detection in Power Generation Plants with Generative Adversarial Networks | Marcellin Atemkeng et.al. | 2310.00335 | null |
2023-09-30 | An easy zero-shot learning combination: Texture Sensitive Semantic Segmentation IceHrNet and Advanced Style Transfer Learning Strategy | Zhiyong Yang et.al. | 2310.00310 | link |
2023-09-30 | A hybrid quantum-classical conditional generative adversarial network algorithm for human-centered paradigm in cloud | Wenjie Liu et.al. | 2310.00246 | null |
2023-09-30 | Finding Pragmatic Differences Between Disciplines | Lee Kezar et.al. | 2310.00204 | null |
2023-09-29 | Unpaired Optical Coherence Tomography Angiography Image Super-Resolution via Frequency-Aware Inverse-Consistency GAN | Weiwen Zhang et.al. | 2309.17269 | null |
2023-09-29 | Style Transfer for Non-differentiable Audio Effects | Kieran Grant et.al. | 2309.17125 | null |
2023-09-29 | ACGAN-GNNExplainer: Auxiliary Conditional Generative Explainer for Graph Neural Networks | Yiqiao Li et.al. | 2309.16918 | null |
2023-09-28 | Cross-Modal Transformer GAN: Brain Structural-Functional Deep Fusing Network for Alzheimer's Disease Analysis | Qiankun Zuo et.al. | 2309.16206 | null |
2023-09-28 | DiffGAN-F2S: Symmetric and Efficient Denoising Diffusion GANs for Structural Connectivity Prediction from Brain fMRI | Qiankun Zuo et.al. | 2309.16205 | null |
2023-09-28 | Learning Effective NeRFs and SDFs Representations with 3D Generative Adversarial Networks for 3D Object Generation: Technical Report for ICCV 2023 OmniObject3D Challenge | Zheyuan Yang et.al. | 2309.16110 | null |
2023-09-27 | Synthetic Latent Fingerprint Generation Using Style Transfer | Amol S. Joshi et.al. | 2309.15734 | null |
2023-09-27 | Style Transfer and Self-Supervised Learning Powered Myocardium Infarction Super-Resolution Segmentation | Lichao Wang et.al. | 2309.15485 | null |
2023-09-26 | Locality-preserving Directions for Interpreting the Latent Space of Satellite Image GANs | Georgia Kourmouli et.al. | 2309.14883 | null |
2023-09-25 | Identity-preserving Editing of Multiple Facial Attributes by Learning Global Edit Directions and Local Adjustments | Najmeh Mohammadbagheri et.al. | 2309.14267 | null |
2023-09-25 | Informative Data Mining for One-Shot Cross-Domain Semantic Segmentation | Yuxi Wang et.al. | 2309.14241 | null |
2023-09-25 | Adapt then Unlearn: Exploiting Parameter Space Semantics for Unlearning in Generative Adversarial Networks | Piyush Tiwary et.al. | 2309.14054 | null |
2023-09-25 | In-Domain GAN Inversion for Faithful Reconstruction and Editability | Jiapeng Zhu et.al. | 2309.13956 | null |
2023-09-25 | On the Effectiveness of Adversarial Samples against Ensemble Learning-based Windows PE Malware Detectors | Trong-Nghia To et.al. | 2309.13841 | null |
2023-10-02 | Backorder Prediction in Inventory Management: Classification Techniques and Cost Considerations | Sarit Maitra et.al. | 2309.13837 | null |
2023-09-24 | MOSAIC: Multi-Object Segmented Arbitrary Stylization Using CLIP | Prajwal Ganugula et.al. | 2309.13716 | null |
2023-09-24 | MM-NeRF: Multimodal-Guided 3D Multi-Style Transfer of Neural Radiance Field | Zijiang Yang et.al. | 2309.13607 | null |
2023-09-24 | Solving Low-Dose CT Reconstruction via GAN with Local Coherence | Wenjie Liu et.al. | 2309.13584 | null |
2023-09-23 | Portrait Stylization: Artistic Style Transfer with Auxiliary Networks for Human Face Stylization | Thiago Ambiel et.al. | 2309.13492 | link |
2023-09-22 | MISFIT-V: Misaligned Image Synthesis and Fusion using Information from Thermal and Visual | Aadhar Chauhan et.al. | 2309.13216 | link |
2023-09-22 | Inter-vendor harmonization of Computed Tomography (CT) reconstruction kernels using unpaired image translation | Aravind R. Krishnan et.al. | 2309.12953 | null |
2023-09-22 | From Tight Gradient Bounds for Parameterized Quantum Circuits to the Absence of Barren Plateaus in QGANs | Alistair Letcher et.al. | 2309.12681 | null |
2023-09-21 | License Plate Super-Resolution Using Diffusion Models | Sawsan AlHalawani et.al. | 2309.12506 | null |
2023-09-21 | Adaptive Input-image Normalization for Solving Mode Collapse Problem in GAN-based X-ray Images | Muhammad Muneeb Saad et.al. | 2309.12245 | null |
2023-09-21 | Optimizing the Wasserstein GAN for TeV Gamma Ray Detection with VERITAS | Deivid Ribeiro et.al. | 2309.12221 | null |
2023-09-21 | A Discourse-level Multi-scale Prosodic Model for Fine-grained Emotion Analysis | Xianhao Wei et.al. | 2309.11849 | null |
2023-09-21 | Quasi-Monte Carlo for 3D Sliced Wasserstein | Khai Nguyen et.al. | 2309.11713 | null |
2023-09-20 | Interactive Flexible Style Transfer for Vector Graphics | Jeremy Warner et.al. | 2309.11628 | null |
2023-09-24 | Latent Diffusion Models for Structural Component Design | Ethan Herron et.al. | 2309.11601 | null |
2023-09-20 | Long-tail Augmented Graph Contrastive Learning for Recommendation | Qian Zhao et.al. | 2309.11177 | link |
2023-09-19 | Specializing Small Language Models towards Complex Style Transfer via Latent Attribute Pre-Training | Ruiqi Xu et.al. | 2309.10929 | link |
2023-09-19 | Assessing the capacity of a denoising diffusion probabilistic model to reproduce spatial context | Rucha Deshpande et.al. | 2309.10817 | null |
2023-09-19 | Locally Stylized Neural Radiance Fields | Hong-Wing Pang et.al. | 2309.10684 | null |
2023-09-19 | Retinex-guided Channel-grouping based Patch Swap for Arbitrary Style Transfer | Chang Liu et.al. | 2309.10528 | null |
2023-09-21 | Learning End-to-End Channel Coding with Diffusion Models | Muah Kim et.al. | 2309.10505 | null |
2023-09-19 | Augmenting Tactile Simulators with Real-like and Zero-Shot Capabilities | Osher Azulay et.al. | 2309.10409 | link |
2023-09-18 | Machine Learning for enhancing Wind Field Resolution in Complex Terrain | Jacob Wulff Wold et.al. | 2309.10172 | link |
2023-09-18 | Offline Detection of Misspelled Handwritten Words by Convolving Recognition Model Features with Text Labels | Andrey Totev et.al. | 2309.10158 | null |
2023-09-18 | Instant Photorealistic Style Transfer: A Lightweight and Adaptive Approach | Rong Liu et.al. | 2309.10011 | null |
2023-09-18 | Quantum Wasserstein GANs for State Preparation at Unseen Points of a Phase Diagram | Wiktor Jurasz et.al. | 2309.09543 | null |
2023-09-17 | Speech-Gesture GAN: Gesture Generation for Robots and Embodied Agents | Carson Yu Liu et.al. | 2309.09346 | null |
2023-09-17 | UGC: Unified GAN Compression for Efficient Image-to-Image Translation | Yuxi Ren et.al. | 2309.09310 | null |
2023-09-16 | Music Generation based on Generative Adversarial Networks with Transformer | Ziyi Jiang et.al. | 2309.09075 | null |
2023-09-16 | In-Style: Bridging Text and Uncurated Videos with Style Transfer for Text-Video Retrieval | Nina Shvetsova et.al. | 2309.08928 | link |
2023-09-16 | Bidirectional Graph GAN: Representing Brain Structure-Function Connections for Alzheimer's Disease | Shuqiang Wang et.al. | 2309.08916 | null |
2023-09-16 | Enhancing Visual Perception in Novel Environments via Incremental Data Augmentation Based on Style Transfer | Abhibha Gupta et.al. | 2309.08851 | link |
2023-09-15 | Quantifying Credit Portfolio sensitivity to asset correlations with interpretable generative neural networks | Sergio Caprioli et.al. | 2309.08652 | null |
2023-09-15 | ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer | Arkadiy Saakyan et.al. | 2309.08583 | link |
2023-09-15 | Classical shadows meet quantum optimal mass transport | Giacomo De Palma et.al. | 2309.08426 | null |
2023-09-15 | Cross-Modal Synthesis of Structural MRI and Functional Connectivity Networks via Conditional ViT-GANs | Yuda Bi et.al. | 2309.08160 | null |
2023-09-15 | Increasing diversity of omni-directional images generated from single image using cGAN based on MLPMixer | Atsuya Nakata et.al. | 2309.08129 | link |
2023-09-14 | An Automated Machine Learning Approach for Detecting Anomalous Peak Patterns in Time Series Data from a Research Watershed in the Northeastern United States Critical Zone | Ijaz Ul Haq et.al. | 2309.07992 | null |
2023-09-14 | M3Dsynth: A dataset of medical 3D images with AI-generated local manipulations | Giada Zingarini et.al. | 2309.07973 | null |
2023-09-14 | SnakeGAN: A Universal Vocoder Leveraging DDSP Prior Knowledge and Periodic Inductive Bias | Sipan Li et.al. | 2309.07803 | null |
2023-09-14 | Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context | Haochong Xia et.al. | 2309.07708 | null |
2023-09-14 | StarGAN-VC++: Towards Emotion Preserving Voice Conversion Using Deep Embeddings | Arnab Das et.al. | 2309.07592 | link |
2023-09-14 | Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer | Yongqi Wang et.al. | 2309.07566 | null |
2023-09-13 | GAN-based Algorithm for Efficient Image Inpainting | Zhengyang Han et.al. | 2309.07293 | null |
2023-09-13 | DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models | Namhyuk Ahn et.al. | 2309.06933 | null |
2023-09-13 | Integrating GAN and Texture Synthesis for Enhanced Road Damage Detection | Tengyang Chen et.al. | 2309.06747 | null |
2023-09-12 | CaloShowerGAN, a Generative Adversarial Networks model for fast calorimeter shower simulation | Michele Faucci Giannelli et.al. | 2309.06515 | null |
2023-09-17 | AGMDT: Virtual Staining of Renal Histology Images with Adjacency-Guided Multi-Domain Transfer | Tao Ma et.al. | 2309.06421 | null |
2023-09-12 | TSSAT: Two-Stage Statistics-Aware Transformation for Artistic Style Transfer | Haibo Chen et.al. | 2309.06004 | null |
2023-09-11 | Divergences in Color Perception between Deep Neural Networks and Humans | Ethan O. Nadler et.al. | 2309.05809 | null |
2023-09-11 | ViHOPE: Visuotactile In-Hand Object 6D Pose Estimation with Shape Completion | Hongyu Li et.al. | 2309.05662 | null |
2023-09-11 | PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud | Chengyu Wang et.al. | 2309.05534 | null |
2023-09-10 | SdCT-GAN: Reconstructing CT from Biplanar X-Rays with Self-driven Generative Adversarial Networks | Shuangqin Cheng et.al. | 2309.04960 | link |
2023-09-10 | Text-driven Editing of 3D Scenes without Retraining | Shuangkang Fang et.al. | 2309.04917 | null |
2023-09-10 | Effective Real Image Editing with Accelerated Iterative Diffusion Inversion | Zhihong Pan et.al. | 2309.04907 | null |
2023-09-09 | VeRi3D: Generative Vertex-based Radiance Fields for 3D Controllable Human Image Synthesis | Xinya Chen et.al. | 2309.04800 | null |
2023-09-09 | TCGAN: Convolutional Generative Adversarial Network for Time Series Classification and Clustering | Fanling Huang et.al. | 2309.04732 | link |
2023-09-08 | Style Generation: Image Synthesis based on Coarsely Matched Texts | Mengyao Cui et.al. | 2309.04608 | null |
2023-09-08 | Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry | Akankshya Kar et.al. | 2309.04147 | null |
2023-09-08 | Design of multifunctional color routers with Kerker switching using generative adversarial networks | Jiahao Yan et.al. | 2309.04104 | null |
2023-09-07 | Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis | Jiapeng Zhu et.al. | 2309.03904 | link |
2023-09-07 | Stroke-based Neural Painting and Stylization with Dynamically Predicted Painting Region | Teng Hu et.al. | 2309.03504 | link |
2023-09-06 | Hierarchical-level rain image generative model based on GAN | Zhenyuan Liu et.al. | 2309.02964 | null |
2023-09-06 | BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network | Takashi Shibuya et.al. | 2309.02836 | link |
2023-09-05 | Generative Algorithms for Fusion of Physics-Based Wildfire Spread Models with Satellite Data for Initializing Wildfire Forecasts | Bryan Shaddy et.al. | 2309.02615 | null |
2023-09-05 | Utilizing Generative Adversarial Networks for Stable Structure Generation in Angry Birds | Frederic Abraham et.al. | 2309.02614 | link |
2023-09-05 | Generating Infinite-Resolution Texture using GANs with Patch-by-Patch Paradigm | Alhasan Abdellatif et.al. | 2309.02340 | link |
2023-09-04 | ATMS: Algorithmic Trading-Guided Market Simulation | Song Wei et.al. | 2309.01784 | null |
2023-09-07 | Generative-based Fusion Mechanism for Multi-Modal Tracking | Zhangyong Tang et.al. | 2309.01728 | link |
2023-09-04 | Toward Defensive Letter Design | Rentaro Kataoka et.al. | 2309.01452 | null |
2023-09-04 | Metric Learning for Projections Bias of Generalized Zero-shot Learning | Chong Zhang et.al. | 2309.01390 | null |
2023-09-04 | Mutual Information Maximizing Quantum Generative Adversarial Network and Its Applications in Finance | Mingyu Lee et.al. | 2309.01363 | null |
2023-09-03 | Large AI Model Empowered Multimodal Semantic Communications | Feibo Jiang et.al. | 2309.01249 | null |
2023-09-03 | MSM-VC: High-fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-scale Style Modeling | Zhichao Wang et.al. | 2309.01142 | null |
2023-09-02 | Few shot font generation via transferring similarity guided global style and quantization local style | Wei Pan et.al. | 2309.00827 | link |
2023-09-01 | Data-driven Topology Optimization of Channel Flow Problems | Ce Guan et.al. | 2309.00278 | null |
2023-09-01 | Diffusion Model with Clustering-based Conditioning for Food Image Generation | Yue Han et.al. | 2309.00199 | null |
2023-08-31 | Segmentação e contagem de troncos de madeira utilizando deep learning e processamento de imagens | João V. C. Mazzochin et.al. | 2309.00123 | null |
2023-08-31 | StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation | Yuhan Wang et.al. | 2308.16909 | link |
2023-08-31 | Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance | Zexin Hu et.al. | 2308.16725 | null |
2023-08-31 | Unsupervised Text Style Transfer with Deep Generative Models | Zhongtao Jiang et.al. | 2308.16584 | null |
2023-08-31 | Robust GAN inversion | Egor Sevriugov et.al. | 2308.16510 | null |
2023-08-30 | Ten Years of Generative Adversarial Nets (GANs): A survey of the state-of-the-art | Tanujit Chakraborty et.al. | 2308.16316 | null |
2023-08-30 | Semantic Image Synthesis via Class-Adaptive Cross-Attention | Tomaso Fontanini et.al. | 2308.16071 | null |
2023-08-31 | Influence of adversarial training on super-resolution turbulence models | Ludovico Nista et.al. | 2308.16015 | null |
2023-08-30 | Fully Embedded Time-Series Generative Adversarial Networks | Joe Beck et.al. | 2308.15730 | null |
2023-08-29 | Unveiling Camouflage: A Learnable Fourier-based Augmentation for Camouflaged Object Detection and Instance Segmentation | Minh-Quan Le et.al. | 2308.15660 | null |
2023-08-29 | Adversarial Style Transfer for Robust Policy Optimization in Deep Reinforcement Learning | Md Masudur Rahman et.al. | 2308.15550 | null |
2023-08-29 | On the Steganographic Capacity of Selected Learning Models | Rishit Agrawal et.al. | 2308.15502 | null |
2023-08-29 | Learning Modulated Transformation in GANs | Ceyuan Yang et.al. | 2308.15472 | link |
2023-09-04 | ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer | Zachary Horvitz et.al. | 2308.15459 | link |
2023-08-29 | WSAM: Visual Explanations from Style Augmentation as Adversarial Attacker and Their Influence in Image Classification | Felipe Moreno-Vera et.al. | 2308.14995 | link |
2023-08-28 | Generating tabular datasets under differential privacy | Gianluca Truda et.al. | 2308.14784 | null |
2023-08-28 | Voice Conversion with Denoising Diffusion Probabilistic GAN Models | Xulong Zhang et.al. | 2308.14319 | null |
2023-08-27 | Unaligned 2D to 3D Translation with Conditional Vector-Quantized Code Diffusion using Transformers | Abril Corona-Figueroa et.al. | 2308.14152 | null |
2023-08-29 | Bi-Modality Medical Image Synthesis Using Semi-Supervised Sequential Generative Adversarial Networks | Xin Yang et.al. | 2308.14066 | null |
2023-08-27 | A Bayesian Non-parametric Approach to Generative Models: Integrating Variational Autoencoder and Generative Adversarial Networks using Wasserstein and Maximum Mean Discrepancy | Forough Fazeli-Asl et.al. | 2308.14048 | null |
2023-08-25 | Text Style Transfer Evaluation Using Large Language Models | Phil Ostheimer et.al. | 2308.13577 | null |
2023-08-23 | A Systematic Study on Quantifying Bias in GAN-Augmented Data | Denis Liu et.al. | 2308.13554 | null |
2023-08-21 | Feature Extraction Using Deep Generative Models for Bangla Text Classification on a New Comprehensive Dataset | Md. Rafi-Ur-Rashid et.al. | 2308.13545 | null |
2023-08-25 | Resolution-independent generative models based on operator learning for physics-constrained Bayesian inverse problems | Xinchao Jiang et.al. | 2308.13295 | null |
2023-08-25 | Unpaired Multi-domain Attribute Translation of 3D Facial Shapes with a Square and Symmetric Geometric Map | Zhenfeng Fan et.al. | 2308.13245 | link |
2023-08-25 | Self-supervised Scene Text Segmentation with Object-centric Layered Representations Augmented by Text Regions | Yibo Wang et.al. | 2308.13178 | null |
2023-08-27 | FFEINR: Flow Feature-Enhanced Implicit Neural Representation for Spatio-temporal Super-Resolution | Chenyue Jiao et.al. | 2308.12508 | null |
2023-08-23 | PFL-GAN: When Client Heterogeneity Meets Generative Models in Personalized Federated Learning | Achintha Wijesinghe et.al. | 2308.12454 | null |
2023-08-23 | ARF-Plus: Controlling Perceptual Factors in Artistic Radiance Fields for 3D Scene Stylization | Wenzhao Li et.al. | 2308.12452 | null |
2023-08-23 | TAI-GAN: Temporally and Anatomically Informed GAN for early-to-late frame conversion in dynamic cardiac PET motion correction | Xueqi Guo et.al. | 2308.12443 | link |
2023-08-23 | Quantum-Noise-driven Generative Diffusion Models | Marco Parigi et.al. | 2308.12013 | null |
2023-08-23 | CoC-GAN: Employing Context Cluster for Unveiling a New Pathway in Image Generation | Zihao Wang et.al. | 2308.11857 | null |
2023-08-24 | Can Authorship Representation Learning Capture Stylistic Features? | Andrew Wang et.al. | 2308.11490 | link |
2023-08-22 | Generating airshower images for the VERITAS telescopes with conditional Generative Adversarial Networks | J. Hoang et.al. | 2308.11431 | null |
2023-08-22 | MosaiQ: Quantum Generative Adversarial Networks for Image Generation on NISQ Computers | Daniel Silver et.al. | 2308.11096 | null |
2023-08-21 | PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion | Yimin Deng et.al. | 2308.11084 | null |
2023-08-21 | Harmonization Across Imaging Locations(HAIL): One-Shot Learning for Brain MRI | Abhijeet Parida et.al. | 2308.11047 | null |
2023-08-21 | MRI Field-transfer Reconstruction with Limited Data: Regularization by Neural Style Transfer | Guoyao Shen et.al. | 2308.10968 | null |
2023-08-21 | Color Prompting for Data-Free Continual Unsupervised Domain Adaptive Person Re-Identification | Jianyang Gu et.al. | 2308.10716 | link |
2023-08-21 | Improving the Transferability of Adversarial Examples with Arbitrary Style Transfer | Zhijin Ge et.al. | 2308.10601 | link |
2023-08-21 | RADIANCE: Radio-Frequency Adversarial Deep-learning Inference for Automated Network Coverage Estimation | Sopan Sarkar et.al. | 2308.10584 | null |
2023-08-20 | Turning Waste into Wealth: Leveraging Low-Quality Samples for Enhancing Continuous Conditional Generative Adversarial Networks | Xin Ding et.al. | 2308.10273 | null |
2023-08-20 | Contrastive Diffusion Model with Auxiliary Guidance for Coarse-to-Fine PET Reconstruction | Zeyu Han et.al. | 2308.10157 | link |
2023-08-19 | Deep Generative Modeling-based Data Augmentation with Demonstration using the BFBT Benchmark Void Fraction Datasets | Farah Alsafadi et.al. | 2308.10120 | null |
2023-08-19 | Controllable Multi-domain Semantic Artwork Synthesis | Yuantian Huang et.al. | 2308.10111 | null |
2023-08-19 | Physics-guided training of GAN to improve accuracy in airfoil design synthesis | Kazunari Wada et.al. | 2308.10038 | null |
2023-08-19 | EGANS: Evolutionary Generative Adversarial Network Search for Zero-Shot Learning | Shiming Chen et.al. | 2308.09915 | null |
2023-08-19 | Generative Adversarial Networks Unlearning | Hui Sun et.al. | 2308.09881 | null |
2023-08-18 | Data augmentation and explainability for bias discovery and mitigation in deep learning | Agnieszka Mikołajczyk-Bareła et.al. | 2308.09464 | null |
2023-08-18 | A review of technical factors to consider when designing neural networks for semantic segmentation of Earth Observation imagery | Sam Khallaghi et.al. | 2308.09221 | null |
2023-08-17 | Distributed Extra-gradient with Optimal Complexity and Communication Guarantees | Ali Ramezani-Kebrya et.al. | 2308.09187 | link |
2023-08-17 | Don't lose the message while paraphrasing: A study on content preserving style transfer | Nikolay Babakov et.al. | 2308.09055 | link |
2023-08-17 | SR-GAN for SR-gamma: photon super resolution at collider experiments | Johannes Erdmann et.al. | 2308.09025 | null |
2023-08-17 | A White-Box False Positive Adversarial Attack Method on Contrastive Loss-Based Offline Handwritten Signature Verification Models | Zhongliang Guo et.al. | 2308.08925 | null |
2023-08-17 | An Effective Deep Learning Based Multi-Class Classification of DoS and DDoS Attack Detection | Arun Kumar Silivery et.al. | 2308.08803 | null |
2023-08-16 | Fair GANs through model rebalancing with synthetic data | Anubhav Jain et.al. | 2308.08638 | null |
2023-08-15 | Implementing Quantum Generative Adversarial Network (qGAN) and QCBM in Finance | Santanu Ganguly et.al. | 2308.08448 | null |
2023-08-16 | Diff-CAPTCHA: An Image-based CAPTCHA with Security Enhanced by Denoising Diffusion Model | Ran Jiang et.al. | 2308.08367 | null |
2023-08-16 | Denoising Diffusion Probabilistic Model for Retinal Image Generation and Segmentation | Alnur Alimanov et.al. | 2308.08339 | link |
2023-08-15 | StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models | Zhizhong Wang et.al. | 2308.07863 | null |
2023-08-16 | DiffGuard: Semantic Mismatch-Guided Out-of-Distribution Detection using Pre-trained Diffusion Models | Ruiyuan Gao et.al. | 2308.07687 | link |
2023-08-15 | Synthetic data generation method for hybrid image-tabular data using two generative adversarial networks | Tomohiro Kikuchi et.al. | 2308.07573 | null |
2023-08-14 | Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation | Alexander Martin et.al. | 2308.07316 | link |
2023-08-14 | A Unifying Generator Loss Function for Generative Adversarial Networks | Justin Veiner et.al. | 2308.07233 | null |
2023-08-14 | AdvCLIP: Downstream-agnostic Adversarial Examples in Multimodal Contrastive Learning | Ziqi Zhou et.al. | 2308.07026 | link |
2023-08-14 | Hierarchy Flow For High-Fidelity Image-to-Image Translation | Weichen Fan et.al. | 2308.06909 | link |
2023-08-13 | Unsupervised Image Denoising in Real-World Scenarios via Self-Collaboration Parallel Generative Adversarial Branches | Xin Lin et.al. | 2308.06776 | link |
2023-08-13 | Precipitation nowcasting with generative diffusion models | Andrea Asperti et.al. | 2308.06733 | null |
2023-08-13 | ALGAN: Time Series Anomaly Detection with Adjusted-LSTM GAN | Md Abul Bashar et.al. | 2308.06663 | null |
2023-08-12 | BigWavGAN: A Wave-To-Wave Generative Adversarial Network for Music Super-Resolution | Yenan Zhang et.al. | 2308.06483 | null |
2023-08-11 | A Review on Classification of White Blood Cells Using Machine Learning Models | Rabia Asghar et.al. | 2308.06296 | null |
2023-08-11 | Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance Flow | Junhong Gou et.al. | 2308.06101 | link |
2023-08-11 | Head Rotation in Denoising Diffusion Models | Andrea Asperti et.al. | 2308.06057 | link |
2023-08-10 | UFed-GAN: A Secure Federated Learning Framework with Constrained Computation and Unlabeled Data | Achintha Wijesinghe et.al. | 2308.05870 | null |
2023-08-09 | EEG-based Emotion Style Transfer Network for Cross-dataset Emotion Recognition | Yijin Zhou et.al. | 2308.05767 | null |
2023-08-10 | SAR Target Image Generation Method Using Azimuth-Controllable Generative Adversarial Network | Chenwei Wang et.al. | 2308.05489 | null |
2023-08-10 | Transforming Breast Cancer Diagnosis: Towards Real-Time Ultrasound to Mammogram Conversion for Cost-Effective Diagnosis | Sahar Almahfouz Nasser et.al. | 2308.05449 | null |
2023-08-09 | Vector quantization loss analysis in VQGANs: a single-GPU ablation study for image-to-image synthesis | Luv Verma et.al. | 2308.05242 | link |
2023-08-09 | Deep Generative Networks for Heterogeneous Augmentation of Cranial Defects | Kamil Kwarciak et.al. | 2308.04883 | null |
2023-08-11 | VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer | Liyang Chen et.al. | 2308.04830 | null |
2023-08-09 | GIFD: A Generative Gradient Inversion Method with Feature Domain Optimization | Hao Fang et.al. | 2308.04699 | link |
2023-08-08 | Generating Modern Persian Carpet Map by Style-transfer | Dorsa Rahmatian et.al. | 2308.04529 | null |
2023-08-08 | Efficient option pricing with unary-based photonic computing chip and generative adversarial learning | Hui Zhang et.al. | 2308.04493 | null |
2023-08-08 | A Deep-Learning Method Using Auto-encoder and Generative Adversarial Network for Anomaly Detection on Ancient Stone Stele Surfaces | Yikun Liu et.al. | 2308.04426 | null |
2023-08-08 | DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images | Xuechao Zou et.al. | 2308.04417 | null |
2023-08-08 | Learning Evaluation Models from Large Language Models for Sequence Generation | Chenglong Wang et.al. | 2308.04386 | null |
2023-08-08 | Domain Adaptive Person Search via GAN-based Scene Synthesis for Cross-scene Videos | Huibing Wang et.al. | 2308.04322 | link |
2023-08-08 | Vision-Based Autonomous Navigation for Unmanned Surface Vessel in Extreme Marine Conditions | Muhayyuddin Ahmed et.al. | 2308.04283 | link |
2023-08-07 | PMU measurements based short-term voltage stability assessment of power systems via deep transfer learning | Yang Li et.al. | 2308.03953 | null |
2023-08-07 | Deterministic Neural Illumination Mapping for Efficient Auto-White Balance Correction | Furkan Kınlı et.al. | 2308.03939 | link |
2023-08-05 | FAST: Font-Agnostic Scene Text Editing | Alloy Das et.al. | 2308.02905 | null |
2023-08-05 | Generative Adversarial Networks for Stain Normalisation in Histopathology | Jack Breen et.al. | 2308.02851 | null |
2023-08-08 | Generation of Realistic Synthetic Raw Radar Data for Automated Driving Applications using Generative Adversarial Networks | Eduardo C. Fidelis et.al. | 2308.02632 | null |
2023-08-01 | Learning to Generate Training Datasets for Robust Semantic Segmentation | Marwane Hariat et.al. | 2308.02535 | null |
2023-08-04 | Painterly Image Harmonization using Diffusion Model | Lingxiao Lu et.al. | 2308.02228 | link |
2023-08-03 | Deep Learning-based Prediction of Stress and Strain Maps in Arterial Walls for Improved Cardiovascular Risk Assessment | Yasin Shokrollahi1 et.al. | 2308.01771 | null |
2023-08-03 | Interleaving GANs with knowledge graphs to support design creativity for book covers | Alexandru Motogna et.al. | 2308.01626 | link |
2023-08-03 | Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS | Myeongjin Ko et.al. | 2308.01573 | link |
2023-08-02 | Feature-aware conditional GAN for category text generation | Xinze Li et.al. | 2308.00939 | null |
2023-08-01 | Graph Contrastive Learning with Generative Adversarial Network | Cheng Wu et.al. | 2308.00535 | null |
2023-08-03 | A Deep Learning Approach for Virtual Contrast Enhancement in Contrast Enhanced Spectral Mammography | Aurora Rofena et.al. | 2308.00471 | null |
2023-08-01 | Generative adversarial networks with physical sound field priors | Xenofon Karakonstantis et.al. | 2308.00426 | link |
2023-08-01 | SkullGAN: Synthetic Skull CT Generation with Generative Adversarial Networks | Kasra Naftchi-Ardebili et.al. | 2308.00206 | link |
2023-07-31 | Controlling Geometric Abstraction and Texture for Artistic Images | Martin Büßemeyer et.al. | 2308.00148 | link |
2023-07-31 | A multiscale and multicriteria Generative Adversarial Network to synthesize 1-dimensional turbulent fields | Carlos Granero-Belinchon et.al. | 2307.16580 | null |
2023-07-31 | DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training | Hyung-Seok Oh et.al. | 2307.16549 | link |
2023-07-31 | Don't be so negative! Score-based Generative Modeling with Oracle-assisted Guidance | Saeid Naderiparizi et.al. | 2307.16463 | null |
2023-07-30 | Stylized Projected GAN: A Novel Architecture for Fast and Realistic Image Generation | Md Nurul Muttakin et.al. | 2307.16275 | null |
2023-07-30 | InfoStyler: Disentanglement Information Bottleneck for Artistic Style Transfer | Yueming Lyu et.al. | 2307.16227 | null |
2023-07-30 | HierVST: Hierarchical Adaptive Zero-shot Voice Style Transfer | Sang-Hoon Lee et.al. | 2307.16171 | null |
2023-07-30 | StylePrompter: All Styles Need Is Attention | Chenyi Zhuang et.al. | 2307.16151 | link |
2023-07-29 | What can Discriminator do? Towards Box-free Ownership Verification of Generative Adversarial Network | Ziheng Huang et.al. | 2307.15860 | null |
2023-07-28 | A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment | Carlo Aironi et.al. | 2307.15611 | link |
2023-07-28 | Staging E-Commerce Products for Online Advertising using Retrieval Assisted Image Generation | Yueh-Ning Ku et.al. | 2307.15326 | null |
2023-07-28 | Learning with Constraint Learning: New Perspective, Solution Strategy and Various Applications | Risheng Liu et.al. | 2307.15257 | null |
2023-07-27 | Generative convective parametrization of dry atmospheric boundary layer | Florian Heyder et.al. | 2307.14857 | null |
2023-07-27 | Semantic Image Completion and Enhancement using GANs | Priyansh Saxena et.al. | 2307.14748 | null |
2023-07-27 | EqGAN: Feature Equalization Fusion for Few-shot Image Generation | Yingbo Zhou et.al. | 2307.14638 | null |
2023-07-26 | Controllable Generation of Dialogue Acts for Dialogue Systems via Few-Shot Response Generation and Ranking | Angela Ramirez et.al. | 2307.14440 | link |
2023-07-26 | Deepfake Image Generation for Improved Brain Tumor Segmentation | Roa'a Al-Emaryeen et.al. | 2307.14273 | null |
2023-07-26 | Artifact Restoration in Histology Images with Diffusion Probabilistic Models | Zhenqi He et.al. | 2307.14262 | link |
2023-07-27 | Creative Birds: Self-Supervised Single-View 3D Style Transfer | Renke Wang et.al. | 2307.14127 | link |
2023-07-26 | Deep learning-based radiointerferometric imaging with GAN-aided training | F. Geyer et.al. | 2307.14100 | null |
2023-07-26 | Controlling the Latent Space of GANs through Reinforcement Learning: A Case Study on Task-based Image-to-Image Translation | Mahyar Abbasian et.al. | 2307.13978 | null |
2023-07-25 | CosSIF: Cosine similarity-based image filtering to overcome low inter-class variation in synthetic medical image datasets | Mominul Islam et.al. | 2307.13842 | link |
2023-07-25 | The GANfather: Controllable generation of malicious activity to improve defence systems | Ricardo Ribeiro Pereira et.al. | 2307.13787 | null |
2023-07-25 | Personal Protective Equipment Detection in Extreme Construction Conditions | Yuexiong Ding et.al. | 2307.13654 | null |
2023-07-25 | Mitigating Cross-client GANs-based Attack in Federated Learning | Hong Huang et.al. | 2307.13314 | null |
2023-07-24 | Deep Learning Approaches for Data Augmentation in Medical Imaging: A Review | Aghiles Kebaili et.al. | 2307.13125 | null |
2023-07-24 | Volcanic ash delimitation using Artificial Intelligence based on Pix2Pix | Christian Carrillo et.al. | 2307.12970 | null |
2023-07-24 | TransFusion: Generating Long, High Fidelity Time Series using Diffusion Models with Transformers | Md Fahim Sikder et.al. | 2307.12667 | null |
2023-07-24 | De-confounding Representation Learning for Counterfactual Inference on Continuous Treatment via Generative Adversarial Network | Yonghe Zhao et.al. | 2307.12625 | null |
2023-07-24 | Attribute Regularized Soft Introspective VAE: Towards Cardiac Attribute Regularization Through MRI Domains | Maxime Di Folco et.al. | 2307.12618 | null |
2023-07-24 | AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models | Xuelong Dai et.al. | 2307.12499 | null |
2023-07-22 | Security and Privacy Issues of Federated Learning | Jahid Hasan et.al. | 2307.12181 | null |
2023-07-22 | SCPAT-GAN: Structural Constrained and Pathology Aware Convolutional Transformer-GAN for Virtual Histology Staining of Human Coronary OCT images | Xueshen Li et.al. | 2307.12138 | null |
2023-07-22 | Synthesis of Batik Motifs using a Diffusion -- Generative Adversarial Network | One Octadion et.al. | 2307.12122 | link |
2023-07-20 | Adversarial Conversational Shaping for Intelligent Agents | Piotr Tarasiewicz et.al. | 2307.11785 | null |
2023-07-21 | CycleIK: Neuro-inspired Inverse Kinematics | Jan-Gerrit Habekost et.al. | 2307.11554 | null |
2023-07-21 | UWAT-GAN: Fundus Fluorescein Angiography Synthesis via Ultra-wide-angle Transformation Multi-scale GAN | Zhaojie Fang et.al. | 2307.11530 | link |
2023-07-21 | LatentAugment: Data Augmentation via Guided Manipulation of GAN's Latent Space | Lorenzo Tronchin et.al. | 2307.11375 | link |
2023-07-21 | ParGANDA: Making Synthetic Pedestrians A Reality For Object Detection | Daria Reshetova et.al. | 2307.11360 | null |
2023-07-21 | PI-VEGAN: Physics Informed Variational Embedding Generative Adversarial Networks for Stochastic Differential Equations | Ruisong Gao et.al. | 2307.11289 | null |
2023-07-20 | Joint one-sided synthetic unpaired image translation and segmentation for colorectal cancer prevention | Enric Moreu et.al. | 2307.11253 | null |
2023-07-20 | Frequency-aware optical coherence tomography image super-resolution via conditional generative adversarial neural network | Xueshen Li et.al. | 2307.11130 | null |
2023-07-20 | BlendFace: Re-designing Identity Encoders for Face-Swapping | Kaede Shiohara et.al. | 2307.10854 | link |
2023-07-20 | Enhancing Job Recommendation through LLM-based Generative Adversarial Networks | Yingpeng Du et.al. | 2307.10747 | null |
2023-07-19 | Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI Synthesis | Lingting Zhu et.al. | 2307.10094 | null |
2023-07-19 | Adversarial Likelihood Estimation with One-way Flows | Omri Ben-Dov et.al. | 2307.09882 | null |
2023-07-20 | AesPA-Net: Aesthetic Pattern-Aware Style Transfer Networks | Kibeom Hong et.al. | 2307.09724 | link |
2023-07-18 | Conditional 360-degree Image Synthesis for Immersive Indoor Scene Decoration | Ka Chun Shum et.al. | 2307.09621 | null |
2023-07-13 | Can Diffusion Model Conditionally Generate Astrophysical Images? | Xiaosheng Zhao et.al. | 2307.09568 | link |
2023-07-19 | A comparative analysis of SRGAN models | Fatemeh Rezapoor Nikroo et.al. | 2307.09456 | null |
2023-07-18 | SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs | Yinghao Aaron Li et.al. | 2307.09435 | null |
2023-07-18 | Face-PAST: Facial Pose Awareness and Style Transfer Networks | Sunder Ali Khowaja et.al. | 2307.09020 | null |
2023-07-17 | Harnessing the Power of AI based Image Generation Model DALLE 2 in Agricultural Settings | Ranjan Sapkota et.al. | 2307.08789 | null |
2023-07-17 | On the Fly Neural Style Smoothing for Risk-Averse Domain Generalization | Akshay Mehra et.al. | 2307.08551 | link |
2023-07-17 | Soft Curriculum for Learning Conditional GANs with Noisy-Labeled and Uncurated Unlabeled Data | Kai Katsumata et.al. | 2307.08319 | null |
2023-07-17 | Complexity Matters: Rethinking the Latent Space for Generative Modeling | Tianyang Hu et.al. | 2307.08283 | null |
2023-07-16 | Self-Attention Based Generative Adversarial Networks For Unsupervised Video Summarization | Maria Nektaria Minaidi et.al. | 2307.08145 | null |
2023-07-16 | Householder Projector for Unsupervised Latent Semantics Discovery | Yue Song et.al. | 2307.08012 | link |
2023-07-18 | Bidirectionally Deformable Motion Modulation For Video-based Human Pose Transfer | Wing-Yin Yu et.al. | 2307.07754 | link |
2023-07-14 | Generative adversarial networks for data-scarce spectral applications | Juan José García-Esteban et.al. | 2307.07454 | null |
2023-07-13 | Improving Nonalcoholic Fatty Liver Disease Classification Performance With Latent Diffusion Models | Romain Hardy et.al. | 2307.06507 | null |
2023-07-12 | Denoising Simulated Low-Field MRI (70mT) using Denoising Autoencoders (DAE) and Cycle-Consistent Generative Adversarial Networks (Cycle-GAN) | Fernando Vega et.al. | 2307.06338 | null |
2023-07-12 | Sem-CS: Semantic CLIPStyler for Text-Based Image Style Transfer | Chanda Grover Kamra et.al. | 2307.05934 | link |
2023-07-11 | Implicit regularisation in stochastic gradient descent: from single-objective to two-player games | Mihaela Rosca et.al. | 2307.05789 | null |
2023-07-11 | Line Art Colorization of Fakemon using Generative Adversarial Neural Networks | Erick Oliveira Rodrigues et.al. | 2307.05760 | null |
2023-07-11 | Image Reconstruction using Enhanced Vision Transformer | Nikhil Verma et.al. | 2307.05616 | null |
2023-07-10 | Substance or Style: What Does Your Image Embedding Know? | Cyrus Rashtchian et.al. | 2307.05610 | null |
2023-07-11 | Efficient 3D Articulated Human Generation with Layered Surface Volumes | Yinghao Xu et.al. | 2307.05462 | null |
2023-07-04 | SleepEGAN: A GAN-enhanced Ensemble Deep Learning Model for Imbalanced Classification of Sleep Stages | Xuewei Cheng et.al. | 2307.05362 | null |
2023-07-08 | A Physics-Informed Low-Shot Learning For sEMG-Based Estimation of Muscle Force and Joint Kinematics | Yue Shi et.al. | 2307.05361 | null |
2023-07-11 | Disentangled Contrastive Image Translation for Nighttime Surveillance | Guanzhou Lan et.al. | 2307.05038 | null |
2023-07-11 | Diffusion idea exploration for art generation | Nikhil Verma et.al. | 2307.04978 | null |
2023-07-11 | Multi-fidelity Emulator for Cosmological Large Scale 21 cm Lightcone Images: a Few-shot Transfer Learning Approach with GAN | Kangning Diao et.al. | 2307.04976 | link |
2023-07-10 | Toward a generative modeling analysis of CLAS exclusive |
T. Alghamdi et.al. | 2307.04450 | null |
2023-07-13 | Seismic Data Interpolation based on Denoising Diffusion Implicit Models with Resampling | Xiaoli Wei et.al. | 2307.04226 | null |
2023-07-11 | DIFF-NST: Diffusion Interleaving For deFormable Neural Style Transfer | Dan Ruta et.al. | 2307.04157 | null |
2023-07-09 | A generative flow for conditional sampling via optimal transport | Jason Alfonso et.al. | 2307.04102 | link |
2023-07-07 | Synthesizing Forestry Images Conditioned on Plant Phenotype Using a Generative Adversarial Network | Debasmita Pal et.al. | 2307.03789 | null |
2023-07-06 | A Hybrid Quantum-Classical Generative Adversarial Network for Near-Term Quantum Processors | Albha O'Dwyer Boyle et.al. | 2307.03269 | null |
2023-07-06 | A Privacy-Preserving Walk in the Latent Space of Generative Models for Medical Applications | Matteo Pennisi et.al. | 2307.02984 | link |
2023-07-05 | DeSRA: Detect and Delete the Artifacts of GAN-based Real-World Super-Resolution Models | Liangbin Xie et.al. | 2307.02457 | null |
2023-07-05 | DiffFlow: A Unified SDE Framework for Score-Based Diffusion Models and Generative Adversarial Networks | Jingwei Zhang et.al. | 2307.02159 | null |
2023-07-05 | Generative Adversarial Networks for Dental Patient Identity Protection in Orthodontic Educational Imaging | Mingchuan Tian et.al. | 2307.02019 | null |
2023-07-04 | Approximate, Adapt, Anonymize (3A): a Framework for Privacy Preserving Training Data Release for Machine Learning | Tamas Madl et.al. | 2307.01875 | null |
2023-07-04 | Disentanglement in a GAN for Unconditional Speech Synthesis | Matthew Baas et.al. | 2307.01673 | link |
2023-07-04 | LEAT: Towards Robust Deepfake Disruption in Real-World Scenarios via Latent Ensemble Attack | Joonkyo Shim et.al. | 2307.01520 | null |
2023-07-02 | Unsupervised denoising of Raman spectra with cycle-consistent generative adversarial networks | Ciaran Bench et.al. | 2307.00513 | link |
2023-07-01 | CasTGAN: Cascaded Generative Adversarial Network for Realistic Tabular Data Synthesis | Abdallah Alshantti et.al. | 2307.00384 | link |
2023-07-01 | StyleStegan: Leak-free Style Transfer Based on Feature Steganography | Xiujian Liang et.al. | 2307.00225 | null |
2023-07-01 | Re-Think and Re-Design Graph Neural Networks in Spaces of Continuous Graph Diffusion Functionals | Tingting Dan et.al. | 2307.00222 | null |
2023-06-29 | Robust Roadside Perception for Autonomous Driving: an Annotation-free Strategy with Synthesized Data | Rusheng Zhang et.al. | 2306.17302 | null |
2023-06-29 | TemperatureGAN: Generative Modeling of Regional Atmospheric Temperatures | Emmanuel Balogun et.al. | 2306.17248 | null |
2023-06-29 | Synthetic Demographic Data Generation for Card Fraud Detection Using GANs | Shuo Wang et.al. | 2306.17109 | link |
2023-06-29 | ICDaeLST: Intensity-Controllable Detail Attention-enhanced for Lightweight Fast Style Transfer | Jiang Shi Qi et.al. | 2306.16846 | null |
2023-06-26 | Procedural content generation of puzzle games using conditional generative adversarial networks | Andreas Hald et.al. | 2306.15696 | null |
2023-06-27 | Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos | Chiori Hori et.al. | 2306.15644 | null |
2023-07-06 | A Simple and Effective Baseline for Attentional Generative Adversarial Networks | Mingyu Jin et.al. | 2306.14708 | link |
2023-07-09 | DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing | Yujun Shi et.al. | 2306.14435 | link |
2023-06-24 | Creating Realistic Anterior Segment Optical Coherence Tomography Images using Generative Adversarial Networks | Jad F. Assaf et.al. | 2306.14058 | null |
2023-06-24 | Radio Generation Using Generative Adversarial Networks with An Unrolled Design | Weidong Wang et.al. | 2306.13893 | null |
2023-06-23 | A New Paradigm for Generative Adversarial Networks based on Randomized Decision Rules | Sehwan Kim et.al. | 2306.13641 | link |
2023-06-23 | Machine Learning methods for simulating particle response in the Zero Degree Calorimeter at the ALICE experiment, CERN | Jan Dubiński et.al. | 2306.13606 | null |
2023-06-23 | Penalty Gradient Normalization for Generative Adversarial Networks | Tian Xia et.al. | 2306.13576 | link |
2023-06-23 | PP-GAN : Style Transfer from Korean Portraits to ID Photos Using Landmark Extractor with GAN | Jongwook Si et.al. | 2306.13418 | null |
2023-06-22 | What to Learn: Features, Image Transformations, or Both? | Yuxuan Chen et.al. | 2306.13040 | null |
2023-06-23 | Semi-Implicit Denoising Diffusion Models (SIDDMs) | Yanwu Xu et.al. | 2306.12511 | null |
2023-06-21 | Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase | Qiuyu Wang et.al. | 2306.12423 | link |
2023-06-21 | A New Initial Distribution for Quantum Generative Adversarial Networks to Load Probability Distributions | Yuichi Sano et.al. | 2306.12303 | null |
2023-06-15 | Out of Distribution Generalization via Interventional Style Transfer in Single-Cell Microscopy | Wolfgang M. Pernice et.al. | 2306.11890 | null |
2023-06-20 | Towards a robust and reliable deep learning approach for detection of compact binary mergers in gravitational wave data | Shreejit Jadhav et.al. | 2306.11797 | null |
2023-06-22 | Data-Driven but Privacy-Conscious: Pedestrian Dataset De-identification via Full-Body Person Synthesis | Maxim Maximov et.al. | 2306.11710 | null |
2023-06-19 | Quantum state preparation of gravitational waves | Fergus Hayes et.al. | 2306.11073 | link |
2023-06-19 | Probabilistic matching of real and generated data statistics in generative adversarial networks | Philipp Pilar et.al. | 2306.10943 | null |
2023-06-19 | Robust Defect Detection with Contrastive Localization | Jiang Lin et.al. | 2306.10720 | null |
2023-06-18 | Stabilizing GANs' Training with Brownian Motion Controller | Tianjiao Luo et.al. | 2306.10468 | null |
2023-06-15 | Taming Diffusion Models for Music-driven Conducting Motion Generation | Zhuoran Zhao et.al. | 2306.10065 | link |
2023-06-16 | Query-Free Evasion Attacks Against Machine Learning-Based Malware Detectors with Generative Adversarial Networks | Daniel Gibert et.al. | 2306.09925 | null |
2023-06-16 | Understanding Deep Generative Models with Generalized Empirical Likelihoods | Suman Ravuri et.al. | 2306.09780 | null |
2023-06-15 | Training generative models from privatized data | Daria Reshetova et.al. | 2306.09547 | null |
2023-06-15 | Improving Path Planning Performance through Multimodal Generative Models with Local Critics | Jorge Ocampo Jimenez et.al. | 2306.09470 | null |
2023-06-19 | ArtFusion: Controllable Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models | Dar-Yen Chen et.al. | 2306.09330 | link |
2023-06-15 | Contrast, Stylize and Adapt: Unsupervised Contrastive Learning Framework for Domain Adaptive Semantic Segmentation | Tianyu Li et.al. | 2306.09098 | link |
2023-06-15 | PUGAN: Physical Model-Guided Underwater Image Enhancement Using GAN with Dual-Discriminators | Runmin Cong et.al. | 2306.08918 | null |
2023-06-15 | Two-Way Semantic Transmission of Images without Feedback | Kaiwen Yu et.al. | 2306.08903 | link |
2023-06-15 | RIDNet Assisted cGAN Based Channel Estimation for One Bit ADC mmWave MIMO Systems | Erhan Karakoca et.al. | 2306.08882 | null |
2023-06-15 | Motion Capture Dataset for Practical Use of AI-based Motion Editing and Stylization | Makito Kobayashi et.al. | 2306.08861 | null |
2023-06-14 | Virtual Histology with Photon Absorption Remote Sensing using a Cycle-Consistent Generative Adversarial Network with Weakly Registered Pairs | James E. D. Tweel et.al. | 2306.08583 | null |
2023-06-14 | Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction | Wenzhe Liu et.al. | 2306.08454 | null |
2023-06-14 | Pedestrian Recognition with Radar Data-Enhanced Deep Learning Approach Based on Micro-Doppler Signatures | Haoming Li et.al. | 2306.08303 | null |
2023-06-13 | Privacy Inference-Empowered Stealthy Backdoor Attack on Federated Learning under Non-IID Scenarios | Haochen Mei et.al. | 2306.08011 | null |
2023-06-12 | MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer | Yazheng Yang et.al. | 2306.07994 | link |
2023-06-13 | ChatGPT vs Human-authored Text: Insights into Controllable Text Summarization and Sentence Style Transfer | Dongqi Pu et.al. | 2306.07799 | null |
2023-06-13 | Dynamically Masked Discriminator for Generative Adversarial Networks | Wentian Zhang et.al. | 2306.07716 | null |
2023-06-13 | Robustness of SAM: Segment Anything Under Corruptions and Beyond | Yu Qiao et.al. | 2306.07713 | null |
2023-06-12 | Transformer-based GAN for Terahertz Spatial-Temporal Channel Modeling and Generating | Zhengdong Hu et.al. | 2306.06902 | null |
2023-06-11 | Precise and Generalized Robustness Certification for Neural Networks | Yuanyuan Yuan et.al. | 2306.06747 | link |
2023-06-10 | Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks | Dominik Wagner et.al. | 2306.06514 | null |
2023-06-10 | Vista-Morph: Unsupervised Image Registration of Visible-Thermal Facial Pairs | Catherine Ordun et.al. | 2306.06505 | null |
2023-06-09 | Attention-stacked Generative Adversarial Network (AS-GAN)-empowered Sensor Data Augmentation for Online Monitoring of Manufacturing System | Yuxuan Li et.al. | 2306.06268 | null |
2023-06-09 | BioGAN: An unpaired GAN-based image to image translation model for microbiological images | Saber Mirzaee Bafti et.al. | 2306.06217 | link |
2023-06-09 | GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields | Barbara Roessle et.al. | 2306.06044 | null |
2023-06-09 | Prediction of Transportation Index for Urban Patterns in Small and Medium-sized Indian Cities using Hybrid RidgeGAN Model | Rahisha Thottolil et.al. | 2306.05951 | link |
2023-06-12 | GAN-CAN: A Novel Attack to Behavior-Based Driver Authentication Systems | Emad Efatinasab et.al. | 2306.05923 | null |
2023-06-09 | HRTF upsampling with a generative adversarial network using a gnomonic equiangular projection | Aidan O. T. Hogg et.al. | 2306.05812 | link |
2023-06-08 | Stochastic Multi-Person 3D Motion Forecasting | Sirui Xu et.al. | 2306.05421 | null |
2023-06-08 | Ownership Protection of Generative Adversarial Networks | Hailong Hu et.al. | 2306.05233 | null |
2023-06-08 | Solution of physics-based inverse problems using conditional generative adversarial networks with full gradient penalty | Deep Ray et.al. | 2306.04895 | null |
2023-06-07 | Estimating Uncertainty in PET Image Reconstruction via Deep Posterior Sampling | Tin Vlašić et.al. | 2306.04664 | null |
2023-06-07 | Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge | Wenhao Guan et.al. | 2306.04301 | null |
2023-06-07 | Phoenix: A Federated Generative Diffusion Model | Fiona Victoria Stanley Jothiraj et.al. | 2306.04098 | null |
2023-06-06 | An Open Patch Generator based Fingerprint Presentation Attack Detection using Generative Adversarial Network | Anuj Rai et.al. | 2306.03577 | null |
2023-06-10 | SDR-GAIN: A High Real-Time Occluded Pedestrian Pose Completion Method for Autonomous Driving | Honghao Fu et.al. | 2306.03538 | null |
2023-06-05 | Brain tumor segmentation using synthetic MR images -- A comparison of GANs and diffusion models | Muhammad Usman Akbar et.al. | 2306.02986 | null |
2023-06-05 | Identifying the style by a qualified reader on a short fragment of generated poetry | Boris Orekhov et.al. | 2306.02771 | null |
2023-06-05 | ZIGNeRF: Zero-shot 3D Scene Representation with Invertible Generative Neural Radiance Fields | Kanghyeok Ko et.al. | 2306.02741 | null |
2023-06-03 | A Conditional Generative Chatbot using Transformer Model | Nura Esfandiari et.al. | 2306.02074 | null |
2023-06-07 | Generative Adversarial Networks for Data Augmentation | Angona Biswas et.al. | 2306.02019 | null |
2023-06-03 | GAT-GAN : A Graph-Attention-based Time-Series Generative Adversarial Network | Srikrishna Iyer et.al. | 2306.01999 | null |
2023-06-02 | LIC-GAN: Language Information Conditioned Graph Generative GAN Model | Robert Lo et.al. | 2306.01937 | null |
2023-06-02 | GANs Settle Scores! | Siddarth Asokan et.al. | 2306.01654 | null |
2023-06-02 | PassGPT: Password Modeling and (Guided) Generation with Large Language Models | Javier Rando et.al. | 2306.01545 | null |
2023-06-02 | Text Style Transfer Back-Translation | Daimeng Wei et.al. | 2306.01318 | link |
2023-06-01 | Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models | Chang Liu et.al. | 2306.00973 | link |
2023-06-01 | Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis | Hubert Siuzdak et.al. | 2306.00814 | link |
2023-06-01 | Data Interpolants -- That's What Discriminators in Higher-order Gradient-regularized GANs Are | Siddarth Asokan et.al. | 2306.00785 | null |
2023-06-01 | Label- and slide-free tissue histology using 3D epi-mode quantitative phase imaging and virtual H&E staining | Tanishq Mathew Abraham et.al. | 2306.00548 | link |
2023-06-01 | A Call for Standardization and Validation of Text Style Transfer Evaluation | Phil Ostheimer et.al. | 2306.00539 | null |
2023-06-02 | Data-scarce surrogate modeling of shock-induced pore collapse process | Siu Wun Cheung et.al. | 2306.00184 | null |
2023-05-31 | GANDiffFace: Controllable Generation of Synthetic Datasets for Face Recognition with Realistic Variations | Pietro Melzi et.al. | 2305.19962 | null |
2023-05-31 | Neural Markov Jump Processes | Patrick Seifner et.al. | 2305.19744 | link |
2023-06-01 | PromptStyle: Controllable Style Transfer for Text-to-Speech with Natural Language Descriptions | Guanghou Liu et.al. | 2305.19522 | null |
2023-05-31 | Fine-grained Text Style Transfer with Diffusion-Based Language Models | Yiwei Lyu et.al. | 2305.19512 | link |
2023-05-31 | A Unified GAN Framework Regarding Manifold Alignment for Remote Sensing Images Generation | Xingzhe Su et.al. | 2305.19507 | null |
2023-05-30 | Calliffusion: Chinese Calligraphy Generation and Style Transfer with Diffusion Modeling | Qisheng Liao et.al. | 2305.19124 | null |
2023-05-30 | GAN-MPC: Training Model Predictive Controllers with Parameterized Cost Functions using Demonstrations from Non-identical Experts | Returaj Burnwal et.al. | 2305.19111 | null |
2023-05-31 | StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation | Chi Zhang et.al. | 2305.19012 | link |
2023-05-30 | Precision-Recall Divergence Optimization for Generative Modeling with GANs and Normalizing Flows | Alexandre Verine et.al. | 2305.18910 | null |
2023-05-30 | A Federated Channel Modeling System using Generative Neural Networks | Saira Bano et.al. | 2305.18856 | null |
2023-05-30 | SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-guided Video Editing | Nazmul Karim et.al. | 2305.18670 | null |
2023-05-30 | Simulation-Aided Deep Learning for Laser Ultrasonic Visualization Testing | Miya Nakajima et.al. | 2305.18614 | null |
2023-05-29 | Conditional Diffusion Models for Semantic 3D Medical Image Synthesis | Zolnamar Dorjsembe et.al. | 2305.18453 | null |
2023-05-28 | Augmenting Character Designers Creativity Using Generative Adversarial Networks | Mohammad Lataifeh et.al. | 2305.18387 | null |
2023-05-28 | A Synergistic Framework Leveraging Autoencoders and Generative Adversarial Networks for the Synthesis of Computational Fluid Dynamics Results in Aerofoil Aerodynamics | Tanishk Nandal et.al. | 2305.18386 | null |
2023-05-29 | Generative Adversarial Networks based Skin Lesion Segmentation | Shubham Innani et.al. | 2305.18164 | link |
2023-05-29 | TD-GEM: Text-Driven Garment Editing Mapper | Reza Dadfar et.al. | 2305.18120 | link |
2023-05-29 | NaturalFinger: Generating Natural Fingerprint with Generative Adversarial Networks | Kang Yang et.al. | 2305.17868 | null |
2023-05-31 | SPAC-Net: Synthetic Pose-aware Animal ControlNet for Enhanced Pose Estimation | Le Jiang et.al. | 2305.17845 | null |
2023-06-01 | StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation | Kun Song et.al. | 2305.17732 | null |
2023-05-27 | CCDWT-GAN: Generative Adversarial Networks Based on Color Channel Using Discrete Wavelet Transform for Document Image Binarization | Rui-Yang Ju et.al. | 2305.17420 | null |
2023-05-26 | Fitting a Deep Generative Hadronization Model | Jay Chan et.al. | 2305.17169 | null |
2023-05-26 | Fast refacing of MR images with a generative neural network lowers re-identification risk and preserves volumetric consistency | Nataliia Molchanova et.al. | 2305.16922 | null |
2023-05-26 | Evaluating generation of chaotic time series by convolutional generative adversarial networks | Yuki Tanaka et.al. | 2305.16729 | link |
2023-05-25 | An AI-Ready Multiplex Staining Dataset for Reproducible and Accurate Characterization of Tumor Immune Microenvironment | Parmida Ghahremani et.al. | 2305.16465 | link |
2023-05-25 | The Representation Jensen-Shannon Divergence | Jhoan K. Hoyos-Osorio et.al. | 2305.16446 | link |
2023-05-25 | UDPM: Upsampling Diffusion Probabilistic Models | Shady Abu-Hussein et.al. | 2305.16269 | null |
2023-05-25 | Incomplete Multimodal Learning for Complex Brain Disorders Prediction | Reza Shirkavand et.al. | 2305.16222 | null |
2023-05-25 | Unifying GANs and Score-Based Diffusion as Generative Particle Models | Jean-Yves Franceschi et.al. | 2305.16150 | link |
2023-05-25 | Learning and accurate generation of stochastic dynamics based on multi-model Generative Adversarial Networks | Daniele Lanzoni et.al. | 2305.15920 | null |
2023-05-25 | Generative Adversarial Reduced Order Modelling | Dario Coscia et.al. | 2305.15881 | link |
2023-05-25 | DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion | Ha-Yeong Choi et.al. | 2305.15816 | null |
2023-05-26 | CLIP3Dstyler: Language Guided 3D Arbitrary Neural Style Transfer | Ming Gao et.al. | 2305.15732 | null |
2023-05-24 | Balancing Effect of Training Dataset Distribution of Multiple Styles for Multi-Style Text Transfer | Debarati Das et.al. | 2305.15582 | null |
2023-05-24 | SAMScore: A Semantic Structural Similarity Metric for Image Translation Evaluation | Yunxiang Li et.al. | 2305.15367 | link |
2023-05-24 | IoT Threat Detection Testbed Using Generative Adversarial Networks | Farooq Shaikh et.al. | 2305.15191 | null |
2023-05-24 | GAN-AE : An anomaly detection algorithm for New Physics search in LHC data | Louis Vaslin et.al. | 2305.15179 | link |
2023-05-24 | DuDGAN: Improving Class-Conditional GANs via Dual-Diffusion | Taesun Yeom et.al. | 2305.14849 | null |
2023-05-24 | ACE: Adversarial Correspondence Embedding for Cross Morphology Motion Retargeting from Human to Nonhuman Characters | Tianyu Li et.al. | 2305.14792 | null |
2023-05-24 | Revisit and Outstrip Entity Alignment: A Perspective of Generative Models | Lingbing Guo et.al. | 2305.14651 | null |
2023-05-22 | Design a Delicious Lunchbox in Style | Yutong Zhou et.al. | 2305.14522 | null |
2023-05-23 | Source-Free Domain Adaptation for RGB-D Semantic Segmentation with Vision Transformers | Giulia Rizzoli et.al. | 2305.14269 | null |
2023-05-23 | Realistic Noise Synthesis with Diffusion Models | Qi Wu et.al. | 2305.14022 | null |
2023-05-23 | Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models | Weifeng Chen et.al. | 2305.13840 | null |
2023-05-23 | Generalizable Synthetic Image Detection via Language-guided Contrastive Learning | Haiwei Wu et.al. | 2305.13800 | link |
2023-05-22 | Attribute-Guided Encryption with Facial Texture Masking | Chun Pong Lau et.al. | 2305.13548 | null |
2023-05-22 | Statistical Guarantees of Group-Invariant GANs | Ziyu Chen et.al. | 2305.13517 | null |
2023-05-22 | Why current rain denoising models fail on CycleGAN created rain images in autonomous driving | Michael Kranl et.al. | 2305.12983 | null |
2023-05-22 | ZS-MSTM: Zero-Shot Style Transfer for Gesture Animation driven by Text and Speech using Adversarial Disentanglement of Multimodal Style Encoding | Mireille Fares et.al. | 2305.12887 | null |
2023-05-22 | SG-GAN: Fine Stereoscopic-Aware Generation for 3D Brain Point Cloud Up-sampling from a Single Image | Bowen Hu et.al. | 2305.12646 | null |
2023-05-21 | iWarpGAN: Disentangling Identity and Style to Generate Synthetic Iris Images | Shivangi Yadav et.al. | 2305.12596 | null |
2023-05-21 | Generalizable synthetic MRI with physics-informed convolutional networks | Luuk Jacobs et.al. | 2305.12570 | null |
2023-05-21 | PCF-GAN: generating sequential data via the characteristic function of measures on the path space | Hang Lou et.al. | 2305.12511 | link |
2023-05-21 | Exploring How Generative Adversarial Networks Learn Phonological Representations | Jingyi Chen et.al. | 2305.12501 | null |
2023-05-21 | Study of GANs for Noisy Speech Simulation from Clean Speech | Leander Melroy Maben et.al. | 2305.12460 | null |
2023-05-21 | InstructVid2Vid: Controllable Video Editing with Natural Language Instructions | Bosheng Qin et.al. | 2305.12328 | null |
2023-05-20 | AI-assisted super-resolution cosmological simulations III: Time evolution | Xiaowen Zhang et.al. | 2305.12222 | null |
2023-05-19 | Reducing Sequence Length by Predicting Edit Operations with Large Language Models | Masahiro Kaneko et.al. | 2305.11862 | null |
2023-05-19 | Sim-to-Real Segmentation in Robot-assisted Transoral Tracheal Intubation | Guankun Wang et.al. | 2305.11686 | null |
2023-05-19 | Latent Imitator: Generating Natural Individual Discriminatory Instances for Black-Box Fairness Testing | Yisong Xiao et.al. | 2305.11602 | null |
2023-05-19 | Brain Captioning: Decoding human brain activity into images and text | Matteo Ferrante et.al. | 2305.11560 | null |
2023-05-19 | PS-FedGAN: An Efficient Federated Learning Framework Based on Partially Shared Generative Adversarial Networks For Data Privacy | Achintha Wijesinghe et.al. | 2305.11437 | null |
2023-05-19 | A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model | Ibrahim Malik et.al. | 2305.11413 | null |
2023-05-19 | Few-Shot Continual Learning for Conditional Generative Adversarial Networks | Cat P. Le et.al. | 2305.11400 | null |
2023-05-18 | JoIN: Joint GANs Inversion for Intrinsic Image Decomposition | Viraj Shah et.al. | 2305.11321 | null |
2023-05-18 | Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold | Xingang Pan et.al. | 2305.10973 | link |
2023-05-18 | Domain Adaptive Sim-to-Real Segmentation of Oropharyngeal Organs | Guankun Wang et.al. | 2305.10883 | link |
2023-05-18 | StawGAN: Structural-Aware Generative Adversarial Networks for Infrared Image Translation | Luigi Sigillo et.al. | 2305.10882 | link |
2023-05-18 | Constructing a personalized AI assistant for shear wall layout using Stable Diffusion | Lufeng Wang et.al. | 2305.10830 | null |
2023-05-18 | BrutePrint: Expose Smartphone Fingerprint Authentication to Brute-force Attack | Yu Chen et.al. | 2305.10791 | null |
2023-05-16 | How does agency impact human-AI collaborative design space exploration? A case study on ship design with deep generative models | Shahroz Khan et.al. | 2305.10451 | null |
2023-05-13 | CBAGAN-RRT: Convolutional Block Attention Generative Adversarial Network for Sampling-Based Path Planning | Abhinav Sagar et.al. | 2305.10442 | null |
2023-05-19 | Spiking Generative Adversarial Network with Attention Scoring Decoding | Linghao Feng et.al. | 2305.10246 | null |
2023-05-17 | Bridging the Gap: Enhancing the Utility of Synthetic Data via Post-Processing Techniques | Andrea Lampis et.al. | 2305.10118 | null |
2023-05-16 | BSGAN: A Novel Oversampling Technique for Imbalanced Pattern Recognitions | Md Manjurul Ahsan et.al. | 2305.09777 | null |
2023-05-16 | Wavelet-based Unsupervised Label-to-Image Translation | George Eskandar et.al. | 2305.09647 | link |
2023-05-16 | Urban-StyleGAN: Learning to Generate and Manipulate Images of Urban Scenes | George Eskandar et.al. | 2305.09602 | null |
2023-05-16 | Improved Type III solar radio burst detection using congruent deep learning models | Jeremiah Scully et.al. | 2305.09327 | null |
2023-05-16 | Style Transfer Enabled Sim2Real Framework for Efficient Learning of Robotic Ultrasound Image Analysis Using Simulated Data | Keyu Li et.al. | 2305.09169 | null |
2023-05-14 | Smart Home Energy Management: VAE-GAN synthetic dataset generator and Q-learning | Mina Razghandi et.al. | 2305.08885 | null |
2023-05-15 | Generative Adversarial Networks for Spatio-Spectral Compression of Hyperspectral Images | Akshara Preethy Byju et.al. | 2305.08514 | null |
2023-05-14 | Local Convergence of Gradient Descent-Ascent for Training Generative Adversarial Networks | Evan Becker et.al. | 2305.08277 | null |
2023-05-14 | Street Layout Design via Conditional Adversarial Learning | Lehao Yang et.al. | 2305.08186 | null |
2023-05-12 | Spider GAN: Leveraging Friendly Neighbors to Accelerate GAN Training | Siddarth Asokan et.al. | 2305.07613 | null |
2023-05-12 | Color Deconvolution applied to Domain Adaptation in HER2 histopathological images | David Anglada-Rotger et.al. | 2305.07404 | null |
2023-05-12 | A Full Quantum Generative Adversarial Network Model for High Energy Physics Simulations | Florian Rehm et.al. | 2305.07284 | null |
2023-05-11 | Realization RGBD Image Stylization | Bhavya Sehgal et.al. | 2305.06565 | null |
2023-05-10 | Analyzing Bias in Diffusion-based Face Generation Models | Malsha V. Perera et.al. | 2305.06402 | null |
2023-05-10 | DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation | Fa-Ting Hong et.al. | 2305.06225 | link |
2023-05-10 | Post-training Model Quantization Using GANs for Synthetic Data Generation | Athanasios Masouris et.al. | 2305.06052 | link |
2023-05-10 | Adapter-TST: A Parameter Efficient Method for Multiple-Attribute Text Style Transfer | Zhiqiang Hu et.al. | 2305.05945 | null |
2023-05-09 | Enhancing Gappy Speech Audio Signals with Generative Adversarial Networks | Deniss Strods et.al. | 2305.05780 | null |
2023-05-09 | Style-A-Video: Agile Diffusion for Arbitrary Text-based Video Style Transfer | Nisha Huang et.al. | 2305.05464 | link |
2023-05-09 | Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing | Jingbei Li et.al. | 2305.05203 | null |
2023-05-09 | Who is Speaking Actually? Robust and Versatile Speaker Traceability for Voice Conversion | Yanzhen Ren et.al. | 2305.05152 | null |
2023-05-08 | Enhancing synthetic training data for quantitative photoacoustic tomography with generative deep learning | Ciaran Bench et.al. | 2305.04714 | null |
2023-05-06 | A Sea-Land Clutter Classification Framework for Over-the-Horizon-Radar Based on Weighted Loss Semi-supervised GAN | Xiaoxuan Zhang et.al. | 2305.04021 | null |
2023-05-05 | Learning Stochastic Dynamical System via Flow Map Operator | Yuan Chen et.al. | 2305.03874 | null |
2023-05-05 | Towards Applying Powerful Large AI Models in Classroom Teaching: Opportunities, Challenges and Prospects | Kehui Tan et.al. | 2305.03433 | null |
2023-05-04 | A Generative Modeling Framework for Inferring Families of Biomechanical Constitutive Laws in Data-Sparse Regimes | Minglang Yin et.al. | 2305.03184 | null |
2023-05-04 | Critical heat flux diagnosis using conditional generative adversarial networks | UngJin Na et.al. | 2305.02622 | null |
2023-05-04 | LayoutDM: Transformer-based Diffusion Model for Layout Generation | Shang Chai et.al. | 2305.02567 | null |
2023-05-03 | GANonymization: A GAN-based Face Anonymization Framework for Preserving Emotional Expressions | Fabio Hellmann et.al. | 2305.02143 | null |
2023-05-02 | Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks | Gašper Beguš et.al. | 2305.01626 | null |
2023-05-02 | Learning Hard Distributions with Quantum-enhanced Variational Autoencoders | Anantha Rao et.al. | 2305.01592 | null |
2023-05-08 | AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis | Hendric Voß et.al. | 2305.01241 | null |
2023-05-01 | Hypernuclear event detection in the nuclear emulsion with Monte Carlo simulation and machine learning | A. Kasagi et.al. | 2305.00884 | null |
2023-04-30 | StyleGenes: Discrete and Efficient Latent Distributions for GANs | Evangelos Ntavelis et.al. | 2305.00599 | null |
2023-04-30 | Towards Computational Architecture of Liberty: A Comprehensive Survey on Deep Learning for Generating Virtual Architecture in the Metaverse | Anqi Wang et.al. | 2305.00510 | null |
2023-04-30 | Identity-driven Three-Player Generative Adversarial Network for Synthetic-based Face Recognition | Jan Niklas Kolf et.al. | 2305.00358 | link |
2023-04-29 | ShipHullGAN: A generic parametric modeller for ship hull design using deep convolutional generative model | Shahroz Khan et.al. | 2305.00210 | null |
2023-04-29 | Visualizing chest X-ray dataset biases using GANs | Hao Liang et.al. | 2305.00147 | null |
2023-04-29 | LD-GAN: Low-Dimensional Generative Adversarial Network for Spectral Image Generation with Variance Regularization | Emmanuel Martinez et.al. | 2305.00132 | link |
2023-04-28 | SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis | Azade Farshad et.al. | 2304.14573 | null |
2023-04-27 | Quantum Generative Adversarial Networks For Anomaly Detection In High Energy Physics | Elie Bermot et.al. | 2304.14439 | null |
2023-04-26 | Deep Learning Techniques for Hyperspectral Image Analysis in Agriculture: A Review | Mohamed Fadhlallah Guerri et.al. | 2304.13880 | null |
2023-04-26 | Multidimensional Evaluation for Text Style Transfer Using ChatGPT | Huiyuan Lai et.al. | 2304.13462 | link |
2023-04-26 | DiffuseExpand: Expanding dataset for 2D medical image segmentation using diffusion models | Shitong Shao et.al. | 2304.13416 | null |
2023-04-25 | LumiGAN: Unconditional Generation of Relightable 3D Human Faces | Boyang Deng et.al. | 2304.13153 | null |
2023-05-05 | Directed Chain Generative Adversarial Networks | Ming Min et.al. | 2304.13131 | null |
2023-04-25 | Learning Volatility Surfaces using Generative Adversarial Networks | Andrew Na et.al. | 2304.13128 | null |
2023-04-25 | Diffusion Probabilistic Model Based Accurate and High-Degree-of-Freedom Metasurface Inverse Design | Zezhou Zhang et.al. | 2304.13038 | null |
2023-04-25 | The Score-Difference Flow for Implicit Generative Modeling | Romann M. Weber et.al. | 2304.12906 | null |
2023-04-25 | Latent diffusion models for generative precipitation nowcasting with accurate uncertainty quantification | Jussi Leinonen et.al. | 2304.12891 | null |
2023-04-24 | Unsupervised Style-based Explicit 3D Face Reconstruction from Single Image | Heng Yu et.al. | 2304.12455 | null |
2023-04-24 | GRIG: Few-Shot Generative Residual Image Inpainting | Wanglong Lu et.al. | 2304.12035 | null |
2023-04-29 | Incorporating Experts' Judgment into Machine Learning Models | Hogun Park et.al. | 2304.11870 | null |
2023-04-24 | Portfolio Optimization using Predictive Auxiliary Classifier Generative Adversarial Networks with Measuring Uncertainty | Jiwook Kim et.al. | 2304.11856 | null |
2023-04-24 | Master: Meta Style Transformer for Controllable Zero-Shot and Few-Shot Artistic Style Transfer | Hao Tang et.al. | 2304.11818 | null |
2023-04-23 | Controlled physics-informed data generation for deep learning-based remaining useful life prediction under unseen operation conditions | Jiawei Xiong et.al. | 2304.11702 | null |
2023-04-23 | Child Face Recognition at Scale: Synthetic Data Generation and Performance Benchmark | Magnus Falkenberg et.al. | 2304.11685 | null |
2023-04-23 | Towards Controllable Audio Texture Morphing | Chitralekha Gupta et.al. | 2304.11648 | null |
2023-04-22 | Physics-guided generative adversarial network to learn physical models | Kazuo Yonekura et.al. | 2304.11488 | null |
2023-04-22 | Conditional Denoising Diffusion for Sequential Recommendation | Yu Wang et.al. | 2304.11433 | null |
2023-04-22 | Medium. Permeation: SARS-COV-2 Painting Creation by Generative Model | Yuan-Fu Yang et.al. | 2304.11354 | null |
2023-04-22 | Two Birds, One Stone: A Unified Framework for Joint Learning of Image and Video Style Transfers | Bohai Gu et.al. | 2304.11335 | null |
2023-04-22 | Spectral normalized dual contrastive regularization for image-to-image translation | Chen Zhao et.al. | 2304.11319 | null |
2023-04-22 | BiTrackGAN: Cascaded CycleGANs to Constraint Face Aging | Tsung-Han Kuo et.al. | 2304.11313 | null |
2023-04-19 | Affective social anthropomorphic intelligent system | Md. Adyelullahil Mamun et.al. | 2304.11046 | null |
2023-04-21 | Near-Optimal Decentralized Momentum Method for Nonconvex-PL Minimax Problems | Feihu Huang et.al. | 2304.10902 | null |
2023-04-21 | Application of quantum-inspired generative models to small molecular datasets | C. Moussa et.al. | 2304.10867 | null |
2023-04-21 | Matching-based Data Valuation for Generative Model | Jiaxi Yang et.al. | 2304.10701 | null |
2023-04-20 | A Plug-and-Play Defensive Perturbation for Copyright Protection of DNN-based Applications | Donghua Wang et.al. | 2304.10679 | null |
2023-04-20 | LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields | Tang Tao et.al. | 2304.10406 | null |
2023-04-21 | Conditional Generative Models for Learning Stochastic Processes | Salvatore Certo et.al. | 2304.10382 | null |
2023-04-20 | Adaptive Consensus Optimization Method for GANs | Sachin Kumar Danisetty et.al. | 2304.10317 | null |
2023-04-20 | Towards replacing precipitation ensemble predictions systems using machine learning | Rüdiger Brecht et.al. | 2304.10251 | link |
2023-04-20 | Machine learning traction force maps of cell monolayers | Changhao Li et.al. | 2304.10065 | null |
2023-04-19 | GREAT Score: Global Robustness Evaluation of Adversarial Perturbation using Generative Models | Li Zaitang et.al. | 2304.09875 | null |
2023-04-20 | Any-to-Any Style Transfer: Making Picasso and Da Vinci Collaborate | Songhua Liu et.al. | 2304.09728 | link |
2023-04-19 | StyleDEM: a Versatile Model for Authoring Terrains | Simon Perche et.al. | 2304.09626 | null |
2023-04-19 | Towards Co-Creative Generative Adversarial Networks for Fashion Designers | Imke Grabe et.al. | 2304.09477 | null |
2023-04-19 | SP-BatikGAN: An Efficient Generative Adversarial Network for Symmetric Pattern Generation | Chrystian et.al. | 2304.09384 | null |
2023-04-19 | Physical Knowledge Enhanced Deep Neural Network for Sea Surface Temperature Prediction | Yuxin Meng et.al. | 2304.09376 | null |
2023-04-18 | Performance of GAN-based augmentation for deep learning COVID-19 image classification | Oleksandr Fedoruk et.al. | 2304.09067 | null |
2023-04-18 | Look ATME: The Discriminator Mean Entropy Needs Attention | Edgardo Solano-Carrillo et.al. | 2304.09024 | link |
2023-04-18 | TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models | Yuwei Yin et.al. | 2304.08821 | null |
2023-04-17 | Insta(nt) Pet Therapy: GAN-generated Images for Therapeutic Social Media Content | Tanish Jain et.al. | 2304.08665 | null |
2023-04-17 | Two-stage MR Image Segmentation Method for Brain Tumors based on Attention Mechanism | Li Zhu et.al. | 2304.08072 | null |
2023-04-16 | Predicting unavailable parameters from existing velocity fields of turbulent flows using a GAN-based model | Linqi Yu et.al. | 2304.07762 | null |
2023-04-16 | A Novel end-to-end Framework for Occluded Pixel Reconstruction with Spatio-temporal Features for Improved Person Re-identification | Prathistith Raj Medi et.al. | 2304.07721 | null |
2023-04-14 | Memory Efficient Diffusion Probabilistic Models via Patch-based Generation | Shinei Arakawa et.al. | 2304.07087 | null |
2023-04-13 | Improving novelty detection with generative adversarial networks on hand gesture data | Miguel Simão et.al. | 2304.06696 | null |
2023-04-13 | Intriguing properties of synthetic images: from generative adversarial networks to diffusion models | Riccardo Corvi et.al. | 2304.06408 | null |
2023-04-13 | ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis | Hongchen Tan et.al. | 2304.06297 | null |
2023-04-12 | VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs | Moayed Haji Ali et.al. | 2304.06020 | null |
2023-04-12 | ALADIN-NST: Self-supervised disentangled representation learning of artistic style through Neural Style Transfer | Dan Ruta et.al. | 2304.05755 | link |
2023-04-12 | Generative Adversarial Networks-Driven Cyber Threat Intelligence Detection Framework for Securing Internet of Things | Mohamed Amine Ferrag et.al. | 2304.05644 | null |
2023-04-12 | Improving Diffusion Models for Scene Text Editing with Dual Encoders | Jiabao Ji et.al. | 2304.05568 | link |
2023-04-12 | An End-to-End Network for Upright Adjustment of Panoramic Images | Heyu Chen et.al. | 2304.05556 | null |
2023-04-11 | GraphGANFed: A Federated Generative Framework for Graph-Structured Molecules Towards Efficient Drug Discovery | Daniel Manu et.al. | 2304.05498 | null |
2023-04-11 | Mask-conditioned latent diffusion for generating gastrointestinal polyp images | Roman Macháček et.al. | 2304.05233 | null |
2023-04-11 | NeAT: Neural Artistic Tracing for Beautiful Style Transfer | Dan Ruta et.al. | 2304.05139 | link |
2023-04-17 | Diffusion Recommender Model | Wenjie Wang et.al. | 2304.04971 | link |
2023-04-10 | DDRF: Denoising Diffusion Model for Remote Sensing Image Fusion | ZiHan Cao et.al. | 2304.04774 | null |
2023-04-10 | Reinforcement Learning-Based Black-Box Model Inversion Attacks | Gyojin Han et.al. | 2304.04625 | link |
2023-04-10 | Sequential Recommendation with Diffusion Models | Hanwen Du et.al. | 2304.04541 | null |
2023-04-10 | Modernizing Old Photos Using Multiple References via Photorealistic Style Transfer | Agus Gunawan et.al. | 2304.04461 | null |
2023-04-10 | Generating Adversarial Attacks in the Latent Space | Nitish Shukla et.al. | 2304.04386 | null |
2023-04-10 | ITportrait: Image-Text Coupled 3D Portrait Domain Adaptation | Xiangwen Deng et.al. | 2304.04364 | null |
2023-04-09 | ForamViT-GAN: Exploring New Paradigms in Deep Learning for Micropaleontological Image Analysis | Ivan Ferreira-Chacua et.al. | 2304.04291 | null |
2023-04-09 | Distributed Conditional GAN (discGAN) For Synthetic Healthcare Data Generation | David Fuentes et.al. | 2304.04290 | null |
2023-04-08 | Towards Realistic Ultrasound Fetal Brain Imaging Synthesis | Michelle Iskandar et.al. | 2304.03941 | link |
2023-04-08 | 3D GANs and Latent Space: A comprehensive survey | Satya Pratheek Tata et.al. | 2304.03932 | null |
2023-04-07 | Correcting Model Misspecification via Generative Adversarial Networks | Pronoma Banerjee et.al. | 2304.03805 | null |
2023-04-07 | Leveraging GANs for data scarcity of COVID-19: Beyond the hype | Hazrat Ali et.al. | 2304.03536 | null |
2023-04-06 | Zero-shot Generative Model Adaptation via Image-specific Prompt Learning | Jiayi Guo et.al. | 2304.03119 | link |
2023-04-06 | Spritz-PS: Validation of Synthetic Face Images Using a Large Dataset of Printed Documents | Ehsan Nowroozi et.al. | 2304.02982 | null |
2023-04-06 | A review of ensemble learning and data augmentation models for class imbalanced problems: combination, implementation and evaluation | Azal Ahmad Khan et.al. | 2304.02858 | link |
2023-04-05 | Bengali Fake Review Detection using Semi-supervised Generative Adversarial Networks | Md. Tanvir Rouf Shawon et.al. | 2304.02739 | null |
2023-04-05 | Learning Stage-wise GANs for Whistle Extraction in Time-Frequency Spectrograms | Pu Li et.al. | 2304.02714 | link |
2023-04-05 | Face Transformer: Towards High Fidelity and Accurate Face Swapping | Kaiwen Cui et.al. | 2304.02530 | null |
2023-04-05 | A Diffusion-based Method for Multi-turn Compositional Image Generation | Chao Wang et.al. | 2304.02192 | null |
2023-04-04 | FakET: Simulating Cryo-Electron Tomograms with Neural Style Transfer | Pavol Harar et.al. | 2304.02011 | link |
2023-04-04 | Revisiting the Evaluation of Image Synthesis with GANs | Mengping Yang et.al. | 2304.01999 | null |
2023-04-04 | A Practical Framework for Unsupervised Structure Preservation Medical Image Enhancement | Quan Huu Cap et.al. | 2304.01864 | link |
2023-04-03 | ViT-DAE: Transformer-driven Diffusion Autoencoder for Histopathology Image Analysis | Xuan Xu et.al. | 2304.01053 | null |
2023-04-03 | Tunable Convolutions with Parametric Multi-Loss Optimization | Matteo Maggioni et.al. | 2304.00898 | null |
2023-04-03 | CG-3DSRGAN: A classification guided 3D generative adversarial network for image quality recovery from low-dose PET images | Yuxin Xue et.al. | 2304.00725 | null |
2023-04-02 | Improving RF-DNA Fingerprinting Performance in an Indoor Multipath Environment Using Semi-Supervised Learning | Mohamed k. Fadul et.al. | 2304.00648 | null |
2023-04-02 | A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation | Bo-Kyeong Kim et.al. | 2304.00471 | null |
2023-04-02 | Ideal Observer Computation by Use of Markov-Chain Monte Carlo with Generative Adversarial Networks | Weimin Zhou et.al. | 2304.00433 | null |
2023-04-02 | Learning Dynamic Style Kernels for Artistic Style Transfer | Xu Wenju et.al. | 2304.00414 | null |
2023-03-31 | Fides: A Generative Framework for Result Validation of Outsourced Machine Learning Workloads via TEE | Abhinav Kumar et.al. | 2304.00083 | null |
2023-03-31 | One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models | Yasser Benigmim et.al. | 2303.18080 | link |
2023-03-31 | Exploiting Multilingualism in Low-resource Neural Machine Translation via Adversarial Learning | Amit Kumar et.al. | 2303.18011 | null |
2023-03-31 | Unsupervised Anomaly Detection and Localization of Machine Audio: A GAN-based Approach | Anbai Jiang et.al. | 2303.17949 | link |
2023-03-31 | Comparing Adversarial and Supervised Learning for Organs at Risk Segmentation in CT images | Leonardo Crespi et.al. | 2303.17941 | null |
2023-03-31 | CAP-VSTNet: Content Affinity Preserved Versatile Style Transfer | Linfeng Wen et.al. | 2303.17867 | null |
2023-04-01 | Semantic Image Translation for Repairing the Texture Defects of Building Models | Qisen Shang et.al. | 2303.17418 | null |
2023-03-30 | Retrospective Motion Correction in Gradient Echo MRI by Explicit Motion Estimation Using Deep CNNs | Mathias S. Feinler et.al. | 2303.17239 | null |
2023-03-30 | LatentForensics: Towards lighter deepfake detection in the StyleGAN latent space | Matthieu Delmas et.al. | 2303.17222 | null |
2023-03-30 | SARGAN: Spatial Attention-based Residuals for Facial Expression Manipulation | Arbish Akram et.al. | 2303.17212 | null |
2023-03-30 | KD-DLGAN: Data Limited Image Generation via Knowledge Distillation | Kaiwen Cui et.al. | 2303.17158 | null |
2023-03-29 | A comparative evaluation of image-to-image translation methods for stain transfer in histopathology | Igor Zingman et.al. | 2303.17009 | null |
2023-03-29 | WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models | Konstantina Nikolaidou et.al. | 2303.16576 | null |
2023-03-28 | Information-Theoretic GAN Compression with Variational Energy-based Model | Minsoo Kang et.al. | 2303.16050 | null |
2023-03-28 | Physics-guided adversarial networks for artificial digital image correlation data generation | David Melching et.al. | 2303.15939 | null |
2023-03-28 | fRegGAN with K-space Loss Regularization for Medical Image Translation | Ivo M. Baltruschat et.al. | 2303.15938 | null |
2023-03-28 | PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout | HsiaoYuan Hsu et.al. | 2303.15937 | link |
2023-03-27 | A Framework for Demonstrating Practical Quantum Advantage: Racing Quantum against Classical Generative Models | Mohamed Hibat-Allah et.al. | 2303.15626 | null |
2023-03-27 | Sequential training of GANs against GAN-classifiers reveals correlated "knowledge gaps" present among independently trained GAN instances | Arkanath Pathak et.al. | 2303.15533 | null |
2023-03-27 | Training-free Style Transfer Emerges from h-space in Diffusion models | Jaeseok Jeong et.al. | 2303.15403 | null |
2023-03-27 | How far generated data can impact Neural Networks performance? | Sayeh Gholipour Picha et.al. | 2303.15223 | link |
2023-03-29 | Generalizable Denoising of Microscopy Images using Generative Adversarial Networks and Contrastive Learning | Felix Fuentes-Hurtado et.al. | 2303.15214 | link |
2023-03-29 | Data Augmentation for Environmental Sound Classification Using Diffusion Probabilistic Model with Top-k Selection Discriminator | Yunhao Chen et.al. | 2303.15161 | link |
2023-03-26 | Query Generation based on Generative Adversarial Networks | Weihua Sun et.al. | 2303.14777 | null |
2023-03-25 | Spatial Latent Representations in Generative Adversarial Networks for Image Generation | Maciej Sypetkowski et.al. | 2303.14552 | null |
2023-03-25 | GANTEE: Generative Adversatial Network for Taxonomy Entering Evaluation | Zhouhong Gu et.al. | 2303.14480 | null |
2023-03-24 | Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis | Takuhiro Kaneko et.al. | 2303.13909 | null |
2023-03-24 | A Three-Player GAN for Super-Resolution in Magnetic Resonance Imaging | Qi Wang et.al. | 2303.13900 | null |
2023-03-24 | Factor Decomposed Generative Adversarial Networks for Text-to-Image Synthesis | Jiguo Li et.al. | 2303.13821 | null |
2023-03-24 | Neural Preset for Color Style Transfer | Zhanghan Ke et.al. | 2303.13511 | link |
2023-03-23 | Transforming Radiance Field with Lipschitz Network for Photorealistic 3D Scene Stylization | Zicheng Zhang et.al. | 2303.13232 | null |
2023-03-23 | PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360 |
Sizhe An et.al. | 2303.13071 | null |
2023-03-23 | Reimagining Application User Interface (UI) Design using Deep Learning Methods: Challenges and Opportunities | Subtain Malik et.al. | 2303.13055 | null |
2023-03-22 | TSI-GAN: Unsupervised Time Series Anomaly Detection using Convolutional Cycle-Consistent Generative Adversarial Networks | Shyam Sundar Saravanan et.al. | 2303.12952 | link |
2023-03-22 | NeRF-GAN Distillation for Efficient 3D-Aware Generation with Convolutions | Mohamad Shahbazi et.al. | 2303.12865 | null |
2023-03-22 | Synthetic Health-related Longitudinal Data with Mixed-type Variables Generated using Diffusion Models | Nicholas I-Hsien Kuo et.al. | 2303.12281 | null |
2023-03-21 | Generative AI for Cyber Threat-Hunting in 6G-enabled IoT Networks | Mohamed Amine Ferrag et.al. | 2303.11751 | null |
2023-03-21 | Linking generative semi-supervised learning and generative open-set recognition | Emile Reyn Engelbrecht et.al. | 2303.11702 | null |
2023-03-21 | CoopInit: Initializing Generative Adversarial Networks via Cooperative Learning | Yang Zhao et.al. | 2303.11649 | null |
2023-03-20 | AnimeDiffusion: Anime Face Line Drawing Colorization via Diffusion Models | Yu Cao et.al. | 2303.11137 | null |
2023-03-20 | Discovering Interpretable Directions in the Semantic Latent Space of Diffusion Models | René Haas et.al. | 2303.11073 | null |
2023-03-20 | k-SALSA: k-anonymous synthetic averaging of retinal images via local style alignment | Minkyu Jeon et.al. | 2303.10824 | link |
2023-03-19 | Cross-GAN Auditing: Unsupervised Identification of Attribute Level Similarities and Differences between Pretrained Generative Models | Matthew L. Olson et.al. | 2303.10774 | null |
2023-03-24 | StyleRF: Zero-shot 3D Style Transfer of Neural Radiance Fields | Kunhao Liu et.al. | 2303.10598 | null |
2023-03-17 | Exploring contrast generalisation in deep learning-based brain MRI-to-CT synthesis | Lotte Nijskens et.al. | 2303.10202 | null |
2023-03-17 | Unsupervised Domain Transfer with Conditional Invertible Neural Networks | Kris K. Dreher et.al. | 2303.10191 | null |
2023-03-17 | DialogPaint: A Dialog-based Image Editing Model | Jingxuan Wei et.al. | 2303.10073 | null |
2023-03-17 | Configurable EBEN: Extreme Bandwidth Extension Network to enhance body-conducted speech capture | Julien Hauret et.al. | 2303.10008 | link |
2023-03-17 | Exploiting Semantic Attributes for Transductive Zero-Shot Learning | Zhengbo Wang et.al. | 2303.09849 | null |
2023-03-22 | Style Transfer for 2D Talking Head Animation | Trong-Thang Pham et.al. | 2303.09799 | null |
2023-03-17 | Diffusing the Optimal Topology: A Generative Optimization Approach | Giorgio Giannone et.al. | 2303.09760 | null |
2023-03-16 | 3D Masked Autoencoding and Pseudo-labeling for Domain Adaptive Segmentation of Heterogeneous Infant Brain MRI | Xuzhe Zhang et.al. | 2303.09373 | null |
2023-03-15 | Copyright Protection and Accountability of Generative AI:Attack, Watermarking and Attribution | Haonan Zhong et.al. | 2303.09272 | null |
2023-03-16 | SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective | Zipeng Xu et.al. | 2303.09270 | link |
2023-03-16 | StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Model | Zipeng Xu et.al. | 2303.09268 | link |
2023-03-20 | Generative Adversarial Network for Personalized Art Therapy in Melanoma Disease Management | Lennart Jütte et.al. | 2303.09232 | null |
2023-03-17 | NLUT: Neural-based 3D Lookup Tables for Video Photorealistic Style Transfer | Yaosen Chen et.al. | 2303.09170 | link |
2023-03-18 | Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation | Lingting Zhu et.al. | 2303.09119 | link |
2023-03-16 | Self-Consistent Learning: Cooperation between Generators and Discriminators | Tong Wu et.al. | 2303.09075 | null |
2023-03-16 | Exploring the Power of Generative Deep Learning for Image-to-Image Translation and MRI Reconstruction: A Cross-Domain Review | Yuda Bi et.al. | 2303.09012 | null |
2023-03-16 | Conditional Synthetic Food Image Generation | Wenjin Fu et.al. | 2303.09005 | null |
2023-03-15 | A parsimonious neural network approach to solve portfolio optimization problems without using dynamic programming | Pieter M. van Staden et.al. | 2303.08968 | null |
2023-03-15 | Class-Guided Image-to-Image Diffusion: Cell Painting from Brightfield Images with Class Labels | Jan Oscar Cross-Zamirski et.al. | 2303.08863 | null |
2023-03-15 | Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer | Serin Yang et.al. | 2303.08622 | null |
2023-03-15 | Investigating GANsformer: A Replication Study of a State-of-the-Art Image Generation Model | Giorgia Adorni et.al. | 2303.08577 | null |
2023-03-15 | MRGAN360: Multi-stage Recurrent Generative Adversarial Network for 360 Degree Image Saliency Prediction | Pan Gao et.al. | 2303.08525 | null |
2023-03-15 | Black-box Adversarial Example Attack towards FCG Based Android Malware Detection under Incomplete Feature Information | Heng Li et.al. | 2303.08509 | null |
2023-03-14 | Graph Transformer GANs for Graph-Constrained House Generation | Hao Tang et.al. | 2303.08225 | null |
2023-03-14 | Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis | Chunyu Qiang et.al. | 2303.07711 | null |
2023-03-14 | 3D Face Arbitrary Style Transfer | Xiangwen Deng et.al. | 2303.07709 | null |
2023-03-13 | AGTGAN: Unpaired Image Translation for Photographic Ancient Character Generation | Hongxiang Huang et.al. | 2303.07012 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-05-21 | Personalized Residuals for Concept-Driven Text-to-Image Generation | Cusuh Ham et.al. | 2405.12978 | null |
2024-05-21 | Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control | Yue Han et.al. | 2405.12970 | null |
2024-05-21 | Impact of inhomogeneous diffusion on secondary cosmic ray and antiproton local spectra | Álvaro Tovar-Pardo et.al. | 2405.12918 | null |
2024-05-21 | Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images | Xiaofei Yu et.al. | 2405.12875 | link |
2024-05-21 | Model Free Prediction with Uncertainty Assessment | Yuling Jiao et.al. | 2405.12684 | null |
2024-05-21 | CustomText: Customized Textual Image Generation using Diffusion Models | Shubham Paliwal et.al. | 2405.12531 | null |
2024-05-21 | Customize Your Own Paired Data via Few-shot Way | Jinshu Chen et.al. | 2405.12490 | null |
2024-05-21 | One-step data-driven generative model via Schrödinger Bridge | Hanwen Huang et.al. | 2405.12453 | null |
2024-05-20 | Diffusion for World Modeling: Visual Details Matter in Atari | Eloi Alonso et.al. | 2405.12399 | link |
2024-05-20 | Images that Sound: Composing Images and Sounds on a Single Canvas | Ziyang Chen et.al. | 2405.12221 | null |
2024-05-20 | Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices | Nathaniel Cohen et.al. | 2405.12211 | null |
2024-05-20 | Nonequilbrium physics of generative diffusion models | Zhendong Yu et.al. | 2405.11932 | null |
2024-05-20 | "Set It Up!": Functional Object Arrangement with Compositional Generative Models | Yiqing Xu et.al. | 2405.11928 | null |
2024-05-20 | Diff-BGM: A Diffusion Model for Video Background Music Generation | Sizhe Li et.al. | 2405.11913 | null |
2024-05-20 | Out-of-Distribution Detection with a Single Unconditional Diffusion Model | Alvin Heng et.al. | 2405.11881 | link |
2024-05-20 | Evolving Storytelling: Benchmarks and Methods for New Character Customization with Diffusion Models | Xiyu Wang et.al. | 2405.11852 | null |
2024-05-20 | Alternators For Sequence Modeling | Mohammad Reza Rezaei et.al. | 2405.11848 | null |
2024-05-20 | ViViD: Video Virtual Try-on using Diffusion Models | Zixun Fang et.al. | 2405.11794 | null |
2024-05-20 | Guided Multi-objective Generative AI to Enhance Structure-based Drug Design | Amit Kadan et.al. | 2405.11785 | null |
2024-05-20 | Diffusion Models for Generating Ballistic Spacecraft Trajectories | Tyler Presser et.al. | 2405.11738 | null |
2024-05-19 | InterAct: Capture and Modelling of Realistic, Expressive and Interactive Activities between Two Persons in Daily Scenarios | Yinghao Huang et.al. | 2405.11690 | null |
2024-05-19 | Uncertainty-Aware PPG-2-ECG for Enhanced Cardiovascular Diagnosis using Diffusion Models | Omer Belhasin et.al. | 2405.11566 | null |
2024-05-19 | Diffusion-Based Hierarchical Image Steganography | Youmin Xu et.al. | 2405.11523 | null |
2024-05-19 | FIFO-Diffusion: Generating Infinite Videos from Text without Training | Jihwan Kim et.al. | 2405.11473 | null |
2024-05-19 | Discrete-state Continuous-time Diffusion for Graph Generation | Zhe Xu et.al. | 2405.11416 | null |
2024-05-18 | On the Trajectory Regularity of ODE-based Diffusion Sampling | Defang Chen et.al. | 2405.11326 | null |
2024-05-18 | Diffusion Model Driven Test-Time Image Adaptation for Robust Skin Lesion Classification | Ming Hu et.al. | 2405.11289 | null |
2024-05-18 | HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Textures from Videos | Qifeng Chen et.al. | 2405.11270 | null |
2024-05-18 | AquaLoRA: Toward White-box Protection for Customized Stable Diffusion Models via Watermark LoRA | Weitao Feng et.al. | 2405.11135 | null |
2024-05-17 | Improving face generation quality and prompt following with synthetic captions | Michail Tarasiou et.al. | 2405.10864 | null |
2024-05-17 | Deep Data Consistency: a Fast and Robust Diffusion Model-based Solver for Inverse Problems | Hanyu Chen et.al. | 2405.10748 | link |
2024-05-17 | Numerical Recovery of the Diffusion Coefficient in Diffusion Equations from Terminal Measurement | Bangti Jin et.al. | 2405.10708 | null |
2024-05-17 | LoCI-DiffCom: Longitudinal Consistency-Informed Diffusion Model for 3D Infant Brain Image Completion | Zihao Zhu et.al. | 2405.10691 | null |
2024-05-17 | LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with T-Diffusion | Tong Chen et.al. | 2405.10550 | link |
2024-05-17 | ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation | Pengzhi Li et.al. | 2405.10508 | null |
2024-05-16 | Text-to-Vector Generation with Neural Path Representation | Peiying Zhang et.al. | 2405.10317 | null |
2024-05-16 | Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model | Zheng Gu et.al. | 2405.10316 | null |
2024-05-16 | CAT3D: Create Anything in 3D with Multi-View Diffusion Models | Ruiqi Gao et.al. | 2405.10314 | null |
2024-05-16 | Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks | João Bordalo et.al. | 2405.10122 | null |
2024-05-16 | Spurious reconstruction from brain activity | Ken Shirakawa et.al. | 2405.10078 | null |
2024-05-16 | Frequency-Domain Refinement with Multiscale Diffusion for Super Resolution | Xingjian Wang et.al. | 2405.10014 | null |
2024-05-16 | VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing | Binghui Chen et.al. | 2405.09985 | null |
2024-05-16 | Language-Oriented Semantic Latent Representation for Image Transmission | Giordano Cicchetti et.al. | 2405.09976 | link |
2024-05-16 | Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models | Ziyu Wang et.al. | 2405.09901 | link |
2024-05-16 | DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection | Yuhao Sun et.al. | 2405.09882 | link |
2024-05-16 | Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode Multi-view Latent Diffusion | Xinyang Li et.al. | 2405.09874 | null |
2024-05-16 | Rethinking Multi-User Semantic Communications with Deep Generative Models | Eleonora Grassucci et.al. | 2405.09866 | null |
2024-05-16 | MediSyn: Text-Guided Diffusion Models for Broad Medical 2D and 3D Image Synthesis | Joseph Cho et.al. | 2405.09806 | null |
2024-05-15 | A Survey of Generative Techniques for Spatial-Temporal Data Mining | Qianru Zhang et.al. | 2405.09592 | null |
2024-05-16 | MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer | Chengyu Wu et.al. | 2405.09539 | link |
2024-05-15 | Diffusion-based Contrastive Learning for Sequential Recommendation | Ziqiang Cui et.al. | 2405.09369 | null |
2024-05-15 | Dance Any Beat: Blending Beats with Visuals in Dance Video Generation | Xuanchen Wang et.al. | 2405.09266 | null |
2024-05-15 | SOEDiff: Efficient Distillation for Small Object Editing | Qihe Pan et.al. | 2405.09114 | null |
2024-05-15 | RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing | Jiamei Xiong et.al. | 2405.09083 | link |
2024-05-15 | Naturalistic Music Decoding from EEG Data via Latent Diffusion Models | Emilian Postolache et.al. | 2405.09062 | null |
2024-05-15 | Response Matching for generating materials and molecules | Bingqing Cheng et.al. | 2405.09057 | null |
2024-05-15 | CTS: A Consistency-Based Medical Image Segmentation Model | Kejia Zhang et.al. | 2405.09056 | null |
2024-05-14 | Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models | Bingdong Li et.al. | 2405.08674 | null |
2024-05-14 | Towards Multi-Task Generative-AI Edge Services with an Attention-based Diffusion DRL Approach | Yaju Liu et.al. | 2405.08328 | null |
2024-05-14 | Compositional Text-to-Image Generation with Dense Blob Representations | Weili Nie et.al. | 2405.08246 | null |
2024-05-13 | Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis | Yifan Wang et.al. | 2405.08210 | null |
2024-05-13 | Do Bayesian imaging methods report trustworthy probabilities? | David Y. W. Thong et.al. | 2405.08179 | null |
2024-05-13 | DiffTF++: 3D-aware Diffusion Transformer for Large-Vocabulary 3D Generation | Ziang Cao et.al. | 2405.08055 | link |
2024-05-13 | Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning | Wenqi Dong et.al. | 2405.08054 | null |
2024-05-11 | Diff-ETS: Learning a Diffusion Probabilistic Model for Electromyography-to-Speech Conversion | Zhao Ren et.al. | 2405.08021 | null |
2024-05-13 | Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data | Mahdi Morafah et.al. | 2405.07925 | null |
2024-05-13 | CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models | Nick Stracke et.al. | 2405.07913 | null |
2024-05-13 | SAR Image Synthesis with Diffusion Models | Denisa Qosja et.al. | 2405.07776 | null |
2024-05-13 | CDFormer:When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution | Qingguo Liu et.al. | 2405.07648 | link |
2024-05-13 | De novo antibody design with SE(3) diffusion | Daniel Cutting et.al. | 2405.07622 | null |
2024-05-13 | Reducing Risk for Assistive Reinforcement Learning Policies with Diffusion Models | Andrii Tytarenko et.al. | 2405.07603 | null |
2024-05-13 | PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator | Hanshu Yan et.al. | 2405.07510 | link |
2024-05-13 | GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting | Haodong Chen et.al. | 2405.07472 | null |
2024-05-12 | Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning | Masane Fuchi et.al. | 2405.07288 | link |
2024-05-12 | Modeling Pedestrian Intrinsic Uncertainty for Multimodal Stochastic Trajectory Prediction via Energy Plan Denoising | Yao Liu et.al. | 2405.07164 | null |
2024-05-12 | Stable Signature is Unstable: Removing Image Watermark from Diffusion Models | Yuepeng Hu et.al. | 2405.07145 | null |
2024-05-11 | Diffusion models as probabilistic neural operators for recovering unobserved states of dynamical systems | Katsiaryna Haitsiukevich et.al. | 2405.07097 | null |
2024-05-11 | Semantic Guided Large Scale Factor Remote Sensing Image Super-resolution with Generative Diffusion Prior | Ce Wang et.al. | 2405.07044 | link |
2024-05-11 | Non-confusing Generation of Customized Concepts in Diffusion Models | Wang Lin et.al. | 2405.06914 | null |
2024-05-10 | Self-Consistent Recursive Diffusion Bridge for Medical Image Translation | Fuat Arslan et.al. | 2405.06789 | null |
2024-05-10 | Shape Conditioned Human Motion Generation with Diffusion Model | Kebing Xue et.al. | 2405.06778 | null |
2024-05-10 | OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation | Jinwei Lin et.al. | 2405.06547 | link |
2024-05-14 | SketchDream: Sketch-based Text-to-3D Generation and Editing | Feng-Lin Liu et.al. | 2405.06461 | null |
2024-05-10 | PUMA: margin-based data pruning | Javier Maroto et.al. | 2405.06298 | null |
2024-05-10 | Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging | Zhuchen Shao et.al. | 2405.06175 | null |
2024-05-09 | Distilling Diffusion Models into Conditional GANs | Minguk Kang et.al. | 2405.05967 | null |
2024-05-09 | Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask | Zineb Senane et.al. | 2405.05959 | link |
2024-05-09 | Frame Interpolation with Consecutive Brownian Bridge Diffusion | Zonglin Lyu et.al. | 2405.05953 | null |
2024-05-09 | Composable Part-Based Manipulation | Weiyu Liu et.al. | 2405.05876 | null |
2024-05-09 | Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control | Gunshi Gupta et.al. | 2405.05852 | link |
2024-05-09 | Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models | Zhe Ma et.al. | 2405.05846 | null |
2024-05-09 | MSDiff: Multi-Scale Diffusion Model for Ultra-Sparse View CT Reconstruction | Pinhuang Tan et.al. | 2405.05814 | null |
2024-05-10 | MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation | Yuxiang Wei et.al. | 2405.05806 | link |
2024-05-09 | DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation | Sitian Shen et.al. | 2405.05800 | null |
2024-05-09 | Sequential Amodal Segmentation via Cumulative Occlusion Learning | Jiayang Ao et.al. | 2405.05791 | null |
2024-05-09 | DP-MDM: Detail-Preserving MR Reconstruction via Multiple Diffusion Models | Mengxiao Geng et.al. | 2405.05763 | null |
2024-05-09 | LatentColorization: Latent Diffusion-Based Speaker Video Colorization | Rory Ward et.al. | 2405.05707 | null |
2024-05-09 | StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework | Yiheng Huang et.al. | 2405.05691 | null |
2024-05-09 | SubGDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning | Jiying Zhang et.al. | 2405.05665 | null |
2024-05-09 | AI in Your Toolbox: A Plugin for Generating Renderings from 3D Models | Mingming Wang et.al. | 2405.05627 | null |
2024-05-09 | Denoising Diffusion Delensing Delight: Reconstructing the Non-Gaussian CMB Lensing Potential with Diffusion Models | Thomas Flöss et.al. | 2405.05598 | link |
2024-05-09 | Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft | Debabrata Pal et.al. | 2405.05574 | null |
2024-05-09 | A Survey on Personalized Content Synthesis with Diffusion Models | Xulu Zhang et.al. | 2405.05538 | null |
2024-05-08 | Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo | Nayantara Mudur et.al. | 2405.05255 | link |
2024-05-08 | Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models | Hongjie Wang et.al. | 2405.05252 | null |
2024-05-08 | Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation | Jonas Kohler et.al. | 2405.05224 | null |
2024-05-08 | FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models | Jinglin Xu et.al. | 2405.05216 | link |
2024-05-08 | An anti-noise seismic inversion method based on diffusion model | Yingtian Liu et.al. | 2405.05026 | null |
2024-05-08 | Discrepancy-based Diffusion Models for Lesion Detection in Brain MRI | Keqiang Fan et.al. | 2405.04974 | null |
2024-05-08 | Empowering Wireless Networks with Artificial Intelligence Generated Graph | Jiacheng Wang et.al. | 2405.04907 | null |
2024-05-08 | Fast LiDAR Upsampling using Conditional Diffusion Models | Sander Elias Magnussen Helgesen et.al. | 2405.04889 | null |
2024-05-08 | FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation | Xuehai He et.al. | 2405.04834 | null |
2024-05-08 | Variational Schrödinger Diffusion Models | Wei Deng et.al. | 2405.04795 | null |
2024-05-07 | Remote Diffusion | Kunal Sunil Kasodekar et.al. | 2405.04717 | null |
2024-05-07 | TexControl: Sketch-Based Two-Stage Fashion Image Generation Using Diffusion Model | Yongming Zhang et.al. | 2405.04675 | null |
2024-05-07 | Tactile-Augmented Radiance Fields | Yiming Dou et.al. | 2405.04534 | null |
2024-05-07 | Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing | Yi Zuo et.al. | 2405.04496 | null |
2024-05-07 | CloudDiff: Super-resolution ensemble retrieval of cloud properties for all day using the generative diffusion model | Haixia Xiao et.al. | 2405.04483 | null |
2024-05-07 | Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos | Junyi Ma et.al. | 2405.04370 | null |
2024-05-07 | Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation | Jihyun Kim et.al. | 2405.04356 | null |
2024-05-08 | Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer | Zhuoyi Yang et.al. | 2405.04312 | link |
2024-05-07 | BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models | Eloi Moliner et.al. | 2405.04272 | null |
2024-05-07 | Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models | Fan Bao et.al. | 2405.04233 | null |
2024-05-07 | Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model | Joo Young Choi et.al. | 2405.03958 | null |
2024-05-06 | MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View | Emmanuelle Bourigault et.al. | 2405.03894 | null |
2024-05-06 | MoDiPO: text-to-motion alignment via AI-feedback-driven Direct Preference Optimization | Massimiliano Pappa et.al. | 2405.03803 | null |
2024-05-06 | Synthetic Data from Diffusion Models Improve Drug Discovery Prediction | Bing Hu et.al. | 2405.03799 | null |
2024-05-06 | GraphSL: An Open-Source Library for Graph Source Localization Approaches and Benchmark Datasets | Junxiang Wang et.al. | 2405.03724 | null |
2024-05-06 | Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models | Ludwig Winkler et.al. | 2405.03549 | null |
2024-05-06 | CCDM: Continuous Conditional Diffusion Models for Image Generation | Xin Ding et.al. | 2405.03546 | link |
2024-05-06 | LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model | Haowen Sun et.al. | 2405.03485 | link |
2024-05-06 | Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond | Jiuxiang Gu et.al. | 2405.03251 | null |
2024-05-06 | Hyperbolic Geometric Latent Diffusion Model for Graph Generation | Xingcheng Fu et.al. | 2405.03188 | null |
2024-05-06 | DeepMpMRI: Tensor-decomposition Regularized Learning for Fast and High-Fidelity Multi-Parametric Microstructural MR Imaging | Wenxin Fan et.al. | 2405.03159 | null |
2024-05-06 | Video Diffusion Models: A Survey | Andrew Melnik et.al. | 2405.03150 | null |
2024-05-06 | AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding | Tao Liu et.al. | 2405.03121 | link |
2024-05-05 | Matten: Video Generation with Mamba-Attention | Yu Gao et.al. | 2405.03025 | null |
2024-05-05 | Exploring Text-based Realistic Building Facades Editing Applicaiton | Jing Wang et.al. | 2405.02967 | null |
2024-05-05 | Efficient Text-driven Motion Generation via Latent Consistency Training | Mengxian Hu et.al. | 2405.02791 | null |
2024-05-04 | DiffuseTrace: A Transparent and Flexible Watermarking Scheme for Latent Diffusion Model | Liangqi Lei et.al. | 2405.02696 | null |
2024-05-03 | Functional Imaging Constrained Diffusion for Brain PET Synthesis from Structural MRI | Minhui Yu et.al. | 2405.02504 | null |
2024-05-03 | Continuous Learned Primal Dual | Christina Runkel et.al. | 2405.02478 | null |
2024-05-03 | CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding | Kaiyuan Chen et.al. | 2405.02384 | null |
2024-05-03 | DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos | Wen-Hsuan Chu et.al. | 2405.02280 | null |
2024-05-03 | Multi-grid reaction-diffusion master equation: applications to morphogen gradient modelling | Radek Erban et.al. | 2405.02117 | null |
2024-05-03 | DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model | Peijin Jia et.al. | 2405.02008 | null |
2024-05-03 | Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition | Yichun Tai et.al. | 2405.01872 | null |
2024-05-03 | Creation of Novel Soft Robot Designs using Generative AI | Wee Kiat Chan et.al. | 2405.01824 | null |
2024-05-03 | Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics | Rucha Deshpande et.al. | 2405.01822 | null |
2024-05-02 | Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model | Zongyang Du et.al. | 2405.01730 | null |
2024-05-02 | Long Tail Image Generation Through Feature Space Augmentation and Iterated Learning | Rafael Elberg et.al. | 2405.01705 | link |
2024-05-02 | LocInv: Localization-aware Inversion for Text-Guided Image Editing | Chuanming Tang et.al. | 2405.01496 | link |
2024-05-02 | Navigating Heterogeneity and Privacy in One-Shot Federated Learning with Diffusion Models | Matias Mendieta et.al. | 2405.01494 | null |
2024-05-02 | Statistical algorithms for low-frequency diffusion data: A PDE approach | Matteo Giordano et.al. | 2405.01372 | null |
2024-05-02 | DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines | Ye Tian et.al. | 2405.01248 | null |
2024-05-02 | Automated Virtual Product Placement and Assessment in Images using Diffusion Models | Mohammad Mahmudul Alam et.al. | 2405.01130 | null |
2024-05-02 | Part-aware Shape Generation with Latent 3D Diffusion of Neural Voxel Fields | Yuhang Huang et.al. | 2405.00998 | null |
2024-05-02 | Generative manufacturing systems using diffusion models and ChatGPT | Xingyu Li et.al. | 2405.00958 | null |
2024-05-02 | EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion | Guangyao Zhai et.al. | 2405.00915 | null |
2024-05-01 | SonicDiffusion: Audio-Driven Image Generation and Editing with Pretrained Diffusion Models | Burak Can Biner et.al. | 2405.00878 | null |
2024-05-01 | Guided Conditional Diffusion Classifier (ConDiff) for Enhanced Prediction of Infection in Diabetic Foot Ulcers | Palawat Busaranuvong et.al. | 2405.00858 | null |
2024-05-01 | ADM: Accelerated Diffusion Model via Estimated Priors for Robust Motion Prediction under Uncertainties | Jiahui Li et.al. | 2405.00797 | null |
2024-05-01 | Obtaining Favorable Layouts for Multiple Object Generation | Barak Battash et.al. | 2405.00791 | null |
2024-05-01 | Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models | Xiaoshi Wu et.al. | 2405.00760 | null |
2024-05-01 | TexSliders: Diffusion-Based Texture Editing in CLIP Space | Julia Guerrero-Viu et.al. | 2405.00672 | null |
2024-05-01 | RGB |
Zheng Zeng et.al. | 2405.00666 | null |
2024-05-01 | Deep Metric Learning-Based Out-of-Distribution Detection with Synthetic Outlier Exposure | Assefa Seyoum Wahd et.al. | 2405.00631 | null |
2024-05-01 | Lane Segmentation Refinement with Diffusion Models | Antonio Ruiz et.al. | 2405.00620 | null |
2024-05-01 | Pricing and delta computation in jump-diffusion models with stochastic intensity by Malliavin calculus | Ayub Ahmadi et.al. | 2405.00473 | null |
2024-05-01 | Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable | Haozhe Liu et.al. | 2405.00466 | null |
2024-05-01 | Detail-Enhancing Framework for Reference-Based Image Super-Resolution | Zihan Wang et.al. | 2405.00431 | null |
2024-05-01 | Streamlining Image Editing with Layered Diffusion Brushes | Peyman Gholami et.al. | 2405.00313 | null |
2024-05-02 | An Unstructured Mesh Reaction-Drift-Diffusion Master Equation with Reversible Reactions | Samuel A. Isaacson et.al. | 2405.00283 | null |
2024-05-01 | ASAM: Boosting Segment Anything Model with Adversarial Tuning | Bo Li et.al. | 2405.00256 | link |
2024-04-30 | Semantically Consistent Video Inpainting with Conditional Diffusion Models | Dylan Green et.al. | 2405.00251 | null |
2024-04-30 | IgCONDA-PET: Implicitly-Guided Counterfactual Diffusion for Detecting Anomalies in PET Images | Shadab Ahamed et.al. | 2405.00239 | link |
2024-04-30 | SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound | Haohe Liu et.al. | 2405.00233 | null |
2024-04-30 | Target-Specific De Novo Peptide Binder Design with DiffPepBuilder | Fanhao Wang et.al. | 2405.00128 | null |
2024-04-30 | MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model | Wenxun Dai et.al. | 2404.19759 | null |
2024-04-30 | Invisible Stitch: Generating Smooth 3D Scenes with Depth Inpainting | Paul Engstler et.al. | 2404.19758 | null |
2024-04-30 | Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule Generation | Ian Dunn et.al. | 2404.19739 | link |
2024-04-30 | X-Diffusion: Generating Detailed 3D MRI Volumes From a Single Image Using Cross-Sectional Diffusion Models | Emmanuelle Bourigault et.al. | 2404.19604 | null |
2024-04-30 | MicroDreamer: Zero-shot 3D Generation in |
Luxi Chen et.al. | 2404.19525 | link |
2024-04-30 | TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models | Teng Zhou et.al. | 2404.19475 | null |
2024-04-30 | Probing Unlearned Diffusion Models: A Transferable Adversarial Attack Perspective | Xiaoxuan Han et.al. | 2404.19382 | null |
2024-04-30 | Bridge to Non-Barrier Communication: Gloss-Prompted Fine-grained Cued Speech Gesture Generation with Diffusion Model | Wentao Lei et.al. | 2404.19277 | null |
2024-04-30 | DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets | Xiaoyu Huang et.al. | 2404.19264 | null |
2024-04-30 | CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition | Jianzong Wang et.al. | 2404.19187 | null |
2024-04-29 | Stylus: Automatic Adapter Selection for Diffusion Models | Michael Luo et.al. | 2404.18928 | null |
2024-04-29 | TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation | Junhao Cheng et.al. | 2404.18919 | null |
2024-04-29 | Learning general Gaussian mixtures with efficient score matching | Sitan Chen et.al. | 2404.18893 | null |
2024-04-29 | A Survey on Diffusion Models for Time Series and Spatio-Temporal Data | Yiyuan Yang et.al. | 2404.18886 | link |
2024-04-29 | Learning Mixtures of Gaussians Using Diffusion Models | Khashayar Gatmiry et.al. | 2404.18869 | null |
2024-04-29 | Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior | Zhiyuan Li et.al. | 2404.18820 | null |
2024-04-29 | Bootstrap 3D Reconstructed Scenes from 3D Gaussian Splatting | Yifei Gao et.al. | 2404.18669 | null |
2024-04-29 | FlexiFilm: Long Video Generation with Flexible Conditions | Yichen Ouyang et.al. | 2404.18620 | link |
2024-04-29 | Anywhere: A Multi-Agent Framework for Reliable and Diverse Foreground-Conditioned Image Inpainting | Tianyidan Xie et.al. | 2404.18598 | null |
2024-05-01 | U-Nets as Belief Propagation: Efficient Classification, Denoising, and Diffusion in Generative Hierarchical Models | Song Mei et.al. | 2404.18444 | null |
2024-04-28 | Fisher Information Improved Training-Free Conditional Diffusion Model | Kaiyu Song et.al. | 2404.18252 | null |
2024-04-28 | Paint by Inpaint: Learning to Add Image Objects by Removing Them First | Navve Wasserman et.al. | 2404.18212 | null |
2024-04-28 | Generative AI for Visualization: State of the Art and Future Directions | Yilin Ye et.al. | 2404.18144 | null |
2024-04-28 | Generative AI for Low-Carbon Artificial Intelligence of Things | Jinbo Wen et.al. | 2404.18077 | null |
2024-04-28 | Grounded Compositional and Diverse Text-to-3D with Pretrained Multi-View Diffusion Model | Xiaolong Li et.al. | 2404.18065 | null |
2024-04-28 | Exposing Text-Image Inconsistency Using Diffusion Models | Mingzhen Huang et.al. | 2404.18033 | null |
2024-04-30 | Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching | Robert Denkert et.al. | 2404.17939 | null |
2024-04-27 | Unsupervised Anomaly Detection via Masked Diffusion Posterior Sampling | Di Wu et.al. | 2404.17900 | null |
2024-04-27 | DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction | Chenhe Du et.al. | 2404.17890 | null |
2024-04-27 | Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission | Mingyu Yang et.al. | 2404.17736 | null |
2024-04-26 | MaPa: Text-driven Photorealistic Material Painting for 3D Shapes | Shangzhan Zhang et.al. | 2404.17569 | null |
2024-04-26 | Chemotaxis-inspired PDE model for airborne infectious disease transmission: analysis and simulations | Pierluigi Colli et.al. | 2404.17506 | null |
2024-04-26 | Multi-view Image Prompted Multi-view Diffusion for Improved 3D Generation | Seungwook Kim et.al. | 2404.17419 | null |
2024-04-29 | MV-VTON: Multi-View Virtual Try-On with Diffusion Models | Haoyu Wang et.al. | 2404.17364 | link |
2024-04-26 | Simultaneous Tri-Modal Medical Image Fusion and Super-Resolution using Conditional Diffusion Model | Yushen Xu et.al. | 2404.17357 | null |
2024-04-26 | Trinity Detector:text-assisted and attention mechanisms based spectral fusion for diffusion generation image detection | Jiawei Song et.al. | 2404.17254 | null |
2024-04-26 | Few-shot Calligraphy Style Learning | Fangda Chen et.al. | 2404.17199 | link |
2024-04-25 | CyNetDiff -- A Python Library for Accelerated Implementation of Network Diffusion Models | Eliot W. Robson et.al. | 2404.17059 | null |
2024-04-25 | Universal fragmentation in annihilation reactions with constrained kinetics | Enrique Rozas Garcia et.al. | 2404.16950 | null |
2024-04-25 | Inferring solid-state diffusivity in lithium-ion battery active materials: improving upon the classical GITT method | A. Emir Gumrukcuoglu et.al. | 2404.16658 | null |
2024-04-25 | MuseumMaker: Continual Style Customization without Catastrophic Forgetting | Chenxi Liu et.al. | 2404.16612 | null |
2024-04-29 | Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models | Parul Gupta et.al. | 2404.16556 | null |
2024-04-25 | DiffSeg: A Segmentation Model for Skin Lesions Based on Diffusion Difference | Zhihao Shuai et.al. | 2404.16474 | null |
2024-04-25 | TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models | Haomiao Ni et.al. | 2404.16306 | null |
2024-04-25 | CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions | Haoyuan Li et.al. | 2404.16302 | link |
2024-04-25 | One Noise to Rule Them All: Learning a Unified Model of Spatially-Varying Noise Patterns | Arman Maesumi et.al. | 2404.16292 | null |
2024-04-24 | Editable Image Elements for Controllable Synthesis | Jiteng Mu et.al. | 2404.16029 | null |
2024-04-24 | RetinaRegNet: A Versatile Approach for Retinal Image Registration | Vishal Balaji Sivaraman et.al. | 2404.16017 | link |
2024-04-24 | MYCloth: Towards Intelligent and Interactive Online T-Shirt Customization based on User's Preference | Yexin Liu et.al. | 2404.15801 | null |
2024-04-24 | MotionMaster: Training-free Camera Motion Transfer For Video Generation | Teng Hu et.al. | 2404.15789 | null |
2024-04-24 | Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations | Kaiwen Xue et.al. | 2404.15766 | link |
2024-04-24 | DeepFeatureX Net: Deep Features eXtractors based Network for discriminating synthetic from real images | Orazio Pontorno et.al. | 2404.15697 | null |
2024-04-24 | Generative Diffusion Model (GDM) for Optimization of Wi-Fi Networks | Tie Liu et.al. | 2404.15684 | null |
2024-04-24 | AnoFPDM: Anomaly Segmentation with Forward Process of Diffusion Models for Brain MRI | Yiming Che et.al. | 2404.15683 | null |
2024-04-24 | CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models | Qinghe Wang et.al. | 2404.15677 | link |
2024-04-24 | Optimizing OOD Detection in Molecular Graphs: A Novel Approach with Diffusion Models | Xu Shen et.al. | 2404.15625 | null |
2024-04-26 | A Dynamic Kernel Prior Model for Unsupervised Blind Image Super-Resolution | Zhixiong Yang et.al. | 2404.15620 | link |
2024-04-23 | ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning | Weifeng Chen et.al. | 2404.15449 | null |
2024-04-23 | GLoD: Composing Global Contexts and Local Details in Image Generation | Moyuru Yamada et.al. | 2404.15447 | null |
2024-04-23 | ControlTraj: Controllable Trajectory Generation with Topology-Constrained Diffusion Model | Yuanshao Zhu et.al. | 2404.15380 | null |
2024-04-23 | Heat flow, log-concavity, and Lipschitz transport maps | Giovanni Brigati et.al. | 2404.15205 | null |
2024-04-23 | CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method | Mingbao Lin et.al. | 2404.15141 | link |
2024-04-23 | Taming Diffusion Probabilistic Models for Character Control | Rui Chen et.al. | 2404.15121 | null |
2024-04-23 | Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models | Jingyao Xu et.al. | 2404.15081 | null |
2024-04-23 | Music Style Transfer With Diffusion Model | Hong Huang et.al. | 2404.14771 | null |
2024-04-23 | Gradient Guidance for Diffusion Models: An Optimization Perspective | Yingqing Guo et.al. | 2404.14743 | null |
2024-04-25 | FlashSpeech: Efficient Zero-Shot Speech Synthesis | Zhen Ye et.al. | 2404.14700 | null |
2024-04-23 | DreamPBR: Text-driven Generation of High-resolution SVBRDF with Multi-modal Guidance | Linxuan Xin et.al. | 2404.14676 | null |
2024-04-22 | UVMap-ID: A Controllable and Personalized UV Map Generative Model | Weijie Wang et.al. | 2404.14568 | null |
2024-04-22 | Align Your Steps: Optimizing Sampling Schedules in Diffusion Models | Amirmojtaba Sabour et.al. | 2404.14507 | null |
2024-04-22 | Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses | Inhee Lee et.al. | 2404.14410 | null |
2024-04-22 | GeoDiffuser: Geometry-Based Image Editing with Diffusion Models | Rahul Sajnani et.al. | 2404.14403 | null |
2024-04-22 | TAVGBench: Benchmarking Text to Audible-Video Generation | Yuxin Mao et.al. | 2404.14381 | link |
2024-04-22 | Full Event Particle-Level Unfolding with Variable-Length Latent Variational Diffusion | Alexander Shmakov et.al. | 2404.14332 | null |
2024-04-22 | X-Ray: A Sequential 3D Representation for Generation | Tao Hu et.al. | 2404.14329 | null |
2024-04-22 | Collaborative Filtering Based on Diffusion Models: Unveiling the Potential of High-Order Connectivity | Yu Hou et.al. | 2404.14240 | null |
2024-04-22 | MultiBooth: Towards Generating All Your Concepts in an Image from Text | Chenyang Zhu et.al. | 2404.14239 | link |
2024-04-22 | Face2Face: Label-driven Facial Retouching Restoration | Guanhua Zhao et.al. | 2404.14177 | null |
2024-04-22 | FLDM-VTON: Faithful Latent Diffusion Model for Virtual Try-on | Chenhui Wang et.al. | 2404.14162 | null |
2024-04-22 | Generative Artificial Intelligence Assisted Wireless Sensing: Human Flow Detection in Practical Communication Environments | Jiacheng Wang et.al. | 2404.14140 | null |
2024-04-23 | RingID: Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification | Hai Ci et.al. | 2404.14055 | null |
2024-04-22 | RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance | Chengrui Wang et.al. | 2404.13984 | null |
2024-04-22 | MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets | Zeyu Li et.al. | 2404.13923 | null |
2024-04-23 | Accelerating Image Generation with Sub-path Linear Approximation Model | Chen Xu et.al. | 2404.13903 | null |
2024-04-22 | Towards Better Text-to-Image Generation Alignment via Attention Modulation | Yihang Wu et.al. | 2404.13899 | null |
2024-04-23 | Decoherence of a charged Brownian particle in a magnetic field : an analysis of the roles of coupling via position and momentum variables | Suraka Bhattacharjee et.al. | 2404.13883 | null |
2024-04-21 | Universal Fingerprint Generation: Controllable Diffusion Model with Multimodal Conditions | Steven A. Grosz et.al. | 2404.13791 | null |
2024-04-21 | Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control | Maria Mihaela Trusca et.al. | 2404.13766 | null |
2024-04-21 | A Splice Method for Local-to-Nonlocal Coupling of Weak Forms | Shuai Jiang et.al. | 2404.13744 | null |
2024-04-21 | Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models | Vitali Petsiuk et.al. | 2404.13706 | null |
2024-04-19 | Analysis of Classifier-Free Guidance Weight Schedulers | Xi Wang et.al. | 2404.13040 | null |
2024-04-19 | RadRotator: 3D Rotation of Radiographs with Diffusion Models | Pouria Rouzrokh et.al. | 2404.13000 | null |
2024-04-19 | Cross-modal Diffusion Modelling for Super-resolved Spatial Transcriptomics | Xiaofei Wang et.al. | 2404.12973 | null |
2024-04-19 | Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion Modelling | Grigory Bartosh et.al. | 2404.12940 | null |
2024-04-19 | Zero-Shot Medical Phrase Grounding with Off-the-shelf Diffusion Models | Konstantinos Vilouras et.al. | 2404.12920 | null |
2024-04-19 | Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images | Santosh et.al. | 2404.12908 | link |
2024-04-19 | ConCLVD: Controllable Chinese Landscape Video Generation via Diffusion Model | Dingming Liu et.al. | 2404.12903 | null |
2024-04-19 | Training-and-prompt-free General Painterly Harmonization Using Image-wise Attention Sharing | Teng-Fang Hsiao et.al. | 2404.12900 | link |
2024-04-19 | MCM: Multi-condition Motion Synthesis Framework | Zeyu Ling et.al. | 2404.12886 | null |
2024-04-19 | Detecting Out-Of-Distribution Earth Observation Images with Diffusion Models | Georges Le Bellier et.al. | 2404.12667 | null |
2024-04-19 | F2FLDM: Latent Diffusion Models with Histopathology Pre-Trained Embeddings for Unpaired Frozen Section to FFPE Translation | Man M. Ho et.al. | 2404.12650 | null |
2024-04-19 | Dragtraffic: A Non-Expert Interactive and Point-Based Controllable Traffic Scene Generation Framework | Sheng Wang et.al. | 2404.12624 | null |
2024-04-19 | Rethinking Clothes Changing Person ReID: Conflicts, Synthesis, and Optimization | Junjie Li et.al. | 2404.12611 | null |
2024-04-18 | GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I Diffusion Models | Sai Sree Harsha et.al. | 2404.12541 | null |
2024-04-18 | G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis | Yufei Ye et.al. | 2404.12383 | null |
2024-04-18 | Learning the Domain Specific Inverse NUFFT for Accelerated Spiral MRI using Diffusion Models | Trevor J. Chan et.al. | 2404.12361 | null |
2024-04-18 | AniClipart: Clipart Animation with Text-to-Video Priors | Ronghuan Wu et.al. | 2404.12347 | null |
2024-04-18 | Guided Discrete Diffusion for Electronic Health Record Generation | Zixiang Chen et.al. | 2404.12314 | null |
2024-04-18 | StyleBooth: Image Style Editing with Multimodal Instruction | Zhen Han et.al. | 2404.12154 | link |
2024-04-18 | LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights | Thibault Castells et.al. | 2404.11936 | null |
2024-04-18 | FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models | Wei Wu et.al. | 2404.11895 | null |
2024-04-17 | Prompt-Driven Feature Diffusion for Open-World Semi-Supervised Learning | Marzi Heidari et.al. | 2404.11795 | null |
2024-04-17 | Diffusion Schrödinger Bridge Models for High-Quality MR-to-CT Synthesis for Head and Neck Proton Treatment Planning | Muheng Li et.al. | 2404.11741 | null |
2024-04-17 | Factorized Diffusion: Perceptual Illusions by Noise Decomposition | Daniel Geng et.al. | 2404.11615 | null |
2024-04-17 | IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination | Xi Chen et.al. | 2404.11593 | null |
2024-04-17 | Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding | Zezhong Fan et.al. | 2404.11589 | null |
2024-04-17 | MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation | Kuan-Chieh et.al. | 2404.11565 | null |
2024-04-17 | Predicting Long-horizon Futures by Conditioning on Geometry and Time | Tarasha Khurana et.al. | 2404.11554 | null |
2024-04-17 | SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening | Yu Zhong et.al. | 2404.11537 | null |
2024-04-17 | Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt | Zhanjie Zhang et.al. | 2404.11474 | link |
2024-04-17 | Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption | Buzhen Huang et.al. | 2404.11291 | link |
2024-04-17 | Optical Image-to-Image Translation Using Denoising Diffusion Models: Heterogeneous Change Detection as a Use Case | João Gabriel Vinholi et.al. | 2404.11243 | null |
2024-04-17 | RiboDiffusion: Tertiary Structure-based RNA Inverse Folding with Generative Diffusion Models | Han Huang et.al. | 2404.11199 | link |
2024-04-19 | LAPTOP-Diff: Layer Pruning and Normalized Distillation for Compressing Diffusion Models | Dingkun Zhang et.al. | 2404.11098 | null |
2024-04-16 | Molecular relaxation by reverse diffusion with time step prediction | Khaled Kahouli et.al. | 2404.10935 | link |
2024-04-16 | RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting | Ashkan Mirzaei et.al. | 2404.10765 | null |
2024-04-16 | LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? | Yuchi Wang et.al. | 2404.10763 | link |
2024-04-16 | GazeHTA: End-to-end Gaze Target Detection with Head-Target Association | Zhi-Yi Lin et.al. | 2404.10718 | null |
2024-04-16 | Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution | Yutao Yuan et.al. | 2404.10688 | link |
2024-04-16 | Generating Human Interaction Motions in Scenes with Text Control | Hongwei Yi et.al. | 2404.10685 | null |
2024-04-16 | StyleCity: Large-Scale 3D Urban Scenes Stylization with Vision-and-Text Reference via Progressive Optimization | Yingshu Chen et.al. | 2404.10681 | null |
2024-04-18 | Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay | Jinmei Liu et.al. | 2404.10662 | link |
2024-04-16 | Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences | Seungwook Kim et.al. | 2404.10603 | null |
2024-04-17 | Do Counterfactual Examples Complicate Adversarial Training? | Eric Yeats et.al. | 2404.10588 | null |
2024-04-17 | AAVDiff: Experimental Validation of Enhanced Viability and Diversity in Recombinant Adeno-Associated Virus (AAV) Capsids through Diffusion Generation | Lijun Liu et.al. | 2404.10573 | null |
2024-04-16 | A bridge between spatial and first-passage properties of continuous and discrete time stochastic processes: from hard walls to absorbing boundary conditions | Mathis Guéneau et.al. | 2404.10537 | null |
2024-04-16 | Four-hour thunderstorm nowcasting using deep diffusion models of satellite | Kuai Dai et.al. | 2404.10512 | null |
2024-04-16 | SparseDM: Toward Sparse Efficient Diffusion Models | Kafeng Wang et.al. | 2404.10445 | null |
2024-04-16 | Portrait3D: Text-Guided High-Quality 3D Portrait Generation Using Pyramid Representation and GANs Prior | Yiqian Wu et.al. | 2404.10394 | null |
2024-04-16 | Generating Counterfactual Trajectories with Latent Diffusion Models for Concept Discovery | Payal Varshney et.al. | 2404.10356 | null |
2024-04-16 | Efficiently Adversarial Examples Generation for Visual-Language Models under Targeted Transfer Scenarios using Diffusion Models | Qi Guo et.al. | 2404.10335 | null |
2024-04-17 | OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model | Runyi Li et.al. | 2404.10312 | null |
2024-04-16 | EucliDreamer: Fast and High-Quality Texturing for 3D Models with Depth-Conditioned Stable Diffusion | Cindy Le et.al. | 2404.10279 | null |
2024-04-16 | OneActor: Consistent Character Generation via Cluster-Conditioned Guidance | Jiahao Wang et.al. | 2404.10267 | null |
2024-04-16 | Diffusion assisted image reconstruction in optoacoustic tomography | M. G. González et.al. | 2404.10239 | null |
2024-04-15 | Equipping Diffusion Models with Differentiable Spatial Entropy for Low-Light Image Enhancement | Wenyi Lian et.al. | 2404.09735 | link |
2024-04-15 | Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models | Ziwei Luo et.al. | 2404.09732 | link |
2024-04-15 | All-in-one simulation-based inference | Manuel Gloeckler et.al. | 2404.09636 | link |
2024-04-15 | TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models | Haojun Sun et.al. | 2404.09532 | null |
2024-04-15 | Magic Clothing: Controllable Garment-Driven Image Synthesis | Weifeng Chen et.al. | 2404.09512 | link |
2024-04-15 | PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI | Yandan Yang et.al. | 2404.09465 | null |
2024-04-15 | Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models | Peifei Zhu et.al. | 2404.09401 | null |
2024-04-14 | Fault Detection in Mobile Networks Using Diffusion Models | Mohamad Nabeel et.al. | 2404.09240 | null |
2024-04-14 | DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling | Xuening Yuan et.al. | 2404.09227 | null |
2024-04-16 | LoopAnimate: Loopable Salient Object Animation | Fanyi Wang et.al. | 2404.09172 | null |
2024-04-14 | RF-Diffusion: Radio Signal Generation via Time-Frequency Diffusion | Guoxuan Chi et.al. | 2404.09140 | link |
2024-04-13 | Rethinking Iterative Stereo Matching from Diffusion Bridge Model Perspective | Yuguang Shi et.al. | 2404.09051 | null |
2024-04-13 | Theoretical research on generative diffusion models: an overview | Melike Nur Yeğin et.al. | 2404.09016 | null |
2024-04-13 | Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles | Abhijnan Nath et.al. | 2404.08949 | link |
2024-04-13 | Enforcing Paraphrase Generation via Controllable Latent Diffusion | Wei Zou et.al. | 2404.08938 | link |
2024-04-13 | Diffusion Models Meet Remote Sensing: Principles, Methods, and Perspectives | Yidan Liu et.al. | 2404.08926 | null |
2024-04-13 | ChangeAnywhere: Sample Generation for Remote Sensing Change Detection via Semantic Latent Diffusion Model | Kai Tang et.al. | 2404.08892 | null |
2024-04-12 | Semantic Approach to Quantifying the Consistency of Diffusion Model Image Generation | Brinnae Bent et.al. | 2404.08799 | null |
2024-04-12 | Diffusion-Based Joint Temperature and Precipitation Emulation of Earth System Models | Katie Christensen et.al. | 2404.08797 | null |
2024-04-12 | Lossy Image Compression with Foundation Diffusion Models | Lucas Relic et.al. | 2404.08580 | null |
2024-04-12 | PiRD: Physics-informed Residual Diffusion for Flow Field Reconstruction | Siming Shan et.al. | 2404.08412 | null |
2024-04-12 | Struggle with Adversarial Defense? Try Diffusion | Yujie Li et.al. | 2404.08273 | null |
2024-04-12 | Balanced Mixed-Type Tabular Data Synthesis with Diffusion Models | Zeyu Yang et.al. | 2404.08254 | null |
2024-04-12 | Interest Maximization in Social Networks | Rahul Kumar Gautam et.al. | 2404.08236 | null |
2024-04-11 | ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback | Ming Li et.al. | 2404.07987 | null |
2024-04-11 | Taming Stable Diffusion for Text to 360° Panorama Image Generation | Cheng Zhang et.al. | 2404.07949 | link |
2024-04-11 | Adaptive Hyperbolic-cross-space Mapped Jacobi Method on Unbounded Domains with Applications to Solving Multidimensional Spatiotemporal Integrodifferential Equations | Yunhong Deng et.al. | 2404.07844 | null |
2024-04-11 | ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model | Lifan Jiang et.al. | 2404.07773 | null |
2024-04-11 | An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization | Minshuo Chen et.al. | 2404.07771 | null |
2024-04-11 | Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations | Yufeng Yue et.al. | 2404.07770 | null |
2024-04-11 | Diffusing in Someone Else's Shoes: Robotic Perspective Taking with Diffusion | Josua Spisak et.al. | 2404.07735 | null |
2024-04-11 | Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models | Tuomas Kynkäänniemi et.al. | 2404.07724 | null |
2024-04-11 | Implicit and Explicit Language Guidance for Diffusion-based Visual Perception | Hefeng Wang et.al. | 2404.07600 | null |
2024-04-11 | ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation | Stanislav Frolov et.al. | 2404.07564 | null |
2024-04-11 | Effects of phase separation on extinction times in population models | Janik Schüttler et.al. | 2404.07563 | null |
2024-04-11 | CAT: Contrastive Adapter Training for Personalized Image Generation | Jae Wan Park et.al. | 2404.07554 | link |
2024-04-10 | Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models | Yasi Zhang et.al. | 2404.07389 | null |
2024-04-10 | GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models | Zewei Zhang et.al. | 2404.07206 | null |
2024-04-10 | RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion | Jaidev Shriram et.al. | 2404.07199 | null |
2024-04-10 | InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models | Jiale Xu et.al. | 2404.07191 | link |
2024-04-10 | Move Anything with Layered Scene Diffusion | Jiawei Ren et.al. | 2404.07178 | null |
2024-04-10 | Diffusion-based inpainting of incomplete Euclidean distance matrices of trajectories generated by a fractional Brownian motion | Alexander Lobashev et.al. | 2404.07029 | link |
2024-04-10 | DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting | Shijie Zhou et.al. | 2404.06903 | null |
2024-04-10 | Fine color guidance in diffusion models and its application to image compression at extremely low bitrates | Tom Bordin et.al. | 2404.06865 | null |
2024-04-10 | UDiFF: Generating Conditional Unsigned Distance Fields with Optimal Wavelet Diffusion | Junsheng Zhou et.al. | 2404.06851 | null |
2024-04-10 | Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer | Yanqi Ge et.al. | 2404.06835 | null |
2024-04-10 | Zero-shot Point Cloud Completion Via 2D Priors | Tianxin Huang et.al. | 2404.06814 | null |
2024-04-10 | Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior | Fan Lu et.al. | 2404.06780 | null |
2024-04-10 | DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space | Jianxiang Xiang et.al. | 2404.06760 | null |
2024-04-11 | Disguised Copyright Infringement of Latent Diffusion Models | Yiwei Lu et.al. | 2404.06737 | null |
2024-04-10 | Efficient Denoising using Score Embedding in Score-based Diffusion Models | Andrew S. Na et.al. | 2404.06661 | null |
2024-04-09 | Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation | Luca Barsellotti et.al. | 2404.06542 | null |
2024-04-09 | GeoDirDock: Guiding Docking Along Geodesic Paths | Raúl Miñán et.al. | 2404.06481 | null |
2024-04-09 | Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion | Fan Yang et.al. | 2404.06429 | null |
2024-04-09 | ZeST: Zero-Shot Material Transfer from a Single Image | Ta-Ying Cheng et.al. | 2404.06425 | null |
2024-04-09 | Policy-Guided Diffusion | Matthew Thomas Jackson et.al. | 2404.06356 | link |
2024-04-09 | Quantum State Generation with Structure-Preserving Diffusion Model | Yuchen Zhu et.al. | 2404.06336 | null |
2024-04-09 | DiffHarmony: Latent Diffusion Model Meets Image Harmonization | Pengfei Zhou et.al. | 2404.06139 | null |
2024-04-09 | Hash3D: Training-free Acceleration for 3D Generation | Xingyi Yang et.al. | 2404.06091 | link |
2024-04-09 | Diffusion-Based Point Cloud Super-Resolution for mmWave Radar Data | Kai Luan et.al. | 2404.06012 | null |
2024-04-09 | Tackling Structural Hallucination in Image Translation with Local Diffusion | Seunghoi Kim et.al. | 2404.05980 | null |
2024-04-09 | Map Optical Properties to Subwavelength Structures Directly via a Diffusion Model | Shijie Rao et.al. | 2404.05959 | null |
2024-04-08 | MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation | Kunpeng Song et.al. | 2404.05674 | null |
2024-04-08 | YaART: Yet Another ART Rendering Technology | Sergey Kastryulin et.al. | 2404.05666 | null |
2024-04-08 | BinaryDM: Towards Accurate Binarization of Diffusion Model | Xingyu Zheng et.al. | 2404.05662 | link |
2024-04-08 | Resistive Memory-based Neural Differential Equation Solver for Score-based Diffusion Model | Jichang Yang et.al. | 2404.05648 | null |
2024-04-08 | Learning a Category-level Object Pose Estimator without Pose Annotations | Fengrui Tian et.al. | 2404.05626 | null |
2024-04-08 | UniFL: Improve Stable Diffusion via Unified Feedback Learning | Jiacheng Zhang et.al. | 2404.05595 | null |
2024-04-08 | Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models | Saman Motamed et.al. | 2404.05519 | null |
2024-04-08 | Taming Transformers for Realistic Lidar Point Cloud Generation | Hamed Haghighi et.al. | 2404.05505 | link |
2024-04-08 | Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance | Dazhong Shen et.al. | 2404.05384 | link |
2024-04-08 | Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt | Zhiqi Huang et.al. | 2404.05331 | null |
2024-04-08 | Text-to-Image Synthesis for Any Artistic Styles: Advancements in Personalized Artistic Image Generation via Subdivision and Dual Binding | Junseo Park et.al. | 2404.05256 | null |
2024-04-08 | DiffCJK: Conditional Diffusion Model for High-Quality and Wide-coverage CJK Character Generation | Yingtao Tian et.al. | 2404.05212 | null |
2024-04-07 | Context-dependent Causality (the Non-Nonotonic Case) | Nir Billfeld et.al. | 2404.05021 | null |
2024-04-07 | Generative downscaling of PDE solvers with physics-guided diffusion models | Yulong Lu et.al. | 2404.05009 | null |
2024-04-07 | Gaussian Shading: Provable Performance-Lossless Image Watermarking for Diffusion Models | Zijin Yang et.al. | 2404.04956 | null |
2024-04-07 | Regularized Conditional Diffusion Model for Multi-Task Preference Alignment | Xudong Yu et.al. | 2404.04920 | null |
2024-04-07 | Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder | Yiyang Ma et.al. | 2404.04916 | null |
2024-04-07 | ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model | Binghui Chen et.al. | 2404.04833 | null |
2024-04-07 | Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving | Jinlong Li et.al. | 2404.04804 | null |
2024-04-07 | Rethinking Diffusion Model for Multi-Contrast MRI Super-Resolution | Guangyuan Li et.al. | 2404.04785 | link |
2024-04-05 | Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models | Sangwon Jang et.al. | 2404.04243 | null |
2024-04-05 | ToolEENet: Tool Affordance 6D Pose Estimation | Yunlong Wang et.al. | 2404.04193 | null |
2024-04-05 | Dynamic Prompt Optimizing for Text-to-Image Generation | Wenyi Mo et.al. | 2404.04095 | link |
2024-04-05 | Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation | Mingyuan Zhou et.al. | 2404.04057 | null |
2024-04-05 | Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models | Gihyun Kwon et.al. | 2404.03913 | null |
2024-04-04 | Bi-level Guided Diffusion Models for Zero-Shot Medical Imaging Inverse Problems | Hossein Askari et.al. | 2404.03706 | null |
2024-04-04 | Mitigating analytical variability in fMRI results with style transfer | Elodie Germani et.al. | 2404.03703 | null |
2024-04-04 | MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation | Hanzhe Hu et.al. | 2404.03656 | null |
2024-04-04 | CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching | Dongzhi Jiang et.al. | 2404.03653 | link |
2024-04-04 | The More You See in 2D, the More You Perceive in 3D | Xinyang Han et.al. | 2404.03652 | null |
2024-04-04 | DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior | Yiming Zhang et.al. | 2404.03642 | null |
2024-04-04 | LCM-Lookahead for Encoder-based Text-to-Image Personalization | Rinon Gal et.al. | 2404.03620 | null |
2024-04-04 | DiffDet4SAR: Diffusion-based Aircraft Target Detection Network for SAR Images | Zhou Jie et.al. | 2404.03595 | link |
2024-04-04 | PointInfinity: Resolution-Invariant Point Diffusion Models | Zixuan Huang et.al. | 2404.03566 | null |
2024-04-04 | Segmentation-Guided Knee Radiograph Generation using Conditional Diffusion Models | Siyuan Mei et.al. | 2404.03541 | null |
2024-04-04 | A Directional Diffusion Graph Transformer for Recommendation | Zixuan Yi et.al. | 2404.03326 | null |
2024-04-04 | SiloFuse: Cross-silo Synthetic Data Generation with Latent Tabular Diffusion Models | Aditya Shankar et.al. | 2404.03299 | null |
2024-04-04 | Future-Proofing Class Incremental Learning | Quentin Jodelet et.al. | 2404.03200 | null |
2024-04-04 | HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud | Wencan Cheng et.al. | 2404.03159 | link |
2024-04-04 | DreamWalk: Style Space Exploration using Diffusion Guidance | Michelle Shu et.al. | 2404.03145 | null |
2024-04-04 | Diverse and Tailored Image Generation for Zero-shot Multi-label Classification | Kaixin Zhang et.al. | 2404.03144 | null |
2024-04-04 | The Diffusive Ultrasound Modulated Bioluminescence Tomography with Partial Data and Uncertain Optical Parameters | Tianyu Yang et.al. | 2404.03124 | null |
2024-04-03 | Many-to-many Image Generation with Auto-regressive Diffusion Models | Ying Shen et.al. | 2404.03109 | null |
2024-04-03 | Computing macroscopic reaction rates in reaction-diffusion systems using Monte Carlo simulations | Mohamed Swailem et.al. | 2404.03089 | null |
2024-04-03 | ASAP: Interpretable Analysis and Summarization of AI-generated Image Patterns at Scale | Jinbin Huang et.al. | 2404.02990 | null |
2024-04-03 | Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections | Gabriel Loaiza-Ganem et.al. | 2404.02954 | null |
2024-04-03 | LidarDM: Generative LiDAR Simulation in a Generated World | Vlas Zyrianov et.al. | 2404.02903 | null |
2024-04-03 | Fast Diffusion Model For Seismic Data Noise Attenuation | Junheng Peng et.al. | 2404.02767 | null |
2024-04-03 | Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models | Wentian Zhang et.al. | 2404.02747 | link |
2024-04-03 | Deep Privacy Funnel Model: From a Discriminative to a Generative Approach with an Application to Face Recognition | Behrooz Razeghi et.al. | 2404.02696 | null |
2024-04-03 | Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Models | Matteo Pennisi et.al. | 2404.02618 | null |
2024-04-03 | A Unified Editing Method for Co-Speech Gesture Generation via Diffusion Inversion | Zeyu Zhao et.al. | 2404.02411 | null |
2024-04-03 | Enhancing Diffusion-based Point Cloud Generation with Smoothness Constraint | Yukun Li et.al. | 2404.02396 | null |
2024-04-02 | Semantic Augmentation in Images using Language | Sahiti Yerramilli et.al. | 2404.02353 | null |
2024-04-02 | Heat Death of Generative Models in Closed-Loop Learning | Matteo Marchi et.al. | 2404.02325 | null |
2024-04-02 | APEX: Ambidextrous Dual-Arm Robotic Manipulation Using Collision-Free Generative Diffusion Models | Apan Dastider et.al. | 2404.02284 | null |
2024-04-02 | Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better | Enshu Liu et.al. | 2404.02241 | link |
2024-04-02 | Diffusion |
Zeyu Yang et.al. | 2404.02148 | link |
2024-04-02 | WcDT: World-centric Diffusion Transformer for Traffic Scene Generation | Chen Yang et.al. | 2404.02082 | link |
2024-04-03 | AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design | Xinze Li et.al. | 2404.02003 | null |
2024-04-02 | Bi-LORA: A Vision-Language Approach for Synthetic Image Detection | Mamadou Keita et.al. | 2404.01959 | null |
2024-04-02 | Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model | Xu He et.al. | 2404.01862 | link |
2024-04-02 | Upsample Guidance: Scale Up Diffusion Models without Training | Juno Hwang et.al. | 2404.01709 | null |
2024-04-02 | FashionEngine: Interactive Generation and Editing of 3D Clothed Humans | Tao Hu et.al. | 2404.01655 | null |
2024-04-02 | Diffusion Deepfake | Chaitali Bhattacharyya et.al. | 2404.01579 | link |
2024-04-01 | Prior Frequency Guided Diffusion Model for Limited Angle (LA)-CBCT Reconstruction | Jiacheng Xie et.al. | 2404.01448 | null |
2024-04-01 | DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery | Yixuan Zhu et.al. | 2404.01424 | link |
2024-04-01 | Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data | Matthias Gerstgrasser et.al. | 2404.01413 | null |
2024-04-01 | Bigger is not Always Better: Scaling Properties of Latent Diffusion Models | Kangfu Mei et.al. | 2404.01367 | null |
2024-04-01 | MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space | Armand Comas-Massagué et.al. | 2404.01296 | null |
2024-04-01 | CosmicMan: A Text-to-Image Foundation Model for Humans | Shikai Li et.al. | 2404.01294 | null |
2024-04-01 | Measuring Style Similarity in Diffusion Models | Gowthami Somepalli et.al. | 2404.01292 | link |
2024-04-01 | A Unified and Interpretable Emotion Representation and Expression Generation | Reni Paskaleva et.al. | 2404.01243 | null |
2024-04-02 | StructLDM: Structured Latent Diffusion for 3D Human Generation | Tao Hu et.al. | 2404.01241 | null |
2024-04-01 | Video Interpolation with Diffusion Models | Siddhant Jain et.al. | 2404.01203 | null |
2024-04-01 | Uncovering the Text Embedding in Text-to-Image Diffusion Models | Hu Yu et.al. | 2404.01154 | null |
2024-04-01 | UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models | Zihan Guan et.al. | 2404.01101 | null |
2024-03-29 | Relation Rectification in Diffusion Model | Yinwei Wu et.al. | 2403.20249 | null |
2024-03-29 | Motion Inversion for Video Customization | Luozhou Wang et.al. | 2403.20193 | null |
2024-03-29 | FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models | Barbara Toniella Corradini et.al. | 2403.20105 | null |
2024-03-29 | SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior | Zhongrui Yu et.al. | 2403.20079 | null |
2024-03-29 | Probing solar modulation analytic models with cosmic ray periodic spectra | Wei-Cheng Long et.al. | 2403.20038 | null |
2024-04-01 | Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting | Haipeng Liu et.al. | 2403.19898 | link |
2024-03-28 | Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks | Pooria Ashrafian et.al. | 2403.19880 | link |
2024-03-28 | ShapeFusion: A 3D diffusion model for localized shape editing | Rolandos Alexandros Potamias et.al. | 2403.19773 | null |
2024-03-28 | MIST: Mitigating Intersectional Bias with Disentangled Cross-Attention Editing in Text-to-Image Diffusion Models | Hidir Yesiltepe et.al. | 2403.19738 | null |
2024-03-28 | Detecting Image Attribution for Text-to-Image Diffusion Models in RGB and Beyond | Katherine Xu et.al. | 2403.19653 | link |
2024-03-28 | InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction | Sirui Xu et.al. | 2403.19652 | null |
2024-03-28 | GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models | Yusuf Dalva et.al. | 2403.19645 | null |
2024-03-28 | In the driver's mind: modeling the dynamics of human overtaking decisions in interactions with oncoming automated vehicles | Samir H. A. Mohammad et.al. | 2403.19637 | null |
2024-03-28 | Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model | Zhicai Wang et.al. | 2403.19600 | link |
2024-03-28 | Frame by Familiar Frame: Understanding Replication in Video Diffusion Models | Aimon Rahman et.al. | 2403.19593 | null |
2024-03-28 | Impact of Resin Molecular Weight on Drying Kinetics and Sag of Coatings | Marola W. Issa et.al. | 2403.19544 | null |
2024-03-28 | Debiasing Cardiac Imaging with Controlled Latent Diffusion Models | Grzegorz Skorupko et.al. | 2403.19508 | link |
2024-03-28 | Burst Super-Resolution with Diffusion Models for Improving Perceptual Quality | Kyotaro Tokoro et.al. | 2403.19428 | link |
2024-03-28 | Imperceptible Protection against Style Imitation from Diffusion Models | Namhyuk Ahn et.al. | 2403.19254 | null |
2024-03-28 | RecDiffusion: Rectangling for Image Stitching with Diffusion Models | Tianhao Zhou et.al. | 2403.19164 | link |
2024-03-28 | MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation | Seyeon Kim et.al. | 2403.19144 | null |
2024-03-28 | QNCD: Quantization Noise Correction for Diffusion Models | Huanpeng Chu et.al. | 2403.19140 | link |
2024-03-27 | Egocentric Scene-aware Human Trajectory Prediction | Weizhuo Wang et.al. | 2403.19026 | null |
2024-03-27 | TextCraftor: Your Text Encoder Can be Image Quality Controller | Yanyu Li et.al. | 2403.18978 | null |
2024-03-27 | CPR: Retrieval Augmented Generation for Copyright Protection | Aditya Golatkar et.al. | 2403.18920 | null |
2024-03-27 | A Geometric Explanation of the Likelihood OOD Detection Paradox | Hamidreza Kamkari et.al. | 2403.18910 | link |
2024-03-27 | ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion | Daniel Winter et.al. | 2403.18818 | null |
2024-03-28 | ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation | Suraj Patni et.al. | 2403.18807 | link |
2024-03-27 | Object Pose Estimation via the Aggregation of Diffusion Features | Tianfu Wang et.al. | 2403.18791 | link |
2024-03-27 | ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object | Chenshuang Zhang et.al. | 2403.18775 | link |
2024-03-27 | A Diffusion-Based Generative Equalizer for Music Restoration | Eloi Moliner et.al. | 2403.18636 | null |
2024-03-27 | HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions | Hao Xu et.al. | 2403.18575 | link |
2024-03-27 | Artifact Reduction in 3D and 4D Cone-beam Computed Tomography Images with Deep Learning -- A Review | Mohammadreza Amirian et.al. | 2403.18565 | null |
2024-03-27 | CosalPure: Learning Concept from Group Images for Robust Co-Saliency Detection | Jiayi Zhu et.al. | 2403.18554 | null |
2024-03-27 | CT-3DFlow : Leveraging 3D Normalizing Flows for Unsupervised Detection of Pathological Pulmonary CT scans | Aissam Djahnine et.al. | 2403.18514 | null |
2024-03-27 | Synthesizing EEG Signals from Event-Related Potential Paradigms with Conditional Diffusion Models | Guido Klein et.al. | 2403.18486 | null |
2024-03-27 | DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis | Zhongxi Chen et.al. | 2403.18471 | link |
2024-03-27 | DiffStyler: Diffusion-based Localized Image Style Transfer | Shaoxu Li et.al. | 2403.18461 | null |
2024-03-27 | SingularTrajectory: Universal Trajectory Predictor Using Diffusion Model | Inhwan Bae et.al. | 2403.18452 | link |
2024-03-27 | U-Sketch: An Efficient Approach for Sketch to Image Diffusion Models | Ilias Mitsouras et.al. | 2403.18425 | null |
2024-03-27 | ECNet: Effective Controllable Text-to-Image Diffusion Models | Sicheng Li et.al. | 2403.18417 | null |
2024-03-27 | Ship in Sight: Diffusion Models for Ship-Image Super Resolution | Luigi Sigillo et.al. | 2403.18370 | link |
2024-03-27 | DODA: Diffusion for Object-detection Domain Adaptation in Agriculture | Shuai Xiang et.al. | 2403.18334 | null |
2024-03-27 | RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation | Yang Tian et.al. | 2403.18259 | null |
2024-03-27 | NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation | Jingyang Huo et.al. | 2403.18211 | null |
2024-03-28 | Oh! We Freeze: Improving Quantized Knowledge Distillation via Signal Propagation Analysis for Large Language Models | Kartikeya Bhardwaj et.al. | 2403.18159 | null |
2024-03-26 | AID: Attention Interpolation of Text-to-Image Diffusion | Qiyuan He et.al. | 2403.17924 | link |
2024-03-26 | Boosting Diffusion Models with Moving Average Sampling in Frequency Domain | Yurui Qian et.al. | 2403.17870 | null |
2024-03-26 | DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions | Sammy Christen et.al. | 2403.17827 | null |
2024-03-26 | Annotated Biomedical Video Generation using Denoising Diffusion Probabilistic Models and Flow Fields | Rüveyda Yilmaz et.al. | 2403.17808 | null |
2024-03-26 | GenesisTex: Adapting Image Denoising Diffusion to Texture Space | Chenjian Gao et.al. | 2403.17782 | null |
2024-03-26 | CT Synthesis with Conditional Diffusion Models for Abdominal Lymph Node Segmentation | Yongrui Yu et.al. | 2403.17770 | null |
2024-03-26 | AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation | Huawei Wei et.al. | 2403.17694 | link |
2024-03-26 | Manifold-Guided Lyapunov Control with Diffusion Models | Amartya Mukherjee et.al. | 2403.17692 | null |
2024-03-26 | Not All Similarities Are Created Equal: Leveraging Data-Driven Biases to Inform GenAI Copyright Disputes | Uri Hacohen et.al. | 2403.17691 | null |
2024-03-26 | DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation | Qilin Wang et.al. | 2403.17664 | null |
2024-03-26 | AniArtAvatar: Animatable 3D Art Avatar from a Single Image | Shaoxu Li et.al. | 2403.17631 | null |
2024-03-26 | DiffGaze: A Diffusion Model for Continuous Gaze Sequence Generation on 360° Images | Chuhan Jiao et.al. | 2403.17477 | null |
2024-03-26 | LaRE^2: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection | Yunpeng Luo et.al. | 2403.17465 | null |
2024-03-26 | Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion Model | Runmin Dong et.al. | 2403.17460 | link |
2024-03-26 | InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion | Jihyun Lee et.al. | 2403.17422 | null |
2024-03-26 | A framework to identify supercritical and subcritical Turing bifurcations: Case study of a system sustaining cubic and quadratic autocatalysis | Deepak Kumar et.al. | 2403.17386 | null |
2024-03-26 | Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance | Donghoon Ahn et.al. | 2403.17377 | null |
2024-03-25 | Diffusion-based Negative Sampling on Graphs for Link Prediction | Trung-Kien Nguyen et.al. | 2403.17259 | link |
2024-03-25 | Latency-Aware Generative Semantic Communications with Pre-Trained Diffusion Models | Li Qiao et.al. | 2403.17256 | null |
2024-03-25 | DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment | Stella Bounareli et.al. | 2403.17217 | null |
2024-03-25 | Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss | Artem Khrapov et.al. | 2403.16728 | null |
2024-03-25 | SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions | Yuda Song et.al. | 2403.16627 | null |
2024-03-25 | SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation | Aysim Toker et.al. | 2403.16605 | null |
2024-03-25 | Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization | Xiangxin Zhou et.al. | 2403.16576 | null |
2024-03-25 | An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models | Zizhao Hu et.al. | 2403.16530 | null |
2024-03-25 | Let Real Images be as a Judger, Spotting Fake Images Synthesized with Generative Models | Ziyou Liang et.al. | 2403.16513 | null |
2024-03-25 | Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework | Ziyao Huang et.al. | 2403.16510 | link |
2024-03-25 | Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation | Sanyam Lakhanpal et.al. | 2403.16422 | null |
2024-03-25 | FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models | Lin Zhao et.al. | 2403.16379 | null |
2024-03-24 | Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis | Atefeh Khoshkhahtinat et.al. | 2403.16258 | null |
2024-03-24 | Skull-to-Face: Anatomy-Guided 3D Facial Reconstruction and Editing | Yongqing Liang et.al. | 2403.16207 | null |
2024-03-24 | Diffusion Model is a Good Pose Estimator from 3D RF-Vision | Junqiao Fan et.al. | 2403.16198 | null |
2024-03-24 | Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery | Siddharth Tourani et.al. | 2403.16194 | link |
2024-03-26 | Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method | Jie Tian et.al. | 2403.16169 | null |
2024-03-24 | Robust Diffusion Models for Adversarial Purification | Guang Lin et.al. | 2403.16067 | null |
2024-03-24 | A Unified Module for Accelerating STABLE-DIFFUSION: LCM-LORA | Ayush Thakur et.al. | 2403.16024 | null |
2024-03-23 | Feature Manipulation for DDPM based Change Detection | Zhenglin Li et.al. | 2403.15943 | null |
2024-03-26 | X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention | You Xie et.al. | 2403.15931 | null |
2024-03-23 | Diffusion-based Aesthetic QR Code Generation via Scanning-Robust Perceptual Guidance | Jia-Wei Liao et.al. | 2403.15878 | link |
2024-03-23 | In-Context Matting | He Guo et.al. | 2403.15789 | null |
2024-03-23 | Time-dependent localized patterns in a predator-prey model | Fahad Al Saadi et.al. | 2403.15788 | null |
2024-03-22 | DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data | Hanrong Ye et.al. | 2403.15389 | null |
2024-03-22 | Ultrasound Imaging based on the Variance of a Diffusion Restoration Model | Yuxin Zhang et.al. | 2403.15316 | null |
2024-03-22 | Controlled Training Data Generation with Diffusion Models | Teresa Yeo et.al. | 2403.15309 | null |
2024-03-22 | Spectral Motion Alignment for Video Motion Transfer using Diffusion Models | Geon Yeong Park et.al. | 2403.15249 | null |
2024-03-22 | Shadow Generation for Composite Image Using Diffusion model | Qingyang Liu et.al. | 2403.15234 | link |
2024-03-22 | MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration | Zhichao Wei et.al. | 2403.15059 | null |
2024-03-22 | Toward Tiny and High-quality Facial Makeup with Data Amplify Learning | Qiaoqiao Jin et.al. | 2403.15033 | null |
2024-03-22 | Dynamics of a memory-based diffusion model with spatial heterogeneity and nonlinear boundary condition | Quanli Ji et.al. | 2403.14969 | null |
2024-03-22 | DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow | Kyungmin Lee et.al. | 2403.14966 | null |
2024-03-22 | CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model | Seungdae Han et.al. | 2403.14944 | null |
2024-03-22 | STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians | Yifei Zeng et.al. | 2403.14939 | null |
2024-03-21 | Osmosis: RGBD Diffusion Prior for Underwater Image Restoration | Opher Bar Nathan et.al. | 2403.14837 | null |
2024-03-21 | Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing | Alberto Baldrati et.al. | 2403.14828 | null |
2024-03-21 | Latent Diffusion Models for Attribute-Preserving Image Anonymization | Luca Piano et.al. | 2403.14790 | null |
2024-03-21 | Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance | Shenhao Zhu et.al. | 2403.14781 | null |
2024-03-21 | StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text | Roberto Henschel et.al. | 2403.14773 | null |
2024-03-21 | GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation | Yinghao Xu et.al. | 2403.14621 | link |
2024-03-21 | DreamReward: Text-to-3D Generation with Human Preference | Junliang Ye et.al. | 2403.14613 | null |
2024-03-21 | ReNoise: Real Image Inversion Through Iterative Noising | Daniel Garibi et.al. | 2403.14602 | null |
2024-03-21 | Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting | Alicia Durrer et.al. | 2403.14499 | link |
2024-03-21 | Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation | Mathias Öttl et.al. | 2403.14429 | null |
2024-03-21 | DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-Tuning | Jonathan Lebensold et.al. | 2403.14421 | null |
2024-03-21 | Physics-Informed Diffusion Models | Jan-Hendrik Bastek et.al. | 2403.14404 | null |
2024-03-21 | Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models | Pablo Marcos-Manchón et.al. | 2403.14291 | link |
2024-03-21 | Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation | Francesco Di Felice et.al. | 2403.14279 | null |
2024-03-21 | Diffusion Models with Ensembled Structure-Based Anomaly Scoring for Unsupervised Anomaly Detection | Finn Behrendt et.al. | 2403.14262 | link |
2024-03-21 | Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition | Sihyun Yu et.al. | 2403.14148 | null |
2024-03-21 | Protein Conformation Generation via Force-Guided SE(3) Diffusion Models | Yan Wang et.al. | 2403.14088 | null |
2024-03-21 | QSMDiff: Unsupervised 3D Diffusion Models for Quantitative Susceptibility Mapping | Zhuang Xiong et.al. | 2403.14070 | null |
2024-03-21 | LeFusion: Synthesizing Myocardial Pathology on Cardiac MRI via Lesion-Focus Diffusion Models | Hantao Zhang et.al. | 2403.14066 | null |
2024-03-21 | DiffSTOCK: Probabilistic relational Stock Market Predictions using Diffusion Models | Divyanshu Daiya et.al. | 2403.14063 | null |
2024-03-20 | Enhancing Fingerprint Image Synthesis with GANs, Diffusion Models, and Style Transfer Techniques | W. Tang et.al. | 2403.13916 | null |
2024-03-20 | Towards Learning Contrast Kinetics with Multi-Condition Latent Diffusion Models | Richard Osuala et.al. | 2403.13890 | link |
2024-03-20 | Editing Massive Concepts in Text-to-Image Diffusion Models | Tianwei Xiong et.al. | 2403.13807 | link |
2024-03-20 | ZigMa: Zigzag Mamba Diffusion Model | Vincent Tao Hu et.al. | 2403.13802 | null |
2024-03-20 | TimeRewind: Rewinding Time with Image-and-Events Video Diffusion | Jingxi Chen et.al. | 2403.13800 | null |
2024-03-20 | DepthFM: Fast Monocular Depth Estimation with Flow Matching | Ming Gui et.al. | 2403.13788 | null |
2024-03-20 | Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation | Fu-Yun Wang et.al. | 2403.13745 | null |
2024-03-20 | DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance | Zixuan Wang et.al. | 2403.13667 | null |
2024-03-20 | ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer | Hiroki Azuma et.al. | 2403.13652 | null |
2024-03-20 | ReGround: Improving Textual and Spatial Grounding at No Cost | Yuseung Lee et.al. | 2403.13589 | null |
2024-03-20 | Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute Editing | Hangeol Chang et.al. | 2403.13551 | null |
2024-03-20 | Compress3D: a Compressed Latent Space for 3D Generation from a Single Image | Bowen Zhang et.al. | 2403.13524 | null |
2024-03-20 | VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis | Yumeng Li et.al. | 2403.13501 | null |
2024-03-20 | Scaling Diffusion Models to Real-World 3D LiDAR Scene Completion | Lucas Nunes et.al. | 2403.13470 | link |
2024-03-20 | S2DM: Sector-Shaped Diffusion Models for Video Generation | Haoran Lang et.al. | 2403.13408 | null |
2024-03-20 | IIDM: Image-to-Image Diffusion Model for Semantic Image Synthesis | Feng Liu et.al. | 2403.13378 | null |
2024-03-20 | AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation | Jingkun An et.al. | 2403.13352 | null |
2024-03-20 | LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment | Peishan Cong et.al. | 2403.13307 | null |
2024-03-20 | DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception | Yibo Wang et.al. | 2403.13304 | null |
2024-03-20 | Building Optimal Neural Architectures using Interpretable Knowledge | Keith G. Mills et.al. | 2403.13293 | null |
2024-03-20 | Beyond Skeletons: Integrative Latent Mapping for Coherent 4D Sequence Generation | Qitong Yang et.al. | 2403.13238 | null |
2024-03-20 | A Contact Model based on Denoising Diffusion to Learn Variable Impedance Control for Contact-rich Manipulation | Masashi Okada et.al. | 2403.13221 | null |
2024-03-19 | FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis | Linjiang Huang et.al. | 2403.12963 | link |
2024-03-19 | FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation | Shuai Yang et.al. | 2403.12962 | link |
2024-03-19 | Zero-Reference Low-Light Enhancement via Physical Quadruple Priors | Wenjing Wang et.al. | 2403.12933 | null |
2024-03-19 | Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model | Jiajie Yang et.al. | 2403.12915 | link |
2024-03-19 | D-Cubed: Latent Diffusion Trajectory Optimisation for Dexterous Deformable Manipulation | Jun Yamada et.al. | 2403.12861 | null |
2024-03-19 | Generative Enhancement for 3D Medical Images | Lingting Zhu et.al. | 2403.12852 | link |
2024-03-19 | Compositional 3D Scene Synthesis with Scene Graph Guided Layout-Shape Generation | Yao Wei et.al. | 2403.12848 | null |
2024-03-19 | DreamDA: Generative Data Augmentation with Diffusion Models | Yunxiang Fu et.al. | 2403.12803 | link |
2024-03-19 | WaveFace: Authentic Face Restoration with Efficient Frequency Recovery | Yunqi Miao et.al. | 2403.12760 | null |
2024-03-19 | Towards Controllable Face Generation with Semantic Latent Diffusion Models | Alex Ergasti et.al. | 2403.12743 | link |
2024-03-19 | AnimateDiff-Lightning: Cross-Model Diffusion Distillation | Shanchuan Lin et.al. | 2403.12706 | null |
2024-03-19 | Tuning-Free Image Customization with Image and Text Guidance | Pengzhi Li et.al. | 2403.12658 | null |
2024-03-19 | LASPA: Latent Spatial Alignment for Fast Training-free Single Image Editing | Yazeed Alharbi et.al. | 2403.12585 | null |
2024-03-19 | Generalized Consistency Trajectory Models for Image Manipulation | Beomsu Kim et.al. | 2403.12510 | link |
2024-03-19 | SC-Diff: 3D Shape Completion with Latent Diffusion Models | Juan D. Galvis et.al. | 2403.12470 | null |
2024-03-19 | Do Generated Data Always Help Contrastive Learning? | Yifei Wang et.al. | 2403.12448 | link |
2024-03-19 | Precise-Physics Driven Text-to-3D Generation | Qingshan Xu et.al. | 2403.12438 | null |
2024-03-19 | ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance | Yongwei Chen et.al. | 2403.12409 | null |
2024-03-19 | Understanding Training-free Diffusion Guidance: Mechanisms and Limitations | Yifei Shen et.al. | 2403.12404 | null |
2024-03-19 | OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation | Junhao Cai et.al. | 2403.12396 | null |
2024-03-18 | Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models | Emilian Postolache et.al. | 2403.11706 | link |
2024-03-19 | Urban Scene Diffusion through Semantic Occupancy Map | Junge Zhang et.al. | 2403.11697 | null |
2024-03-18 | Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection | Julia Wolleb et.al. | 2403.11667 | null |
2024-03-18 | Arc2Face: A Foundation Model of Human Faces | Foivos Paraperas Papantoniou et.al. | 2403.11641 | null |
2024-03-18 | LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models | Yang Yang et.al. | 2403.11627 | link |
2024-03-18 | CRS-Diff: Controllable Generative Remote Sensing Foundation Model | Datao Tang et.al. | 2403.11614 | null |
2024-03-18 | EffiVED:Efficient Video Editing via Text-instruction Diffusion Models | Zhenghao Zhang et.al. | 2403.11568 | null |
2024-03-18 | EchoReel: Enhancing Action Generation of Existing Video Diffusion Models | Jianzhi liu et.al. | 2403.11535 | link |
2024-03-18 | Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors | Ruicheng Wang et.al. | 2403.11503 | null |
2024-03-18 | SeisFusion: Constrained Diffusion Model with Input Guidance for 3D Seismic Data Interpolation and Reconstruction | Shuang Wang et.al. | 2403.11482 | link |
2024-03-18 | ALDM-Grasping: Diffusion-aided Zero-Shot Sim-to-Real Transfer for Robot Grasping | Yiwei Li et.al. | 2403.11459 | null |
2024-03-18 | CasSR: Activating Image Power for Real-World Image Super-Resolution | Haolan Chen et.al. | 2403.11451 | null |
2024-03-18 | VmambaIR: Visual State Space Model for Image Restoration | Yuan Shi et.al. | 2403.11423 | link |
2024-03-18 | DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation | Jeongsol Kim et.al. | 2403.11415 | null |
2024-03-18 | Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors | Yazid Janati et.al. | 2403.11407 | null |
2024-03-17 | StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining | Tushar Kataria et.al. | 2403.11340 | null |
2024-03-17 | Fast Personalized Text-to-Image Syntheses With Attention Injection | Yuxuan Zhang et.al. | 2403.11284 | null |
2024-03-17 | Understanding Diffusion Models by Feynman's Path Integral | Yuji Hirono et.al. | 2403.11262 | null |
2024-03-17 | THOR: Text to Human-Object Interaction Diffusion via Relation Intervention | Qianyang Wu et.al. | 2403.11208 | null |
2024-03-17 | MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation | Yasufumi Kawano et.al. | 2403.11194 | link |
2024-03-15 | Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives | Ronghui Li et.al. | 2403.10518 | link |
2024-03-15 | Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding | Pengkun Liu et.al. | 2403.10395 | link |
2024-03-15 | Denoising Task Difficulty-based Curriculum for Training Diffusion Models | Jin-Young Kim et.al. | 2403.10348 | null |
2024-03-15 | Optimal Control of Stationary Doubly Diffusive Flows on Two and Three Dimensional Bounded Lipschitz Domains: Numerical Analysis | Jai Tushar et.al. | 2403.10282 | null |
2024-03-15 | Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder | Jinseok Kim et.al. | 2403.10255 | null |
2024-03-15 | FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model | Qijun Feng et.al. | 2403.10242 | null |
2024-03-15 | BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution | Feng Li et.al. | 2403.10211 | link |
2024-03-15 | Spectral CT Two-step and One-step Material Decomposition using Diffusion Posterior Sampling | Corentin Vazia et.al. | 2403.10183 | null |
2024-03-15 | Animate Your Motion: Turning Still Images into Dynamic Videos | Mingxiao Li et.al. | 2403.10179 | null |
2024-03-15 | Being heterogeneous is disadvantageous: Brownian non-Gaussian searches | Vittoria Sposini et.al. | 2403.10138 | null |
2024-03-15 | DiffMAC: Diffusion Manifold Hallucination Correction for High Generalization Blind Face Restoration | Nan Gao et.al. | 2403.10098 | null |
2024-03-15 | RangeLDM: Fast Realistic LiDAR Point Cloud Generation | Qianjiang Hu et.al. | 2403.10094 | null |
2024-03-15 | SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model | Tao Wu et.al. | 2403.10044 | null |
2024-03-15 | ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images | Xiangtian Xue et.al. | 2403.10004 | null |
2024-03-15 | Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting | Zhiqi Li et.al. | 2403.09981 | null |
2024-03-14 | ProMark: Proactive Diffusion Watermarking for Causal Attribution | Vishal Asnani et.al. | 2403.09914 | null |
2024-03-14 | DTG : Diffusion-based Trajectory Generation for Mapless Global Navigation | Jing Liang et.al. | 2403.09900 | null |
2024-03-14 | SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior | Huan-ang Gao et.al. | 2403.09638 | null |
2024-03-14 | 3D-VLA: A 3D Vision-Language-Action Generative World Model | Haoyu Zhen et.al. | 2403.09631 | null |
2024-03-14 | Generalized Predictive Model for Autonomous Driving | Jiazhi Yang et.al. | 2403.09630 | link |
2024-03-14 | Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation | Fangfu Liu et.al. | 2403.09625 | null |
2024-03-14 | Score-Guided Diffusion for 3D Human Recovery | Anastasis Stathopoulos et.al. | 2403.09623 | link |
2024-03-14 | Explore In-Context Segmentation via Latent Diffusion Models | Chaoyang Wang et.al. | 2403.09616 | null |
2024-03-14 | MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models | Zunnan Xu et.al. | 2403.09471 | null |
2024-03-14 | Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing | Wonjun Kang et.al. | 2403.09468 | link |
2024-03-14 | Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk | Zhangheng Li et.al. | 2403.09450 | null |
2024-03-14 | 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation | Frank Zhang et.al. | 2403.09439 | null |
2024-03-14 | LM2D: Lyrics- and Music-Driven Dance Synthesis | Wenjie Yin et.al. | 2403.09407 | null |
2024-03-14 | Mitigating Data Consistency Induced Discrepancy in Cascaded Diffusion Models for Sparse-view CT Reconstruction | Hanyu Chen et.al. | 2403.09355 | null |
2024-03-14 | HeadEvolver: Text to Head Avatars via Locally Learnable Mesh Deformation | Duotun Wang et.al. | 2403.09326 | null |
2024-03-14 | Regularity and trend to equilibrium for a non-local advection-diffusion model of active particles | Luca Alasio et.al. | 2403.09282 | null |
2024-03-14 | XReal: Realistic Anatomy and Pathology-Aware X-ray Generation via Controllable Diffusion Model | Anees Ur Rehman Hashmi et.al. | 2403.09240 | null |
2024-03-14 | Intention-driven Ego-to-Exo Video Generation | Hongchen Luo et.al. | 2403.09194 | null |
2024-03-14 | Intention-aware Denoising Diffusion Model for Trajectory Prediction | Chen Liu et.al. | 2403.09190 | null |
2024-03-14 | Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts | Byeongjun Park et.al. | 2403.09176 | null |
2024-03-14 | Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior | Cheng Chen et.al. | 2403.09140 | null |
2024-03-14 | Rethinking Referring Object Removal | Xiangtian Xue et.al. | 2403.09128 | null |
2024-03-13 | VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Enric Corona et.al. | 2403.08764 | null |
2024-03-13 | Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI | Shihan Qiu et.al. | 2403.08758 | null |
2024-03-13 | Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI | Shihan Qiu et.al. | 2403.08749 | null |
2024-03-14 | GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing | Jing Wu et.al. | 2403.08733 | null |
2024-03-13 | Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data | Asad Aali et.al. | 2403.08728 | link |
2024-03-13 | Data Augmentation in Human-Centric Vision | Wentao Jiang et.al. | 2403.08650 | null |
2024-03-13 | ActionDiffusion: An Action-aware Diffusion Model for Procedure Planning in Instructional Videos | Lei Shi et.al. | 2403.08591 | null |
2024-03-13 | Federated Knowledge Graph Unlearning via Diffusion Model | Bingchen Liu et.al. | 2403.08554 | null |
2024-03-13 | Model Will Tell: Training Membership Inference for Diffusion Models | Xiaomeng Fu et.al. | 2403.08487 | null |
2024-03-13 | MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose Prediction | Linjie Fu et.al. | 2403.08479 | null |
2024-03-13 | An Analysis of Human Alignment of Latent Diffusion Models | Lorenz Linhardt et.al. | 2403.08469 | null |
2024-03-13 | Diffusion Models with Implicit Guidance for Medical Anomaly Detection | Cosmin I. Bercea et.al. | 2403.08464 | null |
2024-03-13 | Towards Dense and Accurate Radar Perception Via Efficient Cross-Modal Diffusion Model | Ruibin Zhang et.al. | 2403.08460 | null |
2024-03-13 | PFStorer: Personalized Face Restoration and Super-Resolution | Tuomas Varanka et.al. | 2403.08436 | null |
2024-03-13 | Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification | Shuhan Li et.al. | 2403.08407 | null |
2024-03-13 | Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models | Pengze Zhang et.al. | 2403.08381 | link |
2024-03-13 | Mitigate Target-level Insensitivity of Infrared Small Target Detection via Posterior Distribution Modeling | Haoqing Li et.al. | 2403.08380 | link |
2024-03-13 | VIGFace: Virtual Identity Generation Model for Face Image Synthesis | Minsoo Kim et.al. | 2403.08277 | null |
2024-03-13 | Sketch2Manga: Shaded Manga Screening from Sketch with Diffusion Models | Jian Lin et.al. | 2403.08266 | null |
2024-03-13 | Make Me Happier: Evoking Emotions Through Image Diffusion Models | Qing Lin et.al. | 2403.08255 | null |
2024-03-12 | Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation | Shihao Zhao et.al. | 2403.07860 | link |
2024-03-12 | Quantifying and Mitigating Privacy Risks for Tabular Generative Models | Chaoyi Zhu et.al. | 2403.07842 | null |
2024-03-12 | MPCPA: Multi-Center Privacy Computing with Predictions Aggregation based on Denoising Diffusion Probabilistic Model | Guibo Luo et.al. | 2403.07838 | null |
2024-03-13 | SemCity: Semantic Scene Generation with Triplane Diffusion | Jumin Lee et.al. | 2403.07773 | link |
2024-03-12 | Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model | Yuxuan Zhang et.al. | 2403.07764 | null |
2024-03-12 | SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces | Yuta Oshima et.al. | 2403.07711 | link |
2024-03-12 | Visual Privacy Auditing with Diffusion Models | Kristian Schwethelm et.al. | 2403.07588 | null |
2024-03-12 | D4D: An RGBD diffusion model to boost monocular depth estimation | L. Papa et.al. | 2403.07516 | link |
2024-03-12 | Block-wise LoRA: Revisiting Fine-grained LoRA for Effective Personalization and Stylization in Text-to-Image Generation | Likun Li et.al. | 2403.07500 | null |
2024-03-12 | Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models | Phuong Dam et.al. | 2403.07371 | null |
2024-03-12 | Efficient Diffusion Model for Image Restoration by Residual Shifting | Zongsheng Yue et.al. | 2403.07319 | link |
2024-03-12 | It's All About Your Sketch: Democratising Sketch Control in Diffusion Models | Subhadeep Koley et.al. | 2403.07234 | link |
2024-03-12 | Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers | Subhadeep Koley et.al. | 2403.07214 | null |
2024-03-11 | 3M-Diffusion: Latent Multi-Modal Diffusion for Text-Guided Generation of Molecular Graphs | Huaisheng Zhu et.al. | 2403.07179 | null |
2024-03-11 | One Category One Prompt: Dataset Distillation using Diffusion Models | Ali Abbasi et.al. | 2403.07142 | null |
2024-03-11 | BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion | Xuan Ju et.al. | 2403.06976 | link |
2024-03-11 | Bayesian Diffusion Models for 3D Shape Reconstruction | Haiyang Xu et.al. | 2403.06973 | null |
2024-03-11 | POD-ROM methods: from a finite set of snapshots to continuous-in-time approximations | Bosco Garcia-Archilla et.al. | 2403.06967 | null |
2024-03-11 | SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data | Jialu Li et.al. | 2403.06952 | null |
2024-03-12 | DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations | Tianhao Qi et.al. | 2403.06951 | null |
2024-03-11 | Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction | Qing Xiao et.al. | 2403.06940 | null |
2024-03-11 | Estimation of parameters and local times in a discretely observed threshold diffusion model | Sara Mazzonetto et.al. | 2403.06858 | null |
2024-03-11 | Multistep Consistency Models | Jonathan Heek et.al. | 2403.06807 | null |
2024-03-11 | Distribution-Aware Data Expansion with Diffusion Models | Haowei Zhu et.al. | 2403.06741 | link |
2024-03-11 | V3D: Video Diffusion Models are Effective 3D Generators | Zilong Chen et.al. | 2403.06738 | link |
2024-03-11 | Active Generation for Image Classification | Tao Huang et.al. | 2403.06517 | null |
2024-03-11 | Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning | Woojung Han et.al. | 2403.06516 | null |
2024-03-11 | Incorporating Improved Sinusoidal Threshold-based Semi-supervised Method and Diffusion Models for Osteoporosis Diagnosis | Wenchi Ke et.al. | 2403.06498 | null |
2024-03-11 | Are you sure? Modelling Drivers' Confidence Judgments in Left-Turn Gap Acceptance Decisions | Arkady Zgonnikov et.al. | 2403.06496 | null |
2024-03-11 | Text2QR: Harmonizing Aesthetic Customization and Scanning Robustness for Text-Guided QR Code Generation | Guangyang Wu et.al. | 2403.06452 | null |
2024-03-11 | DivCon: Divide and Conquer for Progressive Text-to-Image Generation | Yuhao Jia et.al. | 2403.06400 | link |
2024-03-11 | FSViewFusion: Few-Shots View Generation of Novel Objects | Rukhshanda Hussain et.al. | 2403.06394 | null |
2024-03-11 | Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models | Yang Zhang et.al. | 2403.06381 | null |
2024-03-12 | Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style | Shuai Tan et.al. | 2403.06365 | null |
2024-03-10 | Transferable Reinforcement Learning via Generalized Occupancy Models | Chuning Zhu et.al. | 2403.06328 | null |
2024-03-08 | VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models | Yabo Zhang et.al. | 2403.05438 | link |
2024-03-08 | DiffSF: Diffusion Models for Scene Flow Estimation | Yushan Zhang et.al. | 2403.05327 | null |
2024-03-08 | Noise Level Adaptive Diffusion Model for Robust Reconstruction of Accelerated MRI | Shoujin Huang et.al. | 2403.05245 | null |
2024-03-08 | Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation | Junyan Wang et.al. | 2403.05239 | null |
2024-03-08 | Denoising Autoregressive Representation Learning | Yazhe Li et.al. | 2403.05196 | null |
2024-03-08 | DiffuLT: How to Make Diffusion Model Useful for Long-tail Recognition | Jie Shao et.al. | 2403.05170 | null |
2024-03-08 | GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting | Francesco Palandra et.al. | 2403.05154 | null |
2024-03-08 | Improving Diffusion Models for Virtual Try-on | Yisol Choi et.al. | 2403.05139 | null |
2024-03-08 | ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment | Xiwei Hu et.al. | 2403.05135 | null |
2024-03-08 | CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion | Wendi Zheng et.al. | 2403.05121 | null |
2024-03-08 | Face2Diffusion for Fast and Editable Face Personalization | Kaede Shiohara et.al. | 2403.05094 | link |
2024-03-08 | Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile | Seokjun Lee et.al. | 2403.05093 | null |
2024-03-08 | Improving Diffusion-Based Generative Models via Approximated Optimal Transport | Daegyu Kim et.al. | 2403.05069 | null |
2024-03-08 | XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution | Yunpeng Qu et.al. | 2403.05049 | null |
2024-03-08 | BjTT: A Large-scale Multimodal Dataset for Traffic Prediction | Chengyang Zhang et.al. | 2403.05029 | link |
2024-03-08 | InstructGIE: Towards Generalizable Image Editing | Zichong Meng et.al. | 2403.05018 | null |
2024-03-08 | DiffClass: Diffusion-Based Class Incremental Learning | Zichong Meng et.al. | 2403.05016 | null |
2024-03-08 | RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction | Peng Liu et.al. | 2403.05010 | link |
2024-03-08 | StereoDiffusion: Training-Free Stereo Image Generation Using Latent Diffusion Models | Lezhong Wang et.al. | 2403.04965 | null |
2024-03-07 | AFreeCA: Annotation-Free Counting for All | Adriano D'Alessandro et.al. | 2403.04943 | null |
2024-03-07 | ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes | Hashmat Shadab Malik et.al. | 2403.04701 | null |
2024-03-07 | Delving into the Trajectory Long-tail Distribution for Muti-object Tracking | Sijia Chen et.al. | 2403.04700 | link |
2024-03-07 | PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | Junsong Chen et.al. | 2403.04692 | null |
2024-03-08 | Pix2Gif: Motion-Guided Diffusion for GIF Generation | Hitesh Kandala et.al. | 2403.04634 | null |
2024-03-07 | A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images | Cristiana Tiago et.al. | 2403.04612 | null |
2024-03-07 | Anatomy-Guided Surface Diffusion Model for Alzheimer's Disease Normative Modeling | Jianwei Zhang et.al. | 2403.04531 | null |
2024-03-07 | Effect of turbulent diffusion in modeling anaerobic digestion | Jeremy Z. Yan et.al. | 2403.04457 | null |
2024-03-07 | Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser | Qingyuan Cai et.al. | 2403.04444 | null |
2024-03-07 | StableDrag: Stable Dragging for Point-based Image Editing | Yutao Cui et.al. | 2403.04437 | null |
2024-03-07 | On-demand Quantization for Green Federated Generative Diffusion in Mobile Edge Networks | Bingkun Lai et.al. | 2403.04430 | null |
2024-03-07 | Controllable Generation with Text-to-Image Diffusion Models: A Survey | Pu Cao et.al. | 2403.04279 | link |
2024-03-06 | PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement | Zhijie Wang et.al. | 2403.04014 | link |
2024-03-06 | GUIDE: Guidance-based Incremental Learning with Diffusion Models | Bartosz Cywiński et.al. | 2403.03938 | link |
2024-03-06 | Latent Dataset Distillation with Diffusion Models | Brian B. Moser et.al. | 2403.03881 | null |
2024-03-06 | Accelerating Convergence of Score-Based Diffusion Models, Provably | Gen Li et.al. | 2403.03852 | null |
2024-03-06 | Diffusion on language model embeddings for protein sequence generation | Viacheslav Meshchaninov et.al. | 2403.03726 | null |
2024-03-06 | Efficient Search and Learning for Agile Locomotion on Stepping Stones | Adithya Kumar Chinnakkonda Ravi et.al. | 2403.03639 | null |
2024-03-06 | Diffusion-based Generative Prior for Low-Complexity MIMO Channel Estimation | Benedikt Fesl et.al. | 2403.03545 | link |
2024-03-06 | NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging | Takahiro Shirakawa et.al. | 2403.03485 | null |
2024-03-06 | FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion | Hao Wang et.al. | 2403.03463 | null |
2024-03-06 | Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing | Bingyan Liu et.al. | 2403.03431 | null |
2024-03-05 | Scaling Rectified Flow Transformers for High-Resolution Image Synthesis | Patrick Esser et.al. | 2403.03206 | null |
2024-03-05 | MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets | Hossein Aboutalebi et.al. | 2403.03194 | null |
2024-03-05 | NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models | Zeqian Ju et.al. | 2403.03100 | null |
2024-03-05 | Global N-body Simulation of Gap Edge Structures Created by Perturbations from a Small Satellite Embedded in Saturn's Rings | Naoya Torii et.al. | 2403.03012 | null |
2024-03-05 | Cross-Domain Image Conversion by CycleDM | Sho Shimotsumagari et.al. | 2403.02919 | null |
2024-03-05 | MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model | Sen Wang et.al. | 2403.02905 | null |
2024-03-05 | Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders | Daniele Mari et.al. | 2403.02887 | null |
2024-03-05 | Zero-LED: Zero-Reference Lighting Estimation Diffusion Model for Low-Light Image Enhancement | Jinhong He et.al. | 2403.02879 | null |
2024-03-05 | Scalable Continuous-time Diffusion Framework for Network Inference and Influence Estimation | Keke Huang et.al. | 2403.02867 | null |
2024-03-05 | Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation | Weijie Li et.al. | 2403.02827 | null |
2024-03-05 | Fast, Scale-Adaptive, and Uncertainty-Aware Downscaling of Earth System Model Fields with Generative Foundation Models | Philipp Hess et.al. | 2403.02774 | null |
2024-03-05 | Few-shot Learner Parameterization by Diffusion Time-steps | Zhongqi Yue et.al. | 2403.02649 | null |
2024-03-05 | Semantic Human Mesh Reconstruction with Textures | Xiaoyu Zhan et.al. | 2403.02561 | null |
2024-03-05 | Updating the Minimum Information about CLinical Artificial Intelligence (MI-CLAIM) checklist for generative modeling research | Brenda Y. Miao et.al. | 2403.02558 | link |
2024-03-05 | UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control | Xuweiyi Chen et.al. | 2403.02332 | link |
2024-03-04 | 3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors | Fangzhou Hong et.al. | 2403.02234 | link |
2024-03-04 | DragTex: Generative Point-Based Texture Editing on 3D Mesh | Yudi Zhang et.al. | 2403.02217 | null |
2024-03-04 | ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models | Jiaxiang Cheng et.al. | 2403.02084 | null |
2024-03-04 | FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio | Chao Xu et.al. | 2403.01901 | link |
2024-03-04 | ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models | Lukas Höllein et.al. | 2403.01807 | link |
2024-03-02 | DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction | Junwen Xiong et.al. | 2403.01226 | null |
2024-03-02 | TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion | Salaheldin Mohamed et.al. | 2403.01212 | null |
2024-03-02 | Training Unbiased Diffusion Models From Biased Dataset | Yeongmin Kim et.al. | 2403.01189 | link |
2024-03-02 | Volume diffusion modelling of a sheared granular gas | Duncan Dockar et.al. | 2403.01188 | null |
2024-03-02 | Text-guided Explorable Image Super-resolution | Kanchana Vaishnavi Gandikota et.al. | 2403.01124 | null |
2024-03-02 | Face Swap via Diffusion Model | Feifei Wang et.al. | 2403.01108 | null |
2024-03-01 | A time-stepping deep gradient flow method for option pricing in (rough) diffusion models | Antonis Papapantoleon et.al. | 2403.00746 | null |
2024-03-01 | Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks | Yuhao Liu et.al. | 2403.00644 | null |
2024-03-01 | Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset | Ander Salaberria et.al. | 2403.00587 | link |
2024-03-01 | Rethinking cluster-conditioned diffusion models | Nikolas Adaloglou et.al. | 2403.00570 | null |
2024-03-01 | Waves, patterns and bifurcations: a tutorial review on the vertebrate segmentation clock | Paul François et.al. | 2403.00457 | null |
2024-03-01 | An Ordinal Diffusion Model for Generating Medical Images with Different Severity Levels | Shumpei Takezaki et.al. | 2403.00452 | null |
2024-03-01 | LoMOE: Localized Multi-Object Editing via Multi-Diffusion | Goirik Chakrabarty et.al. | 2403.00437 | null |
2024-03-01 | Abductive Ego-View Accident Video Understanding for Safe Driving Perception | Jianwu Fang et.al. | 2403.00436 | null |
2024-03-01 | HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation | Zhiying Leng et.al. | 2403.00372 | null |
2024-03-01 | Robust Policy Learning via Offline Skill Diffusion | Woo Kyung Kim et.al. | 2403.00225 | null |
2024-02-29 | DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models | Muyang Li et.al. | 2402.19481 | null |
2024-02-29 | Towards Generalizable Tumor Synthesis | Qi Chen et.al. | 2402.19470 | null |
2024-02-29 | Listening to the Noise: Blind Denoising with Gibbs Diffusion | David Heurtel-Depeiges et.al. | 2402.19455 | link |
2024-02-29 | Structure Preserving Diffusion Models | Haoye Lu et.al. | 2402.19369 | null |
2024-02-29 | A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation | Hanxi Li et.al. | 2402.19330 | null |
2024-02-29 | DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly | Gianluca Scarpellini et.al. | 2402.19302 | link |
2024-02-29 | TEncDM: Understanding the Properties of Diffusion Model in the Space of Language Model Encodings | Alexander Shabalin et.al. | 2402.19097 | null |
2024-03-01 | Graph Convolutional Neural Networks for Automated Echocardiography View Recognition: A Holistic Approach | Sarina Thomas et.al. | 2402.19062 | null |
2024-02-29 | WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis | Paul Friedrich et.al. | 2402.19043 | link |
2024-02-29 | Generating, Reconstructing, and Representing Discrete and Continuous Data: Generalized Diffusion with Learnable Encoding-Decoding | Guangyi Liu et.al. | 2402.19009 | null |
2024-02-29 | ViewFusion: Towards Multi-View Consistency via Interpolated Denoising | Xianghui Yang et.al. | 2402.18842 | link |
2024-02-29 | Extended Flow Matching: a Method of Conditional Generation with Generalized Continuity Equation | Noboru Isobe et.al. | 2402.18839 | null |
2024-02-29 | A Quantitative Evaluation of Score Distillation Sampling Based Text-to-3D | Xiaohan Fei et.al. | 2402.18780 | null |
2024-02-28 | Exploring Privacy and Fairness Risks in Sharing Diffusion Models: An Adversarial Perspective | Xinjian Luo et.al. | 2402.18607 | null |
2024-02-28 | Logarithmic Sobolev Inequalities for Bounded Domains and Applications to Drift-Diffusion Equations | Elie Abdo et.al. | 2402.18572 | null |
2024-02-28 | Dynamical Regimes of Diffusion Models | Giulio Biroli et.al. | 2402.18491 | null |
2024-02-28 | Deep Confident Steps to New Pockets: Strategies for Docking Generalization | Gabriele Corso et.al. | 2402.18396 | link |
2024-02-28 | Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model | Sangjoon Park et.al. | 2402.18362 | null |
2024-02-28 | FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes | Ziying Pan et.al. | 2402.18331 | link |
2024-02-28 | Balancing Act: Distribution-Guided Debiasing in Diffusion Models | Rishubh Parihar et.al. | 2402.18206 | null |
2024-02-28 | Diffusion-based Neural Network Weights Generation | Bedionita Soro et.al. | 2402.18153 | null |
2024-02-28 | Context-aware Talking Face Video Generation | Meidai Xuanyuan et.al. | 2402.18092 | null |
2024-02-28 | Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis | Yanzuo Lu et.al. | 2402.18078 | link |
2024-02-28 | SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model | Bin Cao et.al. | 2402.18068 | null |
2024-02-28 | Diffusion Models as Constrained Samplers for Optimization with Unknown Constraints | Lingkai Kong et.al. | 2402.18012 | null |
2024-02-28 | Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning | Zeyang Liu et.al. | 2402.17978 | null |
2024-02-27 | Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models | Ashkan Taghipour et.al. | 2402.17910 | null |
2024-02-27 | Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning | Xiaoyu Zhang et.al. | 2402.17768 | null |
2024-02-27 | Structure-Guided Adversarial Training of Diffusion Models | Ling Yang et.al. | 2402.17563 | null |
2024-02-27 | Diffusion Model-Based Image Editing: A Survey | Yi Huang et.al. | 2402.17525 | link |
2024-02-27 | Label-Noise Robust Diffusion Models | Byeonghu Na et.al. | 2402.17517 | link |
2024-02-27 | EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions | Linrui Tian et.al. | 2402.17485 | null |
2024-02-28 | DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models | Shyam Marjit et.al. | 2402.17412 | null |
2024-02-27 | Generative diffusion model for surface structure discovery | Nikolaj Rønne et.al. | 2402.17404 | null |
2024-02-27 | Denoising Diffusion Models for Inpainting of Healthy Brain Tissue | Alicia Durrer et.al. | 2402.17307 | null |
2024-02-27 | DivAvatar: Diverse 3D Avatar Generation with a Single Prompt | Weijing Tao et.al. | 2402.17292 | null |
2024-02-27 | Enhancing Hyperspectral Images via Diffusion Model and Group-Autoencoder Super-resolution Network | Zhaoyang Wang et.al. | 2402.17285 | null |
2024-02-27 | DiFashion: Towards Personalized Outfit Generation | Yiyan Xu et.al. | 2402.17279 | null |
2024-02-27 | One-Shot Structure-Aware Stylized Image Synthesis | Hansam Cho et.al. | 2402.17275 | null |
2024-02-27 | Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation | Daiqing Li et.al. | 2402.17245 | null |
2024-02-27 | CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization | Hao-Yang Peng et.al. | 2402.17214 | null |
2024-02-27 | Generative Learning for Forecasting the Dynamics of Complex Systems | Han Gao et.al. | 2402.17157 | null |
2024-02-27 | TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation | Lin Zongying et.al. | 2402.17156 | link |
2024-02-27 | SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution | Chengcheng Wang et.al. | 2402.17133 | link |
2024-02-27 | Transparent Image Layer Diffusion using Latent Transparency | Lvmin Zhang et.al. | 2402.17113 | null |
2024-02-26 | Renormalization Group flow, Optimal Transport and Diffusion-based Generative Model | Artan Sheshmani et.al. | 2402.17090 | null |
2024-02-26 | A Phase Transition in Diffusion Models Reveals the Hierarchical Nature of Data | Antonio Sclocchi et.al. | 2402.16991 | null |
2024-02-26 | Stochastic Conditional Diffusion Models for Semantic Image Synthesis | Juyeon Ko et.al. | 2402.16506 | null |
2024-02-26 | Outline-Guided Object Inpainting with Diffusion Models | Markus Pobitzer et.al. | 2402.16421 | null |
2024-02-26 | Placing Objects in Context via Inpainting for Out-of-distribution Segmentation | Pau de Jorge et.al. | 2402.16392 | link |
2024-02-26 | Generative AI in Vision: A Survey on Models, Metrics and Applications | Gaurav Raut et.al. | 2402.16369 | null |
2024-02-26 | Feedback Efficient Online Fine-Tuning of Diffusion Models | Masatoshi Uehara et.al. | 2402.16359 | null |
2024-02-26 | Graph Diffusion Policy Optimization | Yijing Liu et.al. | 2402.16302 | link |
2024-02-25 | Photon-counting CT using a Conditional Diffusion Model for Super-resolution and Texture-preservation | Christopher Wiedeman et.al. | 2402.16212 | null |
2024-02-25 | Towards Efficient Quantum Hybrid Diffusion Models | Francesca De Falco et.al. | 2402.16147 | null |
2024-02-25 | Cinematographic Camera Diffusion Model | Hongda Jiang et.al. | 2402.16143 | null |
2024-02-25 | Behavioral Refinement via Interpolant-based Policy Diffusion | Kaiqi Chen et.al. | 2402.16075 | null |
2024-02-24 | HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models | Li Pang et.al. | 2402.15865 | link |
2024-02-23 | Minimax Optimality of Score-based Diffusion Models: Beyond the Density Lower Bound Assumptions | Kaihong Zhang et.al. | 2402.15602 | null |
2024-02-23 | Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition | Chun-Hsiao Yeh et.al. | 2402.15504 | link |
2024-02-23 | ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation | Yi Zhang et.al. | 2402.15429 | link |
2024-02-23 | Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models | Shunyu Liu et.al. | 2402.15289 | link |
2024-02-23 | Weak Reproductive Solutions for a Convection-Diffusion Model Describing a Binary Alloy Solidification Processes | Blanca Climent-Ezquerra et.al. | 2402.15221 | null |
2024-02-23 | Label-efficient Multi-organ Segmentation Method with Diffusion Model | Yongzhi Huang et.al. | 2402.15216 | null |
2024-02-23 | Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control | Masatoshi Uehara et.al. | 2402.15194 | null |
2024-02-23 | Dynamics-Guided Diffusion Model for Robot Manipulator Design | Xiaomeng Xu et.al. | 2402.15038 | null |
2024-02-22 | Cameras as Rays: Pose Estimation via Ray Diffusion | Jason Y. Zhang et.al. | 2402.14817 | null |
2024-02-22 | Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models | Yixuan Ren et.al. | 2402.14780 | null |
2024-02-22 | Debiasing Text-to-Image Diffusion Models | Ruifei He et.al. | 2402.14577 | null |
2024-02-22 | Model-Based Reinforcement Learning Control of Reaction-Diffusion Problems | Christina Schenk et.al. | 2402.14446 | null |
2024-02-22 | Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning | Haoran He et.al. | 2402.14407 | null |
2024-02-22 | Diffusion Model Based Visual Compensation Guidance and Visual Difference Analysis for No-Reference Image Quality Assessment | Zhaoyang Wang et.al. | 2402.14401 | null |
2024-02-22 | Typographic Text Generation with Off-the-Shelf Diffusion Model | KhayTze Peong et.al. | 2402.14314 | null |
2024-02-22 | Font Style Interpolation with Diffusion Models | Tetta Kondo et.al. | 2402.14311 | null |
2024-02-23 | Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion | Yujia Huang et.al. | 2402.14285 | link |
2024-02-22 | MVD |
Xin-Yang Zheng et.al. | 2402.14253 | null |
2024-02-21 | T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching | Zizheng Pan et.al. | 2402.14167 | link |
2024-02-21 | Non-asymptotic Convergence of Discrete-time Diffusion Models: New Approach and Improved Rate | Yuchen Liang et.al. | 2402.13901 | null |
2024-02-21 | NeuralDiffuser: Controllable fMRI Reconstruction with Primary Visual Feature Guided Diffusion | Haoyu Li et.al. | 2402.13809 | null |
2024-02-22 | Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions | Jiayu Chen et.al. | 2402.13777 | null |
2024-02-21 | Cas-DiffCom: Cascaded diffusion model for infant longitudinal super-resolution 3D medical image completion | Lianghu Guo et.al. | 2402.13776 | null |
2024-02-21 | Music Style Transfer with Time-Varying Inversion of Diffusion Models | Sifei Li et.al. | 2402.13763 | null |
2024-02-21 | SRNDiff: Short-term Rainfall Nowcasting with Condition Diffusion Model | Xudong Ling et.al. | 2402.13737 | null |
2024-02-21 | Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation | Kihong Kim et.al. | 2402.13729 | null |
2024-02-21 | Flexible Physical Camouflage Generation Based on a Differential Approach | Yang Li et.al. | 2402.13575 | null |
2024-02-21 | ToDo: Token Downsampling for Efficient Generation of High-Resolution Images | Ethan Smith et.al. | 2402.13573 | null |
2024-02-21 | Generative AI for Secure Physical Layer Communications: A Survey | Changyuan Zhao et.al. | 2402.13553 | null |
2024-02-21 | DiffPLF: A Conditional Diffusion Model for Probabilistic Forecasting of EV Charging Load | Siyang Li et.al. | 2402.13548 | link |
2024-02-21 | Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion Models | Chen Wu et.al. | 2402.13490 | null |
2024-02-20 | Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control | Denis Lukovnikov et.al. | 2402.13404 | null |
2024-02-20 | The Uncanny Valley: A Comprehensive Analysis of Diffusion Models | Karam Ghanem et.al. | 2402.13369 | null |
2024-02-20 | Neural Network Diffusion | Kai Wang et.al. | 2402.13144 | link |
2024-02-20 | Text-Guided Molecule Generation with Diffusion Language Model | Haisong Gong et.al. | 2402.13040 | link |
2024-02-21 | Visual Style Prompting with Swapping Self-Attention | Jaeseok Jeong et.al. | 2402.12974 | null |
2024-02-20 | CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection | Sohail Ahmed Khan et.al. | 2402.12927 | null |
2024-02-20 | RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models | Xinchen Zhang et.al. | 2402.12908 | link |
2024-02-20 | Two-stage Rainfall-Forecasting Diffusion Model | XuDong Ling et.al. | 2402.12779 | link |
2024-02-20 | MuLan: Multimodal-LLM Agent for Progressive Multi-Object Diffusion | Sen Li et.al. | 2402.12741 | link |
2024-02-20 | Diffusion Posterior Sampling is Computationally Intractable | Shivam Gupta et.al. | 2402.12727 | null |
2024-02-20 | MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction | Shitao Tang et.al. | 2402.12712 | null |
2024-02-20 | SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion | Liumeng Xue et.al. | 2402.12660 | null |
2024-02-20 | DiffusionNOCS: Managing Symmetry and Uncertainty in Sim2Real Multi-Modal Category-level Pose Estimation | Takuya Ikeda et.al. | 2402.12647 | null |
2024-02-19 | Hierarchical Bayes Approach to Personalized Federated Unsupervised Learning | Kaan Ozkara et.al. | 2402.12537 | null |
2024-02-19 | Improving Deep Generative Models on Many-To-One Image-to-Image Translation | Sagar Saxena et.al. | 2402.12531 | null |
2024-02-19 | On the Semantic Latent Space of Diffusion-Based Text-to-Speech Models | Miri Varshavsky Hassid et.al. | 2402.12423 | null |
2024-02-19 | FiT: Flexible Vision Transformer for Diffusion Model | Zeyu Lu et.al. | 2402.12376 | link |
2024-02-19 | Synthetic location trajectory generation using categorical diffusion models | Simon Dirmeier et.al. | 2402.12242 | link |
2024-02-19 | Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training | Leo Hyun Park et.al. | 2402.12187 | null |
2024-02-19 | Human Video Translation via Query Warping | Haiming Zhu et.al. | 2402.12099 | null |
2024-02-19 | Direct Consistency Optimization for Compositional Text-to-Image Personalization | Kyungmin Lee et.al. | 2402.12004 | null |
2024-02-19 | Privacy-Preserving Low-Rank Adaptation for Latent Diffusion Models | Zihao Luo et.al. | 2402.11989 | link |
2024-02-19 | DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation | Chong Zeng et.al. | 2402.11929 | null |
2024-02-19 | A Generative Pre-Training Framework for Spatio-Temporal Graph Transfer Learning | Yuan Yuan et.al. | 2402.11922 | null |
2024-02-19 | ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image | Yan Hong et.al. | 2402.11849 | null |
2024-02-19 | UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models | Yihua Zhang et.al. | 2402.11846 | null |
2024-02-19 | WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection | Yan Hong et.al. | 2402.11843 | null |
2024-02-19 | Statistical Test for Generated Hypotheses by Diffusion Models | Teruyuki Katsuoka et.al. | 2402.11789 | null |
2024-02-19 | Towards Theoretical Understandings of Self-Consuming Generative Models | Shi Fu et.al. | 2402.11778 | null |
2024-02-18 | SDiT: Spiking Diffusion Model with Transformer | Shu Yang et.al. | 2402.11588 | null |
2024-02-18 | CaloGraph: Graph-based diffusion model for fast shower generation in calorimeters with irregular geometry | Dmitrii Kobylianskii et.al. | 2402.11575 | null |
2024-02-18 | Temporal Disentangled Contrastive Diffusion Model for Spatiotemporal Imputation | Yakun Chen et.al. | 2402.11558 | null |
2024-02-18 | Visual Concept-driven Image Generation with Text-to-Image Diffusion Model | Tanzila Rahman et.al. | 2402.11487 | null |
2024-02-17 | Partial Ly |
Georg Wolschin et.al. | 2402.11320 | null |
2024-02-17 | TC-DiffRecon: Texture coordination MRI reconstruction method based on diffusion model and modified MF-UNet method | Chenyan Zhang et.al. | 2402.11274 | link |
2024-02-17 | DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model | Yu Feng et.al. | 2402.11241 | null |
2024-02-16 | 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations | Tsung-Wei Ke et.al. | 2402.10885 | null |
2024-02-16 | Training Class-Imbalanced Diffusion Model Via Overlap Optimization | Divin Yan et.al. | 2402.10821 | link |
2024-02-16 | VATr++: Choose Your Words Wisely for Handwritten Text Generation | Bram Vanherle et.al. | 2402.10798 | null |
2024-02-16 | Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation | Hongbin Na et.al. | 2402.10699 | null |
2024-02-16 | Generative AI and Attentive User Interfaces: Five Strategies to Enhance Take-Over Quality in Automated Driving | Patrick Ebel et.al. | 2402.10664 | null |
2024-02-16 | Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model | Xiangyu Zhang et.al. | 2402.10642 | null |
2024-02-16 | U |
Ziqi Gao et.al. | 2402.10609 | null |
2024-02-16 | A maximum likelihood estimation of Lévy-driven stochastic systems for univariate and multivariate time series of observations | Babak M. S. Arani et.al. | 2402.10608 | null |
2024-02-16 | Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation | Lanqing Guo et.al. | 2402.10491 | null |
2024-02-16 | Explaining generative diffusion models via visual analysis for interpretable decision-making process | Ji-Hoon Park et.al. | 2402.10404 | null |
2024-02-15 | GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting | Chen Yang et.al. | 2402.10259 | null |
2024-02-15 | Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation | Huizhuo Yuan et.al. | 2402.10210 | null |
2024-02-15 | Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment | Rui Yang et.al. | 2402.10207 | null |
2024-02-15 | Radio-astronomical Image Reconstruction with Conditional Denoising Diffusion Model | Mariia Drozdova et.al. | 2402.10204 | link |
2024-02-15 | Classification Diffusion Models | Shahar Yadin et.al. | 2402.10095 | null |
2024-02-15 | Diffusion Models Meet Contextual Bandits with Large Action Spaces | Imad Aouali et.al. | 2402.10028 | null |
2024-02-16 | Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion | Hila Manor et.al. | 2402.10009 | null |
2024-02-15 | Accelerating Parallel Sampling of Diffusion Models | Zhiwei Tang et.al. | 2402.09970 | null |
2024-02-15 | Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation | Junjie Shentu et.al. | 2402.09966 | link |
2024-02-15 | Lester: rotoscope animation through video object segmentation and tracking | Ruben Tous et.al. | 2402.09883 | link |
2024-02-15 | Diffusion Models for Audio Restoration | Jean-Marie Lemercier et.al. | 2402.09821 | null |
2024-02-15 | DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization | Jisu Nam et.al. | 2402.09812 | null |
2024-02-15 | Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement | Tao Yang et.al. | 2402.09712 | null |
2024-02-14 | Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection | Pengfei Zhou et.al. | 2402.09242 | link |
2024-02-14 | Semi-Supervised Diffusion Model for Brain Age Prediction | Ayodeji Ijishakin et.al. | 2402.09137 | null |
2024-02-14 | L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects | Yutaro Yamada et.al. | 2402.09052 | null |
2024-02-14 | Extreme Video Compression with Pre-trained Diffusion Models | Bohan Li et.al. | 2402.08934 | link |
2024-02-14 | The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes | Myeongseob Ko et.al. | 2402.08922 | null |
2024-02-13 | Percolating transition to turbulence without puffs or bands | Sébastien Gomé et.al. | 2402.08829 | null |
2024-02-13 | LDTrack: Dynamic People Tracking by Service Robots using Diffusion Models | Angus Fung et.al. | 2402.08774 | null |
2024-02-13 | Towards the Detection of AI-Synthesized Human Face Images | Yuhang Lu et.al. | 2402.08750 | null |
2024-02-13 | PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models | Fei Deng et.al. | 2402.08714 | null |
2024-02-13 | Zero Shot Molecular Generation via Similarity Kernels | Rokas Elijošius et.al. | 2402.08708 | link |
2024-02-13 | Chain Reaction of Ideas: Can Radioactive Decay Predict Technological Innovation? | Guilherme S. Y. Giardini et.al. | 2402.08681 | null |
2024-02-13 | Target Score Matching | Valentin De Bortoli et.al. | 2402.08667 | null |
2024-02-13 | Learning Continuous 3D Words for Text-to-Image Generation | Ta-Ying Cheng et.al. | 2402.08654 | null |
2024-02-14 | Denoising Diffusion Restoration Tackles Forward and Inverse Problems for the Laplace Operator | Amartya Mukherjee et.al. | 2402.08563 | null |
2024-02-13 | Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases | Ziyi Zhang et.al. | 2402.08552 | null |
2024-02-13 | A Dense Reward View on Aligning Text-to-Image Diffusion with Preference | Shentao Yang et.al. | 2402.08265 | link |
2024-02-13 | Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation | AprilPyone MaungMaung et.al. | 2402.08200 | null |
2024-02-14 | Convergence Analysis of Discrete Diffusion Model: Exact Implementation through Uniformization | Hongrui Chen et.al. | 2402.08095 | null |
2024-02-12 | Nearest Neighbour Score Estimators for Diffusion Generative Models | Matthew Niedoba et.al. | 2402.08018 | null |
2024-02-12 | Towards a mathematical theory for consistency training in diffusion models | Gen Li et.al. | 2402.07802 | null |
2024-02-12 | Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models | Jiacheng Ye et.al. | 2402.07754 | null |
2024-02-12 | Cosmology at the Field Level with Probabilistic Machine Learning | Adam Rouhiainen et.al. | 2402.07694 | null |
2024-02-12 | Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback | Cansu Korkmaz et.al. | 2402.07597 | null |
2024-02-12 | Score-based Diffusion Models via Stochastic Differential Equations -- a Technical Tutorial | Wenpin Tang et.al. | 2402.07487 | null |
2024-02-13 | SALAD: Smart AI Language Assistant Daily | Ragib Amin Nihal et.al. | 2402.07431 | null |
2024-02-12 | Diff-RNTraj: A Structure-aware Diffusion Model for Road Network-constrained Trajectory Generation | Tonglong Wei et.al. | 2402.07369 | null |
2024-02-11 | Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL | Sungyoon Kim et.al. | 2402.07226 | link |
2024-02-13 | Towards Fast Stochastic Sampling in Diffusion Generative Models | Kushagra Pandey et.al. | 2402.07211 | null |
2024-02-10 | Synthesizing CTA Image Data for Type-B Aortic Dissection using Stable Diffusion Models | Ayman Abaid et.al. | 2402.06969 | null |
2024-02-09 | Towards Principled Assessment of Tabular Data Synthesis Algorithms | Yuntao Du et.al. | 2402.06806 | link |
2024-02-09 | Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following | Brian Yang et.al. | 2402.06559 | null |
2024-02-09 | Sequential Flow Matching for Generative Modeling | Jongmin Yoon et.al. | 2402.06461 | null |
2024-02-09 | ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation | Fengyi Shen et.al. | 2402.06446 | null |
2024-02-09 | Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation | Peter Hönig et.al. | 2402.06436 | null |
2024-02-09 | Particle Denoising Diffusion Sampler | Angus Phillips et.al. | 2402.06320 | link |
2024-02-09 | Controllable seismic velocity synthesis using generative diffusion models | Fu Wang et.al. | 2402.06277 | null |
2024-02-09 | MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models | Yixiao Zhang et.al. | 2402.06178 | null |
2024-02-08 | CLR-Face: Conditional Latent Refinement for Blind Face Restoration Using Score-Based Diffusion Models | Maitreya Suin et.al. | 2402.06106 | null |
2024-02-08 | Animated Stickers: Bringing Stickers to Life with Video Diffusion | David Yan et.al. | 2402.06088 | null |
2024-02-08 | DiscDiff: Latent Diffusion Model for DNA Sequence Generation | Zehui Li et.al. | 2402.06079 | null |
2024-02-08 | InstaGen: Enhancing Object Detection by Training on Synthetic Dataset | Chengjian Feng et.al. | 2402.05937 | null |
2024-02-08 | Time Series Diffusion in the Frequency Domain | Jonathan Crabbé et.al. | 2402.05933 | link |
2024-02-08 | AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning | Wamiq Reyaz Para et.al. | 2402.05803 | null |
2024-02-08 | DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer | Zhiyuan Ma et.al. | 2402.05712 | link |
2024-02-08 | Scalable Diffusion Models with State Space Backbone | Zhengcong Fei et.al. | 2402.05608 | link |
2024-02-08 | Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models | Senmao Li et.al. | 2402.05375 | link |
2024-02-08 | Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model | Junghun Cha et.al. | 2402.05350 | null |
2024-02-07 | SPAD : Spatially Aware Multiview Diffusers | Yash Kant et.al. | 2402.05235 | null |
2024-02-09 | Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models | Nicholas Konz et.al. | 2402.05210 | link |
2024-02-07 | Maitreya Patel et.al. | 2402.05195 | null | |
2024-02-07 | On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling | Marcin Sendera et.al. | 2402.05098 | link |
2024-02-07 | NITO: Neural Implicit Fields for Resolution-free Topology Optimization | Amin Heyrani Nobari et.al. | 2402.05073 | null |
2024-02-07 | LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation | Jiaxiang Tang et.al. | 2402.05054 | null |
2024-02-07 | Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design | Andrew Campbell et.al. | 2402.04997 | link |
2024-02-07 | Blue noise for diffusion models | Xingchang Huang et.al. | 2402.04930 | null |
2024-02-07 | Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation | Shivang Chopra et.al. | 2402.04929 | null |
2024-02-07 | Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints | Jian Chen et.al. | 2402.04754 | null |
2024-02-07 | Cortical Surface Diffusion Generative Models | Zhenshan Xie et.al. | 2402.04753 | null |
2024-02-07 | EvoSeed: Unveiling the Threat on Deep Neural Networks with Real-World Illusions | Shashank Kotyan et.al. | 2402.04699 | link |
2024-02-07 | Noise Map Guidance: Inversion with Spatial Context for Real Image Editing | Hansam Cho et.al. | 2402.04625 | link |
2024-02-07 | BRI3L: A Brightness Illusion Image Dataset for Identification and Localization of Regions of Illusory Perception | Aniket Roy et.al. | 2402.04541 | link |
2024-02-07 | Text2Street: Controllable Text-to-image Generation for Street Views | Jinming Su et.al. | 2402.04504 | null |
2024-02-06 | Fine-Tuned Language Models Generate Stable Inorganic Materials as Text | Nate Gruver et.al. | 2402.04379 | link |
2024-02-06 | Bidirectional Autoregressive Diffusion Model for Dance Generation | Canyu Zhang et.al. | 2402.04356 | null |
2024-02-06 | Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation | Zolnamar Dorjsembe et.al. | 2402.04031 | link |
2024-02-06 | Space Group Constrained Crystal Generation | Rui Jiao et.al. | 2402.03992 | null |
2024-02-06 | Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting | Yiming Xu et.al. | 2402.03981 | null |
2024-02-06 | EscherNet: A Generative Model for Scalable View Synthesis | Xin Kong et.al. | 2402.03908 | null |
2024-02-06 | On gauge freedom, conservativity and intrinsic dimensionality estimation in diffusion models | Christian Horvat et.al. | 2402.03845 | null |
2024-02-06 | SDEMG: Score-based Diffusion Model for Surface Electromyographic Signal Denoising | Yu-Tung Liu et.al. | 2402.03808 | link |
2024-02-06 | FoolSDEdit: Deceptively Steering Your Edits Towards Targeted Attribute-aware Distribution | Qi Zhou et.al. | 2402.03705 | null |
2024-02-06 | Improving and Unifying Discrete&Continuous-time Discrete Denoising Diffusion | Lingxiao Zhao et.al. | 2402.03701 | null |
2024-02-06 | Pard: Permutation-Invariant Autoregressive Diffusion for Graph Generation | Lingxiao Zhao et.al. | 2402.03687 | null |
2024-02-06 | QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning | Haoxuan Wang et.al. | 2402.03666 | null |
2024-02-05 | Diffusion World Model | Zihan Ding et.al. | 2402.03570 | null |
2024-02-05 | Projected Generative Diffusion Models for Constraint Satisfaction | Jacob K Christopher et.al. | 2402.03559 | null |
2024-02-05 | AnaMoDiff: 2D Analogical Motion Diffusion via Disentangled Denoising | Maham Tanveer et.al. | 2402.03549 | null |
2024-02-05 | Hyper-Diffusion: Estimating Epistemic and Aleatoric Uncertainty with a Single Model | Matthew A. Chan et.al. | 2402.03478 | null |
2024-02-05 | Denoising Diffusion via Image-Based Rendering | Titas Anciukevicius et.al. | 2402.03445 | null |
2024-02-05 | Do Diffusion Models Learn Semantically Meaningful and Efficient Representations? | Qiyao Liang et.al. | 2402.03305 | null |
2024-02-05 | Zero-shot Object-Level OOD Detection with Context-Aware Inpainting | Quang-Huy Nguyen et.al. | 2402.03292 | null |
2024-02-05 | InstanceDiffusion: Instance-level Control for Image Generation | Xudong Wang et.al. | 2402.03290 | null |
2024-02-06 | Organic or Diffused: Can We Distinguish Human Art from AI-generated Images? | Anna Yoo Jeong Ha et.al. | 2402.03214 | null |
2024-02-05 | Light and Optimal Schrödinger Bridge Matching | Nikita Gushchin et.al. | 2402.03207 | null |
2024-02-05 | Guidance with Spherical Gaussian Constraint for Conditional Diffusion | Lingxiao Yang et.al. | 2402.03201 | null |
2024-02-05 | Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion | Shiyuan Yang et.al. | 2402.03162 | null |
2024-02-05 | PFDM: Parser-Free Virtual Try-on via Diffusion Model | Yunfang Niu et.al. | 2402.03047 | null |
2024-02-05 | Diffusive Gibbs Sampling | Wenlin Chen et.al. | 2402.03008 | null |
2024-02-05 | DexDiffuser: Generating Dexterous Grasps with Diffusion Models | Zehang Weng et.al. | 2402.02989 | null |
2024-02-05 | Retrieval-Augmented Score Distillation for Text-to-3D Generation | Junyoung Seo et.al. | 2402.02972 | null |
2024-02-05 | ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis | Bernard Spiegl et.al. | 2402.02906 | link |
2024-02-05 | SynthVision -- Harnessing Minimal Input for Maximal Output in Computer Vision Models using Synthetic Image data | Yudara Kularathne et.al. | 2402.02826 | null |
2024-02-05 | Extreme Two-View Geometry From Object Poses with Diffusion Models | Yujing Sun et.al. | 2402.02800 | null |
2024-02-05 | Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning | Yixiang Shan et.al. | 2402.02772 | null |
2024-02-05 | DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models | Yang Sui et.al. | 2402.02739 | null |
2024-02-04 | DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing | Chong Mou et.al. | 2402.02583 | link |
2024-02-04 | Latent Graph Diffusion: A Unified Framework for Generation and Prediction on Graphs | Zhou Cai et.al. | 2402.02518 | null |
2024-02-04 | PoCo: Policy Composition from and for Heterogeneous Robot Learning | Lirui Wang et.al. | 2402.02511 | null |
2024-02-04 | PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection Removal | Tao Wang et.al. | 2402.02374 | link |
2024-02-02 | NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties | Jingyuan Sun et.al. | 2402.01590 | null |
2024-02-02 | Boximator: Generating Rich and Controllable Motions for Video Synthesis | Jiawei Wang et.al. | 2402.01566 | null |
2024-02-02 | Cross-view Masked Diffusion Transformers for Person Image Synthesis | Trung X. Pham et.al. | 2402.01516 | null |
2024-02-02 | Conditioning non-linear and infinite-dimensional diffusion processes | Elizabeth Louise Baker et.al. | 2402.01434 | null |
2024-02-02 | Bass Accompaniment Generation via Latent Diffusion | Marco Pasini et.al. | 2402.01412 | null |
2024-02-02 | Cheating Suffix: Targeted Attack to Text-To-Image Diffusion Models with Multi-Modal Priors | Dingcheng Yang et.al. | 2402.01369 | link |
2024-02-02 | Unsupervised Generation of Pseudo Normal PET from MRI with Diffusion Model for Epileptic Focus Localization | Wentao Chen et.al. | 2402.01191 | null |
2024-02-01 | Unconditional Latent Diffusion Models Memorize Patient Imaging Data | Salman Ul Hassan Dar et.al. | 2402.01054 | null |
2024-02-01 | pop-cosmos: A comprehensive picture of the galaxy population from COSMOS data | Justin Alsing et.al. | 2402.00935 | null |
2024-02-01 | Data-Space Validation of High-Dimensional Models by Comparing Sample Quantiles | Stephen Thorp et.al. | 2402.00930 | null |
2024-02-01 | ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields | Jiahua Dong et.al. | 2402.00864 | link |
2024-02-01 | An Analysis of the Variance of Diffusion-based Speech Enhancement | Bunlong Lay et.al. | 2402.00811 | null |
2024-02-01 | Distilling Conditional Diffusion Models for Offline Reinforcement Learning through Trajectory Stitching | Shangzhe Li et.al. | 2402.00807 | null |
2024-02-01 | AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning | Fu-Yun Wang et.al. | 2402.00769 | link |
2024-01-31 | SeFi-IDE: Semantic-Fidelity Identity Embedding for Personalized Diffusion-Based Generation | Yang Li et.al. | 2402.00631 | null |
2024-02-01 | Cylindrically symmetric diffusion model for relativistic heavy-ion collisions | Johannes Hoelck et.al. | 2402.00628 | null |
2024-02-01 | CapHuman: Capture Your Moments in Parallel Universes | Chao Liang et.al. | 2402.00627 | link |
2024-02-01 | Masked Conditional Diffusion Model for Enhancing Deepfake Detection | Tiewen Chen et.al. | 2402.00541 | null |
2024-02-01 | Energetic Particles in the Central Starburst, Disc, and Halo of NGC253 | Yoel Rephaeli et.al. | 2402.00523 | null |
2024-02-01 | LRDif: Diffusion Models for Under-Display Camera Emotion Recognition | Zhifeng Wang et.al. | 2402.00250 | null |
2024-02-02 | SuperDiff: Diffusion Models for Conditional Generation of Hypothetical New Families of Superconductors | Samuel Yuan et.al. | 2402.00198 | null |
2024-01-31 | Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators | Daniel Geng et.al. | 2401.18085 | null |
2024-01-31 | Ljusternik-Schnirelmann eigenvalues for the fractional $m-$Laplacian without the |
Julian Fernandez Bonder et.al. | 2401.18041 | null |
2024-01-31 | Diagnosing the particle transport mechanism in the pulsar halo via X-ray observations | Qi-Zuo Wu et.al. | 2401.17982 | null |
2024-01-31 | Convergence Analysis for General Probability Flow ODEs of Diffusion Models in Wasserstein Distances | Xuefeng Gao et.al. | 2401.17958 | null |
2024-01-31 | AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error | Jonas Ricker et.al. | 2401.17879 | null |
2024-01-31 | Drift Diffusion Model to understand (mis)information sharing dynamic in complex networks | Lucila G. Alvarez-Zuzek et.al. | 2401.17846 | null |
2024-01-31 | A new class of efficient high order semi-Lagrangian IMEX discontinuous Galerkin methods on staggered unstructured meshes | M. Tavelli et.al. | 2401.17806 | null |
2024-01-31 | Dance-to-Music Generation with Encoder-based Textual Inversion of Diffusion Models | Sifei Li et.al. | 2401.17800 | null |
2024-01-31 | Image Anything: Towards Reasoning-coherent and Training-free Multi-modal Image Generation | Yuanhuiyi Lyu et.al. | 2401.17664 | null |
2024-01-31 | Spatial-and-Frequency-aware Restoration method for Images based on Diffusion Models | Kyungsung Lee et.al. | 2401.17629 | null |
2024-01-31 | Topology-Aware Latent Diffusion for 3D Shape Generation | Jiangbei Hu et.al. | 2401.17603 | null |
2024-01-31 | Head and Neck Tumor Segmentation from [18F]F-FDG PET/CT Images Based on 3D Diffusion Model | Yafei Dong et.al. | 2401.17593 | null |
2024-01-31 | Task-Oriented Diffusion Model Compression | Geonung Kim et.al. | 2401.17547 | null |
2024-01-31 | Enhancing Score-Based Sampling Methods with Ensembles | Tobias Bischoff et.al. | 2401.17539 | null |
2024-01-30 | You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation | Mehdi Noroozi et.al. | 2401.17258 | null |
2024-01-30 | ContactGen: Contact-Guided Interactive 3D Human Generation for Partners | Dongjun Gu et.al. | 2401.17212 | null |
2024-01-30 | Transfer Learning for Text Diffusion Models | Kehang Han et.al. | 2401.17181 | null |
2024-01-30 | PlantoGraphy: Incorporating Iterative Design Process into Generative Artificial Intelligence for Landscape Rendering | Rong Huang et.al. | 2401.17120 | null |
2024-01-30 | Local modification of subdiffusion by initial Fickian diffusion: Multiscale modeling, analysis and computation | Xiangcheng Zheng et.al. | 2401.16885 | null |
2024-01-30 | A Literature Review on Fetus Brain Motion Correction in MRI | Haoran Zhang et.al. | 2401.16782 | null |
2024-01-30 | BoostDream: Efficient Refining for High-Quality Text-to-3D Generation from Multi-View Diffusion | Yonghao Yu et.al. | 2401.16764 | null |
2024-01-30 | Pick-and-Draw: Training-free Semantic Guidance for Text-to-Image Personalization | Henglei Lv et.al. | 2401.16762 | null |
2024-01-30 | Diffusion model for relational inference | Shuhan Zheng et.al. | 2401.16755 | null |
2024-01-29 | Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors | Shiyin Dong et.al. | 2401.16459 | null |
2024-01-29 | Using multiple Dirac delta points to describe inhomogeneous flux density over a cell boundary in a single-cell diffusion model | Qiyao Peng et.al. | 2401.16261 | null |
2024-01-29 | Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models | Zhongjie Duan et.al. | 2401.16224 | null |
2024-01-29 | Spatial-Aware Latent Initialization for Controllable Image Generation | Wenqiang Sun et.al. | 2401.16157 | null |
2024-01-29 | DMCE: Diffusion Model Channel Enhancer for Multi-User Semantic Communication Systems | Youcheng Zeng et.al. | 2401.16017 | null |
2024-01-31 | Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling | Xiaoyu Shi et.al. | 2401.15977 | null |
2024-01-29 | EmoDM: A Diffusion Model for Evolutionary Multi-objective Optimization | Xueming Yan et.al. | 2401.15931 | null |
2024-01-28 | Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with Prototypical Embedding | Jianxiang Lu et.al. | 2401.15708 | null |
2024-01-30 | Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance | Qingcheng Zhao et.al. | 2401.15687 | null |
2024-01-28 | CPDM: Content-Preserving Diffusion Model for Underwater Image Enhancement | Xiaowen Shi et.al. | 2401.15649 | null |
2024-01-28 | FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models | Feihong He et.al. | 2401.15636 | null |
2024-01-28 | Generative AI-enabled Blockchain Networks: Fundamentals, Applications, and Case Study | Cong T. Nguyen et.al. | 2401.15625 | null |
2024-01-28 | Diffusion-based graph generative methods | Hongyang Chen et.al. | 2401.15617 | null |
2024-01-28 | Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization | Yinbin Han et.al. | 2401.15604 | null |
2024-01-28 | BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry | Xiang Xu et.al. | 2401.15563 | null |
2024-01-27 | Wind speed super-resolution and validation: from ERA5 to CERRA via diffusion models | Fabio Merizzi et.al. | 2401.15469 | null |
2024-01-27 | A Survey on Data Augmentation in Large Model Era | Yue Zhou et.al. | 2401.15422 | null |
2024-01-27 | GEM: Boost Simple Network for Glass Surface Segmentation via Segment Anything Model and Data Synthesis | Jing Hao et.al. | 2401.15282 | null |
2024-01-26 | Annotated Hands for Generative Models | Yue Yang et.al. | 2401.15075 | link |
2024-01-26 | Text Image Inpainting via Global Structure-Guided Diffusion Models | Shipeng Zhu et.al. | 2401.14832 | link |
2024-01-25 | Opposite variations for pore pressure on and off the fault during simulated earthquakes in the laboratory | Dong Liu et.al. | 2401.14506 | null |
2024-01-24 | No Longer Trending on Artstation: Prompt Analysis of Generative AI Art | Jon McCormack et.al. | 2401.14425 | null |
2024-01-25 | Deconstructing Denoising Diffusion Models for Self-Supervised Learning | Xinlei Chen et.al. | 2401.14404 | null |
2024-01-25 | pix2gestalt: Amodal Segmentation by Synthesizing Wholes | Ege Ozguroglu et.al. | 2401.14398 | link |
2024-01-25 | UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models | Timo Kapsalis et.al. | 2401.14379 | null |
2024-01-27 | Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation | Minglin Chen et.al. | 2401.14257 | null |
2024-01-26 | Image Synthesis with Graph Conditioning: CLIP-Guided Diffusion Models for Scene Graphs | Rameshwar Mishra et.al. | 2401.14111 | null |
2024-01-25 | CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion | Nisha Huang et.al. | 2401.14066 | link |
2024-01-25 | Diffusion-based Data Augmentation for Object Counting Problems | Zhen Wang et.al. | 2401.13992 | null |
2024-01-25 | BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models | Senthil Purushwalkam et.al. | 2401.13974 | null |
2024-01-25 | StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models | Yalong Bai et.al. | 2401.13942 | null |
2024-01-24 | Inverse Molecular Design with Multi-Conditional Diffusion Guidance | Gang Liu et.al. | 2401.13858 | link |
2024-01-24 | Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All | Mehmet Saygin Seyfioglu et.al. | 2401.13795 | null |
2024-01-24 | Guided Diffusion for Fast Inverse Design of Density-based Mechanical Metamaterials | Yanyan Yang et.al. | 2401.13570 | null |
2024-01-25 | UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion | Wei Li et.al. | 2401.13388 | null |
2024-01-24 | Generative Design of Crystal Structures by Point Cloud Representations and Diffusion Model | Zhelin Li et.al. | 2401.13192 | null |
2024-01-24 | Towards Multi-domain Face Landmark Detection with Synthetic Data from Diffusion model | Yuanming Li et.al. | 2401.13191 | null |
2024-01-24 | Compositional Generative Inverse Design | Tailin Wu et.al. | 2401.13171 | link |
2024-01-24 | Choose Your Diffusion: Efficient and flexible ways to accelerate the diffusion model in fast high energy physics simulation | Cheng Jiang et.al. | 2401.13162 | null |
2024-01-23 | GALA: Generating Animatable Layered Assets from a Single Scan | Taeksoo Kim et.al. | 2401.12979 | null |
2024-01-24 | Zero-Shot Learning for the Primitives of 3D Affordance in General Objects | Hyeonwoo Kim et.al. | 2401.12978 | null |
2024-01-23 | Lumiere: A Space-Time Diffusion Model for Video Generation | Omer Bar-Tal et.al. | 2401.12945 | null |
2024-01-23 | UniHDA: Towards Universal Hybrid Domain Adaptation of Image Generators | Hengjia Li et.al. | 2401.12596 | null |
2024-01-23 | ToDA: Target-oriented Diffusion Attacker against Recommendation System | Xiaohao Liu et.al. | 2401.12578 | null |
2024-01-23 | DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations | Dogyun Park et.al. | 2401.12517 | null |
2024-01-20 | Large-scale Reinforcement Learning for Diffusion Models | Yinan Zhang et.al. | 2401.12244 | null |
2024-01-22 | DITTO: Diffusion Inference-Time T-Optimization for Music Generation | Zachary Novack et.al. | 2401.12179 | null |
2024-01-22 | Single-View 3D Human Digitalization with Large Reconstruction Models | Zhenzhen Weng et.al. | 2401.12175 | null |
2024-01-22 | Feature Denoising Diffusion Model for Blind Image Quality Assessment | Xudong Li et.al. | 2401.11949 | null |
2024-01-22 | EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models | Koichi Namekata et.al. | 2401.11739 | null |
2024-01-22 | Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs | Ling Yang et.al. | 2401.11708 | link |
2024-01-21 | Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers | Katherine Crowson et.al. | 2401.11605 | null |
2024-01-20 | Diffusion Model Conditioning on Gaussian Mixture Model and Negative Gaussian Mixture Gradient | Weiguo Lu et.al. | 2401.11261 | null |
2024-01-20 | Product-Level Try-on: Characteristics-preserving Try-on with Realistic Clothes Shading and Wrinkles | Yanlong Zang et.al. | 2401.11239 | null |
2024-01-24 | MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation | Nhat M. Hoang et.al. | 2401.11115 | null |
2024-01-20 | UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures | Mingyuan Zhou et.al. | 2401.11078 | null |
2024-01-20 | Make-A-Shape: a Ten-Million-scale 3D Shape Model | Ka-Hei Hui et.al. | 2401.11067 | null |
2024-01-19 | Synthesizing Moving People with 3D Control | Boyi Li et.al. | 2401.10889 | null |
2024-01-19 | ActAnywhere: Subject-Aware Video Background Generation | Boxiao Pan et.al. | 2401.10822 | null |
2024-01-19 | From Market Saturation to Social Reinforcement: Understanding the Impact of Non-Linearity in Information Diffusion Models | Tobias Friedrich et.al. | 2401.10818 | null |
2024-01-19 | Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion | Zuoyue Li et.al. | 2401.10786 | null |
2024-01-19 | Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model | Yinan Zheng et.al. | 2401.10700 | link |
2024-01-19 | MAEDiff: Masked Autoencoder-enhanced Diffusion Models for Unsupervised Anomaly Detection in Brain Images | Rui Xu et.al. | 2401.10561 | null |
2024-01-18 | Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution | Xin Yuan et.al. | 2401.10404 | null |
2024-01-18 | A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting | Wouter Van Gansbeke et.al. | 2401.10227 | link |
2024-01-22 | Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation | Changgu Chen et.al. | 2401.10150 | null |
2024-01-18 | DiffusionGPT: LLM-Driven Text-to-Image Generation System | Jie Qin et.al. | 2401.10061 | null |
2024-01-18 | CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects | Zhao Wang et.al. | 2401.09962 | null |
2024-01-18 | BlenDA: Domain Adaptive Object Detection through diffusion-based blending | Tzuhsuan Huang et.al. | 2401.09921 | null |
2024-01-18 | Exploring Latent Cross-Channel Embedding for Accurate 3D Human Pose Reconstruction in a Diffusion Framework | Junkun Jiang et.al. | 2401.09836 | null |
2024-01-18 | Wavelet-Guided Acceleration of Text Inversion in Diffusion-Based Image Editing | Gwanhyeong Koo et.al. | 2401.09794 | null |
2024-01-18 | Image Translation as Diffusion Visual Programmers | Cheng Han et.al. | 2401.09742 | null |
2024-01-17 | Total fraction of drug released from diffusion-controlled delivery systems with binding reactions | Elliot J. Carr et.al. | 2401.09644 | null |
2024-01-17 | Efficient generative adversarial networks using linear additive-attention Transformers | Emilio Morales-Juarez et.al. | 2401.09596 | null |
2024-01-17 | TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion | Yu-Ying Yeh et.al. | 2401.09416 | null |
2024-01-17 | Vlogger: Make Your Dream A Vlog | Shaobin Zhuang et.al. | 2401.09414 | link |
2024-01-17 | On the |
Mireille Bossy et.al. | 2401.09338 | null |
2024-01-17 | Siamese Meets Diffusion Network: SMDNet for Enhanced Change Detection in High-Resolution RS Imagery | Jia Jia et.al. | 2401.09325 | null |
2024-01-17 | T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis | Yoonjin Chung et.al. | 2401.09294 | null |
2024-01-17 | Training-Free Semantic Video Composition via Pre-trained Diffusion Model | Jiaqi Guo et.al. | 2401.09195 | null |
2024-01-17 | Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior | Zike Wu et.al. | 2401.09050 | null |
2024-01-17 | Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis | Jonghyun Lee et.al. | 2401.09048 | link |
2024-01-17 | VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models | Haoxin Chen et.al. | 2401.09047 | link |
2024-01-17 | Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation | Tong Xie et.al. | 2401.09031 | null |
2024-01-17 | 3D Human Pose Analysis via Diffusion Synthesis | Haorui Ji et.al. | 2401.08930 | null |
2024-01-16 | Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive | Yumeng Li et.al. | 2401.08815 | link |
2024-01-16 | Fixed Point Diffusion Models | Xingjian Bai et.al. | 2401.08741 | null |
2024-01-16 | SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers | Nanye Ma et.al. | 2401.08740 | null |
2024-01-16 | RoHM: Robust Human Motion Reconstruction via Diffusion | Siwei Zhang et.al. | 2401.08570 | null |
2024-01-16 | Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation | Mathis Petrovich et.al. | 2401.08559 | null |
2024-01-16 | Modeling Spoof Noise by De-spoofing Diffusion and its Application in Face Anti-spoofing | Bin Zhang et.al. | 2401.08275 | null |
2024-01-16 | Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization | Chongzhi Zhang et.al. | 2401.08232 | null |
2024-01-16 | Photonic Modes Prediction via Multi-Modal Diffusion Model | Jinyang Sun et.al. | 2401.08199 | null |
2024-01-16 | Key-point Guided Deformable Image Manipulation Using Diffusion Model | Seok-Hwan Oh et.al. | 2401.08178 | null |
2024-01-16 | SpecSTG: A Fast Spectral Diffusion Framework for Probabilistic Spatio-Temporal Traffic Forecasting | Lequan Lin et.al. | 2401.08119 | null |
2024-01-16 | DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech | Jaekwon Im et.al. | 2401.08102 | null |
2024-01-16 | EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model | Bingyuan Zhang et.al. | 2401.08049 | null |
2024-01-16 | Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities | Xu Yan et.al. | 2401.08045 | link |
2024-01-15 | Regularity in diffusion models with gradient activation | Damião Araújo et.al. | 2401.07979 | null |
2024-01-15 | HexaGen3D: StableDiffusion is just one step away from Fast and Diverse Text-to-3D Generation | Antoine Mercier et.al. | 2401.07727 | null |
2024-01-15 | Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks | Siyu Zou et.al. | 2401.07709 | null |
2024-01-15 | Multifractal-spectral features enhance classification of anomalous diffusion | Henrik Seckler et.al. | 2401.07646 | null |
2024-01-15 | InstantID: Zero-shot Identity-Preserving Generation in Seconds | Qixun Wang et.al. | 2401.07519 | link |
2024-01-15 | Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation | Yuanchen Ju et.al. | 2401.07487 | null |
2024-01-15 | Hierarchical Fashion Design with Multi-stage Diffusion Models | Zhifeng Xie et.al. | 2401.07450 | null |
2024-01-14 | A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models | Namjoon Suh et.al. | 2401.07187 | null |
2024-01-13 | Exploring Adversarial Attacks against Latent Diffusion Model from the Perspective of Adversarial Transferability | Junxi Chen et.al. | 2401.07087 | null |
2024-01-13 | Quantum Denoising Diffusion Models | Michael Kölle et.al. | 2401.07049 | null |
2024-01-13 | Quantum Generative Diffusion Model | Chuangtao Chen et.al. | 2401.07039 | null |
2024-01-13 | Denoising Diffusion Recommender Model | Jujia Zhao et.al. | 2401.06982 | null |
2024-01-12 | A deep implicit-explicit minimizing movement method for option pricing in jump-diffusion models | Emmanuil H. Georgoulis et.al. | 2401.06740 | null |
2024-01-12 | Decoupling Pixel Flipping and Occlusion Strategy for Consistent XAI Benchmarks | Stefan Blücher et.al. | 2401.06654 | link |
2024-01-12 | Adversarial Examples are Misaligned in Diffusion Model Manifolds | Peter Lorenz et.al. | 2401.06637 | null |
2024-01-12 | Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and Tracking | Wei Cao et.al. | 2401.06614 | null |
2024-01-12 | 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model | Qian Wang et.al. | 2401.06578 | null |
2024-01-12 | RotationDrag: Point-based Image Editing with Rotated Diffusion Features | Minxing Luo et.al. | 2401.06442 | link |
2024-01-12 | Seek for Incantations: Towards Accurate Text-to-Image Diffusion Synthesis through Prompt Engineering | Chang Yu et.al. | 2401.06345 | null |
2024-01-11 | Frequency-Time Diffusion with Neural Cellular Automata | John Kalkhof et.al. | 2401.06291 | null |
2024-01-11 | Demystifying Variational Diffusion Models | Fabio De Sousa Ribeiro et.al. | 2401.06281 | null |
2024-01-11 | Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications | Yuwen Xiong et.al. | 2401.06197 | link |
2024-01-11 | TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation | Rajaei Khatib et.al. | 2401.06191 | null |
2024-01-11 | E |
Yifan Gong et.al. | 2401.06127 | null |
2024-01-11 | DiffDA: a diffusion model for weather-scale data assimilation | Langwen Huang et.al. | 2401.05932 | null |
2024-01-11 | Efficient Image Deblurring Networks based on Diffusion Models | Kang Chen et.al. | 2401.05907 | link |
2024-01-11 | HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models | Hanzhang Wang et.al. | 2401.05870 | null |
2024-01-11 | EraseDiff: Erasing Data Influence in Diffusion Models | Jing Wu et.al. | 2401.05779 | null |
2024-01-10 | Diffusion Priors for Dynamic View Synthesis from Monocular Videos | Chaoyang Wang et.al. | 2401.05583 | null |
2024-01-10 | From Pampas to Pixels: Fine-Tuning Diffusion Models for Gaúcho Heritage | Marcellus Amadeus et.al. | 2401.05520 | null |
2024-01-10 | InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes | Mohamad Shahbazi et.al. | 2401.05335 | null |
2024-01-10 | Score Distillation Sampling with Learned Manifold Corrective | Thiemo Alldieck et.al. | 2401.05293 | null |
2024-01-10 | PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models | Junsong Chen et.al. | 2401.05252 | link |
2024-01-10 | Derm-T2IM: Harnessing Synthetic Skin Lesion Data via Stable Diffusion Models for Enhanced Skin Disease Classification using ViT and CNN | Muhammad Ali Farooq et.al. | 2401.05159 | null |
2024-01-10 | CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model | Yinghui Xing et.al. | 2401.05153 | null |
2024-01-10 | SwiMDiff: Scene-wide Matching Contrastive Learning with Diffusion Constraint for Remote Sensing Image | Jiayuan Tian et.al. | 2401.05093 | null |
2024-01-10 | A novel bond-based nonlocal diffusion model with matrix-valued coefficients in non-divergence form and its collocation discretization | Lili Ju et.al. | 2401.04973 | null |
2024-01-09 | Transmission-eigenchannel velocity and diffusion | Azriel Z. Genack et.al. | 2401.04818 | null |
2024-01-09 | DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation | Junming Chen et.al. | 2401.04747 | null |
2024-01-09 | Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation | Xiyi Chen et.al. | 2401.04728 | null |
2024-01-09 | Efficient estimation for ergodic diffusion processes sampled at high frequency | Michael Sørensen et.al. | 2401.04689 | null |
2024-01-09 | EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models | Jingyuan Yang et.al. | 2401.04608 | null |
2024-01-09 | Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models | Xuewen Liu et.al. | 2401.04585 | null |
2024-01-09 | MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation | Weimin Wang et.al. | 2401.04468 | null |
2024-01-09 | D3AD: Dynamic Denoising Diffusion Probabilistic Model for Anomaly Detection | Justin Tebbe et.al. | 2401.04463 | null |
2024-01-09 | SonicVisionLM: Playing Sound with Vision Language Models | Zhifeng Xie et.al. | 2401.04394 | null |
2024-01-09 | Representative Feature Extraction During Diffusion Process for Sketch Extraction with One Example | Kwan Yun et.al. | 2401.04362 | null |
2024-01-09 | Memory-Efficient Personalization using Quantized Diffusion Model | Hyogon Ryu et.al. | 2401.04339 | null |
2024-01-08 | FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation | Yang Liu et.al. | 2401.04283 | null |
2024-01-08 | Robust Image Watermarking using Stable Diffusion | Lijun Zhang et.al. | 2401.04247 | null |
2024-01-07 | The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline | Haonan Wang et.al. | 2401.04136 | null |
2024-01-08 | scDiffusion: conditional generation of high-quality single-cell data using diffusion model | Erpai Luo et.al. | 2401.03968 | link |
2024-01-08 | D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose Refinement | Danqi Yan et.al. | 2401.03914 | null |
2024-01-08 | DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement | Jiaqi Liu et.al. | 2401.03629 | null |
2024-01-09 | ROIC-DM: Robust Text Inference and Classification via Diffusion Model | Shilong Yuan et.al. | 2401.03514 | null |
2024-01-07 | Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness | Sicheng Yang et.al. | 2401.03476 | null |
2024-01-07 | Deep Learning-based Image and Video Inpainting: A Survey | Weize Quan et.al. | 2401.03395 | null |
2024-01-06 | Reflected Schrödinger Bridge for Constrained Generative Modeling | Wei Deng et.al. | 2401.03228 | null |
2024-01-06 | MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation by Prompts Redescription and Beyond | Yupei Lin et.al. | 2401.03221 | null |
2024-01-06 | Fair Sampling in Diffusion Models through Switching Mechanism | Yujin Choi et.al. | 2401.03140 | link |
2024-01-05 | Latte: Latent Diffusion Transformer for Video Generation | Xin Ma et.al. | 2401.03048 | link |
2024-01-05 | The Rise of Diffusion Models in Time-Series Forecasting | Caspar Meijer et.al. | 2401.03006 | null |
2024-01-08 | Uncovering the human motion pattern: Pattern Memory-based Diffusion Model for Trajectory Prediction | Yuxin Yang et.al. | 2401.02916 | null |
2024-01-05 | Plug-in Diffusion Model for Sequential Recommendation | Haokai Ma et.al. | 2401.02913 | null |
2024-01-05 | Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors | Top Piriyakulkij et.al. | 2401.02739 | null |
2024-01-05 | Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generation | Can Xu et.al. | 2401.02683 | null |
2024-01-04 | Comprehensive Exploration of Synthetic Data Generation: A Survey | André Bauer et.al. | 2401.02524 | null |
2024-01-04 | VASE: Object-Centric Appearance and Shape Manipulation of Real Videos | Elia Peruzzo et.al. | 2401.02473 | null |
2024-01-04 | Bring Metric Functions into Diffusion Models | Jie An et.al. | 2401.02414 | null |
2024-01-06 | GUESS:GradUally Enriching SyntheSis for Text-Driven Human Motion Generation | Xuehao Gao et.al. | 2401.02142 | null |
2024-01-04 | Preserving Image Properties Through Initializations in Diffusion Models | Jeffrey Zhang et.al. | 2401.02097 | null |
2024-01-04 | Energy based diffusion generator for efficient sampling of Boltzmann distributions | Yan Wang et.al. | 2401.02080 | null |
2024-01-04 | DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge Detection | Yunfan Ye et.al. | 2401.02032 | link |
2024-01-04 | Improving Diffusion-Based Image Synthesis with Context Prediction | Ling Yang et.al. | 2401.02015 | null |
2024-01-03 | Instruct-Imagen: Image Generation with Multi-modal Instruction | Hexiang Hu et.al. | 2401.01952 | null |
2024-01-03 | Can We Generate Realistic Hands Only Using Convolution? | Mehran Hosseini et.al. | 2401.01951 | null |
2024-01-03 | Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions | David Junhao Zhang et.al. | 2401.01827 | link |
2024-01-03 | DiffYOLO: Object Detection for Anti-Noise via YOLO and Diffusion Models | Yichen Liu et.al. | 2401.01659 | null |
2024-01-03 | SIGNeRF: Scene Integrated Generation for Neural Radiance Fields | Jan-Niklas Dihlmann et.al. | 2401.01647 | null |
2024-01-03 | S |
Yixuan Wang et.al. | 2401.01520 | link |
2024-01-02 | ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text | Dingkun Yan et.al. | 2401.01456 | link |
2024-01-02 | VALD-MD: Visual Attribution via Latent Diffusion for Medical Diagnostics | Ammar A. Siddiqui et.al. | 2401.01414 | null |
2024-01-01 | DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition | Parul Gupta et.al. | 2401.01387 | null |
2024-01-02 | VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM | Fuchen Long et.al. | 2401.01256 | null |
2024-01-02 | Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation | Renshuai Liu et.al. | 2401.01207 | null |
2024-01-02 | A comparative study of resistivity models for simulations of magnetic reconnection in the solar atmosphere. II. Plasmoid formation | Øystein Håvard Færder et.al. | 2401.01177 | null |
2024-01-02 | Joint Generative Modeling of Scene Graphs and Images via Diffusion Models | Bicheng Xu et.al. | 2401.01130 | null |
2024-01-02 | Robust single-particle cryo-EM image denoising and restoration | Jing Zhang et.al. | 2401.01097 | null |
2024-01-02 | Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation | Jinlong Xue et.al. | 2401.01044 | link |
2023-12-30 | Improving the Stability of Diffusion Models for Content Consistent Super-Resolution | Lingchen Sun et.al. | 2401.00877 | link |
2023-12-30 | FlashVideo: A Framework for Swift Inference in Text-to-Video Generation | Bin Lei et.al. | 2401.00869 | null |
2024-01-01 | DiffMorph: Text-less Image Morphing with Diffusion Models | Shounak Chatterjee et.al. | 2401.00739 | null |
2024-01-01 | Diffusion Models, Image Super-Resolution And Everything: A Survey | Brian B. Moser et.al. | 2401.00736 | null |
2024-01-02 | GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields | Xiao Pan et.al. | 2401.00616 | null |
2024-01-03 | Diff-PCR: Diffusion-Based Correspondence Searching in Doubly Stochastic Matrix Space for Point Cloud Registration | Qianliang Wu et.al. | 2401.00436 | null |
2023-12-31 | SynCDR : Training Cross Domain Retrieval Models with Synthetic Data | Samarth Mishra et.al. | 2401.00420 | link |
2023-12-31 | Controllable Safety-Critical Closed-loop Traffic Simulation via Guided Diffusion | Wei-Jer Chang et.al. | 2401.00391 | null |
2023-12-30 | Probing the Limits and Capabilities of Diffusion Models for the Anatomic Editing of Digital Twins | Karim Kadry et.al. | 2401.00247 | null |
2023-12-30 | Inpaint4DNeRF: Promptable Spatio-Temporal NeRF Inpainting with Generative Diffusion Models | Han Jiang et.al. | 2401.00208 | null |
2023-12-30 | Diffusion Model with Perceptual Loss | Shanchuan Lin et.al. | 2401.00110 | null |
2023-12-29 | Generating Enhanced Negatives for Training Language-Based Object Detectors | Shiyu Zhao et.al. | 2401.00094 | null |
2023-12-29 | FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis | Feng Liang et.al. | 2312.17681 | null |
2023-12-29 | Data Augmentation for Supervised Graph Outlier Detection with Latent Diffusion Models | Kay Liu et.al. | 2312.17679 | link |
2023-12-29 | Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation | Tuan-Anh Vu et.al. | 2312.17505 | null |
2023-12-28 | Classifier-free graph diffusion for molecular property targeting | Matteo Ninniri et.al. | 2312.17397 | null |
2023-12-28 | iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views | Chin-Hsuan Wu et.al. | 2312.17250 | link |
2023-12-28 | Personalized Restoration via Dual-Pivot Tuning | Pradyumna Chari et.al. | 2312.17234 | null |
2023-12-28 | 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency | Yuyang Yin et.al. | 2312.17225 | null |
2023-12-28 | Restoration by Generation with Constrained Priors | Zheng Ding et.al. | 2312.17161 | null |
2023-12-28 | DiffKG: Knowledge Graph Diffusion Model for Recommendation | Yangqin Jiang et.al. | 2312.16890 | link |
2023-12-29 | DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors | Biwen Lei et.al. | 2312.16837 | null |
2023-12-27 | I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models | Xun Guo et.al. | 2312.16693 | null |
2023-12-27 | Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection | Huan Liu et.al. | 2312.16649 | null |
2023-12-27 | Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance | Tomer Garber et.al. | 2312.16519 | null |
2023-12-29 | PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion | Guansong Lu et.al. | 2312.16486 | null |
2023-12-27 | SVGDreamer: Text Guided SVG Generation with Diffusion Model | Ximing Xing et.al. | 2312.16476 | null |
2023-12-27 | Natural Adversarial Patch Generation Method Based on Latent Diffusion Model | Xianyi Chen et.al. | 2312.16401 | null |
2023-12-26 | One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications | Mengyao Lyu et.al. | 2312.16145 | null |
2023-12-26 | Compositional Search of Stable Crystalline Structures in Multi-Component Alloys Using Generative Diffusion Models | Grzegorz Kaszuba et.al. | 2312.16073 | null |
2023-12-26 | HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D | Sangmin Woo et.al. | 2312.15980 | link |
2023-12-26 | Semantic Guidance Tuning for Text-To-Image Diffusion Models | Hyun Kang et.al. | 2312.15964 | null |
2023-12-26 | Implied volatility (also) is path-dependent | Hervé Andrès et.al. | 2312.15950 | link |
2023-12-26 | EnchantDance: Unveiling the Potential of Music-Driven Dance Movement | Bo Han et.al. | 2312.15946 | link |
2023-12-26 | Generating and Reweighting Dense Contrastive Patterns for Unsupervised Anomaly Detection | Songmin Dai et.al. | 2312.15911 | null |
2023-12-26 | Cross Initialization for Personalized Text-to-Image Generation | Lianyu Pang et.al. | 2312.15905 | link |
2023-12-25 | Adversarial Item Promotion on Visually-Aware Recommender Systems by Guided Diffusion | Lijian Chen et.al. | 2312.15826 | null |
2023-12-25 | High-Fidelity Diffusion-based Image Editing | Chen Hou et.al. | 2312.15707 | null |
2023-12-25 | A Multi-Modal Contrastive Diffusion Model for Therapeutic Peptide Generation | Yongkang Wang et.al. | 2312.15665 | link |
2023-12-25 | Balanced SNR-Aware Distillation for Guided Text-to-Audio Generation | Bingzhi Liu et.al. | 2312.15628 | null |
2023-12-25 | Conversational Co-Speech Gesture Generation via Modeling Dialog Intention, Emotion, and Context with Diffusion Models | Haiwei Xue et.al. | 2312.15567 | null |
2023-12-24 | A-SDM: Accelerating Stable Diffusion through Redundancy Removal and Performance Optimization | Jinchao Zhu et.al. | 2312.15516 | null |
2023-12-24 | Diffusion-EXR: Controllable Review Generation for Explainable Recommendation via Diffusion Models | Ling Li et.al. | 2312.15490 | null |
2023-12-24 | A Two-stage Personalized Virtual Try-on Framework with Shape Control and Texture Guidance | Shufang Zhang et.al. | 2312.15480 | null |
2023-12-23 | Prompt-Propose-Verify: A Reliable Hand-Object-Interaction Data Generation Framework using Foundational Models | Gurusha Juneja et.al. | 2312.15247 | null |
2023-12-23 | CaLDiff: Camera Localization in NeRF via Pose Diffusion | Rashik Shrestha et.al. | 2312.15242 | null |
2023-12-23 | Majority-based Preference Diffusion on Social Networks | Ahad N. Zehmakan et.al. | 2312.15140 | null |
2023-12-22 | Spectrally Decomposed Diffusion Models for Generative Turbulence Recovery | Mohammed Sardar et.al. | 2312.15029 | null |
2023-12-22 | MACS: Mass Conditioned 3D Hand and Object Motion Synthesis | Soshi Shimada et.al. | 2312.14929 | null |
2023-12-22 | BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction | Honghao Fu et.al. | 2312.14871 | null |
2023-12-22 | Neural-network-based regularization methods for inverse problems in imaging | Andreas Habring et.al. | 2312.14849 | null |
2023-12-22 | Dreaming of Electrical Waves: Generative Modeling of Cardiac Excitation Waves using Diffusion Models | Tanish Baranwal et.al. | 2312.14830 | null |
2023-12-22 | Neural network models for preferential concentration of particles in two-dimensional turbulence | Thibault Maurel-Oujia et.al. | 2312.14829 | null |
2023-12-22 | Plan, Posture and Go: Towards Open-World Text-to-Motion Generation | Jinpeng Liu et.al. | 2312.14828 | null |
2023-12-22 | Harnessing Diffusion Models for Visual Perception with Meta Prompts | Qiang Wan et.al. | 2312.14733 | link |
2023-12-22 | FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection | Dongmei Zhang et.al. | 2312.14465 | null |
2023-12-22 | Generative AI Beyond LLMs: System Implications of Multi-Modal Generation | Alicia Golden et.al. | 2312.14385 | null |
2023-12-21 | Single-Cell RNA-seq Synthesis with Latent Diffusion Model | Yixuan Wang et.al. | 2312.14220 | null |
2023-12-21 | DreamDistribution: Prompt Distribution Learning for Text-to-Image Diffusion Models | Brian Nlong Zhao et.al. | 2312.14216 | null |
2023-12-21 | Diffusion Reward: Learning Rewards via Conditional Video Diffusion | Tao Huang et.al. | 2312.14134 | null |
2023-12-21 | Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation | Philipp Schröppel et.al. | 2312.14124 | link |
2023-12-21 | HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models | Hayk Manukyan et.al. | 2312.14091 | link |
2023-12-21 | Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning | Desai Xie et.al. | 2312.13980 | null |
2023-12-22 | Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models | Xianfang Zeng et.al. | 2312.13913 | link |
2023-12-21 | Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models | Huan Ling et.al. | 2312.13763 | null |
2023-12-21 | Free-Editor: Zero-shot Text-driven 3D Scene Editing | Nazmul Karim et.al. | 2312.13663 | null |
2023-12-21 | Diff-Oracle: Diffusion Model for Oracle Character Generation with Controllable Styles and Contents | Jing Li et.al. | 2312.13631 | null |
2023-12-21 | Navigating the Structured What-If Spaces: Counterfactual Generation via Structured Diffusion | Nishtha Madaan et.al. | 2312.13616 | null |
2023-12-21 | Front stability of infinitely steep travelling waves in population biology | Matthew J Simpson et.al. | 2312.13601 | null |
2023-12-20 | Unlocking Pre-trained Image Backbones for Semantic Image Synthesis | Tariq Berrada et.al. | 2312.13314 | null |
2023-12-21 | Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting | Junwu Zhang et.al. | 2312.13271 | link |
2023-12-20 | Conditional Image Generation with Pretrained Generative Model | Rajesh Shrestha et.al. | 2312.13253 | null |
2023-12-20 | Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model | Saurabh Saxena et.al. | 2312.13252 | null |
2023-12-20 | Diffusion Models With Learned Adaptive Noise | Subham Sekhar Sahoo et.al. | 2312.13236 | link |
2023-12-21 | DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis | Yuming Gu et.al. | 2312.13016 | link |
2023-12-20 | RadEdit: stress-testing biomedical vision models via diffusion image editing | Fernando Pérez-García et.al. | 2312.12865 | null |
2023-12-20 | ReCo-Diff: Explore Retinex-Based Condition Strategy in Diffusion Model for Low-Light Image Enhancement | Yuhui Wu et.al. | 2312.12826 | null |
2023-12-20 | All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models | Seunghoo Hong et.al. | 2312.12807 | null |
2023-12-21 | AMD:Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion | Beibei Jing et.al. | 2312.12763 | null |
2023-12-20 | How Good Are Deep Generative Models for Solving Inverse Problems? | Shichong Peng et.al. | 2312.12691 | null |
2023-12-19 | Surf-CDM: Score-Based Surface Cold-Diffusion Model For Medical Image Segmentation | Fahim Ahmed Zaman et.al. | 2312.12649 | null |
2023-12-19 | Fixed-point Inversion for Text-to-image diffusion models | Barak Meiri et.al. | 2312.12540 | null |
2023-12-19 | StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation | Akio Kodaira et.al. | 2312.12491 | null |
2023-12-19 | InstructVideo: Instructing Video Diffusion Models with Human Feedback | Hangjie Yuan et.al. | 2312.12490 | null |
2023-12-19 | Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models | Angela Castillo et.al. | 2312.12487 | null |
2023-12-19 | On Inference Stability for Diffusion Models | Viet Nguyen et.al. | 2312.12431 | link |
2023-12-19 | Scene-Conditional 3D Object Stylization and Composition | Jinghao Zhou et.al. | 2312.12419 | null |
2023-12-19 | Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models | Shweta Mahajan et.al. | 2312.12416 | null |
2023-12-19 | Travelling pulses on three spatial scales in a Klausmeier-type vegetation-autotoxicity model | Paul Carter et.al. | 2312.12277 | null |
2023-12-19 | Intrinsic Image Diffusion for Single-view Material Estimation | Peter Kocsis et.al. | 2312.12274 | null |
2023-12-19 | Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model | Lingjun Zhang et.al. | 2312.12232 | link |
2023-12-19 | HuTuMotion: Human-Tuned Navigation of Latent Motion Diffusion Models with Minimal Feedback | Gaoge Han et.al. | 2312.12227 | null |
2023-12-19 | FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning | Zhenhua Yang et.al. | 2312.12142 | link |
2023-12-19 | GazeMoDiff: Gaze-guided Diffusion Model for Stochastic Human Motion Prediction | Haodong Yan et.al. | 2312.12090 | null |
2023-12-19 | Learning Subject-Aware Cropping by Outpainting Professional Photos | James Hong et.al. | 2312.12080 | null |
2023-12-19 | Resource-efficient Generative Mobile Edge Networks in 6G Era: Fundamentals, Framework and Case Study | Bingkun Lai et.al. | 2312.12063 | null |
2023-12-19 | Towards Accurate Guided Diffusion Sampling through Symplectic Adjoint Method | Jiachun Pan et.al. | 2312.12030 | null |
2023-12-19 | Diffusing More Objects for Semi-Supervised Domain Adaptation with Less Labeling | Leander van den Heuvel et.al. | 2312.12000 | null |
2023-12-19 | Optimizing Diffusion Noise Can Serve As Universal Motion Priors | Korrawe Karunratanakul et.al. | 2312.11994 | null |
2023-12-19 | Extending intraday solar forecast horizons with deep generative models | Alberto Carpentieri et.al. | 2312.11966 | null |
2023-12-19 | Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation | Yuze He et.al. | 2312.11774 | null |
2023-12-18 | Learning a Diffusion Model Policy from Rewards via Q-Score Matching | Michael Psenka et.al. | 2312.11752 | null |
2023-12-18 | Unified framework for diffusion generative models in SO(3): applications in computer vision and astrophysics | Yesukhei Jagvaral et.al. | 2312.11707 | null |
2023-12-18 | HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles | Vanessa Sklyarova et.al. | 2312.11666 | null |
2023-12-18 | VolumeDiffusion: Flexible Text-to-3D Generation with Efficient Volumetric Encoder | Zhicong Tang et.al. | 2312.11459 | null |
2023-12-18 | A novel diffusion recommendation algorithm based on multi-scale cnn and residual lstm | Yong Niu et.al. | 2312.10885 | null |
2023-12-17 | Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models | Nikita Starodubcev et.al. | 2312.10835 | link |
2023-12-17 | CogCartoon: Towards Practical Story Visualization | Zhongyang Zhu et.al. | 2312.10718 | null |
2023-12-17 | VidToMe: Video Token Merging for Zero-Shot Video Editing | Xirui Li et.al. | 2312.10656 | null |
2023-12-16 | VecFusion: Vector Font Generation with Diffusion | Vikas Thamizharasan et.al. | 2312.10540 | null |
2023-12-16 | A Unified Filter Method for Jointly Estimating State and Parameters of Stochastic Dynamical Systems via the Ensemble Score Filter | Feng Bao et.al. | 2312.10503 | null |
2023-12-16 | Continuous Diffusion for Mixed-Type Tabular Data | Markus Mueller et.al. | 2312.10431 | null |
2023-12-16 | Lecture Notes in Probabilistic Diffusion Models | Inga Strümke et.al. | 2312.10393 | null |
2023-12-16 | Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge | Conghan Yue et.al. | 2312.10299 | link |
2023-12-15 | Two simple criterion to prove the existence of patterns in reaction-diffusion models of two components | Francisco J. Vielma-Leal et.al. | 2312.10231 | null |
2023-12-15 | Tell Me What You See: Text-Guided Real-World Image Denoising | Erez Yosef et.al. | 2312.10191 | null |
2023-12-15 | Improving new physics searches with diffusion models for event observables and jet constituents | Debajyoti Sengupta et.al. | 2312.10130 | null |
2023-12-15 | MVHuman: Tailoring 2D Diffusion with Multi-view Sampling For Realistic 3D Human Generation | Suyi Jiang et.al. | 2312.10120 | null |
2023-12-15 | Plasticine3D: Non-rigid 3D editting with text guidance | Yige Chen et.al. | 2312.10111 | null |
2023-12-15 | Latent Diffusion Models with Image-Derived Annotations for Enhanced AI-Assisted Cancer Diagnosis in Histopathology | Pedro Osorio et.al. | 2312.09792 | null |
2023-12-15 | DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models | Yifeng Ma et.al. | 2312.09767 | null |
2023-12-15 | PPFM: Image denoising in photon-counting CT using single-step posterior sampling Poisson flow generative models | Dennis Hein et.al. | 2312.09754 | link |
2023-12-15 | Positivity and global existence for nonlocal advection-diffusion models of interacting populations | Valeria Giunta et.al. | 2312.09692 | null |
2023-12-15 | Exploring the Feasibility of Generating Realistic 3D Models of Endangered Species Using DreamGaussian: An Analysis of Elevation Angle's Impact on Model Generation | Selcuk Anil Karatopak et.al. | 2312.09682 | null |
2023-12-15 | Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models | Senmao Li et.al. | 2312.09608 | link |
2023-12-15 | Single PW takes a shortcut to compound PW in US imaging | Zhiqiang Li et.al. | 2312.09514 | null |
2023-12-15 | Fast Sampling generative model for Ultrasound image reconstruction | Hengrong Lan et.al. | 2312.09510 | null |
2023-12-14 | Unbiasing Enhanced Sampling on a High-dimensional Free Energy Surface with Deep Generative Model | Yikai Liu et.al. | 2312.09404 | null |
2023-12-14 | LatentEditor: Text Driven Local Editing of 3D Scenes | Umar Khalid et.al. | 2312.09313 | link |
2023-12-14 | LIME: Localized Image Editing via Attention Regularization in Diffusion Models | Enis Simsar et.al. | 2312.09256 | null |
2023-12-14 | FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection | Hongsuk Choi et.al. | 2312.09252 | null |
2023-12-14 | Single Mesh Diffusion Models with Field Latents for Texture Generation | Thomas W. Mitchel et.al. | 2312.09250 | null |
2023-12-14 | A framework for conditional diffusion modelling with applications in motif scaffolding for protein design | Kieran Didi et.al. | 2312.09236 | null |
2023-12-14 | Mosaic-SDF for 3D Generative Models | Lior Yariv et.al. | 2312.09222 | null |
2023-12-14 | Fast Sampling via De-randomization for Discrete Diffusion Models | Zixiang Chen et.al. | 2312.09193 | null |
2023-12-14 | Improving Efficiency of Diffusion Models via Multi-Stage Framework and Tailored Multi-Decoder Architectures | Huijie Zhang et.al. | 2312.09181 | null |
2023-12-14 | DiffusionLight: Light Probes for Free by Painting a Chrome Ball | Pakkapon Phongthawee et.al. | 2312.09168 | link |
2023-12-14 | Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers | Zi-Xin Zou et.al. | 2312.09147 | null |
2023-12-14 | VideoLCM: Video Latent Consistency Model | Xiang Wang et.al. | 2312.09109 | null |
2023-12-14 | PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion | Ying-Tian Liu et.al. | 2312.09069 | null |
2023-12-14 | Brain Diffuser with Hierarchical Transformer for MCI Causality Analysis | Qiankun Zuo et.al. | 2312.09022 | null |
2023-12-14 | OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers | Han Liang et.al. | 2312.08985 | null |
2023-12-14 | Motion Flow Matching for Human Motion Synthesis and Editing | Vincent Tao Hu et.al. | 2312.08895 | null |
2023-12-14 | VaLID: Variable-Length Input Diffusion for Novel View Synthesis | Shijie Li et.al. | 2312.08892 | null |
2023-12-14 | Diffusion-C: Unveiling the Generative Challenges of Diffusion Models through Corrupted Data | Keywoong Bae et.al. | 2312.08843 | null |
2023-12-14 | Speeding up Photoacoustic Imaging using Diffusion Models | Irem Loc et.al. | 2312.08834 | link |
2023-12-14 | Guided Diffusion from Self-Supervised Diffusion Features | Vincent Tao Hu et.al. | 2312.08825 | null |
2023-12-14 | Reconstruction of Sound Field through Diffusion Models | Federico Miotello et.al. | 2312.08821 | null |
2023-12-14 | Local Conditional Controlling for Text-to-Image Diffusion Models | Yibo Zhao et.al. | 2312.08768 | link |
2023-12-13 | PhenDiff: Revealing Invisible Phenotypes with Conditional Diffusion Models | Anis Bourou et.al. | 2312.08290 | link |
2023-12-13 | Black-box Membership Inference Attacks against Fine-tuned Diffusion Models | Yan Pang et.al. | 2312.08207 | null |
2023-12-13 | Concept-centric Personalization with Large-scale Diffusion Priors | Pu Cao et.al. | 2312.08195 | link |
2023-12-13 | Maxwell X. Cai et.al. | 2312.08153 | null | |
2023-12-13 | Clockwork Diffusion: Efficient Generation With Model-Step Distillation | Amirhossein Habibian et.al. | 2312.08128 | null |
2023-12-13 | Knowledge-Aware Artifact Image Synthesis with LLM-Enhanced Prompting and Multi-Source Supervision | Shengguang Wu et.al. | 2312.08056 | null |
2023-12-14 | Compositional Inversion for Stable Diffusion Models | Xu-Lu Zhang et.al. | 2312.08048 | link |
2023-12-13 | AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing | Zhiyuan Ma et.al. | 2312.08019 | link |
2023-12-13 | Time Series Diffusion Method: A Denoising Diffusion Probabilistic Model for Vibration Signal Generation | Haiming Yi et.al. | 2312.07981 | null |
2023-12-13 | LMD: Faster Image Reconstruction with Latent Masking Diffusion | Zhiyuan Ma et.al. | 2312.07971 | link |
2023-12-13 | Semantic-aware Data Augmentation for Text-to-image Synthesis | Zhaorui Tan et.al. | 2312.07951 | null |
2023-12-13 | BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics | Wenqian Zhang et.al. | 2312.07937 | null |
2023-12-13 | SimAC: A Simple Anti-Customization Method against Text-to-Image Synthesis of Diffusion Models | Feifei Wang et.al. | 2312.07865 | null |
2023-12-13 | Diffusion Models Enable Zero-Shot Pose Estimation for Lower-Limb Prosthetic Users | Tianxun Zhou et.al. | 2312.07854 | null |
2023-12-14 | Noise in the reverse process improves the approximation capabilities of diffusion models | Karthik Elamvazhuthi et.al. | [2312.07851](http://arxiv.org/abs/2312.0 |