Computer-Vision-Research-Group / Computer-Vision-Reading-Group

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

About the reading group

Welcome to the space for the Computer Vision Reading Group, hosted by the Computer Vision Research Group at the Department of Computer Science, University of Sheffield!

Our group convenes to explore cutting-edge research, discuss pivotal papers, and delve into emerging trends in CV. Involved topics may include, but are not limited to, object detection, semantic segmentation, medical image analysis, generative AI, foundation models, and multimodal learning.

The events will normally take place every three weeks from 4 PM to 5 PM, on Wednesdays, at Ada Lovelace (108) meeting room.

The slides and recordings will be shared at internal Google Drive space.

We welcome students and academic staff from various groups and departments who share a genuine interest in advancing their understanding of this dynamic field, as well as those who are willing to help others advance their knowledge beyond their own progression. Please email me if you are willing to propose and deliver a presentation. You can do it either with or without a partner.

Upcoming presentations

Paper Title (link) Presenter Time / Place Notes
TBD Yanran Zhang 26th June, 2024, 4pm-5pm / Ada Lovelace
TBD TBD TBD

Previous presentations

Paper Title (link) Presenter Time / Place Notes
Wang, Z., Bovik, A.C., Sheikh, H.R. and Simoncelli, E.P., 2004. Image Quality Assessment: From Error Visibility to Structural Similarity. IEEE Transactions on Image Processing (link)
Nilsson, J. and Akenine-Möller, T., 2020. Understanding SSIM. (link)
Zhang, R., Isola, P., Efros, A.A., Shechtman, E. and Wang, O., 2018. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric. (link)
Eleftherios Ioannou 22nd May, 2024, 4pm-5pm / Ada Lovelace Topic: Image Quality Assessment: Understanding SSIM (and LPIPS)
Previously arranged on 15th May; Postponed to 22nd May.
Shi, Y., Wang, P., Ye, J., Long, M., Li, K. and Yang, X., 2023. Mvdream: Multi-view diffusion for 3d generation. arXiv preprint arXiv:2308.16512 (link)
Mildenhall, B., Srinivasan, P.P., Tancik, M., Barron, J.T., Ramamoorthi, R. and Ng, R., 2021. Nerf: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM, 65(1), pp.99-106 (link)
Poole, B., Jain, A., Barron, J.T. and Mildenhall, B., 2022. Dreamfusion: Text-to-3d using 2d diffusion. arXiv preprint arXiv:2209.14988 (link)
Ming He 24th April, 2024, 4pm-5pm / Ada Lovelace Topic: Using 2D Diffusion Model for 3D Content Generation
Chefer, Hila, et al. "Attend-and-excite: Attention-based semantic guidance for text-to-image diffusion models." ACM Transactions on Graphics (TOG) 42.4 (2023) (link)
Feng, Weixi, et al. "Training-free structured diffusion guidance for compositional text-to-image synthesis." ICLR 2023 (link)
Chen, Minghao, Iro Laina, and Andrea Vedaldi. "Training-free layout control with cross-attention guidance." Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2024. (link)
Xiaoqi Zhuang 10th April, 2024, 4pm-5pm / Ada Lovelace Topic: Compositional Generation Challeng
Previously arranged on 3rd April; Postponed to 10th April.
Rombach, Robin, et al. "High-resolution image synthesis with latent diffusion models." Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022. (link)
Peebles, William, and Saining Xie. "Scalable diffusion models with transformers." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2023. (link)
Farheen Ramzan & Sijie Li 13th March, 2024, 4pm-5pm / Ada Lovelace Topic: Diffusion-based Generative Modeling Methods and Applications

About