Liviu-Daniel's repositories
APTM
The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"
avalanche
Avalanche: an End-to-End Library for Continual Learning based on PyTorch.
CLL-STR
Cross-lingual learning in scene text recognition (ICASSP2024)
CO-MOT
CO-MOT: Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object Tracking
codabench
Codabench is a flexible, easy-to-use and reproducible benchmarking platform. Check our paper at Patterns Cell Press https://hubs.li/Q01fwRWB0
deep-license-plate-recognition
Automatic License Plate Recognition (ALPR) or Automatic Number Plate Recognition (ANPR) software that works with any camera.
DeepSolo
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Text Spotting"
Depth-Anything-V2
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
DIVOTrack
A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes (Accepted to IJCV 2023)
event-jekyll-theme
Jekyll Theme package for your event
feeling-responsive
»Feeling Responsive« is a free flexible theme for Jekyll built on Foundation framework. You can use it for your company site, as a portfolio or as a blog.
hiertext
The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and paragraph level annotations.
HighwayEnv
A minimalist environment for decision-making in autonomous driving
jekyll-theme-conference
Jekyll template for a conference website containing program, speaker, talks and room overview
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
LION
Official repository of ”LION: Linear Group RNN for 3D Object Detection in Point Clouds“
marker
Convert PDF to markdown quickly with high accuracy
Multi-Task-Transformer
Code of ICLR2023 paper "TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding" and ECCV2022 paper "Inverted Pyramid Multi-task Transformer for Dense Scene Understanding"
post-to-email
Supporting contact forms for static websites
PySyft
Perform data science on data that remains in someone else's server
QCNet
[CVPR 2023] Query-Centric Trajectory Prediction
RPG-DiffusionMaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
SwinTextSpotter
Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)
TCM
Turning a CLIP Model into a Scene Text Detector (CVPR2023)
Uni3D
[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI
zmeventnotification
Machine Learning powered Secure Websocket & MQTT based ZoneMinder event notification server