verigle's repositories
animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
Aurora
[NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model
AutoGPT
An experimental open-source attempt to make GPT-4 fully autonomous.
BXC_VideoAnalyzer_v4
视频行为分析系统v4系列版本,该系统可以在不考虑流媒体音视频开发,编解码开发,界面开发等情况下, 只需要训练自己的模型,开发自己的行为算法插件,就可以轻松开发出任何你想要的安全行为检测,比如人脸识别,车辆识别,周界入侵,打架,斗殴,跌倒,人群聚集,离岗睡岗,安全帽检测,充电桩,工作服, 疲劳检测,交通拥堵等等。
Chat-UniVi
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
CompreFace
Leading free and open-source face recognition system
compreface-python-sdk
Python SDK for CompreFace - free and open-source face recognition system from Exadel
dash-to-panel
An icon taskbar for the Gnome Shell. This extension moves the dash into the gnome main panel so that the application launchers and system tray are combined into a single panel, similar to that found in KDE Plasma and Windows 7+. A separate dock is no longer needed for easy access to running and favorited applications.
Data-Copilot
Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow
decord
An efficient video loader for deep learning with smart shuffling that's super easy to digest
FaceX-Zoo
A PyTorch Toolbox for Face Recognition
funNLP
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、历史名人词库、诗词词库、医学词库、饮食词库、法律词库、汽车词库、动物词库、中文聊天语料、中文谣言数据、百度中文问答数据集、句子相似度匹配算法集合、bert资源、文本生成&摘要相关工具、cocoNLP信息抽取工具、国内电话号码正则匹配、清华大学XLORE:中英文跨语言百科知识图谱、清华大学人工智能技术系列报
grok-1
Grok open release
InternImage
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
LaVIN
Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
moviepy
Video editing with Python
mybatis-pro-plus
An powerful enhanced toolkit of MyBatis for simplify development (todo pro)
open-interpreter
OpenAI's Code Interpreter in your terminal, running locally
opencv-python-cuda-wheels
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
pix2seq
Pix2Seq - A general framework for turning RGB pixels into semantically meaningful sequences
SAM-Med2D
Official implementation of SAM-Med2D
TigerBot
TigerBot: A multi-language multi-task LLM
unidiffuser
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
UNINEXT
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
VideoPipe
跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星:)。
zotero-pdf-translate
PDF translation add-on for Zotero 6