sulasen's repositories
videollm-online
VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
grok-1
Grok open release
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
T-Rex
Detect and count any objects by visual prompting
openai-cookbook
Examples and guides for using the OpenAI API
DevOpsGPT
Multi agent system for AI-driven software development. Combine LLM with DevOps tools to convert natural language requirements into working software. Supports any development language and extends the existing code.
MetaGPT
🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
gpt-researcher
GPT based autonomous agent that does online comprehensive research on any given topic
ChatCaptioner
Official Repository of ChatCaptioner
pytorch-handwriting-synthesis-toolkit
Handwriting generation and handwriting synthesis as described in Alex Graves's paper https://arxiv.org/abs/1308.0850. Pytorch implementation.
minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
football
Check out the new game server:
imagezmq
A set of Python classes that transport OpenCV images from one computer to another using PyZMQ messaging.
video-anomaly-detection
Anomaly detection in videos using deep learning.
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
simpletransformers
Transformers made simple with training, evaluation, and prediction possible with one line each. Currently supports Sequence Classification (binary, multiclass, multilabel, sentence pair), Token Classification (NER), Question Answering, Language Modeling, Regression, Conversational AI, and Multi-Modal tasks. Built on top of the Hugging Face Transformer library.
ktrain
ktrain is a Python library that makes deep learning and AI more accessible and easier to apply
SecLists
SecLists is the security tester's companion. It's a collection of multiple types of lists used during security assessments, collected in one place. List types include usernames, passwords, URLs, sensitive data patterns, fuzzing payloads, web shells, and many more.
fastai-serving
A Docker image for serving fast.ai models, mimicking the API of Tensorflow Serving
pytorch-transformers-classification
Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for employing Transformer models in text classification tasks. Contains code to easily train BERT, XLNet, RoBERTa, and XLM models for text classification.
Object-Detection-and-Tracking
YOLO & RCNN Object Detection and Multi-Object Tracking
socceraction
Convert existing soccer event stream data to SPADL and value player actions
YOLO3-4-Py
A Python wrapper on Darknet. Compatible with YOLO V3.
AlphaPose
Real-Time and Accurate Multi-Person Pose Estimation&Tracking System
mlflow
Open source platform for the machine learning lifecycle
openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
labelImg
:metal: LabelImg is a graphical image annotation tool and label object bounding boxes in images