Chevolier's starred repositories
SEED-Story
SEED-Story: Multimodal Long Story Generation with Large Language Model
StoryDiffusion
Create Magic Story!
FourCastNet
Initial public release of code, data, and model weights for FourCastNet
Pangu-Weather
An official implementation of Pangu-Weather
LargeBatchCTR
Large batch training of CTR models based on DeepCTR with CowClip.
DeepCTR-Torch
【PyTorch】Easy-to-use,Modular and Extendible package of deep-learning based CTR models.
ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
audio-slicer
Python script that slices audio with silence detection
amazon-sagemaker-visual-search
This repository is part of a blog post that guides users through creating a visual search application using Amazon SageMaker and Amazon Elasticsearch service
CTranslate2
Fast inference engine for Transformer models
faster-whisper
Faster Whisper transcription with CTranslate2