Beast code in Giters

Antonio Scaiella's repositories

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Language:Jupyter NotebookBSD-3-Clause000

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT000

core

Production ready AI assistant framework

Language:PythonGPL-3.0000

DeepSpeech-Italian-Model

Tooling for producing Italian model (public release available) for DeepSpeech and text corpus

Language:Python000

GLiNER

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 24

Language:PythonApache-2.0000

mamba

Language:PythonApache-2.0000

MyNN1

Language:Python010

OCRmyImage

Language:Python000

OmniFusion

OmniFusion — a multimodal model to communicate using text and images

Language:PythonApache-2.0000

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonApache-2.0000

squad-it

A large scale dataset for Question Answering in Italian

000

video-caption.pytorch

Language:PythonMIT000

skynet

AI core services for Jitsi

Language:PythonApache-2.0000

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"

Language:PythonMIT000

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookNOASSERTION000

piperino11

Antonio Scaiella's repositories

ast

audiocraft

core

DeepSpeech-Italian-Model

GLiNER

mamba

MyNN1

OCRmyImage

OmniFusion

parler-tts

squad-it

video-caption.pytorch

skynet

VAR

VoiceCraft