view1234567's starred repositories

Ovis

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Language:PythonLicense:Apache-2.0Stargazers:355Issues:0Issues:0

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Language:PythonLicense:NOASSERTIONStargazers:424Issues:0Issues:0

LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Language:PythonLicense:Apache-2.0Stargazers:2130Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:5957Issues:0Issues:0

g1

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Language:PythonLicense:MITStargazers:3407Issues:0Issues:0

LeCo

This the implementation of LeCo

Language:PythonStargazers:24Issues:0Issues:0

LongCite

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Language:PythonLicense:Apache-2.0Stargazers:276Issues:0Issues:0

TAG-Bench

TAG-Bench: A benchmark for table-augmented generation (TAG)

Language:PythonLicense:MITStargazers:457Issues:0Issues:0

DAMO-ConvAI

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

Language:PythonLicense:MITStargazers:1180Issues:0Issues:0

CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Language:PythonLicense:NOASSERTIONStargazers:195Issues:0Issues:0

MambaInLlama

Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Language:PythonLicense:Apache-2.0Stargazers:144Issues:0Issues:0

QAnything

Question and Answer based on Anything.

Language:PythonLicense:AGPL-3.0Stargazers:11533Issues:0Issues:0

MaxKB

🚀 基于大语言模型和 RAG 的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。

Language:PythonLicense:GPL-3.0Stargazers:10506Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:902Issues:0Issues:0

GitHubDaily

坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.

Stargazers:32090Issues:0Issues:0

VideoLingo

Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组

Language:PythonLicense:Apache-2.0Stargazers:2966Issues:0Issues:0

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Language:PythonLicense:MITStargazers:17697Issues:0Issues:0

GPT-SoVITS-Inference

Inference Specialization

Language:PythonLicense:MITStargazers:317Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:33478Issues:0Issues:0

whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Language:PythonLicense:MITStargazers:1874Issues:0Issues:0

whisper-medusa

Whisper with Medusa heads

Language:PythonLicense:MITStargazers:785Issues:0Issues:0

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:11876Issues:0Issues:0

llama-cpp-python

Python bindings for llama.cpp

Language:PythonLicense:MITStargazers:7832Issues:0Issues:0
Language:PythonStargazers:168Issues:0Issues:0

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonLicense:MITStargazers:1554Issues:0Issues:0

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:14170Issues:0Issues:0

vosk

VOSK Speech Recognition Toolkit

Language:CLicense:Apache-2.0Stargazers:378Issues:0Issues:0

vosk-android-demo

Offline speech recognition for Android with Vosk library.

Language:JavaLicense:Apache-2.0Stargazers:740Issues:0Issues:0

piper

A fast, local neural text to speech system

Language:C++License:MITStargazers:5994Issues:0Issues:0

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:12139Issues:0Issues:0