Bayartsogt Yadamsuren (bayartsogt-ya)

bayartsogt-ya

Geek Repo

Company:The Home Depot

Location:United States

Home Page:https://bayartsogt-ya.github.io

Twitter:@_tsogoo_

Github PK Tool:Github PK Tool

Bayartsogt Yadamsuren's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:68355Issues:574Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:55798Issues:521Issues:961

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:36475Issues:371Issues:315

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30225Issues:428Issues:4186

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonLicense:Apache-2.0Stargazers:19082Issues:279Issues:2900

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:18411Issues:174Issues:2226

haystack

:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:16981Issues:138Issues:3526

gpt-3

GPT-3: Language Models are Few-Shot Learners

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12712Issues:101Issues:512

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:11635Issues:121Issues:694

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10801Issues:140Issues:350

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8848Issues:99Issues:1314

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7700Issues:108Issues:156

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptLicense:Apache-2.0Stargazers:7314Issues:79Issues:567

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:6120Issues:50Issues:1014

whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4378Issues:43Issues:179

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:4282Issues:55Issues:97

swift-coreml-diffusers

Swift app demonstrating Core ML Stable Diffusion

Language:SwiftLicense:Apache-2.0Stargazers:2528Issues:40Issues:64

Ax

Adaptive Experimentation Platform

Language:PythonLicense:MITStargazers:2353Issues:69Issues:735

setfit

Efficient few-shot learning with Sentence Transformers

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2177Issues:21Issues:311

dl-translate

Library for translating between 200 languages. Built on 🤗 transformers.

Language:PythonLicense:MITStargazers:438Issues:7Issues:29

inseq

Interpretability for sequence generation models 🐛 🔍

Language:PythonLicense:Apache-2.0Stargazers:364Issues:10Issues:82

audio-transformers-course

The Hugging Face Course on Transformers for Audio

Language:MDXLicense:Apache-2.0Stargazers:318Issues:32Issues:45

transliterate

Bi-directional transliterator for Python. Transliterates (unicode) strings according to the rules specified in the language packs.

rVAD

Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Language:MATLABLicense:NOASSERTIONStargazers:126Issues:7Issues:7

text-preprocessing

A python package for text preprocessing task in natural language processing.

Language:PythonLicense:BSD-3-ClauseStargazers:63Issues:1Issues:9

sliceguard

A library for detecting problematic data segments in structured and unstructured data with few lines of code.

Language:PythonLicense:MITStargazers:61Issues:5Issues:2

2022SegmentationST

SIGMORPHON 2022 Shared Task on Morpheme Segmentation

Language:Jupyter NotebookStargazers:23Issues:7Issues:16

megacov

Mega-COV: A Billion-Scale Dataset of 100+ Languages for COVID-19

sonicpy

Sonic python binding

Language:CLicense:Apache-2.0Stargazers:2Issues:1Issues:0