Lien Le (lltlien)

lltlien

Geek Repo

Location:Ho Chi Minh, Vietnam

Github PK Tool:Github PK Tool

Lien Le's starred repositories

TransformerLens

A library for mechanistic interpretability of GPT-style language models

Language:PythonLicense:MITStargazers:1158Issues:0Issues:0

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:14486Issues:0Issues:0

basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Language:PythonLicense:Apache-2.0Stargazers:3111Issues:0Issues:0

ViHateT5

Repository for the paper "ViHateT5: Enhancing Hate Speech Detection in Vietnamese with A Unified Text-to-Text Transformer Model" (ACL'2024 - Findings)

Language:PythonStargazers:2Issues:0Issues:0

VnCoreNLP

A Vietnamese natural language processing toolkit (NAACL 2018)

Language:JavaLicense:NOASSERTIONStargazers:567Issues:0Issues:0

PhoWhisper

PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

License:Apache-2.0Stargazers:95Issues:0Issues:0

ASR

End-to-End Vietnamese Speech Recognition using wav2vec 2.0

Stargazers:86Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonLicense:Apache-2.0Stargazers:10534Issues:0Issues:0

XPhoneBERT

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Language:PythonLicense:MITStargazers:289Issues:0Issues:0

gpt-researcher

GPT based autonomous agent that does online comprehensive research on any given topic

Language:PythonLicense:MITStargazers:13008Issues:0Issues:0

iwslt-2022

Systems submitted to IWSLT 2022 by the MT-UPC group.

Language:PythonLicense:MITStargazers:8Issues:0Issues:0

BT4ST

Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".

Language:PythonStargazers:12Issues:0Issues:0

SpeechTransProgress

Tracking the progress in end-to-end speech translation

License:CC0-1.0Stargazers:249Issues:0Issues:0

dragonfly

A modern replacement for Redis and Memcached

Language:C++License:NOASSERTIONStargazers:24470Issues:0Issues:0

video-subtitle-extractor

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Language:PythonLicense:Apache-2.0Stargazers:5220Issues:0Issues:0

video-splitter

Simple Python script to split video into equal length chunks or chunks of equal size, duration, etc.

Language:PythonLicense:Apache-2.0Stargazers:449Issues:0Issues:0

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:52056Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38285Issues:0Issues:0

awesome-generative-ai

A curated list of modern Generative Artificial Intelligence projects and services

License:CC0-1.0Stargazers:5192Issues:0Issues:0

100-Days-Of-ML-Code

100 Days of ML Coding

License:MITStargazers:43870Issues:0Issues:0

Data-Science-For-Beginners

10 Weeks, 20 Lessons, Data Science for All!

Language:Jupyter NotebookLicense:MITStargazers:26961Issues:0Issues:0

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language:HTMLLicense:MITStargazers:10383Issues:0Issues:0

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:17294Issues:0Issues:0

MLE-Flashcards

200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.

License:GPL-3.0Stargazers:1898Issues:0Issues:0

the-algorithm

Source code for Twitter's Recommendation Algorithm

Language:ScalaLicense:AGPL-3.0Stargazers:61686Issues:0Issues:0

data-science-road-map

A roadmap for those looking to start or expand a career in the data community

Language:HTMLLicense:MITStargazers:267Issues:0Issues:0

ann-benchmarks

Benchmarks of approximate nearest neighbor libraries in Python

Language:PythonLicense:MITStargazers:4730Issues:0Issues:0

densecap

Dense image captioning in Torch

Language:Jupyter NotebookLicense:MITStargazers:1574Issues:0Issues:0

ImageCaptioning.pytorch

I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)

Language:PythonLicense:MITStargazers:1423Issues:0Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:51175Issues:0Issues:0