horton2009's repositories

regmix

🧬 RegMix: Data Mixture as Regression for Language Model Pre-training

License:MITStargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

License:MITStargazers:0Issues:0Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

License:MITStargazers:0Issues:0Issues:0

abctools

ABC Transcription tools based on abcjs

License:MITStargazers:0Issues:0Issues:0

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

License:NOASSERTIONStargazers:0Issues:0Issues:0

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

License:NOASSERTIONStargazers:0Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

License:MITStargazers:0Issues:0Issues:0

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

License:MITStargazers:0Issues:0Issues:0

TinyGPT-V

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

FullLLM

Full stack LLM (Pre-training/finetuning, PPO(RLHF), Inference, Quant, etc.)

License:MITStargazers:0Issues:0Issues:0

llm-decontaminator

Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"

License:Apache-2.0Stargazers:0Issues:0Issues:0

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

License:MITStargazers:0Issues:0Issues:0

BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

License:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

License:Apache-2.0Stargazers:0Issues:0Issues:0

PGL

Paddle Graph Learning (PGL) is an efficient and flexible graph learning framework based on PaddlePaddle

License:Apache-2.0Stargazers:0Issues:0Issues:0

ERNIE

An Implementation of ERNIE For Language Understanding (including Pre-training models and Fine-tuning tools)

License:Apache-2.0Stargazers:0Issues:0Issues:0

Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

License:Apache-2.0Stargazers:0Issues:0Issues:0

gpt-2-Pytorch

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

License:MITStargazers:0Issues:0Issues:0

12306

12306智能刷票,订票

Language:PythonStargazers:0Issues:0Issues:0

darts-clone

A clone of Darts (Double-ARray Trie System)

Language:C++License:BSD-2-ClauseStargazers:0Issues:0Issues:0

decagon

Graph convolutional neural network for multirelational link prediction

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

pytorch-cn

Pythrch-CN文档地址

Stargazers:0Issues:0Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

torch7

http://torch.ch

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

python-crfsuite

A python binding for crfsuite

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

naive-rete

Python RETE algorithm

Language:PythonStargazers:0Issues:0Issues:0

ansj_seg

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

anthelion

Anthelion is a plugin for Apache Nutch to crawl semantic annotations within HTML pages

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

test_git

test git for mac

Stargazers:0Issues:0Issues:0

svdlibc

A fork of Doug Rohde's SVD C Library.

Language:CStargazers:0Issues:0Issues:0