dongjicheng

dongjicheng

Geek Repo

Github PK Tool:Github PK Tool

dongjicheng's repositories

ann-benchmarks

Benchmarks of approximate nearest neighbor libraries in Python

License:MITStargazers:0Issues:0Issues:0

awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

License:MITStargazers:0Issues:0Issues:0

bytepiece

更纯粹、更高压缩率的Tokenizer

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

chat-dataset-baseline

人工精调的中文对话数据集和一段chatglm的微调代码

Stargazers:0Issues:0Issues:0

ChatLM-mini-Chinese

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

License:Apache-2.0Stargazers:0Issues:0Issues:0

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

License:NOASSERTIONStargazers:0Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

License:MITStargazers:0Issues:0Issues:0

CRIS.pytorch

An official PyTorch implementation of the CRIS paper

License:MITStargazers:0Issues:0Issues:0

deep-tempest

Restoration for TEMPEST images using deep-learning

License:NOASSERTIONStargazers:0Issues:0Issues:0

evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

License:Apache-2.0Stargazers:0Issues:0Issues:0

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

License:MITStargazers:0Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

License:NOASSERTIONStargazers:0Issues:0Issues:0

gpt4all

gpt4all: open-source LLM chatbots that you can run anywhere

License:MITStargazers:0Issues:0Issues:0

labelImg

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.

License:MITStargazers:0Issues:0Issues:0

learnopencv

Learn OpenCV : C++ and Python Examples

Stargazers:0Issues:0Issues:0

llama3-Chinese-chat

Llama3 中文仓库(聚合资料,各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

Stargazers:0Issues:0Issues:0

LLM-from-scratch

一些 LLM 方面的从零复现笔记

License:Apache-2.0Stargazers:0Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:0Issues:0Issues:0

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

License:NOASSERTIONStargazers:0Issues:0Issues:0

matmulfreellm

Implementation for MatMul-free LM.

License:Apache-2.0Stargazers:0Issues:0Issues:0

nlp-tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

License:MITStargazers:0Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

License:Apache-2.0Stargazers:0Issues:0Issues:0

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

License:MITStargazers:0Issues:0Issues:0

Synonyms

:herb: 中文近义词:聊天机器人,智能问答工具包

License:NOASSERTIONStargazers:0Issues:0Issues:0

tantivy-py

Python bindings for Tantivy

License:MITStargazers:0Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

License:MITStargazers:0Issues:0Issues:0

timesfm

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

License:Apache-2.0Stargazers:0Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

License:MITStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

License:MPL-2.0Stargazers:0Issues:0Issues:0

tts-gan

TTS-GAN: A Transformer-based Time-Series Generative Adversarial Network

License:Apache-2.0Stargazers:0Issues:0Issues:0