qiguanjie

followers

following

stars

China

https://qiguanjie.blog.csdn.net

亓官劼's starred repositories

fuzi.mingcha

夫子•明察司法大模型是由山东大学、浪潮云、**政法大学联合研发，以 ChatGLM 为大模型底座，基于海量中文无监督司法语料与有监督司法微调数据训练的中文司法大模型。该模型支持法条检索、案例分析、三段论推理判决以及司法对话等功能，旨在为用户提供全方位、高精准的法律咨询与解答服务。

Language:PythonApache-2.022900

DISC-LawLLM

DISC-LawLLM, an intelligent legal system utilizing large language models (LLMs) to provide a wide range of legal services

Language:PythonApache-2.045800

LexiLaw

LexiLaw - 中文法律大模型

Language:PythonMIT58500

autocut

用文本编辑器剪视频

Language:PythonApache-2.0634600

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.01831200

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonMIT951000

ICD

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"

Language:PythonMIT5200

ICD

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"

MIT100

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonMIT8699300

SRILM

Mirror of SRILM

Language:RoffNOASSERTION4900

MCWS

Language:Python700

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++Apache-2.0971100

llama

Inference code for Llama models

Language:PythonNOASSERTION5385500

Megatron-LLaMA

Best practice for training LLaMA models in Megatron-LM

Language:PythonNOASSERTION55800

OpenTransformer

A No-Recurrence Sequence-to-Sequence Model for Speech Recognition

Language:PythonMIT36900

WeTextProcessing

Text Normalization & Inverse Text Normalization

Language:PythonApache-2.038600

PunctuationModel

中文标点符号模型，可以给文本添加标点符号。

Language:PythonApache-2.011900

PPASR

基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

Language:PythonApache-2.078100

Cross-Domain-Chinese-Punctuation-Prediction

CDCPP: Cross-Domain Chinese Punctuation Prediction

GPL-3.0900

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT6293400

gensim-data

Data repository for pretrained NLP models and NLP corpora.

Language:PythonLGPL-2.195700

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonNOASSERTION415200

FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Language:PythonMIT239400

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonMIT640900

uroman

Universal Romanizer that can convert any unicode script to roman (latin) script

Language:PerlNOASSERTION12500

vqwordseg

Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.

Language:Jupyter NotebookMIT3300

DSegKNN

Language:Python100

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellNOASSERTION1386900

pytorch-softdtw-cuda

Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch

Language:PythonMIT59500

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonMIT1076500