亓官劼's starred repositories

fuzi.mingcha

夫子•明察司法大模型是由山东大学、浪潮云、**政法大学联合研发,以 ChatGLM 为大模型底座,基于海量中文无监督司法语料与有监督司法微调数据训练的中文司法大模型。该模型支持法条检索、案例分析、三段论推理判决以及司法对话等功能,旨在为用户提供全方位、高精准的法律咨询与解答服务。

Language:PythonLicense:Apache-2.0Stargazers:229Issues:0Issues:0

DISC-LawLLM

DISC-LawLLM, an intelligent legal system utilizing large language models (LLMs) to provide a wide range of legal services

Language:PythonLicense:Apache-2.0Stargazers:458Issues:0Issues:0

LexiLaw

LexiLaw - 中文法律大模型

Language:PythonLicense:MITStargazers:585Issues:0Issues:0

autocut

用文本编辑器剪视频

Language:PythonLicense:Apache-2.0Stargazers:6346Issues:0Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18312Issues:0Issues:0

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:9510Issues:0Issues:0

ICD

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"

Language:PythonLicense:MITStargazers:52Issues:0Issues:0

ICD

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"

License:MITStargazers:1Issues:0Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonLicense:MITStargazers:86993Issues:0Issues:0

SRILM

Mirror of SRILM

Language:RoffLicense:NOASSERTIONStargazers:49Issues:0Issues:0
Language:PythonStargazers:7Issues:0Issues:0

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:9711Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:53855Issues:0Issues:0

Megatron-LLaMA

Best practice for training LLaMA models in Megatron-LM

Language:PythonLicense:NOASSERTIONStargazers:558Issues:0Issues:0

OpenTransformer

A No-Recurrence Sequence-to-Sequence Model for Speech Recognition

Language:PythonLicense:MITStargazers:369Issues:0Issues:0

WeTextProcessing

Text Normalization & Inverse Text Normalization

Language:PythonLicense:Apache-2.0Stargazers:386Issues:0Issues:0

PunctuationModel

中文标点符号模型,可以给文本添加标点符号。

Language:PythonLicense:Apache-2.0Stargazers:119Issues:0Issues:0

PPASR

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

Language:PythonLicense:Apache-2.0Stargazers:781Issues:0Issues:0

Cross-Domain-Chinese-Punctuation-Prediction

CDCPP: Cross-Domain Chinese Punctuation Prediction

License:GPL-3.0Stargazers:9Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:62934Issues:0Issues:0

gensim-data

Data repository for pretrained NLP models and NLP corpora.

Language:PythonLicense:LGPL-2.1Stargazers:957Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:4152Issues:0Issues:0

FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Language:PythonLicense:MITStargazers:2394Issues:0Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonLicense:MITStargazers:6409Issues:0Issues:0

uroman

Universal Romanizer that can convert any unicode script to roman (latin) script

Language:PerlLicense:NOASSERTIONStargazers:125Issues:0Issues:0

vqwordseg

Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.

Language:Jupyter NotebookLicense:MITStargazers:33Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:13869Issues:0Issues:0

pytorch-softdtw-cuda

Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch

Language:PythonLicense:MITStargazers:595Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:10765Issues:0Issues:0