kg-nlp

A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance, especially in Text-to-SQL.

Language:PythonMIT000

FinGLM

Language:Python000

k8s_images

k8s镜像仓库

Language:Dockerfile000

kserve

Serverless Inferencing on Kubernetes

Language:PythonApache-2.0000

kubeflow_pytorch

010

Match-Ignition

Language:Python000

MrDoc

mrdoc,online document system developed based on python. It is suitable for individuals and small teams to manage documents, wiki, knowledge and notes. 觅思文档，适合于个人和中小型团队的在线文档、知识库系统。

Language:JavaScriptGPL-3.0000

NLP-Loss-Pytorch

Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al

Language:PythonMIT000

public-apis

A collective list of free APIs

Language:PythonMIT000

pytorch-loss

label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful

Language:PythonMIT000

Scorecard-Bundle

A High-level Scorecard Modeling API | 评分卡建模尽在于此

Language:PythonBSD-3-Clause000

scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Language:PythonBSD-3-Clause000

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonApache-2.0000

Tabular-LLM

本项目旨在收集开源的表格智能任务数据集（比如表格问答、表格-文本生成等），将原始数据整理为指令微调格式的数据并微调LLM，进而增强LLM对于表格数据的理解，最终构建出专门面向表格智能任务的大型语言模型。

000

text2vec

text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。

Language:PythonApache-2.0000

text_classification

使用rnn,lstm,gru,fasttext,textcnn,dpcnn,rnn-att,lstm-att,兼容huggleface/transformers，以及以transforemrs作为词嵌入模型，后面接入cnn、rnn、attention等等做文本分类。以及各个模型的对比

Language:Python000

torchrec

Pytorch domain library for recommendation systems

Language:PythonBSD-3-Clause000

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT000

vocab-coverage

语言模型中文认知能力分析

Language:PythonApache-2.0000

xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Language:C++Apache-2.0000