AI (jiangnanboy)

jiangnanboy

User data from Github https://github.com/jiangnanboy

Company:HaiJiaTech

Location:China

Home Page:https://jiangnanboy.github.io

GitHub:@jiangnanboy

AI's repositories

intelligent_medical

intelligent medical,智慧医疗,包括疾病搜索、相关推荐、疾病医疗问答以及智能疾病诊断等功能。

java-springboot-paddleocr

本项目利用java加载paddle-ocr的C++编译的exe文件,并利用springboot进行web部署访问。This project loads the C++ compiled version of paddle-ocr in java and makes use of springboot for web deployment.

jcorrector

jcorrector 中文文本纠错工具, Text Error Correction Tool,Spelling Check

Language:JavaLicense:Apache-2.0Stargazers:62Issues:1Issues:7

llm_corpus_quality

大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning

Doc-Image-Tool

文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSharpening / HandwritingDenoisingBeautifying / DocShadowRemoval / document_image_dewarping / DocTrimmingEnhancement)。

Language:PythonLicense:MITStargazers:47Issues:1Issues:10

java-springboot-paddleocr-v2

本项目利用JNI加载paddle-ocr的C++编译的dll库,并利用springboot进行web部署访问。This project uses JNI to load the C++ compiled dll libraries of paddle-ocr, and uses springboot for web deployment

table_structure_recognition

利用Swin-Unet(Swin Transformer Unet)实现对文档图片里表格结构的识别,Swin-unet (Swin Transformer Unet) is used to identify the document table structure

Language:PythonLicense:MITStargazers:25Issues:1Issues:3

text_security_audit

text security audit 安全审核-语义模型过滤 敏感内容检测系统

llm_security

利用分类法和敏感词检测法对生成式大模型的输入和输出内容进行安全检测,尽早识别风险内容。The input and output contents of generative large model are checked by classification method and sensitive word detection method to identify content risk as early as possible.

Language:JavaLicense:MITStargazers:16Issues:1Issues:0

pdf_invoice_parser

pdf invoice parser,pdf-ofd发票解析。

Language:JavaStargazers:16Issues:2Issues:0

Image_KIE_LLM

利用llm大语言模型提取卡证票据关键信息。Key Information Extraction from Image with LLM(large language model).Basically, it can extract key information from all bills and documents.

Language:PythonLicense:MITStargazers:15Issues:2Issues:1

ad_detect_textcnn

textcnn for advertising detection,广告检测

Language:PythonStargazers:10Issues:1Issues:0

movie_llm_agent

langchain agent,chatglm,neo4j实现movie qa。

Language:PythonLicense:MITStargazers:10Issues:1Issues:2

pdf_multimodal_rag

pdf multimodal rag 【pdf多模态rag问答】

Language:PythonStargazers:7Issues:0Issues:0

DataFine

数据清洗,文本审核。DataFine mainly includes a number of data processing methods including rule cleaning, sensitive word filtering, advertisement filtering, de-duplication and sensitive content functions, providing safe and reliable data for the training of Chinese corpus.

Language:PythonStargazers:4Issues:0Issues:0

paper_read_note

论文阅读笔记,paper reading notes

License:Apache-2.0Stargazers:4Issues:1Issues:0

customer_support_assistant

customer support assistant是智能客服支持助手项目,利用LLM对Query的理解,去调用相应函数,实现智能客服功能。

Language:PythonStargazers:3Issues:0Issues:0

docimg_tool

复杂背景图像漂白,文字方向矫正,清晰增强,笔记去噪美化,去阴影,扭曲矫正,去黑点以及切边增强。complex background image bleaching, text direction correction, clarity enhancement, note to blur beautification, shadow removal, distortion correction, black spots removal and cutting edge enhancement。

jiangnanboy.github.io

80+个AI人工智能实践项目,欢迎大家使用,并提出批评意见。

Language:CSSStargazers:3Issues:1Issues:0

CPPCorrector

CPPCorrector 中文文本纠错工具, Text Error Correction Tool,Spelling Check

Language:C++License:Apache-2.0Stargazers:2Issues:1Issues:0

dbnet_crnn_java

java with dbnet, crnn for ocr.本项目利用java,javacv,onnx以及djl矩阵计算等技术加载文本检测模型dbnet与文本识别模型crnn,完成ocr的识别推理。

Language:JavaLicense:Apache-2.0Stargazers:2Issues:1Issues:0

llm_agent_math

This project uses chatglm6b to implement a Chinese version of arithmetic and reasoning function, aiming to explore the arithmetic and reasoning ability of llm agent.

Language:PythonStargazers:2Issues:1Issues:0

text_security_detection

text security detection

Language:JavaStargazers:2Issues:1Issues:0

ad_detection

advertising detection,广告检测

Language:JavaStargazers:1Issues:1Issues:0

bert_text_classification_onnx

bert text classification using onnx of(bert,albert,roberta,macbert and so on).

Language:JavaStargazers:1Issues:1Issues:0

chinese_offensive_language_detection_onnx

Chinese Offensive Language Detection using onnx model

Language:PythonStargazers:1Issues:1Issues:0

jiangnanboy

my github readme

pediatrics_llm_qa

Small model of pediatric consultation

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

JiaJiaOCR

java ocr,this project implements ocr functionality entirely in java source code, without invoking dll or exe files(此项目完全用java源代码实现ocr功能,无需调用dll或者exe文件).

Stargazers:0Issues:1Issues:0

llm_dataset_generation

利用大模型LLM生成训练数据,Use LLM to generate training data

Language:PythonStargazers:0Issues:0Issues:0