leododo's starred repositories
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
PaddleCloud
PaddlePaddle Docker images and K8s operators for PaddleOCR/Detection developers to use on public/private cloud.
haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
self-instruct
Aligning pretrained language models with instruction data generated by themselves.
LawCrimeMining
Law Crime Mining Based on Corpus build and content analysis by NLP methods. 基于领域语料库构建与NLP方法的裁判文书与犯罪案例文本挖掘项目
P-tuning-v2
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
awesome-chinese-legal-resources
📝 An Awesome Collection of Chinese Legal Dataset and Relevant Resources. 致力于收集全面的中文法律数据源
langchain-wenxin
langchain baidu wenxinworkshop wrapper
getui-pushapi-java-client-v2
个推官方提供的推送服务端SDK(Java语言),基于全新的RestAPI V2接口(https://docs.getui.com/getui/server/rest_v2/introduction/)
jpush-hbuilder-demo
极光推送官方提供的 HBuilder 示例代码,可用于快速集成 JPush SDK 到 HBuilder 项目里。
html2pdf.js
Client-side HTML-to-PDF rendering using pure JS.
huanhuan-chat
Chat-甄嬛是利用《甄嬛传》剧本中所有关于甄嬛的台词和语句,基于ChatGLM2进行LoRA微调得到的模仿甄嬛语气的聊天语言模型。
duckduckgo
DuckDuckGo Instant Answer Infrastructure
duckduckgo_search
Search for words, documents, images, videos, news, maps and text translation using the DuckDuckGo.com search engine. Downloading files and images to a local hard drive.
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
company-crawler
天眼查爬虫&企查查爬虫,指定关键字爬取公司信息
fake-useragent
Up-to-date simple useragent faker with real world database
proxy_pool
Python ProxyPool for web spider