jaykay233

jaykay233

Geek Repo

Company:Bytedance

Location:China

Home Page:https://blog.nowcoder.net/zyx233

Github PK Tool:Github PK Tool

jaykay233's starred repositories

Language:TypeScriptLicense:MITStargazers:490Issues:0Issues:0

infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

Language:C++License:Apache-2.0Stargazers:2364Issues:0Issues:0

json_repair

A python module to repair invalid JSON, commonly used to parse the output of LLMs

Language:PythonLicense:MITStargazers:621Issues:0Issues:0

Pix2Text

An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.

Language:Jupyter NotebookLicense:MITStargazers:1717Issues:0Issues:0

PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Language:PythonLicense:Apache-2.0Stargazers:4379Issues:0Issues:0

Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Language:TypeScriptLicense:Apache-2.0Stargazers:30833Issues:0Issues:0

feffery-antd-components

Dash + Ant Design = 😍

Language:JavaScriptLicense:MITStargazers:289Issues:0Issues:0

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:9840Issues:0Issues:0

yao

:rocket: A performance app engine to create web services and applications in minutes.Suitable for AI, IoT, Industrial Internet, Connected Vehicles, DevOps, Energy, Finance and many other use-cases.

Language:GoLicense:Apache-2.0Stargazers:7107Issues:0Issues:0

Image-Downloader

Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.

Language:PythonLicense:MITStargazers:2177Issues:0Issues:0

markdowner

A fast tool to convert any website into LLM-ready markdown data. Built by https://supermemory.ai

Language:TypeScriptLicense:MITStargazers:720Issues:0Issues:0

spider-flow

新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。

Language:JavaLicense:MITStargazers:9426Issues:0Issues:0

agents

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Language:PythonLicense:Apache-2.0Stargazers:5118Issues:0Issues:0

wagtail

A Django content management system focused on flexibility and user experience

Language:PythonLicense:BSD-3-ClauseStargazers:17794Issues:0Issues:0

FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

Language:C++License:Apache-2.0Stargazers:1622Issues:0Issues:0

Hypernets

A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

Language:PythonLicense:Apache-2.0Stargazers:261Issues:0Issues:0

mesh

Mesh TensorFlow: Model Parallelism Made Easier

Language:PythonLicense:Apache-2.0Stargazers:1576Issues:0Issues:0

wiseflow

Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and uploads them to the database.

Language:JavaScriptLicense:NOASSERTIONStargazers:3278Issues:0Issues:0

pdftochat

Chat with your PDFs with AI

Language:TypeScriptLicense:MITStargazers:947Issues:0Issues:0

memfree

MemFree - Hybrid AI Search Engine

Language:TypeScriptLicense:MITStargazers:245Issues:0Issues:0

wukong-recommendation

Implements the paper "Wukong: Towards a Scaling Law for Large-Scale Recommendation" from Meta.

Language:PythonLicense:MITStargazers:34Issues:0Issues:0

beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Language:PythonLicense:Apache-2.0Stargazers:1532Issues:0Issues:0

cube-studio

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3389Issues:0Issues:0

GPT_API_free

Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。

Language:PythonLicense:MITStargazers:20366Issues:0Issues:0

Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

License:MITStargazers:986Issues:0Issues:0

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

License:GPL-3.0Stargazers:2294Issues:0Issues:0

LargeBatchCTR

Large batch training of CTR models based on DeepCTR with CowClip.

Language:PythonLicense:Apache-2.0Stargazers:161Issues:0Issues:0

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:16054Issues:0Issues:0

LumberChunker

This repository presents the original implementation of LumberChunker: Long-Form Narrative Document Segmentation by André V. Duarte, João Marques, Miguel Graça, Miguel Freire, Lei Li and Arlindo L. Oliveira (under review for EMNLP 2024)

Language:Jupyter NotebookStargazers:22Issues:0Issues:0

SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

License:Apache-2.0Stargazers:322Issues:0Issues:0