Charles Ching (win4r)

win4r

Geek Repo

Company:Fitneess king Company

Location:1600 Pennsylvania Ave NW, Washington, DC 20500, United States

Github PK Tool:Github PK Tool

Charles Ching's starred repositories

openai-realtime-console

React app for inspecting, building and debugging with the Realtime API

Language:JavaScriptLicense:MITStargazers:2059Issues:0Issues:0

ultravox

A fast multimodal LLM for real-time voice

Language:PythonLicense:MITStargazers:1105Issues:0Issues:0

pocketpal-ai

An app that brings language models directly to your phone.

Language:TypeScriptLicense:MITStargazers:1077Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonLicense:MITStargazers:6009Issues:0Issues:0

LongWriter

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Language:PythonLicense:Apache-2.0Stargazers:1476Issues:0Issues:0

koboldcpp

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

Language:C++License:AGPL-3.0Stargazers:5264Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:8659Issues:0Issues:0

web-llm

High-performance In-browser LLM Inference Engine

Language:TypeScriptLicense:Apache-2.0Stargazers:13617Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:4642Issues:0Issues:0

mistral.rs

Blazingly fast LLM inference.

Language:RustLicense:MITStargazers:4452Issues:0Issues:0

Vision-language-models-VLM

vision language models finetuning notebooks & use cases (paligemma - florence .....)

Language:Jupyter NotebookStargazers:4Issues:0Issues:0

DB-GPT-Hub

A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL

Language:PythonLicense:MITStargazers:1444Issues:0Issues:0

notebooks

Collection of notebook guides created by the Brev.dev team!

Language:Jupyter NotebookLicense:MITStargazers:1667Issues:0Issues:0

VARAG

Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine

Language:PythonStargazers:349Issues:0Issues:0

onnx

Open standard for machine learning interoperability

Language:PythonLicense:Apache-2.0Stargazers:17927Issues:0Issues:0

copa

CoPa: LLM Prompting Templating

Language:TypeScriptLicense:MITStargazers:26Issues:0Issues:0

dataloader

DataLoader is a generic utility to be used as part of your application's data fetching layer to provide a consistent API over various backends and reduce requests to those backends via batching and caching.

Language:JavaScriptLicense:MITStargazers:12895Issues:0Issues:0

datasetGPT

A command-line interface to generate textual and conversational datasets with LLMs.

Language:PythonStargazers:293Issues:0Issues:0

NotionNext

使用 NextJS + Notion API 实现的,支持多种部署方案的静态博客,无需服务器、零门槛搭建网站,为Notion和所有创作者设计。 (A static blog built with NextJS and Notion API, supporting multiple deployment options. No server required, zero threshold to set up a website. Designed for Notion and all creators.)

Language:JavaScriptLicense:MITStargazers:7875Issues:0Issues:0

llama-stack

Composable building blocks to build Llama Apps

Language:PythonLicense:MITStargazers:4566Issues:0Issues:0

harbor

Effortlessly run LLM backends, APIs, frontends, and services with one command.

Language:TypeScriptLicense:Apache-2.0Stargazers:505Issues:0Issues:0

agenta

The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.

Language:PythonLicense:MITStargazers:1274Issues:0Issues:0

FlashRAG

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Language:PythonLicense:MITStargazers:1324Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:6731Issues:0Issues:0

GRIN-MoE

GRadient-INformed MoE

License:NOASSERTIONStargazers:258Issues:0Issues:0

Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:9613Issues:0Issues:0

Controllable-RAG-Agent

This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated graph based algorithm to handle the tasks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:815Issues:0Issues:0

GenAI_Agents

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive AI systems.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:4097Issues:0Issues:0

auto-cot

Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1547Issues:0Issues:0

tree-of-thoughts

Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%

Language:PythonLicense:Apache-2.0Stargazers:4344Issues:0Issues:0