kiminh

followers

following

stars

Ramsey's repositories

AlphaCLIP

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Language:Jupyter NotebookApache-2.0000

BannerGen

Language:PythonApache-2.0000

Causal-Recommender-Systems

An index of causal inference based recommendation algorithms.

000

DiskANN

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

Language:C++NOASSERTION000

FlagEmbedding

Dense Retrieval and Retrieval-augmented LLMs

Language:PythonMIT000

ForestDiffusion

Generating and Imputing Tabular Data via Diffusion and Flow XGBoost Models

Language:Python000

GhostFaceNets

Language:PythonMIT000

gpt-4v-distribution-shift

Code for "How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation"

Language:Jupyter NotebookMIT000

hiclass

A python library for hierarchical classification compatible with scikit-learn

Language:PythonBSD-3-Clause000

hikyuu

Hikyuu Quant Framework 基于C++/Python的开源量化交易研究框架

Language:C++Apache-2.0000

imitater

Imitate OpenAI with Local Models

Language:PythonApache-2.0000

LaCLIP

[NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"

Language:PythonBSD-2-Clause000

LayoutNUWA

Language:PythonMIT000

LLM-UM-Reading

A list of large language models for user modeling (LLM-UM) papers.

000

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Language:PythonMIT000

Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Language:Python000

LongContext_vs_RAG_NeedleInAHaystack

Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths

Language:Jupyter Notebook000

LongLoRA

Code and documents of LongLoRA and LongAlpaca

Language:PythonApache-2.0000

metahuman-stream

Real time streaming digital human based on nerf

Language:PythonMIT000

NeuralNLP-NeuralClassifier

An Open-source Neural Hierarchical Multi-label Text Classification Toolkit

Language:PythonNOASSERTION000

OpenP5

OpenP5: An Open-Source Platform for Developing, Training, and Evaluating LLM-based Recommender Systems

Language:PythonApache-2.0000

operateGPT

🌟 Revolutionize Your Operations with One Sentence Automation: Utilizing large language models and Multi-Agents to generate operational copy, images, and videos with one-line requirements.

Language:PythonMIT000

pgbm

Probabilistic Gradient Boosting Machines

Language:PythonApache-2.0000

pika

Pika is a NoSQL database compatible with redis which is developed by Qihoo's infrastructure team.

Language:C++BSD-3-Clause000

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:CMIT000

robustness_metrics

Language:Jupyter NotebookApache-2.0000

Selective_Context

Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.

Language:Python000

sklearn-genetic

Genetic feature selection module for scikit-learn

Language:PythonLGPL-3.0000

TAG-Benchmark

Benchmark

Language:Python000

TravelPlanner

Dataset and code for the paper "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

Language:PythonMIT000