stdKonjac

stdKonjac

Geek Repo

Company:Tsinghua University

Location:Shenzhen, Guangdong, China

Home Page:https://www.stdkonjac.icu/

Twitter:@stdKonjac

Github PK Tool:Github PK Tool

stdKonjac's starred repositories

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonLicense:NOASSERTIONStargazers:21455Issues:635Issues:260

openai-python

The official Python library for the OpenAI API

Language:PythonLicense:Apache-2.0Stargazers:20717Issues:282Issues:685

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:14550Issues:109Issues:925

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5417Issues:46Issues:73

Anima

33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3402Issues:98Issues:131

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:2949Issues:21Issues:373

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1543Issues:15Issues:73

AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

Language:PythonLicense:MITStargazers:382Issues:8Issues:40
Language:PythonLicense:NOASSERTIONStargazers:347Issues:21Issues:10

DreamLLM

[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation

Language:PythonLicense:Apache-2.0Stargazers:333Issues:18Issues:18

AutoCompressors

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

visdial

[CVPR 2017] Torch code for Visual Dialog

Language:LuaLicense:NOASSERTIONStargazers:227Issues:18Issues:29

Owl

A Large Language Model for IT Operations

Language:PythonLicense:Apache-2.0Stargazers:219Issues:12Issues:4

visdial-challenge-starter-pytorch

Starter code in PyTorch for the Visual Dialog challenge

Language:PythonLicense:BSD-3-ClauseStargazers:194Issues:12Issues:28

SeViLA

[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering

Language:PythonLicense:BSD-3-ClauseStargazers:166Issues:3Issues:24

llama2-lora-fine-tuning

llama2 finetuning with deepspeed and lora

Language:PythonLicense:MITStargazers:152Issues:3Issues:15

WSDM-Cup-2024

1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc

Retrieval-Augmented-Visual-Question-Answering

This is the official repository for Retrieval Augmented Visual Question Answering

Language:PythonLicense:GPL-3.0Stargazers:117Issues:4Issues:38

NExT-QA

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

Language:PythonLicense:MITStargazers:104Issues:2Issues:27

recomp

RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.

Language:PythonLicense:MITStargazers:52Issues:4Issues:7

MAC

Online Adaptation of Language Models with a Memory of Amortized Contexts

Language:PythonLicense:MITStargazers:38Issues:4Issues:1

ADSampling

[SIGMOD 2023] High-Dimensional Approximate Nearest Neighbor Search: with Reliable and Efficient Distance Comparison Operations

MM23-MISSRec

The code for the paper "MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation" (ACM MM'23).

NExT-OE

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

Language:PythonLicense:MITStargazers:21Issues:2Issues:3

ReMuQ

a multimodal retrieval dataset

Language:Jupyter NotebookStargazers:18Issues:1Issues:2