Chong Chen (chenchongthu)

chenchongthu

Geek Repo

Company:Tsinghua University

Home Page:https://chenchongthu.github.io

Github PK Tool:Github PK Tool

Chong Chen's starred repositories

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Language:PythonLicense:MITStargazers:3829Issues:0Issues:0

T2Ranking

T2Ranking: A large-scale Chinese benchmark for passage ranking.

Language:PythonStargazers:138Issues:0Issues:0

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2502Issues:0Issues:0

ce_pretrain

预训练中英文混合bert模型

Language:PythonStargazers:1Issues:0Issues:0

zuowen-dataset-pt1

:paper: 作文数据集 - 第 1 部分

Stargazers:11Issues:0Issues:0

colbert

colbert for dense retrieval, including multi view version, dureader-retrieval as an example

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0

OpenMatch

An Open-Source Package for Information Retrieval

Language:PythonLicense:MITStargazers:140Issues:0Issues:0

haystack-search-engine

A Semantic Search Engine Built on Arxiv dataset from Kaggle.

Language:Jupyter NotebookStargazers:7Issues:0Issues:0

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:14257Issues:0Issues:0

RetroMAE

Codebase for RetroMAE and beyond.

Language:PythonLicense:Apache-2.0Stargazers:209Issues:0Issues:0

contriever

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Language:PythonLicense:NOASSERTIONStargazers:623Issues:0Issues:0

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Language:PythonLicense:Apache-2.0Stargazers:3936Issues:0Issues:0

awesome-machine-unlearning

Awesome Machine Unlearning (A Survey of Machine Unlearning)

Language:Jupyter NotebookLicense:MITStargazers:630Issues:0Issues:0

LEVEN

Source code and dataset for ACL2022 Findings Paper "LEVEN: A Large-Scale Chinese Legal Event Detection dataset"

Language:PythonStargazers:103Issues:0Issues:0

BERT-PLI

bert-pli应用于LeCaRD

Language:PythonStargazers:14Issues:0Issues:0
Language:Jupyter NotebookStargazers:4Issues:0Issues:0

LegalPLMs

Source code and checkpoints for legal pre-trained language models.

Language:PythonStargazers:166Issues:0Issues:0

WebTable

A python package that takes tables from a web page and processes them to get high quality tables

Language:PythonLicense:GPL-3.0Stargazers:52Issues:0Issues:0
Language:PythonLicense:MITStargazers:283Issues:0Issues:0

LeCaRD

A Chinese legal case retrieval dataset.

Language:PythonLicense:MITStargazers:110Issues:0Issues:0

DirectAU

KDD'2022: Towards Representation Alignment and Uniformity in Collaborative Filtering

Language:PythonLicense:MITStargazers:64Issues:0Issues:0

PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

Stargazers:3966Issues:0Issues:0

XAI

Towards Explainable Artificial Intelligence

Stargazers:5Issues:0Issues:0

Human-XAI

Human-centered Explainable AI

Stargazers:5Issues:0Issues:0
Language:PythonStargazers:35Issues:0Issues:0

thuthesis

LaTeX Thesis Template for Tsinghua University

Language:TeXLicense:LPPL-1.3cStargazers:4459Issues:0Issues:0

U-GCN

Source code of "NeurIPS21 - Universal Graph Convolutional Networks"

Language:PythonStargazers:19Issues:0Issues:0
Language:PythonStargazers:13Issues:0Issues:0

isvd

Official implementation of NeurIPS'21: Implicit SVD for Graph Representation Learning

Language:PythonLicense:CC0-1.0Stargazers:19Issues:0Issues:0
Language:PythonStargazers:28Issues:0Issues:0