Soo (simonjisu)

simonjisu

Geek Repo

Company:Seoul National University

Home Page:https://simonjisu.github.io

Github PK Tool:Github PK Tool


Organizations
DSS-MSY

Soo's starred repositories

NPEET

Non-parametric Entropy Estimation Toolbox

Language:PythonLicense:MITStargazers:348Issues:0Issues:0

loguru

Python logging made (stupidly) simple

Language:PythonLicense:MITStargazers:18581Issues:0Issues:0

CRAG

Corrective Retrieval Augmented Generation

Language:PythonStargazers:230Issues:0Issues:0

QuoteSum

QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on Wikipedia passages.

Language:PythonLicense:CC-BY-SA-4.0Stargazers:9Issues:0Issues:0

Probabilistic_ML

Material for the "Probabilistic Machine Learning" Course at the University of Tübingen, Summer Term 2023

Stargazers:105Issues:0Issues:0

awesome-self-supervised-learning-for-tabular-data

A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)

Stargazers:128Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:15679Issues:0Issues:0

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1576Issues:0Issues:0

hllama

hllama is a library which aims to provide a set of utility tools for large language models.

Language:PythonLicense:Apache-2.0Stargazers:10Issues:0Issues:0

langchain-kr

LangChain 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 LangChain을 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.

Language:Jupyter NotebookStargazers:714Issues:0Issues:0

mmkb

Several data modalities for KBs (visual, numerical, temporal, etc.)

Language:PythonLicense:BSD-3-ClauseStargazers:359Issues:0Issues:0

BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Language:PythonLicense:Apache-2.0Stargazers:2730Issues:0Issues:0

LogicKor

한국어 언어모델 다분야 사고력 벤치마크

Language:PythonStargazers:113Issues:0Issues:0

llm_training_handbook

An open collection of methodologies to help with successful training of large language models.

Language:PythonLicense:CC-BY-SA-4.0Stargazers:420Issues:0Issues:0

thefuzz

Fuzzy String Matching in Python

Language:PythonLicense:MITStargazers:2591Issues:0Issues:0

prompt-tuning

Original Implementation of Prompt Tuning from Lester, et al, 2021

Language:PythonLicense:Apache-2.0Stargazers:632Issues:0Issues:0

Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

License:MITStargazers:1250Issues:0Issues:0

entity-recognition-datasets

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

Language:PythonLicense:MITStargazers:1457Issues:0Issues:0
Language:HTMLStargazers:97Issues:0Issues:0

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:14260Issues:0Issues:0

Awesome-LLM-Reasoning

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

License:MITStargazers:1218Issues:0Issues:0

sec-insights

A real world full-stack application using LlamaIndex

Language:TypeScriptLicense:MITStargazers:2179Issues:0Issues:0

ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Language:PythonLicense:Apache-2.0Stargazers:5465Issues:0Issues:0

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:2593Issues:0Issues:0

ML-Papers-Explained

Explanation to key concepts in ML

Stargazers:6748Issues:0Issues:0

FinQA

Data and code for EMNLP 2021 paper "FinQA: A Dataset of Numerical Reasoning over Financial Data"

Language:PythonLicense:MITStargazers:209Issues:0Issues:0

vectra

Vectra is a local vector database for Node.js with features similar to pinecone but built using local files.

Language:TypeScriptLicense:MITStargazers:317Issues:0Issues:0

GenAI_LLM_timeline

ChatGPT, GenerativeAI and LLMs Timeline

Stargazers:933Issues:0Issues:0

FinanceDatabase

This is a database of 300.000+ symbols containing Equities, ETFs, Funds, Indices, Currencies, Cryptocurrencies and Money Markets.

Language:Jupyter NotebookLicense:MITStargazers:3011Issues:0Issues:0