EunCheolChoi0123's starred repositories

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:13863Issues:0Issues:0

elasticsearch-labs

Notebooks & Example Apps for Search & AI Applications with Elasticsearch

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:532Issues:0Issues:0

xsv

A fast CSV command line toolkit written in Rust.

Language:RustLicense:UnlicenseStargazers:10250Issues:0Issues:0

vtuber-livechat-dataset

📊 VTuber 1B: Billion-scale Live Chat and Moderation Event Dataset

Language:PythonLicense:MITStargazers:74Issues:0Issues:0

TM_graph

Topic Modeling Web Graph

Language:PythonStargazers:4Issues:0Issues:0

wannadb

WannaDB: Ad-hoc SQL Queries over Text Collections

Language:PythonLicense:NOASSERTIONStargazers:6Issues:0Issues:0

box2go

Code & information on the Box2Go project, published at The Web Conf 2024

Stargazers:1Issues:0Issues:0

llm_for_css

Guide on how to use LLMs for computational social science research

Language:PythonLicense:MITStargazers:21Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:63124Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:22499Issues:0Issues:0

arctic_shift

Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web interface.

Language:TypeScriptStargazers:210Issues:0Issues:0

css-selector-tool

A low-code data extractor for websites with built in proxy and parsing capabilities. Great for testing and debugging css selectors

Language:JavaScriptStargazers:117Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18567Issues:0Issues:0

TikTokScraperScripts

Scripts to extract video and comments metadata from TikTok.

Language:PythonStargazers:9Issues:0Issues:0

detoxify

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.

Language:PythonLicense:Apache-2.0Stargazers:900Issues:0Issues:0
Language:ShellLicense:CC0-1.0Stargazers:1Issues:0Issues:0

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4683Issues:0Issues:0

Parrot_Paraphraser

A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

Language:PythonLicense:Apache-2.0Stargazers:866Issues:0Issues:0

COVID-19-TweetIDs

The repository contains an ongoing collection of tweets IDs associated with the novel coronavirus COVID-19 (SARS-CoV-2), which commenced on January 28, 2020.

Language:PythonLicense:NOASSERTIONStargazers:714Issues:0Issues:0

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:28527Issues:0Issues:0
Language:Jupyter NotebookStargazers:2Issues:0Issues:0