koalazf99

Fan's repositories

tacube

The data and code for EMNLP 2022 paper "TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data"

16 4 1

CS385Projects

Independent Projects for SJTU CS385

Language:Python4 20

amber-train

Pre-training code for Amber 7B LLM

Language:PythonApache-2.0000

awesome-llm-powered-agent

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

MIT000

CodeQwen1.5

CodeQwen1.5 is the code version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

Language:Python000

cs2916

Language:Python000

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonApache-2.0000

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonApache-2.0000

dbt-test

Language:Python010

koalazf99.github.io

Personal Page

Language:SCSSMIT020

LLM-Agent-Survey

000

openai-cookbook

Examples and guides for using the OpenAI API

Language:Jupyter NotebookMIT000

dspy

DSPy: The framework for programming—not prompting—foundation models

MIT000

k2-train

Apache-2.0000

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Apache-2.0000

open-interpreter

OpenAI's Code Interpreter in your terminal, running locally

MIT000

prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

MIT000

sailcraft

Data Toolkit for Sailor Language Models

000

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.0000