timeflies99's starred repositories

how-to-train-tokenizer

怎么训练一个LLM分词器

Language:PythonStargazers:121Issues:0Issues:0

LeetcodeTop

汇总各大互联网公司容易考察的高频leetcode题🔥

Stargazers:18491Issues:0Issues:0

Interview

Interview = 简历指南 + 算法题 + 八股文 + 源码分析

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8645Issues:0Issues:0

CS-Notes

:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

Stargazers:174347Issues:0Issues:0

SeaLLMs

[ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia

Language:JavaScriptStargazers:139Issues:0Issues:0

Llama-Chinese

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Language:PythonStargazers:13457Issues:0Issues:0

Firefly

Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language:PythonStargazers:5526Issues:0Issues:0

vocab-coverage

语言模型中文认知能力分析

Language:PythonLicense:Apache-2.0Stargazers:232Issues:0Issues:0

PytorchNetHub

项目注释+论文复现+算法竞赛+Pytorch实践

Language:Jupyter NotebookLicense:MITStargazers:612Issues:0Issues:0

Dive-into-DL-PyTorch

本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18120Issues:0Issues:0

CleanTransformer

an implementation of transformer, bert, gpt, and diffusion models for learning purposes

Language:PythonLicense:MITStargazers:139Issues:0Issues:0

llm-universe

本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/

Language:Jupyter NotebookStargazers:4169Issues:0Issues:0

pytorch-widedeep

A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:1273Issues:0Issues:0

Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

Language:CLicense:Apache-2.0Stargazers:786Issues:0Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:67189Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:24292Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonLicense:MITStargazers:5148Issues:0Issues:0

cudf

cuDF - GPU DataFrame Library

Language:C++License:Apache-2.0Stargazers:8160Issues:0Issues:0

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:12130Issues:0Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:2353Issues:0Issues:0

whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4316Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30043Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:CLicense:MITStargazers:33921Issues:0Issues:0

promptbase

All things prompt engineering

Language:PythonLicense:MITStargazers:5315Issues:0Issues:0

mall

mall项目是一套电商系统,包括前台商城系统及后台管理系统,基于SpringBoot+MyBatis实现,采用Docker容器化部署。 前台商城系统包含首页门户、商品推荐、商品搜索、商品展示、购物车、订单流程、会员中心、客户服务、帮助中心等模块。 后台管理系统包含商品管理、订单管理、会员管理、促销管理、运营管理、内容管理、统计报表、财务管理、权限管理、设置等模块。

Language:JavaLicense:Apache-2.0Stargazers:77017Issues:0Issues:0

objectdetection_script

一些关于目标检测的脚本的改进思路代码,详细请看readme.md

Language:PythonStargazers:4960Issues:0Issues:0

Awesome-CVPR2024-CVPR2021-CVPR2020-Low-Level-Vision

A Collection of Papers and Codes for CVPR2024/CVPR2021/CVPR2020 Low Level Vision

Stargazers:915Issues:0Issues:0

gcForest

This is the official implementation for the paper 'Deep forest: Towards an alternative to deep neural networks'

Language:PythonStargazers:1308Issues:0Issues:0

json-logging-python

Cloud-native distributed Python logging library to emit JSON log that can be easily indexed by logging infrastructure

Language:PythonLicense:Apache-2.0Stargazers:301Issues:0Issues:0

openai

OpenAI .NET sdk - Azure OpenAI, ChatGPT, Whisper, and DALL-E

Language:C#License:MITStargazers:2876Issues:0Issues:0