Dave Morrissey (mcyph)

mcyph

Geek Repo

Location:Melbourne

Github PK Tool:Github PK Tool

Dave Morrissey's starred repositories

langchain-kr

LangChain 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 LangChain을 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.

Language:Jupyter NotebookStargazers:785Issues:0Issues:0

top-github-repositories-which-everyone-should-look

This repository contains a list of important and useful github repos which a developer, coder, a student should never miss to look at.

License:MITStargazers:415Issues:0Issues:0
Language:PythonLicense:MITStargazers:392Issues:0Issues:0

fabric

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Language:PythonLicense:MITStargazers:18833Issues:0Issues:0

awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

License:MITStargazers:66Issues:0Issues:0

TurkicTTS

A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Turkmen, Uyghur, and Uzbek.

Language:PythonStargazers:42Issues:0Issues:0

wsd_gloss_bert

Making sense of words sense

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

es-hangul

A modern JavaScript library for handling Hangul characters.

Language:TypeScriptLicense:MITStargazers:1096Issues:0Issues:0

kan-gpt

The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling

Language:PythonLicense:MITStargazers:659Issues:0Issues:0

Taiwan-LLM

Traditional Mandarin LLMs for Taiwan

Language:PythonLicense:Apache-2.0Stargazers:1132Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8708Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19124Issues:0Issues:0

gordicaleksa

GitHub's new feature: repo with the same name as your GitHub name initialized with README.md will show on your landing page!

Stargazers:12Issues:0Issues:0

node-kortype

JS로 한영타변환, 윈도우/맥 한글 변환

Language:TypeScriptLicense:MITStargazers:3Issues:0Issues:0

Most-powerful-NLP-library

Gemini, as capable as GPT-4, provides a free API with limited access. I tested it with the help of prompt engineering and found that it can solve almost any NLP task you want to tackle.

Language:Jupyter NotebookStargazers:30Issues:0Issues:0

XNLG

AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training

Language:PythonStargazers:128Issues:0Issues:0
Language:PythonLicense:MITStargazers:36Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:8Issues:0Issues:0

scikit-cuda

Python interface to GPU-powered libraries

Language:PythonLicense:NOASSERTIONStargazers:974Issues:0Issues:0

CogNet

CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates

Stargazers:40Issues:0Issues:0

knowledge_graph

Convert any text to a graph of knowledge. This can be used for Graph Augmented Generation or Knowledge Graph based QnA

Language:Jupyter NotebookStargazers:1175Issues:0Issues:0

xling-eval

Code and resources for evaluating cross-lingual embedding spaces

Language:PythonStargazers:27Issues:0Issues:0

KISS-Korean-english-Idioms-in-Sentences-dataSet

KISS : Korean-english Idioms in Sentences dataSet

Stargazers:5Issues:0Issues:0

idioms

Work for idiom translation project

Language:PythonStargazers:1Issues:0Issues:0

qdrant

Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Language:RustLicense:Apache-2.0Stargazers:18817Issues:0Issues:0
Language:TeXStargazers:4Issues:0Issues:0

lancedb

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Language:PythonLicense:Apache-2.0Stargazers:3534Issues:0Issues:0

DiskANN

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

Language:C++License:NOASSERTIONStargazers:962Issues:0Issues:0

myanmar-tokenizer

A Rule-based Syllable Segmentation of Myanmar Text

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0

CourseraParallelCorpusMining

Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation

Language:PythonLicense:Apache-2.0Stargazers:12Issues:0Issues:0