zhaoxjmail's starred repositories
java-diff-utils
Diff Utils library is an OpenSource library for performing the comparison / diff operations between texts or some kind of data: computing diffs, applying patches, generating unified diffs or parsing them, generating diff output for easy future displaying (like side-by-side view) and so on.
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
llama2-fine-tune
Scripts for fine-tuning Llama2 via SFT and DPO.
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Linux-Malware-Samples
Linux Malware Sample Archive including various types of malicious ELF binaries and viruses. Be careful!
plain-text-table
A plain text table formatter
ExtracTable
Extract tables from Plain-Text Files.
Context-Free-Grammers-CFGs
A random sentence generator implemented. For this, a sample CFG rule set is provided in the Chomsky Normal Form(CNF). And CYK parser as a recognizer which tells whether a given sentence is grammatically correct or not
OntoNotes-5.0-NER-BIO
A BIO formatted Named Entity Recognition data set extracted from the OntoNotes 5.0 release.
pyimagesearch
Diving into PyImageSearch
spacy-stanza
💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
LayoutLMV3
This repo consists of the code as discussed in the Medium blog.
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Python-Natural-Language-Processing-Cookbook
Python Natural Language Processing Cookbook, published by Packt
FreeAskInternet
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use.