Andrei Paraschiv's starred repositories
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
LibreChat
Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development
romanian-nlp-datasets
A list of Romanian NLP Datasets
article-extraction-dataset
Article title, authors, date and body extraction dataset.
newspaper4k
đź“° Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
carcassonne
Carcassonne implementation in python
TAADpapers
Must-read Papers on Textual Adversarial Attack and Defense
CEF4Delphi
CEF4Delphi is an open source project to embed Chromium-based browsers in applications made with Delphi or Lazarus/FPC for Windows, Linux and MacOS.
Graph-Bert
Source code of Graph-Bert
nlp-architect
A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks
KK-s-Paperlist
A list of papers for machine learning, reinforcement learning, NLP or something interesting
BERT-related-papers
BERT-related papers
awesome-public-datasets
A topic-centric list of HQ open datasets.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
TF2-albert-NER
wrapping albert via bert-for-tf2, implementing NER task
text_classification
all kinds of text classification models and more with deep learning
A_Pipeline_Of_Pretraining_Bert_On_Google_TPU
A tutorial of pertaining Bert on your own dataset using google TPU
entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
keras-LAMB-Optimizer
Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"