VenkteshV

VenkteshV

Geek Repo

Company:Postdoc @tu-delft, PhD @ADS-AI, Reviewer @openjournals

Location:Delhi, india

Home Page:https://www.linkedin.com/in/venktesh-v-78099a135/

Github PK Tool:Github PK Tool

VenkteshV's starred repositories

al-folio-homepage

A beautiful, simple, clean, and responsive Jekyll theme for academics

Language:HTMLLicense:MITStargazers:2Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0

docprompting

Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023

Language:PythonLicense:Apache-2.0Stargazers:230Issues:0Issues:0

OLMo-Eval

Evaluation suite for LLMs

Language:PythonLicense:Apache-2.0Stargazers:285Issues:0Issues:0

interactive-clustering

Python package used to apply NLP interactive clustering methods.

Language:PythonLicense:NOASSERTIONStargazers:10Issues:0Issues:0

ALCE

[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627

Language:PythonLicense:MITStargazers:428Issues:0Issues:0

Child-Vocab-Development

This project was originally my term project for a computational linguistics course at Pitt. It was turned into a research project later and I am working on publishing the work.

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:5Issues:0Issues:0

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1683Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:5Issues:0Issues:0

polyfuse

Fusion for TREC run files with popular fusion techniques

Language:CLicense:MITStargazers:22Issues:0Issues:0

MSMARCO-Passage-Ranking

MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, and passage ranking. A variant of this task will be the part of TREC and AFIRM 2019. For Updates about TREC 2019 please follow This Repository Passage Reranking task Task Given a query q and a the 1000 most relevant passages P = p1, p2, p3,... p1000, as retrieved by BM25 a succeful system is expected to rerank the most relevant passage as high as possible. For this task not all 1000 relevant items have a human labeled relevant passage. Evaluation will be done using MRR

Language:Jupyter NotebookLicense:MITStargazers:288Issues:0Issues:0

ranking-utils

Miscellaneous utilities for ranking models

Language:PythonLicense:MITStargazers:9Issues:0Issues:0

llm-efficiency-challenge.github.io

Website for NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1 GPU + 1 Day

Language:HTMLLicense:MITStargazers:7Issues:0Issues:0

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:14631Issues:0Issues:0

SCANING

[CIKM'23] Code and data for our paper 'James ate 5 oranges = Steve bought 5 pencils': Structure-Aware Denoising for Paraphrasing Word Problems

Language:PythonStargazers:2Issues:0Issues:0

asr-scoring

Common scripts for scoring JSALT 2023 ASR systems

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

jose-reviews

Reviews for the Journal of Open Source Education (JOSE)

License:CC0-1.0Stargazers:34Issues:0Issues:0

LLM-Blender

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.

Language:PythonLicense:Apache-2.0Stargazers:840Issues:0Issues:0

Efficient-Fact-checking

Master thesis on supporting fact extraction on large data collections for a more efficient fact-checking process in real-world applications.

Language:Jupyter NotebookStargazers:1Issues:0Issues:0
Language:Jupyter NotebookStargazers:2Issues:0Issues:0
Language:PythonStargazers:933Issues:0Issues:0

joss-papers

Accepted JOSS papers

Language:HTMLLicense:CC-BY-4.0Stargazers:242Issues:0Issues:0

simple-llm-finetuner

Simple UI for LLM Model Finetuning

Language:Jupyter NotebookLicense:MITStargazers:2042Issues:0Issues:0

RWKV-LM-LoRA

RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:405Issues:0Issues:0

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:5712Issues:0Issues:0

removestar

Tool to automatically replace 'import *' in Python files with explicit imports

Language:PythonLicense:MITStargazers:170Issues:0Issues:0

retomaton

PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)

Language:PythonLicense:MITStargazers:68Issues:0Issues:0

Whisper

Whisper applications

Language:Jupyter NotebookStargazers:78Issues:0Issues:0