Yifan Zhang's repositories
AutoMathText
Official implementation of DPFM @ ICLR 2024 paper "AutoMathText: Autonomous Data Selection with Language Models for Mathematical Texts" (Huggingface Daily Papers: https://huggingface.co/papers/2402.07625)
Matrix-SSL
Official implementation of ICML 2024 paper "Matrix Information Theory for Self-supervised Learning" (https://arxiv.org/abs/2305.17326)
Kernel-InfoNCE
Official implementation of ICLR 2024 paper "Contrastive Learning Is Spectral Clustering On Similarity Graph" (https://arxiv.org/abs/2303.15103)
RelationMatch
Official implementation of paper "RelationMatch: Matching In-batch Relationships for Semi-supervised Learning" (https://arxiv.org/abs/2305.10397)
syntax-semantics
TemplateMath: Syntactic Data Generation for Mathematical Problems
StackMathQA
StackMathQA: A Curated Collection of 2 Million Mathematical Questions and Answers Sourced from Stack Exchange
Chain-of-ThoughtsPapers
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
MathAnalysis
MathAnalysis: A Extensive Collection of Challenging Mathematical Analysis Problems
Matrix-LLM
Official Implementation of ICML 2024 paper 'Matrix Cross-Entropy for Large Language Models' (https://arxiv.org/abs/2305.17326)
WikipediaMath
WikipediaMath: A specialized dataset curated from Wikipedia, focusing on mathematical content for the advancement of language models
github-cheat-sheet
A list of cool features of Git and GitHub.