we-srds-eval
is a tool designed to calculate the correlation between the cosine similarity of word embeddings and human judgments of semantic relations. It processes datasets from punuy-datasets
to evaluate how well word embeddings align with human semantic understanding.
- Computes cosine similarity for word pairs.
- Calculates the correlation between cosine similarity scores and human-judged scores.
- Supports multiple datasets and languages.
To install the necessary dependencies, run:
npm install
To process all datasets and calculate correlations, execute the following command:
tsx src/bin/proc-datasets.ts
This will output the correlation results for each dataset.
- src/bin/proc-datasets.ts: Entry point for processing datasets.
- src/lib/we-srd-corr.ts: Core library functions for calculating cosine similarity and correlation.
- src/lib/utils.ts: Utility functions used across the project.
- src/lib/ds-adapters.ts: Adapters for handling different dataset formats.
This project uses TypeScript for type safety and Vitest for testing. To run tests, use:
npm test
This project is licensed under the ISC License.
André Santos