kylebgorman / cliqs

Crosslinguistic investigations in quantitative syntax

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Crosslinguistic Investigations in Quantitative Syntax (CLIQS)

Build Status

This is code for studying quantitative syntax using dependency corpora.

It is written for Python 3.5, and has been tested to work in Python 2.7.

Dependencies

pip install -r requirements.txt for basic functionality. Additionally, pip install -r optrequirements.txt for optional dependencies used for parallelization and visualization.

Example

The list of langs can be found at corpora.ud_langs.

To compare dependency length in some languages to random and minimal baselines, run: python run_mindep.py run lang1 lang2 ... langn > result_raw.csv.

Then postprocess the resulting csv: python run_mindep.py postprocess result_raw.csv > result.csv.

Then you can run the various R scripts starting in mindep_ to analyze the results and generate figures.

About

Crosslinguistic investigations in quantitative syntax


Languages

Language:Python 92.7%Language:R 7.0%Language:Shell 0.3%