mdda / worldtree_corpus

Worldtree Corpus helper files, and sample solutions

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

worldtree_corpus

Worldtree Corpus helper files, and sample solutions

Note: To view submission code for :

Base code

The idea is to let you :

%load_ext autoreload
%autoreload 2

and

import os
if not os.path.isdir('worldtree_corpus'):
  ! git clone https://github.com/mdda/worldtree_corpus
import worldtree_corpus as wtc

at the top of a notebook, and have a bunch of useful stuff ready-to-go (you can choose the name under which to import it, so as to avoid collisions with your existing code).

Text cleansing

wtc.preprocess.convert_texts(["Which of these will most likely increase?", "Habitats support animals."])

Preprocessing

import pandas
df_exp = wtc.preprocess.XYZ()

About

Worldtree Corpus helper files, and sample solutions

License:MIT License


Languages

Language:Python 100.0%