mjr27 / chategw

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data preparation

Dotnet data preparation

$ cd src/ChatEgw.UI.Indexer
export EGW_SEARCH_DSN="Server=localhost;Database=search;Port=15432;Username=postgres;Password=password"

Create database

$ dotnet run -- migrate

Import base data

$ dotnet run -- import egw -f "Host=localhost;Username=user;Password=password;Database=database"

Export data to file for python postprocessing

$ dotnet run -- export tsv paragraphs-raw.tsv

Extract tagging from raw data

$ cd cuda-backend

About


Languages

Language:Jupyter Notebook 40.5%Language:C# 38.4%Language:Python 12.1%Language:HTML 8.4%Language:CSS 0.3%Language:Dockerfile 0.3%