guxd / deep-code-search

DeepCS: Deep Code Search

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to convert .txt file to .h5 file

XiaoXiaoYi123 opened this issue · comments

ese /indices and /phrases?

请问可以提供将txt文本序列转为对应的.h5文件的实现代码吗?

@li-car-fei
https://github.com/guxd/DialogBERT/blob/master/prepare_data.py
参考这个,用pytable 把dialog (a list of sequences)转成earray.

@li-car-fei https://github.com/guxd/DialogBERT/blob/master/prepare_data.py 参考这个,用pytable 把dialog (a list of sequences)转成earray.

在deepcs中,你是使用了哪里的tokenizer呢?

deepcs没有用第三方tokenizer,是自己实现的简易的tokenizer. 直接把字符转换成字典里的序号