vinhkhuc / JFastText

Java interface for fastText

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

我爬取了2g大的训练数据,训练完成后jre会报错不支持小型转储

zhounan-n opened this issue · comments

JFastText jft = new JFastText();
// Train supervised model
jft.runCmd(new String[]{
"supervised",
"-input", "/resource/trainseg2.txt",
"-output", "/resource/trainseg2.model",
"-bucket", "100",
"-minCount", "1"
});
我训练的代码是这样的,数据有一个g的时候是不报错的,超过一定容量就报错了,
是我的参数不对还是其他原因?日志文件上传不成功,我复制到下面

A fatal error has been detected by the Java Runtime Environment:

EXCEPTION_ACCESS_VIOLATION (0xc0000005) at pc=0x00007fff89122c38, pid=6256, tid=0x000000000000126c

JRE version: Java(TM) SE Runtime Environment (8.0_102-b14) (build 1.8.0_102-b14)

Java VM: Java HotSpot(TM) 64-Bit Server VM (25.102-b14 mixed mode windows-amd64 compressed oops)

Problematic frame:

C [jniFastTextWrapper.dll+0x12c38]

Failed to write core dump. Minidumps are not enabled by default on client versions of Windows

If you would like to submit a bug report, please visit:

http://bugreport.java.com/bugreport/crash.jsp

The crash happened outside the Java Virtual Machine in native code.

See problematic frame for where to report the bug.

help...

same problem ...

maybe you can set the bucket smaller

I'd suggest you train with the official Python binding or just in the terminal. Then you can load the trained model in Java.