dotnet / machinelearning-modelbuilder

Simple UI tool to build custom machine learning models.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Invalid call to 'GetValue' when Training Named Entity Recognition Data

Hargravejason opened this issue · comments

System Information (please complete the following information):

  • Model Builder Version (available in Manage Extensions dialog): 17.18.2.2415501
  • Visual Studio Version : 17.9.4

Describe the bug

  • On which step of the process did you run into an issue: Train
  • Clear description of the problem: When training data, the system processes the data, then throws an error "Invalid call to 'GetValue'"

To Reproduce
Steps to reproduce the behavior:

  1. Select scenario
  2. Select Environment (both CPU and GPU have error)
  3. Add data
  4. Train

Expected behavior
Training complete and moved to Evaluate section

Screenshots
image

Additional context
Error log:
at Microsoft.ML.DataViewSchema.Annotations.GetValue[TValue](String kind, TValue& value)
at Microsoft.ML.Data.SchemaAnnotationsExtensions.GetKeyValues[TValue](Column column, VBuffer1& keyValues) at Microsoft.ML.TorchSharp.NasBert.NerTrainer.Trainer.GetRowCountAndSetLabelCount(IDataView input) at Microsoft.ML.TorchSharp.TorchSharpBaseTrainer2.TrainerBase..ctor(TorchSharpBaseTrainer2 parent, IChannel ch, IDataView input, String modelUrl) at Microsoft.ML.TorchSharp.NasBert.NasBertTrainer2.NasBertTrainerBase..ctor(TorchSharpBaseTrainer2 parent, IChannel ch, IDataView input, String modelUrl) at Microsoft.ML.TorchSharp.NasBert.NerTrainer.CreateTrainer(TorchSharpBaseTrainer2 parent, IChannel ch, IDataView input)
at Microsoft.ML.TorchSharp.TorchSharpBaseTrainer2.Fit(IDataView input) at Microsoft.ML.Data.EstimatorChain1.Fit(IDataView input)
at Microsoft.ML.AutoML.SweepablePipelineRunner.Run(TrialSettings settings)
at Microsoft.ML.AutoML.SweepablePipelineRunner.RunAsync(TrialSettings settings, CancellationToken ct)
at Microsoft.ML.AutoML.AutoMLExperiment.d__24.MoveNext()
--- End of stack trace from previous location where exception was thrown ---
at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)
at Microsoft.ML.ModelBuilder.AutoMLService.Experiments.NamedEntityRecognitionExperiment.d__13.MoveNext() in //src/Microsoft.ML.ModelBuilder.AutoMLService/Experiments/NamedEntityRecognitionExperiment.cs:line 133
--- End of stack trace from previous location where exception was thrown ---
at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
at System.Runtime.CompilerServices.TaskAwaiter.HandleNonSuccessAndDebuggerNotification(Task task)
at Microsoft.ML.ModelBuilder.AutoMLEngine.d__21.MoveNext() in /
/src/Microsoft.ML.ModelBuilder.AutoMLService/AutoMLEngineService/AutoMLEngine.cs:line 199

Files:
NERKeyMap.txt
NER_Data.txt

Note: Data shortened to 1 training item for example, training data has 2000+ items over all, issue occurs on any number of items.

@zewditu Can you take a look