komodojp / tinyld

Simple and Performant Language detection library for NodeJS

Home Page:https://komodojp.github.io/tinyld/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TinyLD

npm npm CDN Download License

logo

πŸŽ‰ Description

Tiny Language Detector, simply detect the language of a unicode UTF-8 text:

  • Pure JS, No api call, No dependencies (Node and Browser compatible)
  • Blazing fast and low memory footprint (unlike ML methods)
  • Train with dataset from Tatoeba and UDHR
  • Support 62 languages (24 for the web version)
  • Reliable even for really short texts (chatbot, keywords, ...)
  • Support both ISO-639-1 & ISO-639-2
  • Available for NodeJS (CommonJS and ESM), Deno and Browser

Links


πŸ’Ύ Getting Started

Install

yarn add tinyld # or npm install --save tinyld

Install Documentation


πŸ“„ TinyLD API

import { detect, detectAll } from 'tinyld'

// Detect
detect('γ“γ‚Œγ―ζ—₯本θͺžγ§γ™.') // ja
detect('and this is english.') // en

// DetectAll
detectAll('ceci est un text en francais.')
// [ { lang: 'fr', accuracy: 0.5238 }, { lang: 'ro', accuracy: 0.3802 }, ... ]

API Documentation


πŸ“Ž TinyLD CLI

tinyld This is the text that I want to check
# [ { lang: 'en', accuracy: 1 } ]

More Information


πŸ“ˆ Performance

Here is a comparison of Tinyld against other popular libraries.

SVG Graph

To summary in one sentence:

Better, Faster, Smaller

More Benchmark Information


Developer

You want to Contribute or Open a PR, it's recommend to take a look at the dev documentation

About

Simple and Performant Language detection library for NodeJS

https://komodojp.github.io/tinyld/

License:MIT License


Languages

Language:TypeScript 58.9%Language:JavaScript 28.9%Language:Vue 11.0%Language:HTML 0.8%Language:Shell 0.2%Language:CSS 0.1%