This project tries to recognize the font of a text in a photo using a set of algorithms and libraries. The goal is to obtain accurate results with the image as only input avoiding other manual processes. This is the only open source project of its kind.
For now it works only in browser environment.
Import the module then call the main function like in the following script
import {Typefont} from "src/index";
Typefont("path/image.png").then((res) => console.log(res));
the first argument can be: a string with the path of the image, a string with the base64 data of the image, the instance of a canvas or the instance of a image. The function returns a Promise. You can compile the whole project using Webpack or other tools.
Text on the cover of a book (texts are in italian because I live in Italy).
Text on the cover of another book.
Screenshot of a text on a video from the web.
Each font in the result has a percentage of similarity with the input image and a piece of information about the font.
I had just discovered the version of Tesseract written in JavaScript and I noticed that he was also trying to identify the font, I wondered how to improve this process then I used Tesseract to extract the letters from the input image, I created a new system that uses the Jimp image processing library to compare the extracted letters with the fonts stored in a dedicated library. I'm writing a dedicated algorithm for comparison in order to obtain more accurate results.
You can pass an object with options as second argument.
Option | Type | Description | Default |
---|---|---|---|
progress |
Function |
A function which is called every time the comparison with a font is completed. | undefined |
minSymbolConfidence |
Number |
The minimum confidence that a symbol must have to be accepted in the comparison queue (the confidence value is assigned by the OCR engine). | 30 |
analyticComparisonThreshold |
Number |
The threshold for the pixel based image comparison. | 0.52161 |
sameSizeComparison |
Boolean |
Scale the symbols to the same size before comparison? | true |
fontsDirectory |
String |
The URL of the directory containing the fonts. | storage/fonts/ |
fontsData |
String |
The name of the file containing the JSON data of a font. | data.json |
fontsIndex |
String |
The URL of the fonts index JSON file. | storage/index.json |
The input image is passed to the optical character recognition after some filters based on its brightness. Then the symbols (letters) are extracted from the input image and compared with the symbols of the fonts in the database using a perceptual comparison and a pixel based comparison in order to obtain a percentage of similarity.
The symbols of fonts are just a JSON structure with letters as keys and the base64 of the image of the letter as value. If you want to add a new font you must follow this structure.
{
"meta": {
"name": "name",
"author": "author",
"uri": "uri",
"license": "license",
"key": "value",
...
},
"alpha": {
"a": "base64",
"b": "base64",
"c": "base64",
...
}
}
Each key of the meta object is included in the final result.
MIT License.