Arael
An all-in-one multilingual analyzing pipeline.
How multi we talk about when we talk about multilingual
Arael supports more than 170 languages and can process terabytes of data in the form of stream. For languages like Chinese, Japanese, Korean, Thai, etc., Arael is able to deal with overlapping ambiguity and out-of-vocabulary words/phrases using machine learning.
Easy to implement into workflows
Not only can Arael segment basic graphemes, words/phrases, and sentences, Arael can also be configured with analyzers and filters to form customized workflows.
Handle emojis perfectly
In addition, Arael implemented Unicode® Standard Annex #29 and Technical Standard #51, translation: Never get screwed by Emojis!
Also available in:
中文