Main Menu

Powered by <TEI:TOK>
Maarten Janssen, 2014-

Language Identification

To determine which language an unknown piece of text is written in, you can paste it into the text box below. From 25 character onwards, the system will attempt to detect the language of the text. If there is more than one similar language, the system will show them all. Languages are detected by so-called trigram models, and CWALI uses trigrams from text collections from around 800 different languages.

  • Click here to see a list of the languages currently detected.
  • Click here for more information about the language detection algorithm.

Provided text is not sufficiently long