Thai Character Cluster

To cluster Thai text into undividable units. Character cluster is defined to be the smallest recognizable unit. The character string is clustered for the sake of avoiding the processing of invalid Thai character units. (download)


KWIC for Thai text

KWIC (KeyWord In Context) for both segmented or unsegmented Thai text. It is used to create concordance of Thai text for studying the occurrence of words in question. (download)

Sirindhorn International Institute of Technology (SIIT)
Thammasat University

Musashino University, Ariake Campus, Japan

  • Facebook Clean Grey
  • Twitter Clean Grey
  • LinkedIn Clean Grey

© 2017 by Virach. Proudly created with