Thai Character Cluster

To cluster Thai text into undividable units. Character cluster is defined to be the smallest recognizable unit. The character string is clustered for the sake of avoiding the processing of invalid Thai character units. (download)

KWIC for Thai text

KWIC (KeyWord In Context) for both segmented or unsegmented Thai text. It is used to create concordance of Thai text for studying the occurrence of words in question. (download)

Faculty of Data Science, Musashino University, Ariake Campus, Japan

Faculty of Engineering, 

Thammasat University, Thailand

  • Facebook Clean Grey
  • Twitter Clean Grey
  • LinkedIn Clean Grey

Research University Network (RUN)

© 2017 by Virach. Proudly created with