» tagged pages
» logout

sorted by: recent | see : popular
Content Tagged with data-mining + prodei

TCatNG Toolkit :: Text Categorization via N-Grams

"The TCatNG Toolkit is a Java package that you can use to apply N-Gram analysis techniques to the process of categorizing text files. [Namely] categorizing documents by topic, detecting the author of a text, or recognizing the language [...]"

open-source: del.icio.us tag/open-source