KH Coder

KH Coder
KH Coder
Developer	Koichi Higuchi
Stable release	2.00f / Dec 2015
Preview release	3.Beta.01 / Mar 2020
Repository	github.com/ko-ichi-h/khcoder ;
Operating system	Microsoft Windows, Linux, macOS
Type	Qualitative data analysis, Text mining, Content analysis
License	GPL2 license
Website	khcoder.net/en/

Last updated December 30, 2025

KH Coder is an open source software for computer assisted qualitative data analysis, particularly quantitative content analysis and text mining. It can be also used for computational linguistics. It supports processing and etymological information of text in several languages, such as Japanese, English, French, German, Italian, Portuguese and Spanish. Specifically, it can contribute factual examination co-event system hub structure, computerized arranging guide, multidimensional scaling and comparative calculations.^[1] Word frequency statistics, part-of-speech analysis, grouping, correlation analysis, and visualization (including histograms and clustering maps) are among the features offered by KH Coder.^[2]

It is well received by researchers worldwide and used in a large number of disciplines, including neuroscience, sociology, psychology, public health, media studies, education research and computer science. There are more than 500 English research papers listed in Google scholar.^[3] More than 3500 academic research papers were published that use KH Coder according to a list compiled by the author.^[4]

KH Coder has been reviewed as a user friendly tool "for identifying themes in large unstructured data sets, such as online reviews or open-ended customer feedback"^[5] and has been reviewed in comparison to WordStat.^[6]

Features

Its features include:

on word-level: Searching, KWIC concordance, collocation statistics, and correspondence analysis.
on category-level: Development of categories or dictionaries, cross tabulation, and correspondence analysis.
on word- and category-level: Frequency lists, multi-dimensional scaling, co-occurrence network, and hierarchical cluster analysis.
on document-level: Searching, clustering, and Naive Bayes classifier

KH Coder allows for further search and statistical analysis functions using back-end tools such as Stanford POS Tagger, the natural language processing toolkit FreeLing, Snowball stemmer, MySQL and R.

Alternatives

qdap (Windows, Linux, macOS) for quantitative analysis of qualitative transcripts and natural language processing.

References

↑ S. N. Vinithra, S.N; Arun Selvan, S.J.; Anand Kumar, M.; Soman, K.P. (2015): Simulated and Self-Sustained Classification of Twitter Data based on its Sentiment. Indian Journal of Science and Technology. Vol. 8, Issue 24
↑ Public Opinion Mining on Construction Health and Safety: Latent Dirichlet Allocation Approach. Buildings 2023, 13, 927. https://doi.org/10.3390/buildings13040927
↑ Google Scholar search using Keywords "KH Coder" and "KHCoder"
↑ Higuchi, Koichi (2017): Scholarly research using KH Coder
↑ Towler, Will (2014): Text Analytics For Everyone. UX Magazine, July 31, 2014.
↑ Huirong, Cheng;Guobin, Huang; Lin, Zheng (2015): Comparison of Software for Unstructured Text Analysis:KH Coder vs. Wordstat Archived 2017-11-07 at the Wayback Machine . 图书与情报, 2015(04): 110-117.

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] S. N. Vinithra, S.N; Arun Selvan, S.J.; Anand Kumar, M.; Soman, K.P. (2015): Simulated and Self-Sustained Classification of Twitter Data based on its Sentiment. Indian Journal of Science and Technology. Vol. 8, Issue 24

[2] Public Opinion Mining on Construction Health and Safety: Latent Dirichlet Allocation Approach. Buildings 2023, 13, 927. https://doi.org/10.3390/buildings13040927

[3] Google Scholar search using Keywords "KH Coder" and "KHCoder"

[4] Higuchi, Koichi (2017): Scholarly research using KH Coder

[5] Towler, Will (2014): Text Analytics For Everyone. UX Magazine, July 31, 2014.

[6] Huirong, Cheng;Guobin, Huang; Lin, Zheng (2015): Comparison of Software for Unstructured Text Analysis:KH Coder vs. Wordstat Archived 2017-11-07 at the Wayback Machine . 图书与情报, 2015(04): 110-117.

[1]

[2]

[3]

[4]

[5]

[6]

v t e Computer-assisted qualitative data analysis software
Open source software	Aquad Cassandre CLAN Coding Analysis Toolkit Compendium ELAN KH Coder Quantitative Discourse Analysis Package (qdap) Requal RQDA Taguette
Proprietary software	ATLAS.ti Dedoose Dovetail MAXQDA NVivo QDA Miner Qiqqa Quirkos Transana
Societyportal

v t e R (programming language)
Features	Sweave
Implementations	Distributed R Microsoft R Open (Revolution R Open) Renjin
Packages	Bibliometrix easystats qdap lumi RGtk2 Rhea Rmetrics rnn RQDA Shiny SimpleITK Statcheck tidyverse ggplot2 dplyr knitr
Interfaces	Emacs Speaks Statistics Java GUI for R KH Coder Rattle GUI R Commander RExcel RKWard RStudio
People	Roger Bivand Jenny Bryan John Chambers Peter Dalgaard Dirk Eddelbuettel Robert Gentleman Ross Ihaka Friedrich Leisch Thomas Lumley Martin Maechler Brian D. Ripley Julia Silge Luke Tierney Hadley Wickham Yihui Xie
Organisations	R Consortium R Foundation for Statistical Computing Revolution Analytics R-Ladies Posit PBC (formerly RStudio PBC)
Publications	The R Journal

KH Coder

Contents

Features

Alternatives

See also

References

External links