Tree-sitter (parser generator)

Last updated
Tree-sitter
Original author(s) Max Brunsfeld
Initial release2018;7 years ago (2018)
Stable release
0.24.7 [1]   OOjs UI icon edit-ltr-progressive.svg / 12 January 2025;39 days ago (12 January 2025)
Repository
Written in Rust, C
Platform Cross-platform
Type Parser generator
License MIT License
Website tree-sitter.github.io/tree-sitter/

In computing, Tree-sitter is a parser generator and incremental parsing library.

Contents

Details

It is used to parse source code into concrete syntax trees usable in compilers, interpreters, text editors, and static analyzers. [2] [3] It is specialized for use in text editors, as it supports incremental parsing for updating parse trees while code is edited in real time, [4] and provides a built-in S-expression query system for analyzing code. [5]

Text editors which have official integrations with Tree-sitter include Atom, [6] GNU Emacs, [7] Neovim, [8] Lapce, [9] Zed, [10] and Helix. [11] Language bindings allow it to be used from programming languages including Go, Haskell, Java, JavaScript (with Node.js and WASM), Kotlin, Lua, OCaml, Perl, Python, Ruby, Rust, and Swift. Tree-sitter parsers have been written for these languages and many others. [12] GitHub uses Tree-sitter to support in-browser symbolic code navigation in Git repositories. [13]

Tree-sitter uses a GLR parser, a type of LR parser. [14] [15] [13]

Tree-sitter was originally developed by GitHub for use in the Atom text editor, where it was first released in 2018. [16] [6]

See also

References

  1. "Release 0.24.7". 12 January 2025. Retrieved 30 January 2025.
  2. "Introductory to Treesitter". Blog Teknologi Umum. Retrieved 2023-07-30.
  3. Petersen, Mickey. "How to Get Started with Tree-Sitter". Mastering Emacs. Retrieved 2023-10-30.
  4. "TreeSitter - the holy grail of parsing source code". symflower.com. Retrieved 2023-07-30.
  5. Petersen, Mickey. "Tree Sitter and the Complications of Parsing Languages". Mastering Emacs. Retrieved 2023-07-30.
  6. 1 2 Brunsfeld, Max (2018-10-31). "Atom understands your code better than ever before". The GitHub Blog. Retrieved 2023-07-30.
  7. "GNU Emacs NEWS -- history of user-visible changes".
  8. "Treesitter - Neovim docs". neovim.io. Retrieved 2023-07-30.
  9. "lapce/CHANGELOG.md at f4747fbd306a4b8fda6927e37593bf23f4a1584b · lapce/lapce". GitHub. Retrieved 2023-07-30.
  10. "Zed - Code at the speed of thought". Zed. Retrieved 2023-07-30.
  11. "Helix". helix-editor.com. Retrieved 2023-07-30.
  12. "Tree-sitter|Introduction". tree-sitter.github.io. Retrieved 2023-07-30.
  13. 1 2 Clem, Timothy; Thomson, Patrick (2021-08-31). "Static Analysis at GitHub: An experience report". Queue. 19 (4): 42–67. doi: 10.1145/3487019.3487022 . ISSN   1542-7730. S2CID   238412787.
  14. Nadeem, Ayman (2020-08-04). "CodeGen: Semantic's improved language support system". The GitHub Blog. Retrieved 2023-07-30.
  15. "Tree-sitter - a new parsing system for programming tools" by Max Brunsfeld, 14 October 2018, retrieved 2023-07-30. See 22:30 for Wagner influence and 29:27 for GLR implementation.
  16. Krill, Paul (2018-03-16). "What's new in GitHub's Atom text editor". InfoWorld. Retrieved 2023-10-30.