IBM SystemT

Last updated
IBM SystemT
Developer(s) IBM
Written in AQL  [ de ], Java
Operating system Linux, macOS, Windows
Type Information Extraction, Text mining
Website SystemT

IBM SystemT is a declarative information extraction system. It was first built in 2005, as a research project at IBM's IBM Almaden Research Center. Its name is partially inspired by System R, a seminal project from the same research center.

SystemT [1] comprises the following three main components: (1) AQL, a declarative rule language with a similar syntax to SQL; (2) Optimizer, which accepts AQL statements as input and generates high-performance algebraic execution plans; and (3) Executing engine, which executes the plan generated by the Optimizer and performs information extraction over input documents.

SystemT is available as part of IBM BigInsights, [2] and has also been taught in multiple universities around the globe. A version of SystemT was available (starting in September 2016) as a companion to a sequence of online courses in Text Analytics. [3] [4]

References

  1. Chiticariu, Laura; Krishnamurthy, Rajasekar; Li, Yunyao; Raghavan, Sriram; Reiss, Frederick R.; Vaithyanathan, Shivakumar (2010-01-01). "SystemT: An Algebraic Approach to Declarative Information Extraction". Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. ACL '10. Uppsala, Sweden: Association for Computational Linguistics: 128–137.
  2. IBM BigInsights
  3. Text Analytics: Getting Results with SystemT
  4. Chiticariu, Laura; Danilevsky, Marina; Li, Yunyao; Reiss, Frederick; Zhu, Huaiyu (2018). "SystemT: Declarative Text Understanding for Enterprise". Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 3 (Industry Papers). Association for Computational Linguistics. pp. 76–83. doi:10.18653/v1/N18-3010.