Type | Private |
---|---|
Industry | Software |
Founded | 2003 |
Headquarters | Boston, MA |
Key people | Jeff Catlin, CEO Mike Marshall, Former CTO |
Products | Text analytics |
Website | www.lexalytics.com |
Lexalytics, Inc. provides sentiment and intent analysis to an array of companies using SaaS and cloud based technology. [1] [2] Salience 6, the engine behind Lexalytics, was built as an on-premises, multi-lingual text analysis engine. It is leased to other companies who use it to power filtering and reputation management programs. In July, 2015 Lexalytics acquired Semantria to be used as a cloud option for its technology. [3] In September, 2021 Lexalytics was acquired by CX company InMoment. [4]
Lexalytics spun into existence in January 2003 out of a content management startup called Lightspeed. [3] Lightspeed consolidated on America’s West Coast. Jeff Catlin, a Lightspeed General Manager, and Mike Marshall, a Lighstpeed Principal Engineer, convinced investors to give them the East Coast company so as to avoid shutdown costs. [5] Catlin and Marshall renamed the operation Lexalytics.
Catlin took on the role of Chief Executive Officer with Marshall working as Chief Technology Officer. [5] Lexalytics opted to not accept venture cash. Instead, the company initially shared sales and marketing expenses with U.K. based document management company Infonic. The partner companies soon formed a joint venture in July 2008, which was later dissolved. Since then, Lexalytics has worked with many other companies, like Bottlenose, [6] Salesforce, [6] Thomson Reuters, [7] Oracle [8] and DataSift. [9] Relationships with social media monitoring companies like Datasift tend to find Lexalytics’ Salience engine baked into the product itself. [1] Lexalytics is used similarly to monitor sentiment as it relates to stock trading. [10] In December 2014, Lexalytics announced the latest iteration to its sentiment analysis engine, Salience 6. [11] Earlier that year Lexalytics acquired Semantria in a bid to appeal to a wider variety of business models. Created by former Lexalytics Marketing Director Oleg Rogynskyy, [12] Semantria is a SaaS text mining service offered as an API and Excel based plugin that measures sentiment. [1] The goal of the acquisition, which cost Lexalytics less than $10 million USD, was to expand the customer base both within the United States and abroad with multilingual support. [1]
The engine that powers Semantria, Salience, is grounded in its deep learning ability. An example of this is its concept matrix, which allows Salience an understanding of concepts and relationship between concepts based on a detailed reading of the entire repository of Wikipedia. [13] This matrix allows Salience to use Wikipedia for automatic categorization. [14] Along with features like the concept matrix, Salience supports 16 international languages. [15] The engine has earned Lexalytics a spot on EContent’s “Top 100 Companies in the Digital Content Industry” List for 2014-2015. [16] In September 2018, Lexalytics launched document data extraction market using natural language processing (NLP). [17] [18]
Text mining, also referred to as text data mining, similar to text analytics, is the process of deriving high-quality information from text. It involves "the discovery by computer of new, previously unknown information, by automatically extracting information from different written resources." Written resources may include websites, books, emails, reviews, and articles. High-quality information is typically obtained by devising patterns and trends by means such as statistical pattern learning. According to Hotho et al. (2005) we can differ three different perspectives of text mining: information extraction, data mining, and a KDD process. Text mining usually involves the process of structuring the input text, deriving patterns within the structured data, and finally evaluation and interpretation of the output. 'High quality' in text mining usually refers to some combination of relevance, novelty, and interest. Typical text mining tasks include text categorization, text clustering, concept/entity extraction, production of granular taxonomies, sentiment analysis, document summarization, and entity relation modeling.
R is a programming language and free software environment for statistical computing and graphics supported by the R Core Team and the R Foundation for Statistical Computing. It is widely used among statisticians and data miners for developing statistical software and data analysis. Polls, data mining surveys, and studies of scholarly literature databases show substantial increases in R's popularity; since August 2021, R ranks 14th in the TIOBE index, a measure of popularity of programming languages.
Unstructured data is information that either does not have a pre-defined data model or is not organized in a pre-defined manner. Unstructured information is typically text-heavy, but may contain data such as dates, numbers, and facts as well. This results in irregularities and ambiguities that make it difficult to understand using traditional programs as compared to data stored in fielded form in databases or annotated in documents.
Sentiment analysis is the use of natural language processing, text analysis, computational linguistics, and biometrics to systematically identify, extract, quantify, and study affective states and subjective information. Sentiment analysis is widely applied to voice of the customer materials such as reviews and survey responses, online and social media, and healthcare materials for applications that range from marketing to customer service to clinical medicine.
Oracle Data Mining (ODM) is an option of Oracle Database Enterprise Edition. It contains several data mining and data analysis algorithms for classification, prediction, regression, associations, feature selection, anomaly detection, feature extraction, and specialized analytics. It provides means for the creation, management and operational deployment of data mining models inside the database environment.
IBM Netezza is a subsidiary of American technology company IBM that designs and markets high-performance data warehouse appliances and advanced analytics applications for uses including enterprise data warehousing, business intelligence, predictive analytics and business continuity planning.
Kirix Strata is a discontinued specialty web browser designed for data analytics. Strata offers a browser's ability to view web pages, but also includes additional tools to perform data analysis and create reports based on structured data from local files, external relational databases and the Web.
Lightspeed Venture Partners is a global venture capital firm focusing on multi-stage investments in the enterprise technology, consumer, and health sectors. Over the past two decades, the firm has backed more than 400 companies, including Snapchat, Affirm, MuleSoft, and AppDynamics.
General Sentiment, Inc. was a Long Island-based social media and news media analytics company.
Light Reading Inc. is a telecommunications industry information company based in New York City. Its activities include publishing, data analysis, market research, and events management.
Topsy Labs was a social search and analytics company based in San Francisco, California. The company was a certified Twitter partner and maintained a comprehensive index of tweets, numbering in the hundreds of billions, dating back to Twitter's inception in 2006.
NetOwl is a suite of multilingual text and identity analytics products that analyze big data in the form of text data – reports, web, social media, etc. – as well as structured entity data about people, organizations, places, and things.
Bottlenose.com, also known as Bottlenose, is an enterprise trend intelligence company that analyzes big data and business data to detect trends for brands. It helps Fortune 500 enterprises discover and track emerging trends that affect their brands. The company uses natural language processing, sentiment analysis, statistical algorithms, data mining and machine learning heuristics to determine trends, and has a search engine that gathers information from social networks. KPMG Capital has invested a "substantial amount" in the company.
Semantria is owned by sentiment analysis company Lexalytics, from which it was spun out in 2011. Semantria offers text analysis via API and Excel plugin. It differs from Lexalytics in that it is offered via API and Excel plugin, and in that it incorporates a bigger knowledge base and uses deep learning.
Google Cloud Platform (GCP), offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products, such as Google Search, Gmail, file storage, and YouTube. Alongside a set of management tools, it provides a series of modular cloud services including computing, data storage, data analytics and machine learning. Registration requires a credit card or bank account details.
Social media analytics is the process of gathering and analyzing data from social networks such as Facebook, Instagram, LinkedIn and Twitter. It is commonly used by marketers to track online conversations about products and companies. One author defined it as "the art and science of extracting valuable hidden insights from vast amounts of semi-structured and unstructured social media data to enable informed and insightful decision making."
Luminoso, a Cambridge, MA-based text analytics and artificial intelligence company, spun out of the MIT Media Lab and its crowd-sourced Open Mind Common Sense (OMCS) project.
Apache Kylin is an open source distributed analytics engine designed to provide a SQL interface and multi-dimensional analysis (OLAP) on Hadoop and Alluxio supporting extremely large datasets.
ThoughtSpot, Inc. is a technology company that produces business intelligence analytics search software. The company is based in Sunnyvale, California, and was founded in 2012.