| | |
| Company type | Private |
|---|---|
| Industry | |
| Founded | 2016 |
| Founders |
|
| Headquarters | , US |
Key people | Jonathan Ross (CEO), Sunny Madra (COO), Andrew S. Rappaport (Board Member), Chamath Palihapitiya (Investor), John Yetimoglu (Board Member/Investor) |
| Products | Language Processing Unit (LPU) |
| Revenue | US$3.2 million (2023) [1] |
| US$−88 million (2023) [1] | |
Number of employees | 250 (2023) |
| Website | groq |
Groq, Inc. is an American artificial intelligence (AI) company that builds an AI accelerator application-specific integrated circuit (ASIC). The architecture was originally introduced as a Tensor Streaming Processor (TSP) but was later rebranded as a Language Processing Unit (LPU) following the widespread adoption of large language models after the breakthrough of ChatGPT. The company also develops related computer hardware and software to accelerate AI inference performance.
Examples of the types of AI workloads that run on Groq's LPU are: large language models (LLMs), [2] [3] image classification, [4] and predictive analysis. [5] [6]
Groq is headquartered in Mountain View, CA, and has offices in San Jose, CA, Liberty Lake, WA, Toronto, Canada, London, U.K. and remote employees throughout North America and Europe.
In December 2025, Nvidia and Groq announced an agreement reportedly valued at approximately US$20 billion to license Groq's AI inference technology and to transfer several senior Groq executives to Nvidia. [7] [8] Groq stated that it would continue to operate as an independent company. [9]
Groq was founded in 2016 by a group of former Google engineers, led by Jonathan Ross, one of the designers of the Tensor Processing Unit (TPU), an AI accelerator ASIC, and Douglas Wightman, an entrepreneur and former engineer at Google X (known as X Development), who served as the company’s first CEO. [10] [1]
Groq received seed funding from Social Capital's Chamath Palihapitiya, with a $10 million investment in 2017 [11] and soon after secured additional funding.
In April 2021, Groq raised $300 million in a series C round led by Tiger Global Management and D1 Capital Partners. [12] Current investors include: The Spruce House Partnership, Addition, GCM Grosvenor, Xⁿ, Firebolt Ventures, General Global Capital, and Tru Arrow Partners, as well as follow-on investments from TDK Ventures, XTX Ventures, Boardman Bay Capital Management, and Infinitum Partners. [13] After Groq’s series C funding round, it was valued at over $1 billion, making the startup a unicorn. [14]
On March 1, 2022, Groq acquired Maxeler Technologies, a company known for its dataflow systems technologies. [15]
On August 16, 2023, Groq selected Samsung Electronics' foundry in Taylor, Texas to manufacture its next-generation chips, on Samsung's 4-nanometer (nm) process node. This was the first order at this new Samsung chip factory. [16]
On February 19, 2024, Groq soft-launched a developer platform, GroqCloud, to attract developers into using the Groq API and rent access to their chips. [17] [1] On March 1, 2024 Groq acquired Definitive Intelligence, a startup known for offering a range of business-oriented AI solutions, to help with its cloud platform. [18]
Groq raised $640 million in a series D round led by BlackRock Private Equity Partners in August 2024, valuing the company at $2.8 billion. [1] [19]
On February 10, 2025, Groq announced that it had secured a US$1.5 billion commitment from the Kingdom of Saudi Arabia to expand delivery of its LPU-based AI inference infrastructure, tied to a new GroqCloud data center in Dammam, Saudi Arabia. [20] [21]
As of 2025, Groq has established a dozen data centers across the U.S., Canada, the Middle East, and Europe, with its technology. [22]
In December 2025, Nvidia agreed to purchase assets from Groq for approximately US$20 billion, which is a record for Nvidia. [23] Groq has described this as a non-exclusive licensing deal. [9] As part of the deal, Groq founder Ross and Groq president Sunny Madra would join Nvidia. [7] [8]
Groq's initial name for their ASIC was the Tensor Streaming Processor (TSP), but later rebranded the TSP as the Language Processing Unit (LPU). [2] [24] [25]
The LPU features a functionally sliced microarchitecture, where memory units are interleaved with vector and matrix computation units. [26] This design facilitates the exploitation of dataflow locality in AI compute graphs, improving execution performance and efficiency. The LPU was designed off on two key observations:
In addition to its functionally sliced microarchitecture, the LPU can also be characterized by its single-core, deterministic architecture. [26] [27] The LPU can achieve deterministic execution by avoiding the use of traditional reactive hardware components (branch predictors, arbiters, reordering buffers, caches) [26] and by having all execution explicitly controlled by the compiler thereby guaranteeing determinism in execution of an LPU program. [26]
The first generation of the LPU (TSP) yields a computational density of more than 1TeraOp/s per square mm of silicon for its 25×29 mm 14nm chip operating at a nominal clock frequency of 900 MHz. [26] The second generation of the LPU (LPU v2) will be manufactured on Samsung's 4nm process node. [16]
Groq hosts open-source large language models running on its LPUs for public access. [28] Access to these demos is available through Groq's website.