AI Stack
Capabilities and model layers highlighted by the company.
- LPU
- GroqCloud

AI inference chip company founded in 2016 by former Google TPU engineers, pioneering the Language Processing Unit (LPU) architecture for ultra fast AI inference. In December 2025, Nvidia agreed to acquire Groq's assets for approximately $20 billion.
Groq designs Language Processing Units (LPUs), custom AI chips optimized for ultra-fast, low-latency inference. Founded in 2016 by Jonathan Ross, inventor of Google's TPU. In December 2025, NVIDIA announced a deal reportedly worth ~$20 billion to license Groq's inference technology.
Platform maturity, autonomy stack, and flagship-system specifications in one view.
Capabilities and model layers highlighted by the company.
In December 2025, NVIDIA announced a ~$20 billion deal to license Groq's inference technology, validating that inference-optimized hardware represents an essential category in AI computing.
No news articles mentioning Groq yet.
Follow Groq's valuation growth through its funding rounds.
Groq designs Language Processing Units (LPUs), custom AI chips optimized for ultra-fast, low-latency inference. Founded in 2016 by Jonathan Ross, inventor of Google's TPU. In December 2025, NVIDIA announced a deal reportedly worth ~$20 billion to license Groq's inference technology.
Groq was founded in 2016 by Jonathan Ross, who had created Google's TPU as a side project. The company designed a novel statically-scheduled architecture delivering predictable, ultra-low latency performance.
Funding included $300 million Series C in 2021, $640 million Series D in August 2024 at $2.8 billion, and $750 million Series E in September 2025 at $6.9 billion. Total raised: ~$1.75 billion.
Groq gained attention in early 2024 with cloud API speeds of 500+ tokens/second, far outpacing GPU alternatives. The speed difference went viral among AI developers.
In December 2025, NVIDIA announced a ~$20 billion deal to license Groq's inference technology, validating that inference-optimized hardware represents an essential category in AI computing.