AI Accelerator Groq(TM) Adapts and Runs LLaMA, the Meta(TM) Chatbot Model and Competitor to ChatGPT, for Its Systems
Sunday, April 16, 2023
MOUNTAIN VIEW, Calif., March 15, 2023 /PRNewswire/ -- Groq, a leading artificial intelligence (AI) and machine learning (ML) systems innovator, last week announced it adapted a new large language model (LLM), LLaMA-chatbot technology from Meta and a proposed alternative to ChatGPT-to run on its systems.
Facebook® parent, Meta, released LLaMA, which can be used by chatbots to generate human-like text, on February 24th. Three days later the Groq team downloaded the model and within a few days had it running on a production GroqNode(TM) server, including eight GroqChip(TM) inference processors. This is a rapid time-to-functionality; a development task that can often take a larger team of engineers weeks to months to complete, while Groq executed with just a small group from its compiler team.
Jonathan Ross, CEO and founder of Groq said, "This speed of development at Groq validates that our generalizable compiler and software-defined hardware approach is keeping up with the accelerating pace of LLM innovation-something traditional kernel-based approaches struggle with."
The rapid LLaMA bring-up by Groq is a particularly unique and noteworthy milestone because Meta researchers originally developed LLaMA for NVIDIA(TM) chips. With Groq engineers successfully running a cutting-edge model on its technology, they demonstrated GroqChip as a ready-to-use alternative to incumbent technology. Generative AI is carving out a place for itself in the market, and as transformers continue to advance the pace of LLM development, customers will need solutions that provide tangible time-to-production advantages, reducing developer complexity for fast iteration.
Bill Xing, Tech Lead Manager, ML Compiler at Groq said, "The complexity of computing platforms is permeating into user code and slowing down innovation. Groq is reversing this trend. Since we're working on models that were trained on Nvidia GPUs, the first step of porting customer workloads to Groq is removing non-portable, vendor-specific code targeted for specific vendors and architectures. This might include replacing vendor-specific code calling kernels, removing manual parallelism or memory semantics, etc. The resulting code ends up looking a lot simpler and more elegant. Imagine not having to do all that 'performance engineering' in the first place to achieve stellar performance! This also helps by not locking a business down to a specific vendor."
If you would like to discuss your AI strategy and solutions with a technology expert at Groq, please reach out to contact@groq.com. For press inquiries about this story or Groq technology please contact pr-media@groq.com.
About Groq Groq is a technology company delivering ultra-low latency performance and record-breaking inference results for the next era of compute in AI, ML, and HPC. Read our latest customer news in cybersecurity, pharma, and finance. For more information, visit www.groq.com.
Groq, the Groq logo, and other Groq marks are trademarks of Groq, Inc. Other names and brands may be claimed as the property of others. Reference to specific trade names, trademarks or otherwise, does not necessarily constitute or imply its endorsement or recommendation by Groq.
Copyright © 2023 Groq Inc. All rights reserved.
View original content to download multimedia:https://www.prnewswire.com/news-releases/ai-accelerator-groq-adapts-and-runs-llama-the-meta-chatbot-model-and-competitor-to-chatgpt-for-its-systems-301772554.html
SOURCE Groq
|
|
|
|
|
 |
Energy Toolbase Launches Energy Storage Partnership with Sungrow to Support PowerStack 255CS and PowerTitan 2.0 | Jan 22, 2026
|
 |
Einride and IonQ Partnership Uses Quantum Computing to Optimize the Logistics of Electric and Autonomous Freight | Jan 22, 2026
|
 |
RS now offers Phoenix Contact's pioneering new NearFi technology | Jan 22, 2026
|
 |
SCAILIUM Debuts "AI Production Layer" to Overcome GPU Starvation and Slash AI Energy Waste | Jan 22, 2026
|
 |
MetaOptics to Showcase Five Breakthrough Metalens-Powered Products at CES 2026 | Jan 22, 2026
|
 |
No Assembly Required: Barrett Distribution Centers Powers Maxwood Furniture's West Coast DTC Expansion | Jan 22, 2026
|
 |
Quantum Art Raises $100 Million in Series A Round to Drive Scalable, Multi-Core Quantum Computing | Jan 22, 2026
|
 |
Hesai Recognized as the Only Lidar Company on Morgan Stanley's "Humanoid Tech 25" of Global Robotics Leaders | Jan 22, 2026
|
 |
TESSAN to Redefine Global Mobility at CES 2026 with '100 Travelers' Initiative and Flagship Voyager 205 | Jan 22, 2026
|
 |
1inch Named Exclusive Swap Provider at Launch for Ledger Multisig | Jan 22, 2026
|
|
|