FuriosaAI Unveils RNGD, A Leading AI Inference Chip
Tuesday, September 3, 2024
SANTA CLARA, Calif., Aug. 26, 2024 /PRNewswire/ -- FuriosaAI, an emerging leader in the AI semiconductor space, today announced the unveiling of RNGD (pronounced "Renegade"), a leading AI accelerator, at Hot Chips 2024. RNGD is positioned to be the most efficient data center accelerator for high-performance large language model (LLM) and multimodal model inference, disrupting an AI hardware landscape long defined by legacy chipmakers and high-profile startups. Founded in 2017 by three engineers with backgrounds at AMD, Qualcomm, and Samsung, the company has pursued a strategy focused on rapid innovation and product delivery which has resulted in the unveiling and fast development of RNGD.
Furiosa successfully completed the full bring-up of RNGD after receiving the first silicon samples from their partner, TSMC. This achievement reinforces the company's track record of fast and seamless technology development. With their first-generation chip, introduced in 2021, Furiosa submitted their first MLPerf benchmark results within 3 weeks of receiving silicon and achieved a 113% performance increase in the next submission through compiler enhancements.
Early testing of RNGD has revealed promising results with large language models such as GPT-J and Llama 3.1. A single RNGD PCIe card delivers 2,000 to 3,000 tokens per second throughput performance (depending on context length) for models with around 10 billion parameters.
"The launch of RNGD is the result of years of innovation, leading to a one-shot silicon success and exceptionally rapid bring-up process. RNGD is a sustainable and accessible AI computing solution that meets the industry's real-world needs for inference," said June Paik, Co-Founder and CEO of FuriosaAI. "With our hardware now starting to run LLMs at high performance, we're entering an exciting phase of continuous advancement. I am incredibly proud and grateful to the team for their hard work and continuous dedication."
June will present performance benchmarks at Hot Chips today in a presentation titled, "Furiosa RNGD: A Tensor Contraction Processor for Sustainable AI Computing" which further underscores RNGD's exceptional capabilities, leaving industry experts eagerly anticipating what comes next. He will offer a first hands-on look at the fully functioning RNGD card along with a live demo at the Furiosa booth.
RNGD's key innovations include:
-- A non-matmul, Tensor Contraction Processor (TCP) based architecture that
enables a perfect balance of efficiency, programmability and
performance.
-- Programmability through a robust compiler co-designed to be optimized
for TCP that treats entire models as single-fused operations.
-- Efficiency, with a TDP of 150W compared to 1000W+ for leading GPUs
-- High-performance, with 48GB of HBM3 memory delivering the ability to run
models like Llama 3.1 8B efficiently on a single card.
What our industry partners have to say:
"The Furiosa RNGD AI Inference solution drives the adoption of green computing with Supermicro. By integrating Furiosa's technology, Supermicro systems can reduce power consumption per card while still delivering exceptional inference performance," said Vik Malyala, SVP, Technology and AI; President and Managing Director, EMEA of Supermicro.
"The collaboration between GUC and FuriosaAI to deliver RNGD with exceptional performance and power efficiency hinges on meticulous planning and execution. Achieving this requires a deep understanding of modern AI software and hardware. FuriosaAI has consistently demonstrated excellence from design to delivery, creating the most efficient AI inference chips in the industry," said Aditya Raina, CMO of GUC.
The chip is currently sampling to early access customers, with broader availability expected in early 2025.
For more details on RNGD's architecture and capabilities, please visit FuriosaAI's blog.
About FuriosaAI
FuriosaAI is a semiconductor company dedicated to creating sustainable AI computing solutions that make powerful AI accessible to all. With its innovative Tensor Contraction Processor architecture, FuriosaAI is revolutionizing the AI hardware landscape, offering unparalleled efficiency and programmability for the most demanding AI workloads. For more information, please visit https://furiosa.ai/.
View original content to download multimedia:https://www.prnewswire.com/news-releases/furiosaai-unveils-rngd-a-leading-ai-inference-chip-302230196.html
SOURCE FuriosaAI
|
|
|
|
|
 |
Energy Toolbase Launches Energy Storage Partnership with Sungrow to Support PowerStack 255CS and PowerTitan 2.0 | Jan 22, 2026
|
 |
RS now offers Phoenix Contact's pioneering new NearFi technology | Jan 22, 2026
|
 |
Quantum Art Raises $100 Million in Series A Round to Drive Scalable, Multi-Core Quantum Computing | Jan 22, 2026
|
 |
MetaOptics to Showcase Five Breakthrough Metalens-Powered Products at CES 2026 | Jan 22, 2026
|
 |
Fresco Raises EUR15m Series C to Power the Future of AI-Driven Cooking and the Connected Kitchen Ecosystem | Jan 22, 2026
|
 |
No Assembly Required: Barrett Distribution Centers Powers Maxwood Furniture's West Coast DTC Expansion | Jan 22, 2026
|
 |
SCAILIUM Debuts "AI Production Layer" to Overcome GPU Starvation and Slash AI Energy Waste | Jan 22, 2026
|
 |
Einride and IonQ Partnership Uses Quantum Computing to Optimize the Logistics of Electric and Autonomous Freight | Jan 22, 2026
|
 |
Hesai Recognized as the Only Lidar Company on Morgan Stanley's "Humanoid Tech 25" of Global Robotics Leaders | Jan 22, 2026
|
 |
Ekinops New C700HC Chassis Efficiently Connects the Data Center and the Central Office | Jan 22, 2026
|
|
|