WebsiteGear Logo Log In
New User? Sign Up
About | Contact | FAQ
  Home News Website Related Website Development Saturday, March 7, 2026 
Add Press Release News | News Feeds Feeds | Email This News Email


Llama 4 Live Day-Zero on Groq at Lowest Cost
Friday, April 11, 2025

MOUNTAIN VIEW, Calif., April 5, 2025 /PRNewswire/ -- Groq, the pioneer in AI inference, has launched Meta's Llama 4 Scout and Maverick models, now live on GroqCloud(TM). Developers and enterprises get day-zero access to the most advanced open-source AI models available.

That speed is possible because Groq controls the full stack--from our custom-built LPU to our vertically integrated cloud. The result: models go live with no delay, no tuning, and no bottlenecks--and run at the lowest cost per token in the industry, with full performance.

"We built Groq to drive the cost of compute to zero," said Jonathan Ross, CEO and Founder of Groq. "Our chips are designed for inference, which means developers can run models like Llama 4 faster, cheaper, and without compromise."

Lowest Cost Per Token -- Without Compromise

With Llama 4 models live, developers can run cutting-edge multimodal workloads while keeping costs low and latency predictable.

    --  Llama 4 Scout: $0.11 / M input tokens and $0.34 / M output tokens, at a
        blended rate of $0.13
    --  Llama 4 Maverick: $0.50 / M input tokens and $0.77 / M output tokens, at
        a blended rate of $0.53

See Groq pricing here.

About the Models

Llama 4 is Meta's latest open-source model family, featuring Mixture of Experts (MoE) architecture and native multimodality.

    --  Llama 4 Scout (17Bx16E): A strong general-purpose model, ideal for
        summarization, reasoning, and code. Runs at over 460 tokens per second
        on Groq.
    --  Llama 4 Maverick (17Bx128E): A larger, more capable model optimized for
        multilingual and multimodal tasks--great for assistants, chat, and
        creative applications.

Build Fast with Llama 4 on GroqCloud

Llama 4 Scout and Maverick are accessible through:

    --  GroqChat
    --  GroqCloud Developer Console
    --  Groq API (model IDs available in-console)

Start building today at console.groq.com.

Free access is available, or upgrade for worry-free rate limits and higher throughput.

About Groq
Groq is the AI inference platform delivering low cost, high performance without compromise. Its custom LPU and cloud infrastructure run today's most powerful open-source models instantly and reliably.

Over 1 million developers use Groq to build fast and scale with confidence.

Groq Media Contact: pr-media@groq.com

View original content to download multimedia:https://www.prnewswire.com/news-releases/llama-4-live-day-zero-on-groq-at-lowest-cost-302421438.html

SOURCE Groq



Email This News Email | Submit To Slashdot Slashdot | Submit To Digg.com Digg | Submit To del.icio.us Del.icio.us | News Feeds Feeds

RELATED NEWS ARTICLES
Nav Asetek - Mandatory Notification of Trade | Jan 22, 2026
Nav Tomorrowland Brings the Magic to Shanghai for a Spectacular First Indoor Edition in China | Jan 22, 2026
Nav BC.GAME to Host "Stay Untamed" Night During Abu Dhabi's Packed Web3 Summit Week | Jan 22, 2026
Nav Rent Manager Earned Best Real Estate Software Product Award and Multiple Review Badges from G2 Platform | Jan 22, 2026
Nav Auburn University's Applied Research Institute Expands Advanced Manufacturing Capabilities with CF3D Enterprise Cell | Jan 22, 2026
Nav AMPERA ANNOUNCES LOCATION FOR GLOBAL HEADQUARTERS | Jan 22, 2026
Nav Gemmy Alerts Customers: Fake Websites Target Holiday Decorators | Jan 22, 2026
Nav Culture and tourism sectors thrive in Xiamen | Jan 22, 2026
Nav California Divorce Mediation Center Unveils Modern Website Redesign | Jan 22, 2026
Nav AMPLIFY Named Finalist in Three Categories at the 2026 Golden Gavel Awards | Jan 22, 2026
NEWS SEARCH

FEATURED NEWS | POPULAR NEWS
Submit News | View More News View More News