Groq

Name: Groq
Availability: OnlineOnly

codeFreemium

Groq provides an API for running open-source LLMs (Llama, Mistral, Gemma) at dramatically faster speeds than traditional GPU inference using their custom LPU hardware. Free tier available. The go-to when response speed matters in an AI application.

Best for

Developers who need the fastest possible LLM inference for latency-sensitive applications.