Products/AI & Machine Learning/General Compute

General Compute

AI models that run on an inference cloud optimized for speed Discussion | ...

AI & Machine Learning

Our Take

GPUs were built for graphics. General Compute was built for inference.

Look, every cloud provider out there is running your AI workloads on repurposed gaming hardware—chips designed to render Call of Duty, now awkwardly shoved into service as inference machines. General Compute said "nah." They built purpose-built ASICs from scratch, specifically engineered for one thing: fastest possible inference. Their benchmarks show 950 tokens per second on the MiniMax M2.5 model versus roughly 100 on standard NVIDIA cloud hardware. That's not a incremental improvement—that's a 10x leap. And they're claiming 7x faster inference overall.

But here's what actually matters for your bill: efficiency. Their racks pull 17 kilowatts versus 120 for equivalent GPU setups. Their energy cost is $0.035 per kWh versus the $0.13 US commercial average. That's 4x less power, and in inference where you're paying per token, thatCompound interest hits hard. They're offering $100 in free credit to test drive, and they just hit #3 Product of the Day on Product Hunt. The message is simple—GPUs carry 70 years of legacy architecture designed for rendering pixels. General Compute skipped all that. Build different, price different.

generalcompute.com →