In testing with an RTX 5090, DiffusionGemma spits out around 700 tokens per second. With a single Nvidia H100 AI accelerator, DiffusionGemma can produce 1,000+ tokens per second.
文章提供了具体的性能测试数据,声称DiffusionGemma在RTX 5090上达到700 tokens/秒,在H100上达到1000+ tokens/秒。这些关键性能数据需要独立验证,以确认Google宣称的4倍速度提升是否准确。
