Sabrina Ortiz/ZDNET via HARTSince the release of DALL-E in 2021, the first AI image-generating model to popularize the tech, much progress has been made in the AI text-to-image generator space with improved quality, speed, and prompt adherence. However, even the fastest image generators typically take a couple of seconds to create an image — except this one. Also: Apple’s AI doctor will be ready to see you next springHART, short for Hybrid Autoregressive Transformer, is an AI text-to-image generator developed by MIT, Nvidia, and Tsinghua University. It features unprecedented speed and generations with 3.1 to 5.9 times lower latency than state-of-the-art diffusion models. The key difference? How HART was trained. Without getting too technical, instead of using a diffusion model, which is the training method employed by most popular AI image generators, including OpenAI’s DALL-E and Google’s Imagen 3, HART is an autoregressive (AR) visual generation model, the same as OpenAI’s recently released GPT-4o image generator. More