Microsoft may have an audio-to-image generator in the works, new patent shows

by Eliana Willis 14 October 2024, 19:29 169 Views

spfdigital/Getty Images

There are currently many artificial intelligence (AI) tools on the market that can take users’ text and images and transform them into images and videos that match the initial prompt. A new patent reveals that audio may soon be an input option to bring your visions to real life.

As spotted by MSPowerUser, the US Patent and Trademark Office (USPTO) posted a 20-page document filed by Microsoft on April 5, 2023, and published on October 10, 2024, that details a new AI-supported system that converts live audio into images.

Also: Adobe’s free AI video generator is here – how to try it out

This system would take an audio live stream, such as that from a meeting or lecture, and convert it into a live text transcript. The transcript would then be summarized by a large language model (LLM) and fed into a text-to-image model, where an image would be generated and output on the screen, as seen in the image below.

This system would continue to do this during the audio stream, continuously generating live images. According to Microsoft, displaying images in real-time can help make communication more effective, with visual aids keeping people more engaged and making concepts easier to understand.

<!–>

“Displaying images related to verbally communicated information can enhance the effectiveness of communication by making it more engaging, memorable, and easier to understand,” said Microsoft.

Also: The best AI chatbots of 2024: ChatGPT, Copilot, and worthy alternatives

If you’re wondering whether the feature will launch soon, the answer is most likely no. Filing a patent is a long journey between producing a product or feature, and many patents never make it into the production phase and remain an idea.

However, if Microsoft does decide to launch this feature, it would likely live in Microsoft Teams, its video conferencing meeting platform, and be accessible through its AI add-on, Copilot, such as Copilot Pro or Microsoft 365 Copilot for businesses.

Artificial Intelligence

–>

Source: Robotics - zdnet.com

Buy an Echo Dot (5th gen) with clock and get a free smart bulb

One of the best productivity laptops I’ve tested is not a ThinkPad or MacBook (and it’s on sale)

This portal is not a newspaper as it is updated without periodicity. It cannot be considered an editorial product pursuant to law n. 62 of 7.03.2001. The author of the portal is not responsible for the content of comments to posts, the content of the linked sites. Some texts or images included in this portal are taken from the internet and, therefore, considered to be in the public domain; if their publication is violated, the copyright will be promptly communicated via e-mail. They will be immediately removed.

Microsoft may have an audio-to-image generator in the works, new patent shows

Artificial Intelligence

Buy an Echo Dot (5th gen) with clock and get a free smart bulb

One of the best productivity laptops I’ve tested is not a ThinkPad or MacBook (and it’s on sale)

ITALIAN LANGUAGE

ENGLISH LANGUAGE