More stories

  • in

    I tested 3 text-to-speech AI models to see which is best – hear my results

    Elyse Betters Picaro / ZDNETZDNET’s key takeawaysThere are several AI tools available that can generate humanlike speech.Some AI voices can whisper, laugh, and perform other expressive feats.TTS tools vary in terms of level of realism and their intended audiences.Synthetic voices generated by artificial intelligence are, for better or worse, becoming commonplace. Meanwhile, the number of companies developing this technology is growing rapidly.Recent innovations in AI, such as the transformer architecture — which forms the backbone of many generative AI tools, including large language models, generative adversarial networks (GANs), and diffusion models — have led to the rise of AI systems that can convert text prompts into natural-sounding artificial speech. There are now a wide variety of these text-to-speech (TTS) systems available, each with its particular benefits and shortcomings. To gain a clearer sense of which are the most advanced, I tested three of the most popular free TTS tools currently on the market. ElevenLabs [embedded content]ElevenLabs is widely considered an industry leader in voice realism, and I found this to be a reasonably accurate assessment in my own experiments with the company’s TTS tool. But that realism feels more closely aligned with the voice of a trained voice actor or professional podcaster than it does with ordinary human conversation — it’s almost a little too polished. In that sense, however, it tends to be the preferred choice for many businesses and professionals looking for reliable automated narration. It also supports more than 20 languages, further expanding the platform’s reach and appeal. The company also released a new text-to-speech model called v3 as a research preview last month. It supports more than 70 languages, and users can spice up their AI-generated dialogue with audio tags that cause it to laugh, sigh, or speak in a whisper, to name just a few examples. Also: ElevenLabs’ new AI voice assistant can automate your favorite tasks — and you can try it for freeYou can sign up for a free account with ElevenLabs, and you’ll automatically receive 10,000 free credits. Select the “Text to Speech” option under “Playground” in the left-hand menu, and you’ll be redirected to a page where you can enter a custom prompt you’d like the AI system to narrate, select from a range of custom voices, and adjust parameters like speed and stability. Prompts are limited to 5,000 characters, and every character in each iteration of a voice generation uses a single credit. Hume AI [embedded content]Hume AI’s TTS model is another contender for the most realistic voice-generating tool. The company has positioned its proprietary Empathic Voice Interface (EVI) as an AI system that can capture and simulate the subtleties of human speech, imbuing it with a deeper layer of believability. Like ElevenLabs, Hume offers a broad set of premade AI voice characters, each with its own expressive quirks. You can also generate custom voices by describing them in natural-language prompts. To test it out, I did my best to describe the voice of Samwise Gamgee from “The Lord of the Rings,” as portrayed in the films by Sean Astin. My prompt: “Gentle but brave hobbit, with a working-class, West Country British — possibly with a hint of Welsh — accent. He should sound frightened but resolved to complete his mission.” Also: This new text-to-speech AI model understands what it’s saying – how to try it for freeAfter I prompted it to say a famous line from the film, “If I take one more step, it’ll be the furthest away from home I’ve ever been,” it produced three samples, varying in tone and emphasis. All of them were impressive; to my ear, they contained a degree of realism and emotional depth that isn’t replicable by its competitors. They didn’t sound much like Astin’s Sam, but that was undoubtedly a reflection of the admittedly imperfect description I used as a prompt. You can also pepper pauses by adding “[pause]” into your prompt, or add slangy infusions like “y’all” to enhance the believability of your custom voices. Descript [embedded content]If you’re looking for an AI voice-generating tool that offers a range of editing features, Descript is the one to choose. The company’s TTS model generates audio files in a waveform format, which you can edit just as you would in Adobe Audition or a similar platform. You can choose from a library of premade AI voices or submit a short recording of your own voice, and the system will clone it for you. I tested the voice-cloning feature by asking the system to read a short prompt: “Summers in New York City are getting brutal, and I need to invest in more high-quality air conditioning.” (Which is true.) The first time around, the AI-generated version of my voice definitely sounded like me, but there was also a mechanical quality that detracted from the realism. I decided to give it another try and re-record my voice, this time taking off my Bluetooth headphones and reading the script more slowly and deliberately. The results this time were much more realistic — a more convincing simulation of my voice, in my opinion, than a similar voice-cloning feature offered by Hume. Also: I spoke with an AI version of myself, thanks to Hume’s free tool – how to try itYou can also adjust each piece of AI-generated audio by directly editing your written prompt. It wasn’t perfect, of course; my close friends and family members would probably be able to spot the difference, but it would likely fool my more distant acquaintances. I can easily imagine using the tool to narrate my own articles or for some similar use case. For podcasters and other content creators looking to quickly polish their audio recordings, Descript also offers an AI feature that identifies and eliminates filler words, unnecessary pauses, “umms” and “uhhs,” and other unwanted bits of audio. More

  • in

    ChatGPT can no longer tell you to break up with your boyfriend

    Elyse Betters Picaro/ZDNETZDNET’s key takeawaysOpenAI adds reminders to take a break. ChatGPT will also have improved functions for mental health support.The company is working with experts, including physicians and researchers. As OpenAI prepares to drop one of the biggest ChatGPT launches of the year, the company is also taking steps to make the chatbot safer and more reliable with its latest update. Also: Could Apple create an AI search engine to rival Gemini and ChatGPT? Here’s how it could succeedOn Monday, OpenAI published a blog post outlining how the company has updated or is updating the chatbot to be more helpful, providing you with better responses in times when you need support, or encouraging a break when you use it too much:We build ChatGPT to help you thrive in the ways you choose — not to hold your attention, but to help you use it well. We’re improving support for tough moments, have rolled out break reminders, and are developing better life advice, all guided by expert input.…— OpenAI (@OpenAI) August 4, 2025

    New get off ChatGPT nudge If you have ever tinkered with ChatGPT, you are likely familiar with the feeling of getting lost in the conversation. Its responses are so amusing and conversational that it is easy to keep the back-and-forth volley going. This is especially true for fun tasks, such as creating an image and then modifying it to generate different renditions that meet your exact needs.  More

  • in

    Got a new PC? 5 apps to install first (and how they’ll improve your workflow)

    Windows 11 comes with its own screen capture app, the Snipping Tool. It’s perfectly serviceable for its purpose, but ShareX is better in just about every way.Like the Snipping Tool, ShareX supports various image capture methods. You can capture the entire screen, a specific window, or a certain region among other areas, as well as record videos. But what makes ShareX so much better is its powerful editing tools. With this set, the app is almost like having a mini-Photoshop at your disposal.Download: ShareXYou can add shapes, text, crop areas, slap on emoji stickers, and blur or pixelate key areas of a screenshot. Once you’ve finished, you can upload your finished content onto a variety of third-party platforms, like Google Drive and Imgur, directly. I remember using ShareX for the first time back in 2021, and it vastly improved my workflow.  More

  • in

    People are using ChatGPT to write their text messages – here’s how you can tell

    Kirill Stytsenko/Getty Images ZDNET’s key takeaways People are using AI to write sensitive messages to loved ones. Detecting AI-generated text is becoming more difficult as chatbots evolve. Some tech leaders have promoted this use of AI in their marketing strategies. Everyone loves receiving a handwritten letter, but those take time, patience, effort, and sometimes multiple […] More

  • in

    You can use T-Mobile’s Starlink service to send images, audio, and video now – here’s how

    T-Mobile / Elyse Betters Picaro / ZDNETZDNET’s key takeawaysT-Satellite now supports MMS on select Android devices.Send images, voice notes, and short videos via satellite.Satellite data support for optimized apps is coming October 2025.T-Mobile’s satellite service, T-Satellite, has quietly upgraded into a more capable off-grid communication tool. Originally meant to handle basic SMS, the service now supports full MMS messaging on select Android devices. Send images, voice notes, and video Your phone should automatically switch to the satellite link when it loses cell service. You’ll see the T-Mobile and SpaceX logo, which indicate a solid connection, and from there you can start texting, sending images, and recording voice notes. Our Ziff Davis sister site PCMag put MMS over T-Satellite to the test and reported that you need a clear view of the sky and shouldn’t expect tower-like speeds. Still, you can send images, audio, and even short video files in a couple of minutes. Also: How to use your iPhone’s emergency satellite features if you lose cell coverageThe new capability apparently works on compatible Samsung and Motorola handsets to start — just open your default Messages app on a supported phone, choose attach picture or attach file, and then send your JPG, GIF, voice clip, or video. Longer videos can take upward of 20 minutes to transmit, but the battery impact on your device appears modest. Satellite data for apps is coming Beyond MMS, T-Mobile has teased a broader “satellite data” feature for optimized apps. Coming in October 2025, third-party apps such as WhatsApp, AccuWeather, and X will reportedly be able to send data over Starlink. Also: T-Mobile will give you a free Samsung Galaxy S25 Plus when you trade in almost any phone – here’s how”We are also rolling out picture and voice messaging to select devices and will continue adding more in the coming months,” T-Mobile’s website reads. “Next up, we’ll deliver satellite data for optimized apps, and will continue to add more features.” More

  • in

    Anthropic wants to stop AI models from turning evil – here’s how

    Lyudmila Lucienne/Getty ZDNET’s key takeaways New research from Anthropic identifies model characteristics, called persona vectors.  This helps catch bad behavior without impacting performance. Still, developers don’t know enough about why models hallucinate and behave in evil ways.  Why do models hallucinate, make violent suggestions, or overly agree with users? Generally, researchers don’t really know. But […] More

  • in

    This Linux distro makes Slackware easier than ever

    Jack Wallen / Elyse Betters Picaro / ZDNETZDNET’s key takeawaysSlackel is a user-friendly take on the otherwise challenging Slackware.There are four different desktop variations to choose from.Slackel is a great distribution for learning Linux.Slackware has been around since 1993 and has served as the foundation for many Linux distributions, such as the early iterations of SUSE. What sets Slackware apart from other Linux distributions is that it tries to be more UNIX-like than not. Slackware makes as few modifications as possible to upstream software packages, does not anticipate use cases, and provides no GUI installer.It’s that lack of a GUI installer that keeps some users from giving Slackware a try. That’s a shame, because Slackware is a remarkably stable operating system. Fortunately, there are always distributions based on Slackware, one of which is called Slackel.Slackel borrows some tools from Salix (another distribution based on Slackware) and has a user-friendly GUI installer. Slackel can be installed to an internal drive, to external media (such as USB drives), or run as a live instance. In other words, Slackel is the most Linux-like of the UNIX-like distributions. Make sense?Also: Want to save your old computer? Try one of these 8 Linux distros for freeThink of it this way: Slackel is to Slackware what Ubuntu is to Linux. This distribution aims to make what might otherwise be challenging much less so, and it succeeds.First off, the Slackel installer makes installing a Slackware-based distribution much easier. It’s a point-and-click affair, but do know this: you have to take a few extra steps, as Slackel requires you to prepare your drive first. The preparation is done outside of the installer, but Slackel does include the GParted tool, which makes this easy. You’ll first have to create an MS-DOS partition table, and then create your ext4 partition that will be used for the installation. Outside of that, the installation is as easy as any Linux installer.I hope I haven’t scared you off yet.  More

  • in

    Gemini adds powerful new Deep Think model – what it does and who can try it

    Google / Elyse Betters Picaro / ZDNETTwo weeks ago, Google and OpenAI touted their models’ award-winning performance at the International Math Olympiad (IMO). Now, Google is making a version of its model available to the public. Also: This one feature could make GPT-5 a true game changer (if OpenAI gets it right)On Friday, Google launched Deep Think in the Gemini app for Google Ultra subscribers, a premium subscription tier that costs $250 per year or $125 for the first three months. Although the model is a variation of the one that achieved the gold-medal standard at IMO, it is faster for everyday tasks. Internal evaluations suggested the model reaches bronze-level on the 2025 IMO benchmark. More